![]() |
The PROSITE database of protein families and domains Release Notes Release 18, July 2003 |
Table of contents |
---|
(1) Introduction |
---|
The following table shows the growth of the database since its creation in 1989.
Rel. | Date | Doc | Entries | Note |
---|---|---|---|---|
1.0 | 03/89 | 58 | 60 | Only released in PC/Gene (Version 5.16) |
2.0 | 03/89 | 129 | 132 | Only released in PC/Gene (Version 6.00) |
3.0 | 05/89 | ? | 160 |   |
4.0 | 10/89 | ? | 202 | Printed release (EMBL Biocomputing document) |
5.0 | 04/90 | 296 | 338 |   |
6.0 | 11/90 | 375 | 433 |   |
7.0 | 05/91 | 441 | 508 |   |
8.0 | 11/91 | 530 | 605 |   |
9.0 | 06/91 | 580 | 689 |   |
10.0 | 12/92 | 635 | 803 |   |
11.0 | 10/93 | 715 | 927 |   |
12.0 | 06/94 | 785 | 1029 | First release to include profiles |
13.0 | 11/95 | 889 | 1167 |   |
14.0 | 12/97 | 997 | 1335 |   |
15.0 | 06/98 | 1014 | 1352 |   |
16.0 | 07/99 | 1034 | 1374 |   |
17.0 | 12/01 | 1108 | 1501 |   |
18.0 | 07/03 | 1200 | 1639 |   |
(2) Description of the changes made to PROSITE since release 17.0 |
---|
(see <PDB:1J5E; M>)where M is the PDB chain identifier.
A file containing all deleted AC (psdelac.txt) has been added to the prosite package:
(3) Forthcoming changes |
---|
Generally repeats possess high amino acid substitution rates and their identification is highly problematic. Even if the presence of a certain repeat family is known, the exact locations and the number of repetitive units often cannot be determined using current profile search. We have implemented a context dependant threshold that allows the detection of strongly divergent repeats when well characterized ones have already been identified.
This method will be implemented in ps_scan.pl, the reference tool to scan PROSITE and the following minor changes will be done in the profile format:
Tags 'R' and 'RR' will be introduced in the field TEXT of MA /CUT_OFF lines. This minor change are compatible with old version of pfsearch and pfscan. Example:
MA /CUT_OFF: LEVEL=0; SCORE=246; N_SCORE=8.5; MODE=1; TEXT='R'; MA /CUT_OFF: LEVEL=-1; SCORE=158; N_SCORE=5.8; MODE=1; TEXT='RR';
This Swiss-Prot modification will introduce a change in the size of PROSITE DR lines. As soon as Swiss-Prot introduces the 5-letter code in ID lines, we will extend PROSITE DR lines to 76 characters.
IP IPR001254; Trypsin;
GO GO:0003677; Molecular function: DNA binding (inferred from direct assay);
CC /TAXO-RANGE=ABEPV;
Some domains are clearly known to be absent from certain taxonomic range. For example no ubiquitination is observed in prokaryotes so we can exclude the presence of domains linked to ubiquitination in this kingdom. Actually there are no possibilities to distinguish between a range where we know that the presence of a given domain will never appear and a range where a domain was never found but where we cannot exlude its presence. To distinguish these two possibilities we change the TAXO-RANGE format as followed:
CC /TAXO-RANGE=A?EP-;
where '?' indicates that no matches were observed for a given signature but we cannot exclude its presence in this range.
and '-' indicates that we can exclude its presence.
The format will be:
CC /VERSION=n;where n is a digit number.
All signatures anterior to release 18.0 will have the version number 1. Version number will be incremented only when modification will appear in PA and MA lines.
(4) Status of the PROSITE files |
---|
prosuser.txt | User manual |
profile.txt | Description of the profile syntax |
psrelnot.txt | Release notes for the current release |
prosite.dat | Patterns, profiles and rules databases (updated weekly) |
prosite.doc | Documentation database for each pattern and profile (updated weekly) |
prosite.lis | List of documentation entries (updated weekly) |
pautindex.txt | Authors index (updated weekly) |
psdelac.txt | Deleted accession number index (updated weekly) |
experts.txt | List of on-line experts for PROSITE and Swiss-Prot (updated weekly) |
jourlist.txt | List of cited journals in PROSITE (updated weekly ) |
ps_98.txt | Announcement concerning PROSITE |
(5) FTP access to PROSITE |
---|
Organization | Swiss Institute of Bioinformatics (SIB) |
Address | ftp.expasy.org |
Directory | /databases/prosite/ |
(6) Acknowledgments |
---|