000README. This file describes the lastest available release of the PIR-International Protein Sequence Database on the NCBI anonymous FTP mirror server. The most current release of Protein Sequence Database and additional PIR databases can be found at the NBRF Anonymous FTP server NBRF.Georgetown.Edu. This machine runs VAX/VMS; directories are depicted in [] with "." separating sub-directories. Change Directory to [ANONYMOUS.PIR] for more information in the file 000README. Weekly updates of the databases and a variety of database search and analysis tools are available via the NBRF/PIR WWW Home page at URL http://pir.georgetown.edu/ ______________________________________________________________________________ The release is available in two formats: the VMS compatible (NBRF-PIR formatted) version and the ASCII CARD image (CODATA formatted) version. At this site all files originate in the path: /ncbi/ftp/repository/PIR VMS specific files are in the vms subdirectory and ASCII specific files are in the ascii subdirectory. VMS version ----------- The VMS version of the database contains the datasets PIR1, PIR2, PIR3 and NRL_3D in NBRF-PIR format split among two files (.REF and .SEQ) for each dataset. Please refer to the following documents for more information: 00TAPE_DOC.VMS - document describing lastest release 0PRFILE_DOC.VMS - document describing NBRF-PIR file format 0PROTEIN_DOC.VMS - document describing NBRF-PIR database file structure and format specification Files with the ".Z" extension have been compressed by LZCOMP. LZDCMP preserves the VAX/VMS file characteristics and is required to uncompress these files after binary transfer. The LZ suite of programs may be acquired from the PIR Anonymous FTP server at NBRF.Georgetown.Edu in directory [ANONYMOUS.COMPRESS]. For VMS systems the LZ.SHAR file contains the software; for non-VMS systems a series of .C and .H source files is available. The DCOMPRESS program may also be used to uncompress these files. ASCII version ------------- The ASCII version of the database contains the datasets PIR1, PIR2, PIR3 and NRL_3D in CODATA sequence exchange format. Please refer to the following documents for more information: 00TAPE_DOC.ASCII - document describing lastest release 0PROTEIN_DOC.ASCII - document describing CODATA exchange format specification Files with the ".Z" extension have been compressed by the Unix compatible "zcompress" program. Use LZDCMP or "zcompress -d" to uncompress selected files after a binary transfer. Not all files listed in the documents are available on the FTP server. If you would like access to data not presently available but listed in one of the documents or if you have suggestions or comments please notify: Peter McGarvey Ph.D. Email: mcgarvey@NBRF.Georgetown.edu Phone: (202) 687-2121