================================== ================================== NOTICE OF FILE NAME FORMAT CHANGE: ================================== ================================== In RefSeq release 49 (September 2011), the name format will change as follows - File Format Change Current name New name -------------------------------------------------------------------------------- All add decimal plant6.rna.gbff.gz plant.6.rna.gbff.gz genomic fasta add sub-part plant6.genomic.fna.gz plant.6.1.genomic.fna.gz 1. The file number will be preceded by a decimal '.' for all files. The file number in this example is '6'. 2. A sub-part number will be added to all genomic fasta files. The sub-part number in this example is '1'. Reason for this change: Large FASTA files will be split into smaller files to facilitate file transfer. The sub-part number has been added to provide a unique name to these files. To facilitate file name parsing, a sub-part number (1) will always be provided on genomic fasta files even if the file was not split. Details: When generating the various file formats for a given node of the RefSeq Release, we start with files in uncompressed binary ASN.1 format of no more than 500 megabytes. However, because the binary ASN.1 format for genomic records is very compact and efficient, output formats such as genomic FASTA can expand to *much* larger sizes. So, in order to make file transfer and processing easier for our users, we are introducing a "subpart number" in the filenames of genomic FASTA files. For example, file 'plant.6.genomic.bna.gz' is the 6th part of genomic ASN.1 data for the plant node. Genomic FASTA which is generated from this file is now broken up into multiple subparts. For example, 'plant.6.2.genomic.fna.gz' is the 2nd sub-part of genomic FASTA data that corresponds to the 6th part of plant node data in ASN.1 format. There might be multiple such files, depending on the degree of expansion:'plant.6.1.genomic.fna.gz' , 'plant.6.2.genomic.fna.gz', 'plant.6.3.genomic.fna.gz' , and etc.