UniGene FILES IN THIS DIRECTORY ======================= NOTE: The files in this directory have been renamed with "Hs" (for Homo sapiens) appearing somewhere in the filename. A mouse version of UniGene is under construction and the prefix "Mm" (Mus musculus) will be used analogously for these file. Hs.info Some statistics for the current build Hs.seq.all.Z Human transcript sequences derived both known genes and ESTs that have been partitioned into clusters. The lines beginning with the # character delimit the clusters. The cluster identifier, which is NOT guaranteed to remain stable across UG builds, appears as Xx.99999, with Xx the two-letter organism abbreviation. Otherwise, the sequences are shown in FASTA-style. The number following the # is the UniGene sequence ID; This number won't change from UG build to build, though the sequence may not remain in the same (or in any) cluster across UG builds. If the GB or dbEST sequence is updated, the UG sid remains the same. Note that individual clusters may be downloaded from the main UniGene website. Hs.seq.uniq.Z One sequence selected from each UniGene cluster (the one with the longest region of high-quality sequence data). This file was intended to be used for BLAST/FASTA searching. Hs.data.Z NOTE: This is a new file and the format is still undergoing revision. Send comments to Lukas Wagner (wagner@ncbi.nlm.nih.gov). Line types/qualifiers: ID UniGene cluster ID TITLE Title for the cluster GENE Gene symbol CHROMOSOME Chromosome CYTOBAND Cytological band STS STS NAME= Name of STS ACC= GenBank/EMBL/DDBJ accession number of STS DSEG= GDB Dsegment number PROTSIM Protein Similarity data ORG= Organism PROTID= Sequence ID of protein PCT= Percent alignment ALN= length of aligned region (aa) SCOUNT Number of sequences in the cluster SEQUENCE Sequence ACC= GenBank/EMBL/DDBJ accession number of sequence NID= Unique nucleotide sequence identifier PID= Unique protein sequence identifier (used for non-ESTs) CLONE= Clone identifier (used for ESTs only) END= End (5'/3') of clone insert read (used for ESTs only) LID= Library ID; see Hs.lib.info for library name and tissue // End of record Hs.lib.info.Z additional information regarding the LID field. Note that libraries may be browsed and downloaded via the Library Browser page, accessible through the main UniGene website.