#### README ####

IMPORTANT: Please note you can download correlation data tables, 
supported by Ensembl, via the highly customisable BioMart and 
EnsMart data mining tools. See http://protists.ensembl.org/biomart/martview 
or http://www.ebi.ac.uk/biomart/ for more information.


##################
Fasta RNA dumps
#################

These files hold the transcript sequences corresponding to non-coding RNA genes (ncRNA).

------------
FILE NAMES
------------
The files are consistently named following this pattern:
<species>.<assembly>.<eg_version>.<sequence type>.<status>.fa.gz

<species>: The systematic name of the species. 
<assembly>: The assembly build name.
<eg_version>: The version of Ensembl Genomes from which the data was exported.
<sequence type>: ncrna for non-coding RNA sequences
<status>
  * 'ncrna' - all non-coding RNA genes

EXAMPLES
  for Human:
    Homo_sapiens.NCBI36.ncrna.all.fa.gz
      Transcript sequences for all ncRNA gene types.


-------------------------------
FASTA Sequence Header Lines
------------------------------
The FASTA sequence header lines are designed to be consistent across 
all types of Ensembl FASTA sequences.  This gives enough information 
for the sequence to be identified outside the context of the FASTA file. 

General format:

>ID SEQTYPE:STATUS LOCATION GENE

Example of an Ensembl RNA header:

>ENST00000347977 ncrna:miRNA chromosome:NCBI35:1:217347790:217347874:-1 gene:ENSG00000195671
   ^             ^     ^     ^                                          ^
   ID            |     |  LOCATION                            GENE: gene stable ID
                 |   STATUS
              SEQTYPE