#### README #### IMPORTANT: Please note you can download subsets of data via the BioMart data mining tool. See https://www.ensembl.org/info/data/biomart/ for more information. ################## Fasta RNA dumps ################# These files hold the transcript sequences corresponding to non-coding RNA genes (ncRNA). ------------ FILE NAMES ------------ The files are consistently named following this pattern: ...fa.gz : The systematic name of the species. : The assembly build name. : ncrna for non-coding RNA sequences EXAMPLES for Human: Homo_sapiens.NCBI36.ncrna.fa.gz Transcript sequences for all ncRNA gene types. ------------------------------- FASTA Sequence Header Lines ------------------------------ The FASTA sequence header lines are designed to be consistent across all types of Ensembl FASTA sequences. Stable IDs for genes and transcripts are suffixed with a version if they have been generated by Ensembl (this is typical for vertebrate species, but not for non-vertebrates). General format: >TRANSCRIPT_ID SEQTYPE LOCATION GENE_ID GENE_BIOTYPE TRANSCRIPT_BIOTYPE Example of an Ensembl ncRNA header: >ENST00000347977.1 ncrna chromosome:NCBI35:1:217347790:217347874:-1 gene:ENSG00000195671.1 gene_biotype:miRNA transcript_biotype:miRNA ^ ^ ^ ^ ^ ^ TRANSCRIPT_ID | LOCATION GENE_ID GENE_BIOTYPE TRANSCRIPT_BIOTYPE SEQTYPE