Rfam: annotating non-coding RNAs in complete genomes

S Griffiths-Jones, S Moxon, M Marshall… - Nucleic acids …, 2005 - academic.oup.com
Nucleic acids research, 2005academic.oup.com
Rfam is a comprehensive collection of non-coding RNA (ncRNA) families, represented by
multiple sequence alignments and profile stochastic context-free grammars. Rfam aims to
facilitate the identification and classification of new members of known sequence families,
and distributes annotation of ncRNAs in over 200 complete genome sequences. The data
provide the first glimpses of conservation of multiple ncRNA families across a wide
taxonomic range. A small number of large families are essential in all three kingdoms of life …
Abstract
Rfam is a comprehensive collection of non-coding RNA (ncRNA) families, represented by multiple sequence alignments and profile stochastic context-free grammars. Rfam aims to facilitate the identification and classification of new members of known sequence families, and distributes annotation of ncRNAs in over 200 complete genome sequences. The data provide the first glimpses of conservation of multiple ncRNA families across a wide taxonomic range. A small number of large families are essential in all three kingdoms of life, with large numbers of smaller families specific to certain taxa. Recent improvements in the database are discussed, together with challenges for the future. Rfam is available on the Web at http://www.sanger.ac.uk/Software/Rfam/ and http://rfam.wustl.edu/ .
Oxford University Press