Sequence Read Archive

Sequence Read Archive

Content
Description	FASTQ Sequences BAM data
Organisms	all
Contact
Research center	National Center for Biotechnology Information European Bioinformatics Institute DNA Data Bank of Japan
Access
Website	www.ncbi.nlm.nih.gov/sra/ www.ebi.ac.uk/ena/ trace.ddbj.nig.ac.jp/dra/index_e.html

National Center for Biotechnology Information
European Bioinformatics Institute

www.ncbi.nlm.nih.gov/sra/
www.ebi.ac.uk/ena/

The Sequence Read Archive (SRA, previously known as the Short Read Archive) is a bioinformatics database that provides a public repository for DNA sequencing data, especially the "short reads" generated by High-throughput sequencing, which are typically less than 1,000 base pairs in length. The archive is part of the International Nucleotide Sequence Database Collaboration (INSDC), and run as a collaboration between the NCBI, the European Bioinformatics Institute (EBI), and the DNA Data Bank of Japan (DDBJ).

The archive was established by the National Center for Biotechnology Information (NCBI) in 2007 in order to provide a repository for data produced by RNA-Seq and ChIP-Seq studies as well as large-scale studies including the Human Microbiome Project and the 1000 Genomes Project. Originally called the Short Read Archive, the name was changed in anticipation of future sequencing technologies being able to produce longer sequence reads.

The volume of data deposited in the Sequence Read Archive has grown rapidly. As of September 2010, 65% of the SRA was human genomic sequence, with another 16% relating to human metagenome sequence reads. Much of this data was deposited through the 1000 Genomes Project. In June 2011, the data contained within the SRA passed 100 Terabases of DNA in volume.

...
Wikipedia