 |
EMBL database-related SRS libraries
-
EMBL
The virtual library contains the data from the following libraries described below: EMBL (Release), EMBL (Updates), EMBL (Third Party Annotation)
-
EMBL (Contig)
EMBLCON entries include construct information for building sequences of chromosomes, genomes and other long sequences which can't be reasonably represented by a single entry. CON entries are also used to describe the supercontig assemblies of Whole Genome Shotgun (WGS) data. CON entries don't contain sequence data per se; the arrangement of the segments is described in the assembly section of a CON entry, line type CO, as in the following example:
CO join(Z99104.1:1..213080,Z99105.1:18431..
221160,Z99106.1:13061..209100, ...
For the full description of the CO lines, please refer to the User Manual.
- EMBL (Contigs expanded)
EMBLCONEXP is a library of EMBL-formatted flatfiles, obtained by expanding construct information from the entries in the CON division of the EMBL database. Flatfiles include full sequence and annotation from the underlying segments.
-
EMBL (Release)
Library contain the nucleotide sequence data from
the latest release of EMBL nucleotide sequence database. Standard entries and Whole Genome Shotgun (WGS) data are included; CON, expanded CON, TPA and Coding Sequences entries are not included into this library
-
EMBL (Updates)
Library contains entries that were newly created or updated since the latest release of EMBL nucleotide sequence database. Standard entries and Whole Genome Shotgun (WGS) are included into this library.
-
EMBL (Coding Sequences)
Library contains nucleotide sequences of the CDS (coding sequence) features, as
annotated in EMBL database. Data are distributed in a flatfile format, similar
to
that of the EMBL nucleotide sequence database (see the description of the format
here)
The CRC32 checksum is calculated and indexed for the nucleotide sequences of the
CDSs, in order to put together the groups of CDSs sharing checksum values
 |