EBI Dbfetch

ID   ABBA01065756; SV 1; linear; genomic DNA; WGS; HUM; 5355 BP.
AC   ABBA01065756;
PR   Project:PRJNA19621;
DT   05-JUN-2007 (Rel. 92, Created)
DT   23-AUG-2014 (Rel. 121, Last updated, Version 4)
DE   Homo sapiens CTG_1103276821598, whole genome shotgun sequence.
OS   Homo sapiens (human)
OC   Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC   Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae;
OC   Homo.
RN   [1]
RC   Publication Status: Online-Only
RP   1-5355
RX   PUBMED; 17803354.
RA   Levy S., Sutton G., Ng P.C., Feuk L., Halpern A.L., Walenz B.P.,
RA   Axelrod N., Huang J., Kirkness E.F., Denisov G., Lin Y., Macdonald J.R.,
RA   Pang A.W., Shago M., Stockwell T.B., Tsiamouri A., Bafna V., Bansal V.,
RA   Kravitz S.A., Busam D.A., Beeson K.Y., McIntosh T.C., Remington K.A.,
RA   Abril J.F., Gill J., Borman J., Rogers Y.H., Frazier M.E., Scherer S.W.,
RA   Strausberg R.L., Venter J.C.;
RT   "The Diploid Genome Sequence of an Individual Human";
RL   PLoS Biol. 5(10):E254-E254(2007).
RN   [2]
RP   1-5355
RA   Levy S., Sutton G., Ng P., Feuk L., Halpern A.L., Walenz B., Axelrod N.,
RA   Huang J., Kirkness E.F., Denisov G., Lin Y., MacDonald J.R., Wing A.,
RA   Pang C., Shago M., Stockwell T.B., Tsiamouri A., Bafna V., Bansal V.,
RA   Kravitz S.A., Busam D., Beeson K.Y., McIntosh T.C., Remington K., Gill J.,
RA   Borman J., Johnson J., Resnick A., Rogers Y.-H., Frazier M., Scherer S.W.,
RA   Strausberg R.L., Venter J.C.;
RT   ;
RL   Submitted (18-MAY-2007) to the INSDC.
RL   J Craig Venter Institute, 9704 Medical Center Drive, Rockville, MD 20850,
DR   MD5; b37b107090f25647d8e0acc95be470d2.
DR   ENA; ABBA010000000; SET.
DR   ENA; ABBA000000000; SET.
DR   ENA-CON; DS486227.
DR   ENA-CON; GL000186.
DR   BioSample; SAMN02981236.
CC   DNA Donor Name: J. Craig Venter | Date of Birth: October 14, 1946 |
CC   Sex: Male | Ethnicity: Caucasian | Descent: European - England
CC   This WGS project represents a composite haploid version of the
CC   genome where the highest scoring allele contained is represented in
CC   the consensus sequence.  The number of contigs may differ from
CC   those in the PLoS Biol. paper (PloS Biology 2007 5: e254) because
CC   some short sequences were found to be foreign and thus were
CC   suppressed. Scaffolds DS486015-DS490530 represent the 4528
CC   scaffolds that are discussed in the paper.  There are fewer than
CC   listed in the paper because 12 of the original were determined to
CC   be foreign, so were omitted here.  Scaffolds DS490531-DS490620 are
CC   the remaining multi-component scaffolds, not in the set of 4528.
CC   The chromosomes are records CM000462-CM000485, assembled from the
CC   scaffolds.
FH   Key             Location/Qualifiers
FT   source          1..5355
FT                   /organism="Homo sapiens"
FT                   /mol_type="genomic DNA"
FT                   /sex="male"
FT                   /dev_stage="adult"
FT                   /db_xref="taxon:9606"
SQ   Sequence 5355 BP; 1135 A; 1581 C; 421 G; 2218 T; 0 other;
     ccattatatt cgagtctgtt ctacccagtc cattccattc tgttccattc cattcgattc        60
     cattccattc cattctattc catactattg cattcctttt gattccattc tatacgaaga       120
     aattcccttt gagaccattc ctttcgagtc tattctattt gagtccattt catttgattc       180
     cattactttt gtgtccattc cattctattc cattccattc cattccattc cattccattc       240
     cattccattc cattccattc cattctgttg ctttccattc cattgaatgc cattctgttt       300
     gatactattc ctttagtgtc cattccattc gagtccattc ctttccattc cattccaatt       360
     gatgccattc cttttgattc tattccattc aacactatac catttcatgc cattctatcc       420
     gattccgttc ctttctattc ctttccattc cattccactc cattccatta caatcgtttc       480
     cattccattc gagttcattt cactcagcct gataccattt gagtccattc cattacagtg       540
     cattccattc gagtccattc cattccagtc cattccattt gttatatttc cattactctc       600
     cattccattc tattcttttt tattccattc aattcctttc agtttgattt cattcttttc       660
     aattccattc cattcaactt cattacattc gagtccattc aatgccattc cattccgtta       720
     cagtcgagtc cattccattc gattccattt ttttcccatc cattcttccg agtgcattcc       780
     attccagtct atttgattcg attccattcc atttgattcc attatattca attccattcc       840
     actggatttc gctcggttcc attccattgc attccattct attccattcc actgtataac       900
     tttccattcc atttgattat tttccattgg attccattcc actcaaatca attgcattgc       960
     attccattac gttcgagtcc gttctattcc attccattca attctggtgg attccagtca      1020
     attccattcc atactaatgc tttccattcg attccattct attcgaataa attccattcg      1080
     aaaccttacc tttctagtcc attatatttg tgtctattct gtccgagtcc attacatttg      1140
     ggtccattcc attccattcc attccattcc attccattcg atgatattcc attcaattct      1200
     attccacttg agtccattcc actcgattaa atttctttcc gttcctttat attcaatgtc      1260
     attccttttg attctattcc atttgactgc attccattcc attctgttcc attcctttcc      1320
     attccattct attccttctc attccatttc attcctttcc accgccctcc tttccattcc      1380
     attcgagtcc attcaactcc aatgcattcc attccagtcc tttactttcg agtccattcc      1440
     attccattgc tttccattcg atatcttttc attacactcc attccattct atccctttct      1500
     actccataca ctttcattct ttcgattcat ttccatttgg ttccattcta ttatattcca      1560
     ttccaacaga gtccattccg ttccattcca ttccataaca tactgttcca ttcgattcca      1620
     atctgatcaa tttcatattt ttccagtcca atccattcca atccattcca ttccattcca      1680
     ctccattcca ttccattcaa tgtcattgca ttccgttcca ttccactaga ttccactaag      1740
     ttccattcca ttgcattcca ttctattcca ttccattcca tttgattaca ttacattcga      1800
     atccattctt ttcaaatcaa ttagatttca tttttttact ttcgagccag ttctattcca      1860
     ctccaatcca ttccggtcta ttccattcaa ttacaattct ttcgattcca ttccatacta      1920
     attcattcca ttcgattcca ttctatttga ataaattgct ttcaaaacct ttcctttcaa      1980
     gtccattcta tttgagtcca ttccattcga gtccattata tttggttcca ggccattgct      2040
     ttccaatcca ttcgatgtca ttccattcaa ttctgctcca tttgggtcca ttccattcca      2100
     ttccattcga gtccattcca ttccattcca ttccatctga tgccattcca ttagattctt      2160
     ttccattcga ttccattctg ttccattccg ttccatccaa ttccattccg ttctattcct      2220
     ttcctttcca ttctttccat tccaatcatt tcctttccat ttgagtccaa tcctctccag      2280
     tccattccat tccagtccca ttccattgcg gtctactcca atccagtcca taccattcca      2340
     ttcgagtcca atcttctcca ggccattcca ttccagtcca ttctattcca gtctattcca      2400
     ttccattcca ttccattcca ttccattcct ttctttttga tatctttcca ttacactcca      2460
     tttgattcta ttcctttcga ttctattcaa ttccattaca ttcatgtcca ttccatttga      2520
     tttcattgaa accgactcca ttccatttga gtccattcca ttccgtttgt tatgtttcca      2580
     ttaaactcta ttccattcta ttcctttcga ttctattcaa ttccattcca ttcgattcca      2640
     ttccattcca ttcggttcct ttccattcga ctgcattcca ttcgtgtcca ttgtattcca      2700
     atccattcca ttctattcca tttgattaca atctgtttga ttccattttg taccagtcca      2760
     ttccattcga gtgcattcca ttacagtcca tgccattcga ttacactcca ttcgattcca      2820
     gtccatgcca tttgatacca ttccatatga ttccattcaa ttcgattcca ttacactcga      2880
     ctctactcca ctcgattcca ctccgttccc ctttatggca tttaattcta ttccattcca      2940
     ttgcttacca ttctaatcca ttttattaca ttccatttga atgcattgca ttgaaataaa      3000
     ttacattgca atgcatttca ttcgagtcgg ttctatccca ttccattcca ttctggtaca      3060
     gtacgttcga ttccattcca tactatttca ttccattcga ttccattcaa cttgattcca      3120
     tttgatacca ttcctttcga ttccattcca tactactgca ttctattcaa ttccattcta      3180
     tttgaataaa ttccattcga gaaaattcct ttagagtgca ttacgtttga gtccattcat      3240
     tcgagtccat tatatttggg tccattctgt tccattaaaa tccattcgat gtcattccct      3300
     tttatgctgt tccattcaag tccattccat tggagtccat accaatccat accattccat      3360
     tccattcgaa accattccat tcgattttat tcgattcaac tccattccat tccattccat      3420
     tccattccat tccattccat tccatttcat cccattcctt tccattcttt tcctttccat      3480
     tgcattgcat tccattcgtt tctattccat tcgattccat tccattccat tcaatgccat      3540
     tccatttgat tctattccat tcgattccat ttcattccat ccaatttcat tccattctat      3600
     tcctttccat tccattccat tcctttccat tccattcaat tcgttatcat tccattcgag      3660
     tccattccac tccattccat tccatttgat tccattcctt tccagtccat tccatttgag      3720
     tcctttccaa accattacat tcgatcttcc cattactttc catttcattc tatacctttc      3780
     aatttgattc aattacattc cattcggttt cattccatta gactccattc cattcgtgtc      3840
     cattccactg cactccattc cattcctttc cattccattc cattttgttc cagtaaattc      3900
     cattcgagtc ctttccattc cagtccattc catttgattc cattccactc gattcaactc      3960
     cattctattg cattgcactc cattctattc cattccattc tatttgatta ctttccatta      4020
     gattccatta cattcgaatc aaatacattg caatccagta cattcgagtc cgttctattc      4080
     ccttccattc cattctattt tattccattc gattccattc catttgattc cattccatac      4140
     tattacatta aattcgatta cattctattt gaatgaattc cattcgagat cctttctttc      4200
     gagtgcattc tatttgagtc cattccattc gagtccattc catttggttc cattccattc      4260
     cattcaatgc cattctatcg tattctattg cattcgaatc cgttcaattc gaggccattc      4320
     tattccgttc catttcattc cattccattc gatgccattc cattcgttta tattccattc      4380
     gacttcattc cattccattt cccttccatc ccatttcatt cctttgtatt cctttccatt      4440
     caattccttt ccattccatt gtattccact ccattcgatt ccattccact ctattccatt      4500
     ccattcgttt ccattctatt cgagtcgatt ccattccttt ccattccatt caatgctttt      4560
     cattcgactg tattaccttc aactccattc cattccgtta cgttccttcc gatttcattc      4620
     cattctattc ctttccattc cattccagtc ctttccgttt aattccattc gtttccattc      4680
     cgttcaagtc cattccactc cagtccattc cattcgagtg tgttccattc cattacattc      4740
     cattcgagtc cattccattc taatacattc gatatctttc ctttacactc tatatctttc      4800
     cattacacta agttgcattc tattcttttg attccattca attccattct attcagttcc      4860
     attaaattcg actgcattcc attcgagtcc attcctttgc gttctattcc attctgttcc      4920
     cttccattcc aatccggtgg tttccatttt gtttcagtcc attccattcc agtccatttc      4980
     tttggattcc attcctttca attccactcc attctattcc attgaattcc attctactcc      5040
     attcaattcc attccatgtg attacattcc attcgtttcc attccattcc attcaattac      5100
     attgcaatcc attccattcc attccattgt attccagtcc cttccattcc agtccattcc      5160
     attcgattac attccattgg attccatttc atactattgc attccattag atttcattct      5220
     atttgtataa attccatttg aaaaaattcc tttcgagtcc attctatttg agtccattcc      5280
     attcgagtcc tttacatttg tgtccattcc attgatttcc gttggattcc attccgttcc      5340
     attccattcg atggc                                                       5355