EBI Dbfetch

ID   ABBA01065757; SV 1; linear; genomic DNA; WGS; HUM; 4540 BP.
AC   ABBA01065757;
PR   Project:PRJNA19621;
DT   05-JUN-2007 (Rel. 92, Created)
DT   23-AUG-2014 (Rel. 121, Last updated, Version 4)
DE   Homo sapiens CTG_1103276778610, whole genome shotgun sequence.
OS   Homo sapiens (human)
OC   Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC   Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae;
OC   Homo.
RN   [1]
RC   Publication Status: Online-Only
RP   1-4540
RX   PUBMED; 17803354.
RA   Levy S., Sutton G., Ng P.C., Feuk L., Halpern A.L., Walenz B.P.,
RA   Axelrod N., Huang J., Kirkness E.F., Denisov G., Lin Y., Macdonald J.R.,
RA   Pang A.W., Shago M., Stockwell T.B., Tsiamouri A., Bafna V., Bansal V.,
RA   Kravitz S.A., Busam D.A., Beeson K.Y., McIntosh T.C., Remington K.A.,
RA   Abril J.F., Gill J., Borman J., Rogers Y.H., Frazier M.E., Scherer S.W.,
RA   Strausberg R.L., Venter J.C.;
RT   "The Diploid Genome Sequence of an Individual Human";
RL   PLoS Biol. 5(10):E254-E254(2007).
RN   [2]
RP   1-4540
RA   Levy S., Sutton G., Ng P., Feuk L., Halpern A.L., Walenz B., Axelrod N.,
RA   Huang J., Kirkness E.F., Denisov G., Lin Y., MacDonald J.R., Wing A.,
RA   Pang C., Shago M., Stockwell T.B., Tsiamouri A., Bafna V., Bansal V.,
RA   Kravitz S.A., Busam D., Beeson K.Y., McIntosh T.C., Remington K., Gill J.,
RA   Borman J., Johnson J., Resnick A., Rogers Y.-H., Frazier M., Scherer S.W.,
RA   Strausberg R.L., Venter J.C.;
RT   ;
RL   Submitted (18-MAY-2007) to the INSDC.
RL   J Craig Venter Institute, 9704 Medical Center Drive, Rockville, MD 20850,
DR   MD5; 04b7b01d676d8db23348e112fbd38df7.
DR   ENA; ABBA010000000; SET.
DR   ENA; ABBA000000000; SET.
DR   ENA-CON; DS486227.
DR   ENA-CON; GL000186.
DR   BioSample; SAMN02981236.
DR   Ensembl-Scaffolds; ABBA01065757.1:1-4540; homo_sapiens.
CC   DNA Donor Name: J. Craig Venter | Date of Birth: October 14, 1946 |
CC   Sex: Male | Ethnicity: Caucasian | Descent: European - England
CC   This WGS project represents a composite haploid version of the
CC   genome where the highest scoring allele contained is represented in
CC   the consensus sequence.  The number of contigs may differ from
CC   those in the PLoS Biol. paper (PloS Biology 2007 5: e254) because
CC   some short sequences were found to be foreign and thus were
CC   suppressed. Scaffolds DS486015-DS490530 represent the 4528
CC   scaffolds that are discussed in the paper.  There are fewer than
CC   listed in the paper because 12 of the original were determined to
CC   be foreign, so were omitted here.  Scaffolds DS490531-DS490620 are
CC   the remaining multi-component scaffolds, not in the set of 4528.
CC   The chromosomes are records CM000462-CM000485, assembled from the
CC   scaffolds.
FH   Key             Location/Qualifiers
FT   source          1..4540
FT                   /organism="Homo sapiens"
FT                   /mol_type="genomic DNA"
FT                   /sex="male"
FT                   /dev_stage="adult"
FT                   /db_xref="taxon:9606"
SQ   Sequence 4540 BP; 983 A; 1377 C; 325 G; 1855 T; 0 other;
     tggtgtccat tgtattccag tacattcaat tctggtccat tccattcgat accataactt        60
     tcgattccat tttatactat tgcgttctat tcgattcctt tctactcgaa taaattccat       120
     tcgagtccat tcctttctag tccattctgt ttgtgtctat tccgttcaag tccattacat       180
     ttgtgtccat tccattccat ttcattaaat tcaattccat tcgatgccat tccattcaat       240
     ccattccact tgattccatt ccatttgatt aatttccgtt ctgttccatt ccattatatt       300
     cgatgtcatt ccattcgatt ctgttccatt cgactccatt ccattccatt ccgttccatt       360
     cgattccatt tcattctatt ccttccctct ccatttcatt ccattccatg gaactccttt       420
     ccattccgtt caagttcatt caactcctgt ccattatatt cctctccttt cctttcgagt       480
     ccattccatt cgatatcttt tcattacact ccattccatt ctgtctcttt ctattcaatt       540
     cactctcatt ctttcgattc aattccattc ggttccattt cattatactc ctttccattc       600
     gagtccattc cattccattc cgttccgtta gattccaatc caatcgattc caattcattc       660
     cagtctaatc tcttcgagtc cattccattc cactccattc catttgattc cactctattc       720
     aatttcattc cactagattc cattccacta gattccactc agttccattc cattgcgttt       780
     cattctactc cctttcattg cattacattc cattccattt gattacatta ctttagattc       840
     ccttcctttc aaatcaatta cgttacattc tattacattc gagcccattc tattccactc       900
     caatccattc tgttccattc cattcgattc cagttctttc gattccattc cacactgttg       960
     cattccattc aattccattc tattcgaata aattgcgttc gagaccattc ctttcgagtc      1020
     cattacattt cattccatgc cattccattg cagtccattc gatgacattc ctttcaattc      1080
     tgctccattc gagtccattc cattcaagcc cattccattc catctgatgc cgtaccattc      1140
     gattctattc cactcgactg cattccattc cattccattc cattccattc cattccattc      1200
     cattccgttc cattccattc cgttccatcc aattccattc cattccattc tattcctttc      1260
     cattcatttc ctttccattc gagtacaatc ctctccagtc cattccattc cagtccattc      1320
     cattccattc ctttttattc gatatctttc catttcactc cattcgattc tattcctttc      1380
     gtttctgttc acttccattc cattcaattc cattcctttc aatttcattc cattcaactc      1440
     cattccattc gagtccatta cattccattt gatgtctttc cattacactc catttcattc      1500
     tattcctttc gattccattc aattccattc cattcgattc cataccactt ggatcctttc      1560
     tattcgactc cattccattt gagtcctttc cattccattc cattcaattc agttaggttc      1620
     gattcccatc tgtccgattc cgttttgttc cagtcaattg cattcgagtc cactgcattc      1680
     tagtccgtgc cattccattc cattccaatc tattacattc cattcaattc cagtccatgc      1740
     cgtttgatac cattccatac aattctattc catttgatta aattacactc cattctactc      1800
     cactcgattc cactccattc ccctttattg cattccattc tattccattc catttcttac      1860
     cattccaatc catttgatta cattccattt gaatccattc tattcaaatc aattacattg      1920
     caatccatat cattcaaatc ggttttattc cattccattc cattctggtc cattgcattc      1980
     tattccattc catacaattc cattccattc gattccattc cactggattc cactccattt      2040
     ccttttattg cattccattc ttatccattc cgttgcatac cattccattc catttcatta      2100
     cattcctttt gaatccattc atttcaaatc cattaaattg caatccatta cattcgagtc      2160
     gcttctattc cagtccattc cattccggtt cattccattc tattccattc cattcgatat      2220
     cattccatac taattcattc cattcgattc cattgtatta taatgaattc cattcgagac      2280
     cattcctttc aagtccattg tacttgagtc cattccactt gagtccattc catttgtgtc      2340
     ctttacattt gggtccaatc cattacattc ctatccattg cattccaatc catttcattc      2400
     catttgattc gacgtcattc cattctcttc tattgcattc gagtccatta cattttagtt      2460
     caatccattc cagtcagttc gataccattc cattagattc tataccattt gactccattc      2520
     ctttctattc cgttccaact gattccattc ctttccattc ctttcctttc cattcgatgt      2580
     catgccattc cattctacta cattcaagtc cattccattc gagtccattc cattgcattc      2640
     cattcgatgc cattacactc aattttattc cattctaatc cattctattg catccaattc      2700
     aattccaatc cattcctttg catttcattg cattccgttc gtttccattg cattcgagtc      2760
     aattccattc cattctatta cattcaatgc catttcattc gactctattc tattcgactc      2820
     cattccatac cattcccgtc catctgattt cattccattc tattcctttc aattctattg      2880
     cagtcctttc catttcattt cattcgtttc cattccattc aagtccattc tactcgagtc      2940
     cgttccactc cagtctgttc cattccagtc cattccattc gagaccattc cattccattc      3000
     ctgtcctttc cattcaaaga aattccttta cactcctatc aattctattc cttttgattc      3060
     tcatcaattc cattccattt gattccattc tatttgaatc cattccattc cattcttttc      3120
     ctttccattc ctttcctttt gattcaaatc cattccattc cattttcgtc ccatccattc      3180
     cattcgattt cattccatgc taggccattc catttgattc cattccattc gattctgttc      3240
     aagtcgattt cactccattc cattccgttg cattccattc tattccattc ctttgtattc      3300
     cattccattt cgttagatta tattccattc gattaaattc tattcgaatc aattacattg      3360
     caatccatta cattggactc catactattc tagtccattc tttcctgtca attacattcg      3420
     tttccattcc atttgattcc attaatttct attgccttcg atttgactct tttatattcg      3480
     aataaattcc attcaagaac attcctttcg tttccattct atttgaatcc attccattcg      3540
     agtccattcc atttaagtcc accacattcc attccattcc attcgaggcc attctaaatg      3600
     attctattcc atttgaatcc attccattcc attccattcc aatctattcc attccattcc      3660
     attccattcc attccattcc attccattcc attccattcc attccattcc attccaatca      3720
     attccattcc attccattcc attccattcc attccattcc attccgttcc attccattcc      3780
     attccattcc tttcatttcc ataccattag aatccattcc actccagtcc attcctttcg      3840
     tttccaatgt tttccagtcc attcaatttg agtctattac attcgattcg gtatatttcc      3900
     aagacactcc attccattct attcctttcg attccattca attccattct gtttgattcc      3960
     gttccattcg attccattac tttcaaatcc attccattca tatccattcc attccattcc      4020
     tttcagaaca attccaatcc attcggttcc atttttttcc actccaatct attcgagacc      4080
     attccattcc attgcattcc attccattcc ttttcattcc attccattcc attgaactcc      4140
     attgcactcg attccactct gttccattcc attgcacttc actctatacc attccattgc      4200
     attcctttcc attccatttg attacattcc attcgattcc attccattcg aatcaattac      4260
     attacaatcc gttacattct attcctttct attccagtat attccattcg atatctttcc      4320
     attaaactcc attccattct attcctttcg aatccattca attccattcc attcgattcc      4380
     attctattcg attccattcc gtttcactcc attccattca tgtcgattcg attccattct      4440
     attcctttcc attcctttcg attccaatcc gttctattca attttgttcc agtccattgc      4500
     attcgagtcc attcaattcc agtccattcc attcgattac                            4540