![]() |
EBI DbfetchID AACZ02115232; SV 1; linear; genomic DNA; WGS; MAM; 2824 BP. XX AC AACZ02115232; AACZ02000000; AACZ00000000; XX DT 19-MAR-2006 (Rel. 87, Created) DT 04-DEC-2008 (Rel. 98, Last updated, Version 2) XX DE Pan troglodytes chromosome 10 bld2_Cont125.85, whole genome shotgun DE sequence. XX KW WGS. XX OS Pan troglodytes (chimpanzee) OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; OC Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae; OC Pan. XX RN [1] RP 1-2824 RX DOI; 10.1038/nature04072 RX PUBMED; 16136131. RG Chimpanzee Sequencing and Analysis Consortium RA ; RT "Initial sequence of the chimpanzee genome and comparison with the human RT genome"; RL Nature 437(7055):69-87(2005). XX RN [2] RP 1-2824 RX DOI; 10.1038/nature04101 RX PUBMED; 16136134. RA Hughes J.F., Skaletsky H., Pyntikova T., Minx P.J., Graves T., Rozen S., RA Wilson R.K., Page D.C.; RT "Conservation of Y-linked genes during human evolution revealed by RT comparative sequencing in chimpanzee"; RL Nature 437(7055):100-103(2005). XX RN [3] RP 1-2824 RA Yang S.P., Hillier L.W., Chinwalla A.T., Fulton L.A., Huang X., RA Wilson R.K.; RT ; RL Submitted (26-NOV-2003) to the EMBL/GenBank/DDBJ databases. RL Genome Sequencing Center, Washington University School of Medicine, 4444 RL Forest Park, St. Louis, MO 63108, USA XX RN [4] RP 1-2824 RA Yang S.P., Hillier L.W., Chinwalla A.T., Fulton L.A., Glasscock J., RA Wallis J.W., Huang X., Mardis E.R., Wilson R.K.; RT ; RL Submitted (17-FEB-2006) to the EMBL/GenBank/DDBJ databases. RL Genome Sequencing Center, Washington University School of Medicine, 4444 RL Forest Park Parkway, St. Louis, MO 63108, USA XX DR EMBL-CON; CM000324. DR GOA; P0C594. DR InterPro; IPR000340; Dual-sp_phosphatase_cat-dom. DR InterPro; IPR000387; Dual-sp/Tyr_phosphatase. DR InterPro; IPR016130; Tyr_Pase_AS. DR InterPro; IPR020405; Dual-sp_phosphatase_famA. DR InterPro; IPR020417; Dual-sp_phosphatase. DR InterPro; IPR020422; Dual-sp_phosphatase_subgr_cat. DR UniProtKB/Swiss-Prot; P0C594; DUPD1_PANTR. XX CC Sequencing/Assembly: The whole genome shotgun sequence data were CC assembled and organized by the Washington University Genome CC Sequencing Center. The underlying whole genome shotgun data were CC generated at the Washington University School of Medicine and the CC Broad Institute. A 5 megabase region of chromosome 7 was finished CC at the Washington University Genome Sequencing Center CC (chr7:84674857-89461887). The chromosome Y sequence was finished at CC the Washington University Genome Sequencing Center with detailed CC mapping and extensive collaboration with David Page's group at the CC Whitehead Institute (The DNA Sequence of Chimpanzee Chromosome Y, CC unpublished; Hughes et al., Conservation of Y-linked genes during CC human evolution revealed by comparative sequencing in chimpanzee. CC Nature, 2005 437:100-3; PMID:16136134). The chromosome 21 sequence CC data was kindly provided by Todd Taylor and the Riken Genome CC Sciences Center (Watanabe et al., DNA sequence and comparative CC analysis of chimpanzee chromosome 22. Nature. 2004 May CC 27;429(6990):382-8. PMID: 15164055). CC This assembly covers about 97 percent of the genome and is based on CC 6X sequence coverage. It is composed of 246,375 contigs with an N50 CC length of 29 kb, and 44,454 supercontigs with an N50 length of 9.7 CC Mb. The total contig length, not including estimated gap sizes, is CC 2.97 Gb. Of that total, 2.82 Gb has been ordered and oriented CC along specific chimpanzee chromomes, 107Mb has been linked to CC chromosomes but is unplaced, and 50Mb remains unlinked (chrUn). CC The whole genome shotgun data from primary donor-derived reads CC (Clint, a captive-born male chimpanzee from the Yerkes Primate CC Research Center (Atlanta, USA)) were assembled using PCAP (Huang CC 2006) using stringent parameters derived by eliminating detectable CC global mis-assemblies (interchromosomal cross-overs determined by CC alignment of the chimpanzee genome against the human genome) larger CC than 50kb. CC The assembly data were aligned against the human genome at UCSC (B. CC Raney) utilizing BLASTZ (Schwartz 2003) to align and score CC non-repetitive chimpanzee regions against repeat-masked human CC sequence. Alignment chains differentiated between orthologous and CC paralogous alignments (Kent 2003) and only 'reciprocal best' CC alignments were retained in the alignment set. The chimpanzee AGP CC files were generated from these alignments in a manner similar to CC that already described (The Chimpanzee Genome Sequencing Consortium CC 2005). Centromeres were introduced into the chimp sequence at the CC positions of the centromeres in the human chromosomes. Ten CC documented/known human inversions (Yunis 1982) supported by the CC assembly were introduced into the ordering as was the separation of CC alignments to human chromosome 2 into chimpanzee chromosomes 2A and CC 2B. We removed the contigs from the WGS project that corresponded CC to the finished chromosome 21 and chromosome Y sequences and a 5 Mb CC finished region from chimpanzee chromosome 7 because they are CC represented by the corresponding finished sequences. The CC chromosome 21 sequence is GenBank Accession Number BA000046 and the CC chromosome Y sequence is GenBank Accession Numbers CC DP000054-DP000056 and AC163716.2. XX FH Key Location/Qualifiers FH FT source 1..2824 FT /organism="Pan troglodytes" FT /chromosome="10" FT /isolate="Yerkes chimp pedigree #C0471 (Clint)" FT /mol_type="genomic DNA" FT /sex="male" FT /db_xref="taxon:9598" XX SQ Sequence 2824 BP; 762 A; 639 C; 742 G; 680 T; 1 other; aaaaaaaaaa aaaaaattag ctgggcgtgg tagtgcatgc ctgtaatccc agctactcag 60 gaggctgagg caagagaatt gcttaaaccc aggaggtgga ggttgcagtg agtcaagatg 120 gtgccactgc actccagcct gggtgacaga aggagactcc atctcaaaaa aaaaaaaagt 180 aaacccttta tatgtgtgta tatgtttcct tatgttggca atagcccaca caaggtagta 240 gaagtatgca agtaaattta acactggtta gtcaggggta gaattaagag gagacaaagg 300 gagattatgc agtttgtctt tataaccctc tacctcattt aacctagtat aattaagcat 360 gtcatatttg taactttttc aattaaaaaa aagaaagacg aaaggcaagg gatgcttaac 420 ctggatcttc ctatggctag acgattatct ctaggtatta agcagcaaga tgtgtattaa 480 aatcacatat aggccgggca cggtggctca tacctttaat cccggcactt tgggaggctg 540 atgtgggcag atcaactgag gtcaggagtt ggagaccagc ctggccaaca tggtgaaacc 600 ctgcttctac taaaaataaa aaattagcca ggtgtggagg caggtgcctg taatcccagc 660 tacttgggaa gctgagacag aggattgctt gaacccagga ggcagaggtt gcagtgagcc 720 gaggtcaaac cactccaacc tgcacttcaa cgcgggtgac agggtgagac tgtctcgcaa 780 aaaaaaagaa aagaaaaaga aaataaatac aataatatat ggaggggtcc tacagcatct 840 gctccttggt gagatggaac atggctggag tccgggagag tggagggagg agcagttaag 900 aacaagcctc tgggttgggc agagctgcat tttaatccca gctctctgcc accgactgag 960 ctgtgggact tgggccagtt atttaactgc tctgtgcctc agtttcccta tctttataat 1020 gggagtaatt agcttctatc actgggttat caggagagaa aaatgagatc aagcatgcag 1080 gatgcttagc acagtgcctg gcatacagtg ttcaagaaag gatagtcgtt atcattatta 1140 atattattta cagtaggtgc atctcagttg tgtagggagt ggggaatggt ggttataaaa 1200 atctaaagga ggtgacagtg acctaaattt aggctcccat acttccaggc atatattgat 1260 gtgttcctgc accatcaacc taagaaatga ttttattctt gagggtatta tagctagcta 1320 aaataaaaga ctacatttcc caagctccct tgcagctagg ggcaggggaa gtctgggatg 1380 ggactttgct ttcagaagat gcccttctgc tctttctttc cctccaccct cctgctgtct 1440 gggtcctaga gttgatggct ggggtcctgg cggcaatctg ggaccctgca gcacctggag 1500 ggaagaagcc acgtccccag gacagtggcg cagaaacaca ggagtctgag tccctgatga 1560 ccctgcagtg gccgtgccag ccccagccca cctccctctg cacttatatg tgagagagaa 1620 ataaacctct atcttgccta agactttgtt taggttttct gtaatgtgaa ggtaagccca 1680 gccctaacta attgagttct cacagaacta caaattccaa agcccacggc taggaggtgg 1740 cacagggcgt ggcctgggga ggggctgctg cctgctccca agcgagcctg ggcctgagca 1800 cttactcatc accaatgtag agcttgggcc agacctcgtt gacgtgggtg tactggggac 1860 tgcccttcca gaagagccgc tccagctcaa aggctccagg ggtgcagtag tcctcctcct 1920 ccccttcctc ctccatcttc agcgacagcc tcttggcaga tgagtaggca ttcttgaggc 1980 ttgtcttcac ttctccagat gtcattttag agccaaggga ttttctctcc tttctgcagc 2040 tggttatgag agagaagcag gatgggggtt agcaggggac actgaggcag ggaataagct 2100 ttacaaaaag aggtgggaat aagcttatct taaaatttgg caagagtttc cccaacctgc 2160 tatgccccgg cttcctcttc ccaggccaca caggtaaaca aggaaggcca tgatggcagt 2220 taaatgacac ttccccaatg ttccaatctt tctagagaag gagagctggg gtcctcggac 2280 gtgggccaag gcgggctctg gagctaagaa atcctgcaag cagccagttc cttcccacct 2340 ttctttagac tttactcctt tggttcctac tttttccttt atctgactga aacctactgg 2400 tttgggccat ttgggaccca aaataggaag gcccagaaga tccctgacat tccaggcctg 2460 tggccctctc agtgtctgca gggagccatc agccagtgga aagagtacat cgccaaggcc 2520 atacctgccc gggcctgggt ctgggccaca cgcacatcag caggacttca gaagccttga 2580 gaggggagga aaccggctgc atgagggtta acctgtcaga agagctgaag gcaacgctca 2640 cgctctcaca ctgggagcct ggccatcact gtgttaactt accaagggaa ttggaagaga 2700 tgccccaaga tagaaagctg tgcctcgaga ctgaaatgtg catttggaga gcttgttagg 2760 ctctgccagt ggaatgtggg agaggacttt ccattgtccc caaggagagg aattgncagg 2820 gaat 2824 // ![]() |