ID CP001395; SV 1; circular; genomic DNA; STD; PRO; 3653 BP. XX AC CP001395; ABYZ01000000-ABYZ01000039; XX PR Project:PRJNA29407; XX DT 05-FEB-2009 (Rel. 99, Created) DT 26-AUG-2010 (Rel. 105, Last updated, Version 2) XX DE Caldicelulosiruptor becscii DSM 6725 plasmid pATHE02, complete sequence. XX KW . XX OS Caldicellulosiruptor bescii DSM 6725 OC Bacteria; Firmicutes; Clostridia; Thermoanaerobacterales; OC Thermoanaerobacterales Family III. Incertae Sedis; Caldicellulosiruptor. OG Plasmid pATHE02 XX RN [1] RP 1-3653 RG US DOE Joint Genome Institute RA Lucas S., Copeland A., Lapidus A., Glavina del Rio T., Tice H., Bruce D., RA Goodwin L., Pitluck S., Sims D., Meincke L., Brettin T., Detter J.C., RA Han C., Larimer F., Land M., Hauser L., Kyrpides N., Ovchinnikova G., RA Kataeva I., Adams M.W.W.; RT "Complete sequence of plasmid2 of Caldicelulosiruptor becscii DSM 6725"; RL Unpublished. XX RN [2] RP 1-3653 RG US DOE Joint Genome Institute RA Lucas S., Copeland A., Lapidus A., Glavina del Rio T., Tice H., Bruce D., RA Goodwin L., Pitluck S., Sims D., Meincke L., Brettin T., Detter J.C., RA Han C., Larimer F., Land M., Hauser L., Kyrpides N., Ovchinnikova G., RA Kataeva I., Adams M.W.W.; RT ; RL Submitted (26-JAN-2009) to the INSDC. RL US DOE Joint Genome Institute, 2800 Mitchell Drive B310, Walnut Creek, CA RL 94598-1698, USA XX DR GR; CP001395_GR. DR StrainInfo; 160269; 0. XX CC URL -- http://www.jgi.doe.gov CC JGI Project ID: 4084710 CC Source DNA available from Michael Adams (adams@bmb.uga.edu) CC Bacteria available from DSMZ: DSM 6725 CC Contacts: Michael Adams (adams@bmb.uga.edu) CC David Bruce (microbe@cuba.jgi-psf.org) CC Annotation done by JGI-ORNL and JGI-PGF CC Finishing done by JGI-LANL CC Finished microbial genomes have been curated to close all gaps with CC greater than 98% coverage of at least two independent clones. Each CC base pair has a minimum q (quality) value of 30 and the total error CC rate is less than one per 50000. CC The JGI and collaborators endorse the principles for the CC distribution and use of large scale sequencing data adopted by the CC larger genome sequencing community and urge users of this data to CC follow them. it is our intention to publish the work of this CC project in a timely fashion and we welcome collaborative CC interaction on the project and analysis. CC (http://www.genome.gov/page.cfm?pageID=10506376) CC Meta information: CC Organism display name: Caldicelulosiruptor becscii Z-1320, DSM CC 6725 CC Culture Collection IDs: DSM 6725 CC GOLD ID: Gi03121 http://genomesonline.org/GOLD_CARDS/Gi03121.html CC Sequencing Platforms: Sanger, 454 CC Phenotypes: Thermoacidophile, Cellulose degrader CC Diseases: None CC Habitat: Fresh water, Hot spring CC Oxygen Requirement: Anaerobe CC Temperature Range: Thermophile CC Biotic Relationship: Free living CC Isolation: Hot spring on the Kamchatka peninsula in Russia. XX FH Key Location/Qualifiers FH FT source 1..3653 FT /organism="Caldicellulosiruptor bescii DSM 6725" FT /plasmid="pATHE02" FT /strain="DSM 6725" FT /mol_type="genomic DNA" FT /note="type strain of Caldicelulosiruptor becscii DSM 6725" FT /db_xref="taxon:521460" FT gene 1125..2117 FT /locus_tag="Athe_2777" FT CDS 1125..2117 FT /codon_start=1 FT /transl_table=11 FT /locus_tag="Athe_2777" FT /product="conserved hypothetical protein" FT /note="KEGG: cpy:Cphy_3144 hypothetical protein" FT /db_xref="GOA:B9MSB0" FT /db_xref="InterPro:IPR002104" FT /db_xref="InterPro:IPR010998" FT /db_xref="InterPro:IPR011010" FT /db_xref="InterPro:IPR013762" FT /db_xref="InterPro:IPR023109" FT /db_xref="UniProtKB/TrEMBL:B9MSB0" FT /inference="similar to AA sequence:KEGG:Cphy_3144" FT /protein_id="ACM61829.1" FT /translation="MGKPSIIKQVLNEFEKQIRFGESKHEAKREERERCEVTGETWNPA FT RVEGIFSFSTYREYVKEALEFANWARTEKGCKDLEQARAYVSEYLQSHIDKGYSAWTVK FT KEAAALAKLYHCRTTDFKVELPARHREEIERSRGYKDHDREFSKERNRDIIIFSKATGL FT RRRELERVSSRDIFRGPDGRLYVHVSNGKGGRERDVHVLQKYEREVERIVREREGRDRL FT FDRVPIRMDVHSYRREYARERYREVEREISRERKLFDRVEDLVRSRLTRLYPDRFREIG FT ERQLTRELTRADGLYHRSDGREFDRLALWEVSNDLGHNRIDVVARHYLD" FT gene complement(2218..2439) FT /locus_tag="Athe_2778" FT CDS complement(2218..2439) FT /codon_start=1 FT /transl_table=11 FT /locus_tag="Athe_2778" FT /product="conserved hypothetical protein" FT /note="KEGG: csc:Csac_2655 hypothetical protein" FT /db_xref="UniProtKB/TrEMBL:B9MSB1" FT /inference="similar to AA sequence:KEGG:Csac_2655" FT /protein_id="ACM61830.1" FT /translation="MKKLTIEFTREEAMYLLGYFTARAMEGYRFDEFEQGIIKKLADKC FT NVEFVFENGKILQARYKGNLFYCTTPQE" FT gene complement(2420..2749) FT /locus_tag="Athe_2779" FT CDS complement(2420..2749) FT /codon_start=1 FT /transl_table=11 FT /locus_tag="Athe_2779" FT /product="hypothetical protein" FT /db_xref="UniProtKB/TrEMBL:B9MSB2" FT /inference="ab initio prediction:Prodigal:1.4" FT /protein_id="ACM61831.1" FT /translation="MLTKQELIQRLLEIKQLCFYIEEDEAKQIENIVEDLLSKIRNVKL FT NQLKRLVHAERKKYPIGTAENLLFHNLYEKLKNVQPDDTERIDRLYKEYLSLVKGVKES FT EKANN" FT gene complement(2743..3192) FT /locus_tag="Athe_2780" FT CDS complement(2743..3192) FT /codon_start=1 FT /transl_table=11 FT /locus_tag="Athe_2780" FT /product="hypothetical protein" FT /db_xref="GOA:B9MSB3" FT /db_xref="InterPro:IPR008813" FT /db_xref="UniProtKB/TrEMBL:B9MSB3" FT /inference="ab initio prediction:Prodigal:1.4" FT /protein_id="ACM61832.1" FT /translation="MKKIGYKKMVDPETGEVQTFILIGHDFEDTDFVKLPFISIKLIME FT DKDLAKSAMRILSYIVQHKISFNNYTFALSYEYDIKGNIDMSKKQYHLAIKKLIEKDLL FT IKIGRGRFMLNPRRIRYGRADQLRKFESEYDKIKKTEKGKGDKEC" XX SQ Sequence 3653 BP; 1007 A; 708 C; 860 G; 1078 T; 0 other; aaaaaaatta gagcaggatt tgagcaagag gggcgctgca accccctttg aaaccgcttt 60 cataacccga aaagctcttc acccaccgca tcaatattca tacccgcttg cataattatt 120 catccctgat tgtaccacaa aacccagaaa agtcctggga aaatcatgaa gattcttcca 180 ttttccaatt tccaggaagc gaattgaaat tcccagggat tttttgcctt tttccaggga 240 atgaactgga atttccagcg gtttttccag ggacttaacc acacgaaagg tactttccgc 300 catcaaaggc ttgcaaaccc ataggaaact actggcaggc aaaaccttcc acccagtagc 360 aaatctcttc taccttgttt tgcgagcgtt agcagggagc tgtccgttag ctttgtgtta 420 gcatccttct gttagccttt ggcacgttag tagacttttg ttagttctct cttcttgttt 480 ttatcttcat tgtgattgtt ttccagaaga ctgcttgaca ttttgttatt gtagtgttat 540 gatctatttc agaagcgaaa agtaaaccgc ggtagcgggt tactttttgc gtaatatttg 600 gaattgtcag tttggcacgg gaaatggaat cgggcaggta tagggaaagg caatagccgt 660 ggacggcgtt ccgattccgc tttgtttgtg cgttagcaca tgagtcttgc tgttatcacg 720 ctctgctctg tggtagtaca ccgcaagggg tacaccattt ggagttttgc cgcatctgag 780 agtttagcag tttgcaaaac aaagttcgca gttagaaaag ttaggcaagg ggcgatgacg 840 tgcacgtctg acacgcttga cgtgcaccga gccgagagta gagcgtgatg acatagaaaa 900 agcccagtag caggttgtgc gtaccgtgac atgctactgt ggaagacaag taaatagtca 960 aaaaacggtg cgcttactgt tcgtgggctc atgggaatac cgtgagcatt ctggacaggt 1020 tcaacagcct gaaagggcaa aacccccctt gaaagggttt aaatacacac gtcctttttg 1080 tcgttttttg tcggtttaaa tatttatgca ggggatgata gggaatggga aagccgtcca 1140 taatcaagca agtactcaat gaattcgaaa agcagataag gttcggggaa agtaagcatg 1200 aagcaaagag agaagagcgg gagagatgtg aagttactgg agagacgtgg aatcctgctc 1260 gtgtggaagg catttttagt ttctcaacct acagggagta cgttaaggag gcgttagaat 1320 ttgcgaattg ggctcgtact gaaaaggggt gtaaggattt agaacaagca cgggcctatg 1380 tgtctgaata tttgcagtcg catatagaca aagggtatag cgcgtggact gttaagaaag 1440 aagcagcagc cctggcgaaa ctgtatcatt gtcgtacaac tgactttaaa gtagagcttc 1500 ccgcaagaca cagggaagag attgagagaa gcaggggata caaagaccac gatagggagt 1560 ttagcaaaga gaggaataga gacattatca tcttttcaaa agctactggg ctgagaagaa 1620 gggaattgga aagagtgagt tctcgggata tctttcgtgg gcctgacgga agattatatg 1680 tgcacgtgag caacggcaag ggcggtagag aaagggatgt tcatgttttg cagaaatacg 1740 agagagaggt tgagaggata gtcagagagc gggaaggaag agacaggctg tttgacaggg 1800 tccccataag gatggacgta cacagctata ggagggaata tgcaagagag cgttacagag 1860 aagttgagcg tgagataagt cgtgagagaa agcttttcga cagagttgag gatctcgttc 1920 gtagtaggct tacaaggctc tatcctgaca ggtttagaga aattggcgaa agacaactta 1980 ctcgtgaact cacaagagct gatgggcttt atcatcgcag tgatggtagg gagtttgacc 2040 gcctggcatt gtgggaagtt tcaaacgact tgggacataa tcgaattgac gttgttgcaa 2100 gacactatct ggattaagcg aataaaggct caagaaagtg gataaaaaaa cagggggtat 2160 attgatatat cccccctgtt ttttgtgcgt ctacaggacc ttatttgcgt ttcaaggcta 2220 ttcttgtggg gtagtgcagt aaaaaaggtt gcctttgtat cttgcctgta gaatcttgcc 2280 gttttcaaaa acaaattcaa cgttgcactt atcagcaagt ttcttgatta ttccctgctc 2340 aaattcgtcg aacctgtacc cttccattgc tcttgctgtg aaataaccca gaaggtacat 2400 tgcttcttca cgtgtaaact caattgttag ctttttcact ctcttttacc ccctttacga 2460 gactgagata ttctttgtac aatctgtcaa tacgttctgt atcgtcaggt tgtacgtttt 2520 tgagtttttc gtagagatta tgaaacaaca ggttctctgc tgttccaata gggtatttct 2580 ttctttccgc gtggactagc cttttcagtt ggtttagctt tacgttccgg atcttgctca 2640 gtaggtcttc gacgatattt tcaatctgtt ttgcttcgtc ttcttcgatg tagaagcata 2700 gttgtttgat ttcaagtagc ctctgaataa gttcctgctt tgttaacatt ccttgtcacc 2760 cttccccttt tcagtctttt ttattttgtc gtattctgat tcgaactttc tcagttggtc 2820 agctctgccg taacgtatgc gtcgaggatt tagcatgaat cgtccacggc ctattttaat 2880 caacaaatct ttctcgatta gctttttgat tgcaagatgg tattgctttt ttgacatgtc 2940 gatgtttcct ttgatgtcgt attcgtatga aagtgcaaac gtgtagttgt taaacgagat 3000 tttgtgttgt acaatgtatg agagaattcg cattgctgat tttgcaaggt ctttatcttc 3060 cattatgagt ttgatagaaa tgaatggaag tttgacaaag tctgtgtctt caaaatcatg 3120 tccgatcagt atgaaagttt gtacttcgcc tgtttcaggg tcgaccattt ttttgtaccc 3180 tatcttcttc atttgaatcc cccttttttt acctttttcc aattctaaag ccattataat 3240 acacttaaag taacttgtca agtaacttca ggtgtaaaaa attacacttc aggtgtaata 3300 aattacactt tccattcagt attttcaagg ctttgtgggt aactttattc ttatctatgt 3360 atatatcgcc tgcgttagca ggcttgaaaa tttccagtta ggataagcag gaacaacggt 3420 cgctgacgct gaacactgac gaaatagctg acgccccaaa gtccacaaca gtgccaaacc 3480 gataacaaaa acatgctaac gcaaacatag actaacgcac gactgacgtc gtgatgtgtg 3540 tgtgggccta cctacacaca aaaagaacta acaacagctg actaacgtct gaagagctct 3600 aacaacactt tgctaacgct gagctaacgg acagctcaac gttaacaccc gct 3653 //