ID FJ959078; SV 1; circular; genomic DNA; STD; VRL; 2819 BP. XX AC FJ959078; XX DT 10-JUL-2009 (Rel. 101, Created) DT 18-SEP-2009 (Rel. 102, Last updated, Version 2) XX DE Circovirus-like genome RW-B, complete genome. XX KW . XX OS Circovirus-like genome RW-B OC Viruses; ssDNA viruses; unclassified ssDNA viruses. XX RN [1] RP 1-2819 RX DOI; 10.1099/vir.0.012955-0. RX PUBMED; 19570956. RA Rosario K., Duffy S., Breitbart M.; RT "Diverse circovirus-like genome architectures revealed by environmental RT metagenomics"; RL J. Gen. Virol. 90(Pt 10):2418-2424(2009). XX RN [2] RP 1-2819 RA Rosario K., Duffy S., Breitbart M.; RT ; RL Submitted (23-APR-2009) to the INSDC. RL College of Marine Science, University of South Florida, 140 7th Ave S, St. RL Petersburg, FL 33701, USA XX FH Key Location/Qualifiers FH FT source 1..2819 FT /organism="Circovirus-like genome RW-B" FT /strain="RW-B" FT /mol_type="genomic DNA" FT /country="USA:Southwest Florida" FT /isolation_source="reclaimed water" FT /note="assembled from metagenome data" FT /db_xref="taxon:642252" FT stem_loop join(2794..2819,1..15) FT /note="contains nonanucleotide sequence (tagtattac)" FT CDS 361..1590 FT /codon_start=1 FT /product="hypothetical protein" FT /note="viral replication protein-like protein (PF02407); FT putative Rep protein similar to circovirus replication FT proteins; ORF1_Rep" FT /db_xref="GOA:C6GIH7" FT /db_xref="InterPro:IPR000605" FT /db_xref="InterPro:IPR003365" FT /db_xref="UniProtKB/TrEMBL:C6GIH7" FT /protein_id="ACQ78158.1" FT /translation="MDVDTTPVSRRGRPSSSSSSSSTGERFTARTFCVTVNNRDEVEQN FT WPNAVQEPSEEFVERVRYVVWQRELAPETGRVHIQAYVELYMAAGIVMLKRLFNCPTMH FT VEKRRGTQDQARNYCMKEDTRDGPDGGPFEYGTYSKTPGNGQGKRNDLTDAVEALEAGG FT VEAVVSKHPNTYVRYHRGIHALYQAKLEAIASRQQRNVKTAVFYGEPGTGKTHCAFELA FT RITGEEIYILNAPAGRNQSVWFNGYQNQKILVVDEMNGDWIGWQLLLRMTDKYPLQCQT FT KGGMVWAMWEIVIFTSNAHWEDWYPYYAGGMDKAALKRRIHKVVKFYGSEASGNFTKKY FT YDPDLPEANWTVATTDVIATFLRNGTVRPHTDEELEEVEDVVMNTQQESQGTQTTELVE FT DSDDEMSETF" FT polyA_signal 1593..1598 FT CDS complement(1855..2691) FT /codon_start=1 FT /product="hypothetical protein" FT /note="weak similarity to geminivirus structural proteins; FT ORF2" FT /db_xref="GOA:C6GIH8" FT /db_xref="InterPro:IPR000263" FT /db_xref="UniProtKB/TrEMBL:C6GIH8" FT /protein_id="ACQ78159.1" FT /translation="MSKRTNPYQRASLPPKKRYRADQQPAIPRLRLPYHLGGPLYVSRP FT LPMARRGAPTGRYASNPRMGELKTVDYQFSAAYAVPYVGDSQPFPLLNCTNATGNIQCV FT NLVQQGTGVAQRIGNKICMRSLRIRLRFVEAGTDDNQAPYFARIMLIYDRNPNSLYIAT FT NNILANALQNNTLGNGTLDSSLNPTYYDRFVVLRDWYQTIPTFQESTSNVAPNWIIDDF FT VNLKNLETTYNGTANPMTINQVSVGALLLFVISDQVAATSDAVALGGNLRLRFHDQ" XX SQ Sequence 2819 BP; 723 A; 622 C; 749 G; 725 T; 0 other; acccgctcac ttccgttaac gtaaaaatgc tgggggtcgc tttggacccc cgcggagcgc 60 agctcccttc taaagcgtgc cgtacctctc tctcctatgg cgtgccaagt taggaactag 120 ggctggagcg agaccggata acggctccgg agctaaccgt ctgcggttag tgcgagcgta 180 gcgagtagag ccgtttgtag gtagagcgta acccggagtt actggcctct tcttcggtcc 240 ggtgttttgt gaaccccggg gtacacactc cgtaaaagga catcgctccg gcagaagtaa 300 cttctgcctt ccccacttag acgcgcggct ggaaaaatct cgcaagaaaa aaccaagaca 360 atggacgtag acactactcc tgtaagcaga cgtggtcgcc cttcctcctc ctcttccagc 420 tcgagcactg gtgagcgttt cactgcgcgc accttttgcg tcaccgtcaa caacagggat 480 gaagttgaac aaaattggcc caacgctgtt caagagccct ccgaagaatt tgttgagcgt 540 gtgcgctatg tggtttggca acgggaactt gctccggaaa ctgggcgtgt acacatccaa 600 gcttatgtgg aactttacat ggccgctgga attgttatgc taaaacgcct ctttaactgc 660 cctacaatgc atgttgaaaa gcgccgtggc acccaagacc aagcccgtaa ttactgtatg 720 aaagaagaca ctagggatgg accagatggt ggcccctttg aatacggtac ttactctaaa 780 acccctggaa atggccaggg gaaacgtaat gaccttactg acgccgtgga ggctctcgaa 840 gctggcgggg tagaagccgt cgtctcaaag cacccaaaca cctacgtgcg ttatcatcgc 900 ggtatccatg ccctgtacca ggcgaaactc gaagctattg cttcccgaca gcaacggaat 960 gtaaaaactg ctgtgtttta tggggagccg ggaactggaa aaacacattg tgcgttcgag 1020 cttgcgagaa tcacggggga ggagatctac attttgaatg ctccggcggg gaggaaccaa 1080 tcagtctggt tcaacggcta ccagaaccag aagatactag tggtggacga gatgaatggg 1140 gactggattg gatggcagct cctgctgagg atgacggaca aatatccttt acaatgccag 1200 acgaagggtg gaatggtttg ggcgatgtgg gagattgtga tctttacttc taatgcccac 1260 tgggaggact ggtatcctta ctacgctggg ggaatggaca aagcggcgct caagcgacgc 1320 attcataaag tggttaagtt ctacggatcc gaagcttctg ggaatttcac aaagaagtac 1380 tatgaccctg atcttcccga agctaattgg actgtggcca ccacggatgt aattgcaaca 1440 ttcctgagaa atggcactgt taggcctcat accgacgagg aactcgagga ggtggaagac 1500 gtagtaatga acacccagca agaatcccag ggaactcaaa caacagaatt agtggaagat 1560 tctgacgacg aaatgtctga aactttttaa agaataaaca accggatttt tctaagtttt 1620 gattccgctg aaggtgcacc agcgagcgaa gcgagcgatg gcgaagtttt gattccgctg 1680 aaactaagag tactaaaatt tctaagtttt gattccgcag aagggttctc aaaggtaggc 1740 attctaaggt tactccctac aattatgttg acgtcagcga aggctcgcgt gagcgagccg 1800 tagccgctgg cccccaacct aagtaacaac taaggtgtac acgaaggagt tgttttattg 1860 atcatgaaat ctaaggcgta agtttcctcc tagcgcaact gcatcgctag ttgcagccac 1920 ttgatctgag ataacaaaaa gcaagagtgc tccgacagat acttgattaa tggtcatagg 1980 gtttgctgtt ccgttgtacg ttgtttccaa gtttttgagg ttgacgaaat catcaatgat 2040 ccagtttgga gctacgttag acgttgattc ttggaatgta ggaattgttt ggtaccaatc 2100 cctaagcact acgaatctat cataataagt agggttcaag gatgaatcca aagttccgtt 2160 tcctaaggtg ttattttgga gagcgttggc taggatgttg tttgttgcta tgtacaacga 2220 gttagggtta cggtcgtaaa taagcatgat tctagcaaaa taaggtgctt gattatcatc 2280 ggttcccgct tctacaaagc gtagcctaat acggagagat ctcatgcaga tcttgtttcc 2340 aattctttgc gccactccag tgccttgttg tactaagttt acgcactgga tatttcctgt 2400 ggcgttggtg cagttgagta atgggaatgg ttgggaatct ccaacatagg gaactgcgta 2460 agcggcggag aactggtaat ccacggtttt taactctccc attcttgggt tgcttgcata 2520 tcttccggtt ggtgctcccc ttctagccat gggtaggggt cttgagacat agagaggtcc 2580 gccaaggtgg taaggcaggc ggagacgtgg gatagctggc tgctgatcag ctcgatatct 2640 ttttttaggt ggcaaacttg ctctttgata agggttagtt cgtttagaca tgcgtcttgg 2700 cgttctagtg ttggacgatc cagggttgac gtattttccg tgtaatttgc acgctagtta 2760 actgtaggtt taactgttaa aaaagacgtt taacggtcaa agtgagcggc gttagtatt 2819 //