ID FJ959077; SV 1; circular; genomic DNA; STD; VRL; 2162 BP. XX AC FJ959077; XX DT 10-JUL-2009 (Rel. 101, Created) DT 18-SEP-2009 (Rel. 102, Last updated, Version 2) XX DE Circovirus-like genome RW-A, complete genome. XX KW . XX OS Circovirus-like genome RW-A OC Viruses; ssDNA viruses; unclassified ssDNA viruses. XX RN [1] RP 1-2162 RX DOI; 10.1099/vir.0.012955-0. RX PUBMED; 19570956. RA Rosario K., Duffy S., Breitbart M.; RT "Diverse circovirus-like genome architectures revealed by environmental RT metagenomics"; RL J. Gen. Virol. 90(Pt 10):2418-2424(2009). XX RN [2] RP 1-2162 RA Rosario K., Duffy S., Breitbart M.; RT ; RL Submitted (23-APR-2009) to the INSDC. RL College of Marine Science, University of South Florida, 140 7th Ave S, St. RL Petersburg, FL 33701, USA XX FH Key Location/Qualifiers FH FT source 1..2162 FT /organism="Circovirus-like genome RW-A" FT /strain="RW-A" FT /mol_type="genomic DNA" FT /country="USA:Southwest Florida" FT /isolation_source="reclaimed water" FT /note="assembled from metagenome data" FT /db_xref="taxon:642251" FT stem_loop join(2137..2162,1..20) FT /note="contains nonanucleotide sequence (cagtattac)" FT CDS 39..1079 FT /codon_start=1 FT /product="putative Rep protein" FT /note="viral replication protein-like protein (PF02407); FT similar to circovirus replication proteins; ORF1_Rep" FT /db_xref="GOA:C6GIH4" FT /db_xref="InterPro:IPR000605" FT /db_xref="InterPro:IPR003365" FT /db_xref="UniProtKB/TrEMBL:C6GIH4" FT /protein_id="ACQ78155.1" FT /translation="MTSPQGKYWCFTHNNPLVDGDTFLSSLKAYFPKLTYIVFQLERIT FT TPHFQGYMEFSTLVRLSALKRFHPGIHWERRRGSQQDAIDYCSRETYKGEDKGRVDGPW FT EWGTRAETHQGQRSDLASAIEALRDGGIRRVAEENPEALVRYSRGIQFLASLTPPPKDP FT PEVYLLFGPTGVGKTKRFFDSEPDGWASPVTDGLWFDGYMGQDAALFDDFCGKYTKLGM FT AQFLRVIDRYRVQLATKGGFTWFNPKRIYVTSNFHPLDWWDWSGRQQQYPALERRFTHV FT YWWKRPGLLVSLRRPDPEASQDVLDGIDNDDQWRHFWDGVGRAQLALDIATGRLVSNAP FT EDYFNF" FT polyA_signal complement(1201..1206) FT CDS complement(1215..2000) FT /codon_start=1 FT /product="hypothetical protein" FT /note="weak similarity to circovirus structural proteins; FT ORF2" FT /db_xref="UniProtKB/TrEMBL:C6GIH5" FT /protein_id="ACQ78156.1" FT /translation="MGKYTKRRSYGRKSRLKRRRSVKQRRIRRIKRRSGIYTKRTRFPA FT KNPFGDKAYVKLRFNRASYISGDGASSQTTTTYRTINNLADTWLSFESVSKGFLTYPKL FT FRRYKVNGVMVKFTVYQLKNSTGSGEFPSLAWILPYSTLDGTPTVFAQNAIALKSQRHT FT AWANVANWGNGGRSTTVKKFFKMKSLVGANYPSTDIDYSGTTDVLANPYNAPAIEWRFL FT AGISSVAEVALPTTQSYHYNLEMTYYVEFWEQTWEAQGL" FT CDS complement(1460..2005) FT /codon_start=1 FT /product="hypothetical protein" FT /note="ORF3" FT /db_xref="UniProtKB/TrEMBL:C6GIH6" FT /protein_id="ACQ78157.1" FT /translation="MEWESTQSEDLTDVSLDSSEEGLSNSDAYEGSSDEAESTQSEQDF FT PQRIPSETRLMSSSGSIELATSQEMERPLKRQRLTEPSTTSLTHGYPLSQSQKDFSPIL FT SSSEGIKSTESWSNSQSTNSRTPPEAGSFQALHGSCHIRRSTEHLPSSRKTLSRLSLNA FT IRRGLMSQIGGMGGGQRQ" XX SQ Sequence 2162 BP; 473 A; 575 C; 539 G; 575 T; 0 other; acccaccact tcgttcactt tgttaacgag gttcactgat gacatcccct caagggaaat 60 attggtgttt tacgcataat aaccctttgg tcgatggaga taccttcttg tcgtcgctca 120 aggcctactt cccgaagctt acgtacattg tcttccaact cgagcgcatc actacccctc 180 acttccaggg atacatggag ttttctacgc tcgtcagatt gtctgccctc aagaggtttc 240 accctggcat tcactgggag cgacgacgtg gcagtcagca agatgctatc gactactgct 300 cccgtgagac ttacaaaggc gaggacaaag gccgtgtgga cggtccttgg gagtggggca 360 cgcgtgctga gacccaccaa ggacagcgct ccgatctcgc cagtgcgatt gaagccttgc 420 gcgatggagg catccgccgc gttgccgagg agaatcccga ggcattggtg cgctactccc 480 gtggtattca gttcctcgct tcccttacgc cgccccctaa agaccccccc gaagtctacc 540 tcctctttgg tccgaccggc gttggtaaga caaaacgctt ttttgattcc gagcccgatg 600 gctgggcctc ccccgttaca gatggcctct ggttcgatgg ctatatggga caagacgctg 660 ccctcttcga cgacttctgt ggcaagtaca ccaagcttgg aatggcccag tttctccgag 720 tcattgatcg ataccgcgtt caattggcca ctaagggagg atttacctgg tttaacccca 780 aacgaatcta tgtcacctcc aatttccatc cccttgactg gtgggactgg tctggccgcc 840 aacaacagta ccccgccctt gagagaagat tcacccacgt ttactggtgg aaacgacctg 900 gacttctggt ctcccttcgc cgacccgatc ctgaggcctc tcaggacgtc ctcgatggaa 960 ttgataacga cgaccaatgg agacactttt gggatggagt gggacgagcc caactggcgt 1020 tggacattgc caccggaaga ctggtaagta atgcccccga agactatttc aacttttaag 1080 ggtacatcca tctacagcaa cccgcccccc gtttggccgc gacgagcgcc agcgaggcgc 1140 tggccggtga atatggaagc gtagcggtgt agtagcgaaa gtccttaata gtgtaaatca 1200 tttattaaca tcaatcataa accctgtgct tcccaagtct gctcccagaa ctctacatag 1260 taagtcattt caagattgta atgatagctt tgggttgtcg gtaaagccac ttcagctact 1320 gatgatattc cggccaggaa cctccattcg atcgcgggtg cgttgtacgg atttgcgaga 1380 acgtcagttg ttcctgagta gtcaatatct gtgcttgggt agttcgctcc aaccaacgac 1440 ttcatcttga agaacttctt tactgtcgtt gacctccccc cattccccca atttgcgaca 1500 ttagcccacg ccgtatggcg ttgagactta agcgcgatag cgttttgcgc gaagacggta 1560 ggtgttccgt cgagcgtcga atatggcaag atccatgcaa ggcttggaaa ctccccgctt 1620 ccggtggagt tcttgagttg gtagactgtg aatttgacca tgactccgtt gactttatac 1680 cttcggaaga gcttaggata ggtgagaaat ccttttgaga ctgactcaaa ggatagccat 1740 gtgtcagcga ggttgttgat ggttcggtaa gtcgttgtcg tttgagagga cgctccatct 1800 cctgagatgt agctagctcg attgaacctg agcttgacat aagccttgtc tccgaaggga 1860 ttctttgcgg gaaatcttgt tcgctttgtg tagattccgc ttcgtcgctt gatccttcgt 1920 atgcgtcgct gtttgacaga ccttcttcgc ttgagtcgag acttacgtcc gtaagatctt 1980 cgctttgtgt actttcccat tccatatgca aaagaagaaa agaatgatta acctcattca 2040 cttcgatcac attgttcaca caggcggccc ccctttccga ctgtcggact tttttttcct 2100 gtggggaaac cgaacggttt tttctggact gttcggaagt gaacgaggtg gtggtcagta 2160 tt 2162 //