ID FJ959080; SV 1; circular; genomic DNA; STD; VRL; 2072 BP. XX AC FJ959080; XX DT 10-JUL-2009 (Rel. 101, Created) DT 18-SEP-2009 (Rel. 102, Last updated, Version 2) XX DE Circovirus-like genome RW-D, complete genome. XX KW . XX OS Circovirus-like genome RW-D OC Viruses; ssDNA viruses; unclassified ssDNA viruses. XX RN [1] RP 1-2072 RX DOI; 10.1099/vir.0.012955-0. RX PUBMED; 19570956. RA Rosario K., Duffy S., Breitbart M.; RT "Diverse circovirus-like genome architectures revealed by environmental RT metagenomics"; RL J. Gen. Virol. 90(Pt 10):2418-2424(2009). XX RN [2] RP 1-2072 RA Rosario K., Duffy S., Breitbart M.; RT ; RL Submitted (23-APR-2009) to the INSDC. RL College of Marine Science, University of South Florida, 140 7th Ave S, St. RL Petersburg, FL 33701, USA XX FH Key Location/Qualifiers FH FT source 1..2072 FT /organism="Circovirus-like genome RW-D" FT /strain="RW-D" FT /mol_type="genomic DNA" FT /country="USA:Southwest Florida" FT /isolation_source="reclaimed water" FT /note="assembled from metagenome data" FT /db_xref="taxon:642254" FT stem_loop join(2048..2072,1..17) FT /note="contains nonanucleotide sequence (aagtattac)" FT CDS 29..1078 FT /codon_start=1 FT /product="hypothetical protein" FT /note="similar to structural proteins; ORF1" FT /db_xref="UniProtKB/TrEMBL:C6GII1" FT /protein_id="ACQ78162.1" FT /translation="MRRRPAYVGPSWGGSRIGGANIPMNTRAKRRRRKIIPLLWNSGSK FT LANNLNLGFGSRGRGPGSMVSRIRNYKMEISVKPVGTGSSYSYYRYTRRPDKGSRIVKT FT QQAPLFSVFNSGSRMTSLQGEQGYTTLSVMTGAKLRSLYLETSANQDGRFWVGYARCRW FT MFQNQSEATTHLTIYEFTTRRDSNVGPSLAFQSGLASIQAGVGSNASDIGATPFMTPRF FT TENFRILKKYSVELAQGRSHIHTALYRMNKNYSDSLYQMDGSSDVVLGGWTRGLFVIAH FT GTPYNSFATKTNVSTTPVAIDIVANETYHTYTNLQNKSIFEVQSDFPTFTDGRIIDIGS FT GDPEAVDEV" FT TATA_signal 1074..1079 FT CDS 1147..2067 FT /codon_start=1 FT /product="putative Rep protein" FT /note="replication-associated protein similar to circovirus FT Reps; viral replication protein-like protein (PF02407); FT ORF2_Rep" FT /db_xref="GOA:C6GII2" FT /db_xref="InterPro:IPR000605" FT /db_xref="InterPro:IPR003365" FT /db_xref="UniProtKB/TrEMBL:C6GII2" FT /protein_id="ACQ78163.1" FT /translation="MLAQAPGRYRNWVITVNNWTENDYKLALLSPYRYIIIGRERGECN FT TPHLQIYLQLYHAKSFDTVKQNFFPRAHLEASHSKPRQARAYCTKEHYFEYGEMSTQGK FT RTDLQVAQELLDSGISIRQALESEMITSMGALAAYEKLQKYYTVHRPRPRVVWIYGPAG FT SGKTDKAYSISGPDVYKSDLIKEGWFDGYDRHRSIIIDDLEIDRDDKKTFGLLLSLLDK FT NPQKVNVKGSSASILADTIVITCQQAPWHIWYHPSDNMSLPFKHMATREEIERDVDLRQ FT IMRRITEIIHLTDTDKVKYPEVTEV" XX SQ Sequence 2072 BP; 677 A; 390 C; 428 G; 577 T; 0 other; acctttagtc acttctgtgc aacatactat gagaagaaga cccgcatacg taggccccag 60 ctggggcggt tcacgcatcg gcggcgcaaa tattccgatg aatacaagag ctaaacgccg 120 ccgtagaaag ataataccct tactgtggaa ttcaggcagt aagttagcca ataacctaaa 180 cttaggtttt ggctctagag gaagaggacc tggctcaatg gttagcagga tccgtaatta 240 taaaatggaa ataagtgtaa aaccagtagg tactggttca tcttattctt attaccgtta 300 tacgcgcaga cctgacaagg gatcgcgtat tgtaaaaaca caacaggctc cattgtttag 360 tgttttcaat tctggctcac gaatgacatc tttacaaggt gagcaaggat atacaactct 420 ttcagtaatg actggtgcaa agctccgttc attatatctt gaaacatctg ccaaccaaga 480 tggtagattt tgggttggct atgcacgttg tagatggatg ttccagaacc aatctgaggc 540 tacaacgcat ttaacaattt atgaatttac aacacgacgt gattcaaacg tcggtcctag 600 cttagctttc caatctggat tagcttcaat tcaagctggt gtaggttcaa atgcctctga 660 tataggagct acaccgttca tgacaccacg ttttacagaa aactttagaa ttctgaaaaa 720 gtatagtgtt gaactagctc aaggtcgctc acatattcat actgcactat acagaatgaa 780 taaaaactat agcgactcac tatatcagat ggatggatca tctgatgtag tgctaggtgg 840 atggacacgt gggttatttg taatcgccca cggtacacca tataacagtt ttgcaacgaa 900 gacaaatgtc tcgacaaccc ctgttgcgat agacattgtc gctaatgaaa cgtatcatac 960 ttacacaaat ttacaaaaca aatcaatatt tgaagttcaa tctgacttcc caacattcac 1020 agatggtcga attatcgaca tcggtagtgg tgatccggaa gcagttgatg aggtataaaa 1080 tgtacaattt tacacatata gacgattggc aggcctgggc cattctcctt tggtttaacc 1140 aaggacatgc tggcccaagc gcctggtaga tatagaaatt gggttataac agttaataat 1200 tggactgaaa acgattataa acttgcactt ttatctcctt atcgttatat aataataggt 1260 agggagagag gtgagtgcaa cacaccgcat ttacaaattt atttacaatt gtatcatgcg 1320 aagtcatttg acactgttaa acaaaacttt tttcctcgag ctcatttaga agcaagccat 1380 agtaaacctc gtcaggctcg agcgtattgt actaaagagc actatttcga gtacggggaa 1440 atgtctaccc aaggtaaacg aacagatcta caggtagctc aagagctact agattctggt 1500 attagtatac gacaagcatt agaatccgaa atgattacat caatgggtgc cttagcagca 1560 tatgagaaat tacaaaaata ttataccgtt catcgtccta gaccgcgagt ggtatggatt 1620 tacggtccgg ctggatctgg aaagactgat aaggcatata gcatcagcgg tccagatgtt 1680 tataaatccg acttaataaa ggagggatgg tttgatggtt atgatagaca tcgatcgatt 1740 atcatcgacg atttggaaat tgatagagat gataagaaaa cattcgggtt attactttca 1800 ctattagata aaaatccaca aaaagtaaat gtcaaaggtt cgtctgcatc aatattggct 1860 gataccattg tcatcacctg tcaacaggct ccctggcata tttggtatca tcctagtgac 1920 aatatgtctt tgccttttaa gcatatggct acaagagaag aaatagaaag agatgtcgat 1980 ttaagacaga tcatgcgtag aattacagaa ataatacatt taacagatac agataaagta 2040 aaatatccag aagtgactga agtataagta tt 2072 //