ID FJ959084; SV 2; circular; genomic DNA; STD; VRL; 1739 BP. XX AC FJ959084; XX DT 10-JUL-2009 (Rel. 101, Created) DT 18-SEP-2009 (Rel. 102, Last updated, Version 3) XX DE Circovirus-like genome SAR-A, complete genome. XX KW . XX OS Circovirus-like genome SAR-A OC Viruses; ssDNA viruses; unclassified ssDNA viruses. XX RN [1] RP 1-1739 RX DOI; 10.1099/vir.0.012955-0. RX PUBMED; 19570956. RA Rosario K., Duffy S., Breitbart M.; RT "Diverse circovirus-like genome architectures revealed by environmental RT metagenomics"; RL J. Gen. Virol. 90(Pt 10):2418-2424(2009). XX RN [2] RP 1-1739 RA Rosario K., Duffy S., Breitbart M.; RT ; RL Submitted (23-APR-2009) to the INSDC. RL College of Marine Science, University of South Florida, 140 7th Ave S, St. RL Petersburg, FL 33701, USA XX RN [3] RC Sequence update by submitter RP 1-1739 RA Rosario K., Duffy S., Breitbart M.; RT ; RL Submitted (03-AUG-2009) to the INSDC. RL College of Marine Science, University of South Florida, 140 7th Ave S, St. RL Petersburg, FL 33701, USA XX CC On Aug 3, 2009 this sequence version replaced gi:229562111. XX FH Key Location/Qualifiers FH FT source 1..1739 FT /organism="Circovirus-like genome SAR-A" FT /strain="SAR-A" FT /mol_type="genomic DNA" FT /isolation_source="Sargasso Sea" FT /note="assembled from metagenome data" FT /db_xref="taxon:642258" FT stem_loop join(1720..1739,1..11) FT /note="contains nonanucleotide sequence (tagtattac)" FT TATA_signal 54..59 FT CDS 126..956 FT /codon_start=1 FT /product="hypothetical protein" FT /note="similar to structural proteins; ORF1" FT /db_xref="UniProtKB/TrEMBL:C6GIJ0" FT /protein_id="ACQ78171.2" FT /translation="MPGYNFRSNPVKKPSYRFRSKSKPRSQPNKSVKKEVKKNTKDIKL FT LKSVGFQYAPFQQREVGTISNFLHTSLLTAPNNWGGIFRMHGVSNEDLPRQYMMKSVDV FT NWIAQAEASEVGNLWLQVFVVSLKHKMANQVLTRTTRLTNLEEGLDYTSQSAGTAFALQ FT GDLGYKLNPDLYTIHYHSGQRRIGESTVGESPVAVTNINNGTTMGSCRIKWPHTFKNDE FT TSDAGFKELTYQNIEPKQHLYLLVMSNTSGSVITGGELFFSSRAQFNGHVNNPN" FT CDS complement(1143..1682) FT /codon_start=1 FT /product="putative Rep protein" FT /note="replication-associated protein similar to circovirus FT Reps; viral replication protein-like protein (PF02407)" FT /db_xref="GOA:C6GIJ1" FT /db_xref="InterPro:IPR003365" FT /db_xref="UniProtKB/TrEMBL:C6GIJ1" FT /protein_id="ACQ78172.2" FT /translation="MPRVNPAKAWCFTLNNYTENEHGALVQRFSDFDDKYYFIVGCEIG FT AQGTPHLQGYIEKKVGRFRPLPCFEVLRDGKNAMHFERAKGNRKQNYNYCSKDGDFITN FT IDKPIMTYSEAKDIWKETNGISNDKYDAATINEAAMHIEYMDLYDCYTDEGQKKFMVRY FT KAMMQARYPEKETVEI" XX SQ Sequence 1739 BP; 498 A; 369 C; 354 G; 518 T; 0 other; acccaatccc cttatgtcgt tttgcaccac ctgcaccact gcaccattct agatataaag 60 acaacgtgtc atatagaatg cacccactct ttcgccattt ttcttttcat ctattttaag 120 aacaaatgcc tggctataac tttcgttcta accctgttaa aaaaccaagt tatcgatttc 180 gtagtaagtc taagccgcgc tctcagccta ataagagcgt aaagaaagaa gtgaagaaaa 240 acaccaaaga tatcaagttg ttaaaatctg tcgggtttca gtatgcacca tttcagcaac 300 gtgaggtagg taccatttca aatttcctac acactagtct tcttaccgca ccaaacaact 360 ggggcggcat cttccgtatg catggcgtta gcaatgaaga cttgcctagg cagtatatga 420 tgaaatccgt agatgtaaat tggatagcac aggctgaggc ttctgaagtc ggtaatcttt 480 ggttacaagt gtttgttgtg tctcttaaac ataagatggc aaaccaagtt ttgacaagaa 540 ctacacgatt aacaaacctt gaagaaggac ttgactacac tagtcaaagt gctggcaccg 600 ctttcgctct acagggtgac cttggttaca agcttaaccc tgatttgtat actatacatt 660 atcattctgg acagcgccgt attggcgagt ccactgtagg cgagagtccc gttgctgtga 720 caaatatcaa caatggtaca actatgggtt cttgtagaat taaatggcca catacattca 780 aaaatgatga gacttctgat gctgggttca aggagctaac atatcaaaat attgaaccaa 840 agcagcattt gtacctcttg gtgatgtcta atacttcagg tagtgttatc acgggtggag 900 agttgttctt ctcatcccga gcacagttca acggtcatgt taataatccc aactagacgg 960 agtattgagt tagggaaaaa aaaccaacag ttttttttac cacttggatt ccaaaggtac 1020 cctttggcct ccggaggaac taacagaatt tgtcgccaga caaacataga tgtgaaaatc 1080 ggtgaattct ttataccccg tatggggtac aaagaagagg gctttgccct caggctttag 1140 cgtcatatct cgacagtttc tttttcggga tagcgagcct gcatcattgc tttgtagcgc 1200 accatgaact ttttctgacc ttcgtcagtg taacaatcat ataagtccat gtactcgatg 1260 tgcatggcgg cctcgttaat ggtcgctgca tcatacttat cgttcgagat accattggtt 1320 tctttccaaa tgtccttggc ttcgctatag gtcatgatag gtttgtcgat gttagtgata 1380 aaatctccat ctttcgaaca atagttataa ttttgtttac ggtttccttt tgcgcgctca 1440 aagtgcattg cattcttgcc gtcgcgtaaa acttcaaagc atggcagcgg cctgaaacga 1500 ccaacttttt tttcaatata tccttgtaaa tgcggtgtac cctgtgcacc aatttcacaa 1560 ccgacgataa aataatattt gtcgtcaaaa tccgaaaacc gctgcaccag tgcaccatgt 1620 tcattttcgg tataattatt gagtgtaaaa caccaagctt tggccggatt tactctgggc 1680 atgattgtag gtagaaaaaa gttagtacgc gtgaaaattg gggattggta aatagtatt 1739 //