ID FJ959085; SV 1; circular; genomic DNA; STD; VRL; 1833 BP. XX AC FJ959085; XX DT 10-JUL-2009 (Rel. 101, Created) DT 18-SEP-2009 (Rel. 102, Last updated, Version 2) XX DE Circovirus-like genome SAR-B, complete genome. XX KW . XX OS Circovirus-like genome SAR-B OC Viruses; ssDNA viruses; unclassified ssDNA viruses. XX RN [1] RP 1-1833 RX DOI; 10.1099/vir.0.012955-0. RX PUBMED; 19570956. RA Rosario K., Duffy S., Breitbart M.; RT "Diverse circovirus-like genome architectures revealed by environmental RT metagenomics"; RL J. Gen. Virol. 90(Pt 10):2418-2424(2009). XX RN [2] RP 1-1833 RA Rosario K., Duffy S., Breitbart M.; RT ; RL Submitted (23-APR-2009) to the INSDC. RL College of Marine Science, University of South Florida, 140 7th Ave S, St. RL Petersburg, FL 33701, USA XX FH Key Location/Qualifiers FH FT source 1..1833 FT /organism="Circovirus-like genome SAR-B" FT /strain="SAR-B" FT /mol_type="genomic DNA" FT /isolation_source="Sargasso Sea" FT /note="assembled from metagenome data" FT /db_xref="taxon:642259" FT stem_loop join(1801..1833,1..26) FT /note="contains nonanucleotide sequence (tagtattac)" FT polyA_signal 38..43 FT CDS 63..1181 FT /codon_start=1 FT /product="hypothetical protein" FT /note="similar to structural proteins; ORF1" FT /db_xref="UniProtKB/TrEMBL:C6GIJ2" FT /protein_id="ACQ78173.1" FT /translation="MVRMTTGRRKTFRSIKKVATRGKDVIDTVIEATEKGKEFVDNISQ FT MPVKEVMTTAAKLAQSKKSPIGATGLNLLPKEARSVVSSTDAIGDFTSSASMFMYRPPR FT KSNRAGFTKYQLKTGATRTNSSLSNQVGTSDMNILDAIQVENNPDTDAKYSNLSVKEAF FT DNYLLAAGLKDTAGTSYDQKIQQTSIHVSSLQSELVITNNNNDTVMVDLYELVPQHTLG FT PSQYVNENQATGYMSPVWTWTQGLSDTVMVDDALSRSYFQANPFNSSLFSRTWKIVKQV FT RANISGGSTHRHKSAYMINKTVSYPEMAQFTTRGGKFAGWNPTFMIVQQGVPTSAAQEG FT VASSITVRMNAQLNYEASAVEQARVIVFDSKT" FT CDS 1193..1795 FT /codon_start=1 FT /product="putative Rep protein" FT /note="replication-associated protein similar to circovirus FT Reps; viral replication protein-like protein (PF02407); FT ORF2_Rep" FT /db_xref="GOA:C6GIJ3" FT /db_xref="InterPro:IPR003365" FT /db_xref="UniProtKB/TrEMBL:C6GIJ3" FT /protein_id="ACQ78174.1" FT /translation="MASQHGWVFTLNNPSIEEYPLKFEETSSGRRYPIAKWELPRATLA FT FLGCGRERGQSNTPHLQGLLISSRPVSLKSLKKLNPRAHWEPMRGSFKQAREYCEKEGK FT FVQWTKSGESVASLSRYVQTDLEIFQMIQQSDTDLNKLSVTMLKQEKKLAELSKRIEDL FT TKLQSHFLLKQDNFQNSILKLLSQKKILNTLSDEDLI" XX SQ Sequence 1833 BP; 603 A; 384 C; 406 G; 440 T; 0 other; acccaaagac tcgtacaccc cccgccgccc cgcccccaat aaataactca aagaggtata 60 atatggtcag aatgacaaca ggacgaagaa aaactttccg ttccataaaa aaggttgcga 120 caagaggtaa agatgtcata gacaccgtta tcgaagcaac cgaaaaaggc aaagaatttg 180 tcgataatat ttctcaaatg cccgttaagg aagtaatgac gacagctgcc aagctagcac 240 aatcgaagaa atctccgatt ggagccaccg ggttaaatct tttaccaaaa gaggcaagaa 300 gcgtagtctc gagcaccgac gcgatcggag attttacctc atcggcgagc atgttcatgt 360 atcgtccacc acgaaaaagt aatcgtgctg gatttactaa gtaccaattg aaaactggtg 420 cgactcgaac gaattcctca ctgagcaatc aggtaggaac gtcagatatg aatatcttag 480 acgctatcca ggttgagaac aatcctgaca cggatgcaaa atactcaaac ctttcagtaa 540 aggaagcctt cgataattat ctattagcgg ctggcctgaa ggatactgct ggtacgtctt 600 acgaccagaa gatccagcag acgtcaatcc atgtatcatc attgcagtcg gagttagtaa 660 taacgaacaa caataacgac accgtaatgg tggatttgta tgagttggta ccccaacata 720 cactaggtcc gagtcaatat gttaacgaga accaagcgac aggttatatg tcgccggtct 780 ggacctggac acaagggttg tctgacacag tcatggtcga cgacgcactt agtcgttcgt 840 acttccaagc aaacccgttc aattcctcgt tattttcgag aacgtggaaa attgtaaaac 900 aggttcgtgc caacataagt ggtggatcaa cgcaccgtca caaatcggct tatatgatta 960 ataagacggt gagctatcct gagatggctc aatttacaac tcgaggtggt aaatttgctg 1020 gatggaaccc tacattcatg atagtacaac aaggtgtacc tacttcggca gcacaagaag 1080 gagtagcgtc aagcataaca gttcgaatga atgctcagct gaattatgag gcttcagcag 1140 ttgaacaagc acgtgtgatt gtctttgact cgaaaaccta agtggcgtct ggatggcgag 1200 tcagcatggt tgggttttca ccctaaacaa tccaagtatc gaagagtacc cattgaagtt 1260 tgaagaaaca tctagcggta gacgataccc tatcgccaag tgggagttgc ctcgtgcaac 1320 tctcgctttc ttaggttgtg gtagagaacg aggtcagtcg aatacaccac acctacaagg 1380 acttttaata agttcacgac cagtttcatt aaagagtttg aagaagttaa accctagggc 1440 ccactgggaa ccaatgcgag gttcgttcaa gcaagctcgt gaatactgcg aaaaagaagg 1500 caagtttgtc caatggacaa aatccggtga gtcagtagca agtctgtcac gttatgttca 1560 gactgatttg gaaatatttc aaatgatcca acaatcagat actgatttaa acaagttatc 1620 ggtaacgatg cttaagcaag aaaaaaaact tgcagaatta tctaaaagaa tagaagattt 1680 aacaaagctt cagtctcact ttctcctaaa acaagacaat ttccagaatt ccattcttaa 1740 attattatcg caaaagaaaa ttttgaacac attatcagac gaagatctga tataatagta 1800 ggtggggggt gggagagacg aagggctagt att 1833 //