ID AB193315; SV 2; circular; genomic DNA; STD; VRL; 6000 BP. XX AC AB193315; XX DT 08-JUL-2005 (Rel. 84, Created) DT 04-OCT-2009 (Rel. 102, Last updated, Version 5) XX DE Chaetoceros salsugineum DNA virus DNA, complete genome. XX KW . XX OS Chaetoceros salsugineum DNA virus OC Viruses; unassigned viruses; Bacillariodnavirus. XX RN [1] RP 1-6000 RA Nagasaki K., Tomaru Y., Takao Y., Nishida K., Shirai Y., Suzuki H., RA Nagumo T., Mizumoto H.; RT ; RL Submitted (19-OCT-2004) to the INSDC. RL Contact:Keizo Nagasaki National Research Institute of Fisheries and RL Environment of Inland Sea, Harmful Algae Control division; 2-17-5, RL Maruishi, Ohno, Saeki, Hiroshima 739-0452, Japan XX RN [2] RX DOI; 10.1128/AEM.71.7.3528-3535.2005. RX PUBMED; 16000758. RA Nagasaki K., Tomaru Y., Takao Y., Nishida K., Shirai Y., Suzuki H., RA Nagumo T.; RT "Previously unknown virus infects marine diatom"; RL Appl. Environ. Microbiol. 71(7):3528-3535(2005). XX FH Key Location/Qualifiers FH FT source 1..6000 FT /organism="Chaetoceros salsugineum DNA virus" FT /lab_host="Chaetoceros salsugineum" FT /mol_type="genomic DNA" FT /country="Japan:Fukuoka, Ariake Sea" FT /note="The viral genome consists of a single molecule of FT covalently closed circular DNA (6,000 bases: the present FT sequence data) as well as a segment of linear DNA (997 bp). FT The linear segment is complementary to a portion of the FT closed circle creating a partially double-stranded genome." FT /note="CsNIV" FT /note="synonym: Chaetoceros salsugineum nuclear inclusion FT virus" FT /db_xref="taxon:674980" FT misc_feature 1..45 FT /note="The adenine proportions of the regions 1 is 82.22 FT %." FT misc_feature 19..1015 FT /note="a partially double-stranded genome." FT CDS complement(93..719) FT /codon_start=1 FT /transl_table=1 FT /product="hypothetical protein" FT /note="open reading frame 4" FT /db_xref="UniProtKB/TrEMBL:Q2L6L9" FT /protein_id="BAE79192.1" FT /translation="MRNFERIMNQVEDEHNLPNHVYANLMDDSPACIATMASDRQWAGA FT ELKEIAGKHYFYAFNTYYDIDQLTNRAERYFLEQKWTLMMYNEFTMYMDMLICGYKFNP FT HYTAKLFWAPTLQRCVDAPIIDIWYQFIYPDEVHYVIDLTSDDGADVPFDTDSIATFDV FT VGPIDNDILDVWTTPTVPAIIDVPNLRIDLRDGHEIIDLTNDIIS" FT misc_feature 1192..1229 FT /note="The adenine proportions of the regions 2 is 79.54 FT %." FT CDS complement(1297..3018) FT /codon_start=1 FT /transl_table=1 FT /product="putative replication-associated protein" FT /note="open reading frame 1" FT /db_xref="GOA:Q2L6L8" FT /db_xref="InterPro:IPR000605" FT /db_xref="UniProtKB/TrEMBL:Q2L6L8" FT /protein_id="BAE79193.1" FT /translation="MNSSSVSFQGFWFWNIFVIDAKSQFTLNAQILIIMEQCPLEDQIT FT ASSSTTPLDSNNRELDEWLDLVMPIEGGSCAEFETPSDISGGGGLDPPEEGVEECIEDD FT SPPQPHPQIHLAPPVYKQDQLPLGLPLDPDDDHLDDITSCMIGDYRSVTKADLLSRCIV FT TFFPKDNDRRWLKPETYFGPNPDNFQCWCGQFEICPRTGALHAHIYFECVRSRRLRFVR FT TAALFRKYHHRVHIKKARTVSKKQRQSAINYVLDDAKRAPGETIFTWHGCKFPVAYDDG FT CKHGRSKKTITQSKEENTEEQRLWIESKPRSWTWDQIVHESDHSKKLLCTCSWGQKYHA FT GRRASDPRRTISNVIILYGAGGTGKTTTAHDWDPREGETQKERYYRRNPDDGHFWGAGR FT SAYNGQRIVHFEEFTGQEAFSRLKEVCDIGKSGPTINIKNSGGELNHETVIFTSNVHPA FT GWFHKLWKDDPKQFHPFWRRVTEVRFYPTHRADGALNIPDQDHPPAFIDQTGEWREFGG FT DYQKALAHADTYWPLKMDSLEPQSMLFNPGGVSNYSEPPFFSYCKTGRDPTNIWGR" FT CDS 1444..2034 FT /codon_start=1 FT /transl_table=1 FT /product="hypothetical protein" FT /note="open reading frame 5" FT /db_xref="UniProtKB/TrEMBL:Q2L6L7" FT /protein_id="BAE79194.1" FT /translation="MSKSLLVIPAELAPLASLVYKGRRVVLIGDVQGPIGPVGRIETYF FT CNTTPERVELLWIIFPKFVEPASRMNVTSENNGFMVQLTTRIFDIDGGTGLSNVAHFFQ FT SAKGLLACKFLEVDNPLPIVGRSASSPEMAIVRVSTIIALFLGFTLTGVPIVGSRGLTS FT TTGTVEDDNIGDGSSWVRGTAPCMVFLTPTAGA" FT CDS complement(1752..2060) FT /codon_start=1 FT /transl_table=1 FT /product="hypothetical protein" FT /note="open reading frame 6" FT /db_xref="UniProtKB/TrEMBL:Q2L6L6" FT /protein_id="BAE79195.1" FT /translation="MRAITPKSSYAPAVGVKNTMQGAVPLTQEEPSPMLSSSTVPVVLV FT RPRLPTIGTPVRVKPRKSAIIVETLTMAISGELADRPTMGKGLSTSRNLQAKRPLAD" FT CDS 3506..4684 FT /codon_start=1 FT /transl_table=1 FT /product="hypothetical protein" FT /note="open reading frame 2" FT /db_xref="UniProtKB/TrEMBL:Q2L6L5" FT /protein_id="BAE79196.1" FT /translation="MARKKSTPRRRKAVKRRRTVRRRQSPKARVRSTTTKAKRRISPSG FT SGSQHLTVRKQPFSNATSQPKILDGALTSSLSRRLQNVIGLTNGNGGLGTEIMHIFFAP FT TLGIPLIAMNSAEGVALRPSSSADPFFIGFPGQTIKFDYVSSGTTPPATGNLVTFSNEC FT GFSKWRIVSQGLRMELANSDEENDGWFEAVRFNWRNNPADICFTPIDGTLGGAKTTDFA FT VAPSPVGMYALKDMAMVEQPGYTTGLLKDLKNHEFMLHPQSTTHDPIILEQSYEGTIGT FT TAADDMYYSVTSEVFELGNTVRGNTMKNSLVDNNMDWIYLRLHCRTNNGTTSNGSKLIV FT NAIQNLEVSFNPSSDFAAFQTINKMHPQQKKVDDQLNNSAEASNKRQKTGGG" FT CDS 4810..5796 FT /codon_start=1 FT /transl_table=1 FT /product="hypothetical protein" FT /note="open reading frame 3" FT /db_xref="UniProtKB/TrEMBL:Q2L6L4" FT /protein_id="BAE79197.1" FT /translation="MQKARDEFNLSLNLGGAAEPGYWSEYLDQADIETFHQQAYANADG FT LAIRINPVTGYKELYVSGSRGVRDHIQNLAEGLSRGIDDYDEWLEAYGAGKKAETLWTE FT TGWGKARAADPMDQQAFEWAKTALSGSELARDYWTQYIDTVIEAEGVEVVYGHSRGAAT FT ISGLKSNVKKIGLDGAMYIAKEDTEFTNLANANLLIPQLPGVVDYAISGGYKHNVYLPN FT RAFHDVARGKDVEKKPKPTGEASAAQRKKQRRQRPRAQKSRERVKELDKLLGRKTDDQK FT KAQYLRNRRKYFKWKKHYKDAKRLGKIVYESYKKHQELKSGVYRRRR" XX SQ Sequence 6000 BP; 1696 A; 1267 C; 1513 G; 1524 T; 0 other; aaaaaataaa aagaaaaaag aaaagaaaag aaagaaaaga taaaatagat atataattaa 60 gtcctaagta ttgttattgc tattagaatt agttaagaga taatgtcatt ggtaagatcg 120 atgatttcat ggccatcgcg gagatcgata cggagattgg ggacatcgat gatagcaggg 180 acggtaggag tagtccaaac gtccaagatg tcattgtcaa tgggaccaac aacgtcgaaa 240 gtagcaatgg agtcggtatc gaacggtaca tcggcgccat cgtcagaggt taagtcgata 300 acatagtgaa cttcgtcggg atagatgaac tgataccaga tgtcaatgat aggagcatcg 360 acgcaacgct gaagagttgg agcccagaac aacttagcag tgtaatgtgg attgaacttg 420 tagccgcaga taagcatgtc catgtacata gtgaactcgt tgtacatcat gagggtccac 480 ttctgctcta ggaagtaacg ctcggcacgg ttggtcaact gatcgatgtc atagtaggtg 540 ttaaaggcat agaagtaatg tttaccagcg atctctttga gctcggcacc ggcccactga 600 cggtcagacg ccattgtggc gatacaagca gggctatcgt ccataaggtt ggcatacaca 660 tggttaggga ggttgtgctc gtcctctact tggttcataa tacgttcgaa gttcctcatg 720 attgatcact tttgtgtttt atgagatgat aaaagtgcaa atgtgaaaat tatgaagctg 780 aagtacgact aaaatccact ctacgcttct cgaatgaccg atgaccgggg ttcggttctc 840 gatttggtac gattcaagtg ggacttgaat agttcttccg cttagaaagt ccgattcccc 900 gatacgaact ccgttaaccc gaagcacgtg ctaacggtta accgtcatat aacggtttaa 960 cggtttttgc tagggttttg ctagggttaa gggcaagtta gggttagggt cgcaccgcga 1020 ggtaacaagg ttagtaggac gaagggagtg gttctcccgt aggacaagga cagacggcag 1080 tgtacgaagg ggacctagca acaccaagta cacttaggag aggctggact aagtgagttg 1140 tatttgtgat cacctttagg tgatcatcag gtactacgag agcgtagcgg taaaaaaata 1200 aaaaaaaaga attaagaaag agaaaaaaag gaaaacaggg aacttaattt agctgtgtaa 1260 cagcgctatg gggtgactaa aaaagactaa gaatctctaa cgtccccaga tgttagtagg 1320 atctctacca gtcttacagt atgaaaagaa ggggggttca gaatagttgc taaccccacc 1380 ggggttaaag agcatggatt gaggctctaa actatccatc ttaagaggcc aataagtatc 1440 tgcatgagca agagcctttt ggtaatcccc gccgaactcg cgccactcgc cagtttggtc 1500 tataaaggcc ggagggtggt cttgatcggg gatgttcagg gccccatcgg ccctgtgggt 1560 aggatagaaa cgtacttctg taacacgacg ccagaaaggg tggaactgct ttggatcatc 1620 tttccaaagt ttgtggaacc agccagccgg atgaacgtta ctagtgaaaa taacggtttc 1680 atggttcagc tcaccaccag aatttttgat attgatggtg ggaccggact ttccaatgtc 1740 gcacacttct ttcaatcggc taaaggcctc ttggcctgta aattcctcga agtggacaat 1800 cctttgccca ttgtaggccg atctgccagc tccccagaaa tggccatcgt cagggtttcg 1860 acgataatag cgctctttct gggtttcacc ctcacggggg tcccaatcgt gggcagtcgt 1920 ggtcttacca gtaccaccgg caccgtagag gatgataaca ttggagatgg ttcttcttgg 1980 gtcagaggca cggcgccctg catggtattt ttgaccccaa ctgcaggtgc ataagagctt 2040 tttggagtga tcgctctcat ggactatttg gtcccatgtc caggagcgtg gcttcgattc 2100 aatccataaa cgttgctctt ccgtgttctc ttccttcgac tgagtaatgg tctttttgga 2160 ccgtccatgt ttacacccgt cgtcataggc gactggaaac ttgcagccat gccaagtaaa 2220 gatggtttca ccaggagcac gcttggcgtc atcgagtaca taattgatcg cagattgacg 2280 ttgtttttta gaaaccgtgc gagctttctt tatatggacc ctatggtgat acttgcggaa 2340 taatgctgct gtacgcacga agcgcagacg ccgacttctg acgcactcaa agtatatatg 2400 agcatgtaag gcaccagtcc taggacatat ttcaaactgt ccacaccagc attggaaatt 2460 gtcagggtta ggtccgaaat aggtctcagg cttaagccaa cgcctatcgt tgtccttagg 2520 gaagaaggtg actatgcagc ggctaagtaa atcagcctta gtgacagatc tgtaatcccc 2580 tatcatgcaa gacgtaatat cgtcgagatg atcatcatca gggtctagcg ggagccccaa 2640 agggagttgg tcctgcttat acactggagg tgctagatga atctgtggat gcggttgtgg 2700 aggtgagtcg tcttcgatac actcctccac cccctcctcc ggtgggtcta acccgcctcc 2760 acctgatata tctgagggcg tttcgaattc tgcacatgag ccgccttcta tcggcatcac 2820 caagtccagc cactcgtcta gctcgcggtt gttcgagtct agtggcgttg tcgatgatga 2880 tgcggtgatt tgatcttcca atggacattg ttccatgatg attaaaattt gagcattgag 2940 tgtgaactgt gacttggcat cgatcacaaa aatgttccag aaccagaagc cctggaagct 3000 cactgaactg gaattcattt gtgttcctac acgactcaaa taactgattt aacttaacag 3060 tttctacgtt acccgttagg ctaaaagggg tgagcccgag gcttgttcga atctcgccaa 3120 ctggcctgcc aggaacccct ttttggagca ggctcccgcc gctcgataca ttttcacgtg 3180 agcttgcgag cgtagaaaat gtgtaggcgg gactagcgac gcattggagc gatctgattg 3240 gtctagacaa acgggtacag ttaagggctt aagtagtaca taagccccag ccacgtgcta 3300 acggctgtgc agcaaaacgg ataaaataac attcttcgtt ttggtgcaca gcacgtagaa 3360 actgtaccag gagcctacgt acactttttg aatcccacca ccaaaagtga gtaggcgacc 3420 ccccaaaagt gtacacttta taaagggtac accccccgct ttagttttag gtaccgctca 3480 taaaacaaga agcagtactg taagtatggc gcgtaaaaaa agcacaccaa ggagaagaaa 3540 agcggttaaa cgccggcgta ccgttaggag acgtcagtct cctaaagcac gggttcgtag 3600 tacaactacg aaagccaaaa ggcgtatatc gccctcgggc tcaggctcac agcacttaac 3660 tgtgagaaag cagccctttt cgaacgcaac ttcacaacca aagattcttg acggagccct 3720 tacctcctcg ttatcgagga ggcttcagaa cgtgatcggt ttgacaaatg ggaatggggg 3780 tctcggaacc gagatcatgc acattttctt cgcccccacg cttggaattc cattgatagc 3840 aatgaattct gctgaagggg tcgctctccg tccatcatct tccgcagacc catttttcat 3900 cggattcccc ggtcaaacta ttaagtttga ctatgtttcg tccggtacta cacctccagc 3960 tacaggaaac ctagtgacgt ttagtaacga atgtgggttt tctaaatgga gaatcgtgag 4020 ccaagggctt cgaatggagt tagctaacag tgacgaagag aatgatgggt ggtttgaagc 4080 ggttcgcttt aactggagga acaatccagc ggatatttgt ttcactccga ttgatggtac 4140 gctaggtggg gccaagacga cggactttgc tgtagcccct agtccggttg gcatgtacgc 4200 attaaaggac atggcaatgg tggaacagcc tgggtacact acaggcttac tcaaggattt 4260 gaagaatcat gagtttatgt tacacccaca gagtacgact catgatccaa tcatattgga 4320 gcagagctac gagggtacga ttggaaccac cgcggcagat gatatgtatt actctgttac 4380 ttccgaagtc tttgagcttg ggaatacagt aagaggtaat actatgaaga actcgttagt 4440 tgataataac atggattgga tttacctccg actccattgt aggacaaata acggtactac 4500 ttctaacggt tctaaactga tagttaacgc tatccagaac cttgaagtat cttttaatcc 4560 ctccagtgat ttcgcggcat ttcaaactat aaataaaatg catccacaac agaagaaggt 4620 cgatgatcag cttaataatt cagctgaagc ttctaacaag aggcaaaaga ccggcggagg 4680 ttagtgatgt attttgttta gagtttgtct ggtgtatttt taacgtaata ttttgtttcc 4740 tcacatacag aatatgaagg agaaaatatt gaagatcctc ctgaattggt tgaggaagtt 4800 agccctgata tgcagaaggc ccgagacgaa ttcaatctca gtctcaatct cggaggagcg 4860 gcagaaccag gatattggag tgaatatctt gatcaggccg atattgagac gttccaccaa 4920 caagcttacg ccaatgcgga tgggctcgct atccgtatca acccggttac cggatacaaa 4980 gaactctacg tctccggaag ccggggagta agggatcaca tacagaatct agccgaaggc 5040 ctatcacgag gaatcgatga ttacgatgag tggttagagg cctatggcgc aggaaagaag 5100 gctgagacat tatggacgga aactggttgg ggaaaagcac gtgctgcgga tcctatggat 5160 cagcaggcct ttgaatgggc caagactgct ctaagtggct cagaacttgc acgtgattat 5220 tggactcagt atatagatac tgttattgag gctgaaggcg tagaagttgt ctacggccat 5280 agtcgtggcg cagccacgat ttcaggctta aagtccaatg ttaagaagat cggtcttgac 5340 ggagctatgt atatagcaaa ggaagacacc gaatttacta atttagctaa tgctaatctt 5400 cttataccgc agcttccggg tgttgtcgat tatgcgatat ccggtggcta taagcataat 5460 gtttacctac ccaacagggc atttcatgat gttgcccgtg gtaaagacgt tgagaagaaa 5520 ccgaagccaa ctggcgaagc cagtgcagct caaagaaaga aacaacgtcg ccaaagacca 5580 cgcgcacaga agtctcgcga gagagtgaag gagcttgata agcttttagg ccgtaaaacg 5640 gacgatcaga aaaaagctca atacttgcgc aacagacgta agtatttcaa gtggaagaag 5700 cattataaag atgcgaagcg tcttggaaag atcgtatacg aatcttacaa gaaacatcag 5760 gaactgaaga gtggagttta ccgccgtagg cggtagtcag tatagaagat cgagtctgta 5820 ctgacgtaag catccgtgat tgacaattac cacgacttaa aagaattttt agaagttttg 5880 gtatgttagt taatttaagg atgtaagctc ccccctttag cgacatttag gtgggtagtg 5940 ttttaaactc taagggatga gtgagttcac taccctttta tttttaggga gaatggggtt 6000 //