ID HQ407395; SV 1; circular; genomic DNA; STD; VRL; 2768 BP. XX AC HQ407395; XX DT 14-DEC-2010 (Rel. 107, Created) DT 14-DEC-2010 (Rel. 107, Last updated, Version 1) XX DE Tobacco curly shoot virus isolate WSF1 segment A, complete sequence. XX KW . XX OS Tobacco curly shoot virus OC Viruses; Geminiviridae; Begomovirus. XX RN [1] RP 1-2768 RA Singh M., Mandal B., Varma A.; RT "First report of a new strain of Tobacco curly shoot virus associated with RT satellite molecules infecting wild sunflower (Helianthus spp.) in India"; RL Unpublished. XX RN [2] RP 1-2768 RA Singh M., Mandal B., Varma A.; RT ; RL Submitted (15-OCT-2010) to the INSDC. RL Advanced Centre for Plant Virology, Indian Agricultural Research Institute, RL Pusa Road, New Delhi 284001, India XX DR MD5; c529587d03054e09e031c53270081838. XX FH Key Location/Qualifiers FH FT source 1..2768 FT /organism="Tobacco curly shoot virus" FT /segment="A" FT /host="wild sunflower" FT /isolate="WSF1" FT /mol_type="genomic DNA" FT /country="India" FT /collection_date="2010" FT /db_xref="taxon:180526" FT gene 135..482 FT /gene="AV2" FT CDS 135..482 FT /codon_start=1 FT /gene="AV2" FT /product="AV2" FT /note="pre-coat protein" FT /db_xref="GOA:E5LCF1" FT /db_xref="InterPro:IPR002511" FT /db_xref="InterPro:IPR005159" FT /db_xref="UniProtKB/TrEMBL:E5LCF1" FT /protein_id="ADR79361.1" FT /translation="MWDPLVNEFPETVHGFRCMLAVKYLQLVEKTYSPDTLGHDLIRDL FT ISVIRARNYVEATSRYNHFHARFEGTPTSQLRQPICEPCSCPHCPRHQSKSMGEQAHEQ FT KAQDVQDVQKS" FT gene 295..1065 FT /gene="AV1" FT CDS 295..1065 FT /codon_start=1 FT /gene="AV1" FT /product="AV1" FT /note="coat protein" FT /db_xref="GOA:E5LCF2" FT /db_xref="InterPro:IPR000263" FT /db_xref="InterPro:IPR000650" FT /db_xref="UniProtKB/TrEMBL:E5LCF2" FT /protein_id="ADR79362.1" FT /translation="MSKRPADIIISTPASKVRRRLNFDSPYVSRAPAPIVRVTKARAWA FT NRPMNRKPRMYRMYRSPDVPRGCEGPCKVQSFESRHDIQHIGKVMCVSDVTRGTGLTHR FT VGKRFCVKSVYVLGKIWMDENIKTKNHTNSVMFFLVRDRRPVDKPQDFGEVFNMFDNEP FT STATVKNVHRDRYQVLRKWHATVTGGQYASKEQALVKKFVRVNNYVVYNQQEAGKYENH FT SENALMLYMACTHASNPVYATLKIRIYFYDSVTN" FT gene complement(1062..1466) FT /gene="C3" FT CDS complement(1062..1466) FT /codon_start=1 FT /gene="C3" FT /product="C3" FT /db_xref="GOA:E5LCF3" FT /db_xref="InterPro:IPR000657" FT /db_xref="UniProtKB/TrEMBL:E5LCF3" FT /protein_id="ADR79363.1" FT /translation="MDSRTGELITAAQAENGVFIWEIQNPLYFKITEHNNRPFLMEADI FT ITVQIQFNYNLRKALGIHKCFLAYRIWMTSQPPTGQFLRVFKTQVLKYLNSLGIISINN FT VIRAVDHVLWNVLEHIVYVDQSSSIKFNIY" FT gene complement(1207..1611) FT /gene="C2" FT CDS complement(1207..1611) FT /codon_start=1 FT /gene="C2" FT /product="C2" FT /db_xref="GOA:E5LCF4" FT /db_xref="InterPro:IPR000942" FT /db_xref="UniProtKB/TrEMBL:E5LCF4" FT /protein_id="ADR79364.1" FT /translation="MRPSSPSKAHSTQVPIKVQHKIAKKGTRRRRVDLPCGCSYFIALA FT CHDHGFTHRGTHHCSSSREWRVYLGDSKSPVFQDNRTQQPAISYGSGHHHRPNTVQLQP FT EESVGDTQVFSSLPNLDDFTASDWSILKGL" FT gene complement(1514..2617) FT /gene="C1" FT CDS complement(1514..2617) FT /codon_start=1 FT /gene="C1" FT /product="C1" FT /db_xref="GOA:E5LCF5" FT /db_xref="InterPro:IPR001191" FT /db_xref="InterPro:IPR001301" FT /db_xref="InterPro:IPR022690" FT /db_xref="InterPro:IPR022692" FT /db_xref="UniProtKB/TrEMBL:E5LCF5" FT /protein_id="ADR79365.1" FT /translation="MAPPNKFRINAKNYFLTYPHCSLTKEEALSQLKNLETPTNKLFIR FT ICRELHEDGSPHLHVLIQFEGKFQCKNQRFFDLTSPTRSAHFHPNIQGAKSSTDVKTYM FT EKDGDVLDYGVFQVDGRSARGGCQSANDAYAEAINSGSKHEALNILREKAPKGLCFYNF FT IILKFKFSIGFFTPPMEGLCFSFFYLLLFESSSLKNLKEWTCLKNVNECRLRGLLRPMS FT IIIEGDSRTGKTMWARSLGPHNYLCGHLDLSPKVYNNDAWYNVIDDVDPHYLKHFKEFM FT GAQRDWQSNTKYGKPVQIKGGIPTIFLCNPGPNSSYKEFLDEEKNAALKNWALKNATFI FT TLEGPLYSGSNQSAAQDSQEGDEASTC" FT gene complement(2167..2460) FT /gene="C4" FT CDS complement(2167..2460) FT /codon_start=1 FT /gene="C4" FT /product="C4" FT /db_xref="InterPro:IPR002488" FT /db_xref="UniProtKB/TrEMBL:E5LCF6" FT /protein_id="ADR79366.1" FT /translation="MDLLTCMFSFNSKENSNAKTKGSSISHPQPGQHISIRTFRELRAQ FT QMSKPTWKKTETCLIMEFSKSMEDQLEEVANLPTTHMPKQSIQGQNMRPSIY" XX SQ Sequence 2768 BP; 747 A; 535 C; 620 G; 866 T; 0 other; accggatggc cgcgaatttt ttttgtggcc cccgcagcgc atttacatgt ggaccaatga 60 aattggctcc tcgtgactta attgttttgt ggtcccctat ttaaacttgc tcaccaagta 120 gtgcactccg cactatgtgg gatccattag taaacgagtt tcccgaaacc gttcacggtt 180 ttagatgtat gttagcagtt aaatatctgc agttagtaga gaagacttat tcgcctgaca 240 cattagggca cgatttaatt agggatttaa tttcagttat tagggctaga aattatgtcg 300 aagcgaccag cagatataat catttccacg cccgcttcga aggtacgccg acgtctcaac 360 ttcgacagcc catatgtgag ccgtgctcct gcccccattg tccgcgtcac caaagcaaga 420 gcatgggcga acaggcccat gaacagaaag cccaggatgt acaggatgta cagaagtcct 480 gatgtcccta gaggatgtga aggtccatgt aaggtccaat cttttgagtc tagacatgac 540 attcagcata taggtaaagt tatgtgtgtc agtgatgtta cgcgtggaac tgggctgact 600 catcgagtgg gtaaaaggtt ttgtgttaaa tccgtttatg tcttgggtaa gatctggatg 660 gatgaaaata ttaagaccaa gaaccacact aacagtgtta tgtttttttt agttagggat 720 cgtaggcctg tcgataaacc tcaagatttt ggagaggtct ttaacatgtt tgataatgag 780 cccagtacgg ctactgtgaa gaatgttcat cgtgataggt atcaagtgct tcggaaatgg 840 catgcaactg ttactggtgg tcaatatgcg tcaaaggaac aagctcttgt gaagaagttt 900 gttagggtta ataattatgt tgtgtataac cagcaagaag ctggcaagta tgaaaatcat 960 tctgagaatg ctttaatgtt gtatatggca tgtactcatg cttctaaccc agtgtatgct 1020 actttgaaga tacggatata cttctatgat tccgtaacaa attaatatat attgaatttt 1080 attgaagaag attggtctac atatacaata tgttctaata cattccacaa tacatgatca 1140 actgcacgaa ttacattatt aatactgata attcctaaac tatttaaata cttaagcact 1200 tgggtcttaa agacccttaa gaattgacca gtcggaggct gtgaagtcat ccagattcgg 1260 taggctagaa aacacttgtg tatccccaac gctttcctca ggttgtaatt gaactgtatt 1320 tggacggtga tgatgtccgc ttccataaga aatggccggt tgttgtgttc tgttatcttg 1380 aaatacaggg gattttgaat ctcccagata aacacgccat tctctgcttg agctgcagtg 1440 atgagttccc ctgtgcgtga atccatggtc gtggcaggct aatgctatga agtatgaaca 1500 cccacaaggg agatcaacac gtcgacgcct cgtccccttc ttggctatct tgtgctgcac 1560 tttgattgga acctgagtag agtgggcctt cgagggtgat gaaggtcgca ttctttaaag 1620 cccagttctt gagtgcagca ttcttctcct catccaagaa ctctttataa ctggaattgg 1680 gtcctggatt gcagaggaag atagtgggaa ttccgccttt aatttgaact ggctttccgt 1740 atttcgtgtt gctttgccag tccctttggg cccccatgaa ttccttaaag tgctttaggt 1800 agtggggatc tacgtcatca atgacgttgt accaggcatc attattgtag accttaggac 1860 taaggtctaa gtgaccacac aggtaattat gtggacccaa agatctagcc cacatggtct 1920 ttcctgttct actatcaccc tctatgataa tactcatggg tctcaatagg ccgcgcaggc 1980 ggcactcatt cacatttttc aagcaggtcc attctttcaa gtttttcagg gaacttgatt 2040 caaaaagaag aagataaaaa aaggagaaac ataaaccctc cataggaggc gtaaaaaatc 2100 ctatactaaa tttgaatttt aaaattatga aattgtaaaa acataatcct tttggagcct 2160 tctcccttaa tatattgagg gcctcatgtt ttgaccctga attgattgct tcggcatatg 2220 cgtcgttggc agattggcaa cctcctctag ctgatcttcc atcgacttgg aaaactccat 2280 aatcaagcac gtctccgtct ttttccatgt aggttttgac atctgttgag ctcttagctc 2340 cctgaatgtt cggatggaaa tgtgctgacc tggttgggga tgtgagatcg aagaaccttt 2400 ggtttttgca ttggaatttt ccttcgaatt gaatgagaac atgcaggtga ggagatccat 2460 cttcgtgtag ttccctacag attctgatga ataatttatt ggttggggtt tctaggtttt 2520 ttaattggga aagtgcttcc tctttcgtta gggagcagtg tgggtatgtg aggaaataat 2580 ttttggcatt tatcctgaat ttattaggag gagccattga cttggtcaat cggtgtctag 2640 caaacttggc tatgcaattg gtgtctggtg tcttatttat atgtgtacac ttaatggcat 2700 atttgtaatt ttggaaagtg ctttaattca aatttcaaaa ttcccaaagc ggccatccgt 2760 ataatatt 2768 //