ID EU636992; SV 1; linear; genomic RNA; STD; VRL; 5682 BP. XX AC EU636992; XX DT 21-JUN-2010 (Rel. 105, Created) DT 21-JUN-2010 (Rel. 105, Last updated, Version 1) XX DE Cucurbit aphid-borne yellows virus from China: Xinjiang, complete genome. XX KW . XX OS Cucurbit aphid-borne yellows virus OC Viruses; Riboviria; Luteoviridae; Polerovirus. XX RN [1] RP 1-5682 RA Xiang H., Shang Q., Yang D., Han C., Li D., Yu J.; RT "Sequence analysis of CABYV isolated from Xinjiang in China"; RL Unpublished. XX RN [2] RP 1-5682 RA Han C.; RT ; RL Submitted (14-APR-2008) to the INSDC. RL Plant Pathology, China Agricultural University, No.2 Yuanmingyuan West RL Road, Haidian District, Beijing 100193, China XX DR MD5; 079d1cc7ac0916cdcf74670efcf50abd. DR EuropePMC; PMC4337425; 24372390. DR EuropePMC; PMC4422679; 25946037. DR EuropePMC; PMC4677746; 26673519. DR EuropePMC; PMC5712577; 29238357. DR RFAM; RF01074; RF_site1. XX FH Key Location/Qualifiers FH FT source 1..5682 FT /organism="Cucurbit aphid-borne yellows virus" FT /host="cantaloupe" FT /mol_type="genomic RNA" FT /country="China:Xinjiang" FT /db_xref="taxon:91753" FT 5'UTR 1..20 FT CDS 21..740 FT /codon_start=1 FT /product="P0 protein" FT /note="ORF1" FT /db_xref="GOA:D7EZI7" FT /db_xref="InterPro:IPR006755" FT /db_xref="UniProtKB/TrEMBL:D7EZI7" FT /protein_id="ACF48403.1" FT /translation="MQIESVQQQLIFRPTRRASIEDRRLNTAYFLINHVFFLAQNGSKI FT LFRLFLARLPLLISEQLSGDYVYTPGASKRIILARFHRHCGAPLPSSSAVDLRLPATKD FT VARFFLARHYSRIMGERLQRNQTSLFGGYAEFAKFINVWCSSISCRLRESAPRDFSSSY FT IFVELSNLGLSLRNLVFTSGVYNRDALARVALHVHRIYGEDGGLDFWRLANLPSKSWPF FT NDARYLEGSVVQKILQR" FT CDS join(142..1488,1488..3311) FT /codon_start=1 FT /ribosomal_slippage FT /product="P1 protein-P2 fusion protein" FT /note="ORF1-ORF2" FT /db_xref="GOA:D7EZI8" FT /db_xref="InterPro:IPR000382" FT /db_xref="InterPro:IPR001795" FT /db_xref="InterPro:IPR007094" FT /db_xref="InterPro:IPR009003" FT /db_xref="UniProtKB/TrEMBL:D7EZI8" FT /protein_id="ACF48405.1" FT /translation="MEAKYFSAFFLLAFLCSLASSYQGTMFIPLEPVNASYWLDSTAIA FT VPPSPHQVQLIYDCPPQKMLRDFSSRDITRELWGRGYNETRLAFSEAMQNLQNLLMSGV FT RQSRAGLESLLHVTFRAVTYLWSSLIWASACAIWYLLREYTIEMLSLASLYMSTVYMVR FT MAAWIFGDLPIFLLKAGLSMMRGISKALWFKRSYNAEKSVEGFLSFKIPQSPPKHSVLQ FT VQYKDGSHAGYATCVTLYNGTNGLLTAHHVAVPGSKIVSTRNGNKIPLSEFRSIMESEK FT RDLVLLAGPPNWEGTLACKAVHFQDAQSLCKSKATFYAYDGEEWTSSNADIVGIAQGKT FT HASVLSNTDAGHSGTPYFNGRTVLGVHVGGAKEENANYMAPIPGIYGLTSPSYEFETTA FT PQGRLFTQEEIEELIEEFSFSEITSIMGHRRFHQMHDSQRHQADYEYESGKRAGGGVRR FT NNRTRCHPCGWAHKWRRTVSCCSLLPEGLHIRTSEQTGEPHCFCSIPHHQCFHRHFVGN FT QTSNNGQDRRPFNREAGGSSLGEPSHEETEVAQTRQEKFEELTTDFSSFFDAQYTWEQG FT GEEAPGFDKVGFLPQFYHAKQKKSSNWGDKICEQHPEMGDLTKGFGWPQFGAKAELKSL FT RLQAARWLERAQSVKIPPTEEREHVIERCCRAYQAARTNGPMATRGDRLSWDNFLQDFK FT QAVLSLEMDAGIGVPYIAYGKPTHRGWVEDKKLLPILARLTFSRLQKMLEVRYDDLSPA FT ELVREGLCDPIRVFVKGEPHKQSKLDEGRYRLIMSVSLVDQLVARVLFQNQNKREIALW FT RVVPPKPGFGLSTDEQVAEFMQILSAQVGLTPSELITEWRSHMIATDCSGFDWSVSDWL FT LEDDMEVRNRLTLDLNETTRRLRAAWLYCISNSVLCLSDGTLLAQRIPGVQKSGSYNTS FT SSNSRIRVMAAYHCGAEWAMAMGDDALESACSNLERYKSLGFKVEESSKLEFCSHIFEK FT EDLAIPVNKAKMLYKLIHGYEPECGNVEVLINYLAACFSILNELRSDPSLVETLHQWLV FT LPVQPQKI" FT CDS 142..2037 FT /codon_start=1 FT /product="P1 protein" FT /note="ORF1" FT /db_xref="GOA:D7EZI9" FT /db_xref="InterPro:IPR000382" FT /db_xref="InterPro:IPR009003" FT /db_xref="InterPro:IPR018019" FT /db_xref="UniProtKB/TrEMBL:D7EZI9" FT /protein_id="ACF48404.1" FT /translation="MEAKYFSAFFLLAFLCSLASSYQGTMFIPLEPVNASYWLDSTAIA FT VPPSPHQVQLIYDCPPQKMLRDFSSRDITRELWGRGYNETRLAFSEAMQNLQNLLMSGV FT RQSRAGLESLLHVTFRAVTYLWSSLIWASACAIWYLLREYTIEMLSLASLYMSTVYMVR FT MAAWIFGDLPIFLLKAGLSMMRGISKALWFKRSYNAEKSVEGFLSFKIPQSPPKHSVLQ FT VQYKDGSHAGYATCVTLYNGTNGLLTAHHVAVPGSKIVSTRNGNKIPLSEFRSIMESEK FT RDLVLLAGPPNWEGTLACKAVHFQDAQSLCKSKATFYAYDGEEWTSSNADIVGIAQGKT FT HASVLSNTDAGHSGTPYFNGRTVLGVHVGGAKEENANYMAPIPGIYGLTSPSYEFETTA FT PQGRLFTQEEIEELIEEFSFSEITSIMGHRRFHQMHDSQRHQADYEYESGNGQAAASAE FT TTAPDATPAVGRTNGDAQSPAAPSSPKVYTYAPPNKRENRTASAASPTTSASTDTLSEI FT KQAIMDKIDVHSIEKQVVQALANQAMKKPKSRRRGRRSSKNSQPTSQVSSMPSTPGNKA FT GKKPPDSTRSDSSPSFTTLNKRKARIGETKSANSTQKWVISQRASGGPSSARKPN" FT misc_feature 3312..3510 FT /note="intergenic non-coding region (NCR)" FT CDS 3511..5517 FT /codon_start=1 FT /transl_except=(pos:4108..4110,aa:OTHER) FT /product="P3-P5 readthrough protein" FT /db_xref="GOA:D7EZJ0" FT /db_xref="InterPro:IPR001517" FT /db_xref="InterPro:IPR002929" FT /db_xref="UniProtKB/TrEMBL:D7EZJ0" FT /protein_id="ACF48408.1" FT /translation="MNTAVARNQNAGRRRRRNQRSIRRDRVVVVNPSGGPPRGRRQRRN FT RRRPNRGGRARGRSPGETFVFSKDNLTGSSTGSITFGPSLSESPAFSSGILKAYHEYKI FT IMVQLEFISEASSTSSGSISYELDPHCKLSSLQSTINKFGITKNGLRRWAAKQINGMEW FT HDATEDQFKILYKGNGSSSVAGSFRITIKCQVQNPKXVDGSSPPPPSPSPTPPPPPPPQ FT PQPQPCAQRFWGYEGNPQNKILTAENSRNIDSRPLNFVQMYKWEDEKWDKVNLQAGYSR FT NDRRCMETYLTIPADKGKFHVYLEADGEFVVKHIGGELDGSWLGNIAYDVSQRGWNIGN FT YKGCKIKNYQSNTTFVAGHPDATMNSKSFDSARAVEVDWYASFELECDDEEGSWAIYPP FT PIQKDSSYNYTVSYGNYTEKYCEWGAISVSIDEDNNGSAPRRIPRKGAMAWSTPEPSFS FT GDESQRQDFKTPSPEERGSDTLESEETKEEENLLDRFEEENIPDVDDEDIWKGISRGSE FT TGTTEDDRASTSSRLRGNLKPQGLPKPQPTRTITKFNPNPDLVEAWRPDLAPGYSKADV FT AAATVIAGGSIHEGRDMLRRRDEKVMDSRKKWGILSSASSLTSGSLKKLSAQSEKLAKL FT TTGERAEFERIKNSHGKTVAAEYLEVVLADKTS" FT CDS 3511..4110 FT /codon_start=1 FT /product="P3 coat protein" FT /note="ORF3" FT /db_xref="GOA:D7EZJ1" FT /db_xref="InterPro:IPR001517" FT /db_xref="UniProtKB/TrEMBL:D7EZJ1" FT /protein_id="ACF48406.1" FT /translation="MNTAVARNQNAGRRRRRNQRSIRRDRVVVVNPSGGPPRGRRQRRN FT RRRPNRGGRARGRSPGETFVFSKDNLTGSSTGSITFGPSLSESPAFSSGILKAYHEYKI FT IMVQLEFISEASSTSSGSISYELDPHCKLSSLQSTINKFGITKNGLRRWAAKQINGMEW FT HDATEDQFKILYKGNGSSSVAGSFRITIKCQVQNPK" FT CDS 3539..4114 FT /codon_start=1 FT /product="P4 movement protein" FT /note="ORF3-ORF5" FT /db_xref="InterPro:IPR001964" FT /db_xref="UniProtKB/TrEMBL:D7EZJ2" FT /protein_id="ACF48407.1" FT /translation="MQGGGGEEISALYGATAWLWSTPLGDHRAEDDNEETADALIEEAE FT LEEGAQAKHSYFQRTISRAVPQEVSPSGRLYQRAQHSALEYSRPTMNIRSSWSSWSSSP FT RPLPPPRAPSLMSWTPTASLAPSNPRLINLESPRMDCDVGQLSRSTGWNGMTQLRTNSR FT SSIKGTDLPRLQAASGSPSSARSKTRNR" FT 3'UTR 5518..5682 XX SQ Sequence 5682 BP; 1560 A; 1442 C; 1420 G; 1260 T; 0 other; acaaaagata cgagcgggtg atgcaaattg agtctgtcca gcaacaacta atattcagac 60 caacccgacg agcgagcata gaagatcgaa ggttaaacac agcatacttc ctgatcaacc 120 acgttttctt tttggcgcaa aatggaagca aaatactttt ccgccttttt cttgctcgcc 180 ttcctctgct cattagcgag cagttatcag gggactatgt ttatacccct ggagccagta 240 aacgcatcat actggctcga ttccaccgcc attgcggtgc ccccctcccc tcatcaagtg 300 cagttgatct acgactgccc gccacaaaag atgttgcgcg atttttcctc gcgcgacatt 360 actcgagaat tatgggggag aggttacaac gaaaccagac tagccttttc ggaggctatg 420 cagaatttgc aaaatttatt aatgtctggt gttcgtcaat ctcgtgcagg cttagagagt 480 ctgctccacg tgacttttcg agcagttaca tatttgtgga gctctctaat ctgggcctca 540 gcctgcgcaa tttggtattt acttcgggag tatacaatcg agatgctctc gctcgcgtcg 600 ctttacatgt ccaccgtata tatggtgagg atggcggctt ggatttttgg cgacttgcca 660 atcttccttc taaaagctgg cctttcaatg atgcgaggta tctcgaaggc tctgtggttc 720 aaaagatctt acaacgctga gaagtctgtt gaaggatttc tctcattcaa gataccacaa 780 agccctccca aacactcggt attacaggtc caatacaaag atggatctca tgccgggtac 840 gctacctgcg tgacgctcta caatggaacg aacgggcttc tgactgccca tcacgtggct 900 gtcccaggta gtaaaattgt ctccaccagg aatggaaaca agatcccact ctctgaattt 960 agatcaatca tggaatctga aaagagggat ctcgtgcttt tagctggacc ccccaactgg 1020 gagggtactc tagcttgtaa agcagtccac tttcaagatg cccaaagtct ttgcaaatca 1080 aaagcaacct tctacgctta tgacggagaa gagtggactt catccaatgc tgacatagtc 1140 ggcattgcgc agggcaaaac ccacgcttca gtgttaagta acacagacgc gggtcatagt 1200 ggcaccccat acttcaatgg tagaactgtc ttgggtgtcc acgtcggtgg ggcaaaagaa 1260 gaaaatgcca attatatggc cccgatacct ggaatttatg gtctcactag cccaagttat 1320 gaatttgaga ccacggcacc ccaaggacgc ctgttcacac aggaagaaat agaagaactc 1380 attgaggaat tctccttcag tgagataact tccatcatgg gacatcgccg attccatcaa 1440 atgcacgact cgcagcgaca ccaagctgat tatgagtacg agtcgggaaa cgggcaggcg 1500 gcggcgtccg ccgaaacaac cgcacccgat gccacccctg cggttgggcg cacaaatgga 1560 gacgcacagt ctcctgctgc tccctcctcc ccgaaggtct acacatacgc acctccgaac 1620 aaacgggaga accgcactgc ttctgcagca tcccccacca ccagtgcttc caccgacact 1680 ttgtcggaaa tcaaacaagc aataatggac aagatcgacg tccattcaat cgagaagcag 1740 gtggttcaag ccttggcgaa ccaagccatg aagaaaccga agtcgcgcag acgcggcagg 1800 agaagttcga agaactcaca accgacttct caagtttctt cgatgcccag tacacctggg 1860 aacaaggcgg ggaagaagcc cccggattcg acaaggtcgg attcctcccc cagttttacc 1920 acgctaaaca aaagaaaagc tcgaattggg gagacaaaat ctgcgaacag cacccagaaa 1980 tgggtgatct cacaaagggc ttcgggtggc cccagttcgg cgcgaaagcc gaactaaaat 2040 cgctgcggct gcaagccgcg cgttggctgg aacgtgccca gtcagttaaa atccccccaa 2100 ctgaggagcg ggagcacgtt atagagagat gctgtcgggc ataccaagca gccagaacta 2160 atggcccgat ggcaacgaga ggagatcgac tttcctggga taacttccta caggatttta 2220 aacaggcggt cctctcgtta gaaatggacg caggcatagg agttccgtac atagcttacg 2280 gcaaaccgac ccaccgcggg tgggttgaag ataagaagct ccttccgata ttggctcgcc 2340 ttaccttcag ccggctacag aagatgttgg aggtaaggta tgatgacttg tcgcctgcgg 2400 agcttgtgcg agagggtctc tgtgacccta ttcgagtgtt tgtcaaaggt gaaccgcaca 2460 agcaatccaa attagatgaa ggccgctacc gcctcatcat gagtgtctct ctagtggatc 2520 aactggtagc ccgggtttta ttccaaaatc agaacaagcg cgagatagcg ttgtggagag 2580 tggttccccc caaacccggt tttggcttgt ctacggatga gcaagtggcg gagttcatgc 2640 agattctctc cgcccaggtt gggctcacgc cttcggaatt gattaccgag tggcgatccc 2700 acatgatagc aactgactgc tccggttttg actggagcgt ttcggactgg ctccttgaag 2760 atgatatgga ggtccgaaac cgcctgacgc tggatttaaa cgaaaccacg cgccgtttgc 2820 gcgctgcatg gttatattgc atttcaaaca gcgtcctctg cctttcagac ggaacattat 2880 tagcgcagag aatcccaggc gtgcagaaga gtggcagcta caatacgtca tcaagcaatt 2940 cccgtattcg ggtgatggct gcctaccact gcggcgcaga atgggcaatg gcgatgggcg 3000 acgatgccct ggagtcagct tgctcgaacc tcgagcgtta taaatcgctc ggtttcaaag 3060 tcgaggagtc ctcaaaactg gaattctgtt ctcacatctt tgagaaagag gacctcgcca 3120 ttccggtcaa caaagcaaag atgctttaca agctcataca tggctatgaa ccggaatgtg 3180 gcaatgtgga agtgctgatt aattacttgg ccgcctgttt ctcaattctc aatgagttgc 3240 ggtctgatcc ttcccttgtc gaaactctcc accagtggct ggtccttcca gtgcagccac 3300 aaaagatata aggggagtat aaagaacact agccaagcac acacgagttg caagcgttgg 3360 aagtacaagt ctcgttacca agagtccaca caatagatta caaatttctc gcaggatttt 3420 ctagcggtct attgtctgca gtaccagtta cggtaatagg actctatttt gtctacctaa 3480 agatttcagc acacgtgcga tcaattgtta atgaatacgg ccgtggctag aaatcaaaat 3540 gcagggaggc ggaggcgaag aaatcagcgc tctatacggc gcgaccgcgt ggttgtggtc 3600 aacccctctg ggggaccacc gcgcggaaga cgacaacgaa gaaaccgccg acgccctaat 3660 cgaggaggca gagctagagg aaggagccca ggcgaaacat tcgtattttc aaaggacaat 3720 ctcacgggca gttccacagg aagtatcacc ttcgggccgt ctctatcaga gagcccagca 3780 ttcagctctg gaatactcaa ggcctaccat gaatataaga tcatcatggt ccagctggag 3840 ttcatctccg aggcctcttc cacctcctcg ggctccatct cttatgagtt ggacccccac 3900 tgcaagctta gctccctcca atccacgatt aataaatttg gaatcaccaa gaatggattg 3960 cgacgttggg cagctaagca gatcaacggg atggaatggc atgacgcaac tgaggaccaa 4020 ttcaagatcc tctataaagg gaacggatct tcctcggttg caggcagctt caggatcacc 4080 atcaagtgcc aggtccaaaa cccgaaatag gtagacggca gctccccccc cccccccagt 4140 cctagcccga ctccaccacc tccaccacct cctcagcctc aaccccagcc ttgcgctcag 4200 cgcttttggg gttatgaagg caacccacaa aataagatac tcacggcaga aaattcgagg 4260 aacattgatt cccggccgtt gaactttgtg caaatgtaca agtgggagga tgaaaagtgg 4320 gacaaggtca acttacaagc cggatactcc cgcaacgatc gacgttgcat ggagacttat 4380 ctcacaattc cagcagacaa aggaaaattt cacgtttatc ttgaagctga tggtgagttc 4440 gtcgtcaaac atattggcgg cgagttagac ggtagttggc ttggaaatat cgcttacgat 4500 gtttcccaga gaggttggaa tataggaaat tacaaaggct gtaaaataaa aaattatcaa 4560 tccaatacaa cctttgtggc aggacacccc gatgctacaa tgaattcaaa aagttttgac 4620 tcagcacggg cagttgaggt cgattggtac gcttccttcg aattagaatg tgatgatgaa 4680 gaaggaagtt gggctatata ccccccccct atacagaaag attcttcgta taactacacc 4740 gtttcatacg ggaattacac ggagaaatat tgcgagtggg gggctatttc agtctcaatc 4800 gatgaagata acaatgggag cgcgcctaga agaataccac gcaagggggc aatggcgtgg 4860 tctacccccg agccgtcttt ttcgggggat gaaagtcaaa gacaagactt taaaactcct 4920 tcgcccgaag agcgaggttc cgatactctg gaatcggaag aaacgaagga ggaggaaaac 4980 cttctagata ggtttgagga ggaaaatata cccgatgtcg atgatgaaga tatttggaag 5040 ggtatttccc ggggttctga aaccgggacc acagaagatg atcgagcgtc aacaagctct 5100 cgtcttcgtg gtaacctcaa gccacaaggc ctgccaaaac cacagcccac ccggactata 5160 actaagttca atcctaatcc agatttggtg gaggcgtgga ggcctgacct agcacctgga 5220 tactccaaag cggacgtcgc agcagcgacc gtgatcgcag gagggagcat ccacgaggga 5280 cgagatatgc tcaggcgaag agacgagaaa gtgatggata gcaggaagaa atggggaatt 5340 ctttcctcag cctcctctct cactagcggt tctctaaaga aactcagcgc gcagtcggag 5400 aagcttgcca agttgacaac tggtgagcgt gcggaatttg agcgaatcaa gaactcgcat 5460 ggcaagactg ttgcagcaga gtatctcgaa gtggtgctag ctgataaaac ctcataaccg 5520 ctctgtggag acgagcgtga ctccacccgg catccagtgg gcccgaccaa atcactgatg 5580 acatcaagcc aaagatgtaa aattggaacg actccgaaag gataggcaac gaacgttccc 5640 accttagtgg aaacaggggg actccccctg gcatttcggt gt 5682 //