ID AY197548; SV 2; linear; genomic DNA; STD; VRL; 7448 BP. XX AC AY197548; XX DT 02-JUL-2003 (Rel. 76, Created) DT 16-APR-2005 (Rel. 83, Last updated, Version 5) XX DE Enzootic nasal tumour virus of goats, complete genome. XX KW . XX OS Enzootic nasal tumor virus 2 OC Viruses; Riboviria; Pararnavirae; Artverviricota; Revtraviricetes; OC Ortervirales; Retroviridae; Orthoretrovirinae; Betaretrovirus. XX RN [1] RP 1-7448 RX DOI; 10.1099/vir.0.19125-0. RX PUBMED; 12867657. RA Ortin A., Cousens C., Minguijon E., Pascual Z., Villarreal M.P., RA Sharp J.M., Heras Mde L.; RT "Characterization of enzootic nasal tumour virus of goats: complete RT sequence and tissue distribution"; RL J. Gen. Virol. 84(Pt 8):2245-2252(2003). XX RN [2] RP 1-7448 RA Cousens C., Ortin A.; RT ; RL Submitted (16-DEC-2002) to the INSDC. RL Virology, Moredun Research Institute, Bush Loan, Penicuik, Midlothian EH26 RL 0PZ, UK XX RN [3] RC Nucleotide and amino acid sequences updated by submitter RP 1-7448 RA Cousens C., Ortin A.; RT ; RL Submitted (30-JUL-2003) to the INSDC. RL Virology, Moredun Research Institute, Bush Loan, Penicuik, Midlothian EH26 RL 0PZ, UK XX DR MD5; 9c9e7c49876d87690d1bd14f59613d93. DR EuropePMC; PMC3293530; 22235118. DR EuropePMC; PMC3909617; 24150961. DR EuropePMC; PMC5530571; 28747230. DR RFAM; RF01097; RF_site8. DR RFAM; RF01417; RSV_RNA. XX CC On Jul 30, 2003 this sequence version replaced gi:32395333. XX FH Key Location/Qualifiers FH FT source 1..7448 FT /organism="Enzootic nasal tumor virus 2" FT /mol_type="genomic DNA" FT /proviral FT /db_xref="taxon:2913605" FT repeat_region 14..125 FT /rpt_type=LONG_TERMINAL_REPEAT FT /note="U5 region" FT CDS 256..2094 FT /codon_start=1 FT /product="gag protein" FT /db_xref="GOA:Q7SMF1" FT /db_xref="InterPro:IPR000721" FT /db_xref="InterPro:IPR001878" FT /db_xref="InterPro:IPR003322" FT /db_xref="InterPro:IPR008916" FT /db_xref="InterPro:IPR008919" FT /db_xref="InterPro:IPR010999" FT /db_xref="InterPro:IPR036875" FT /db_xref="InterPro:IPR038124" FT /db_xref="UniProtKB/TrEMBL:Q7SMF1" FT /protein_id="AAO85304.2" FT /translation="MGQMHSRQLFVHMLSVMLTHRGITVSKPKLINFLSFVEEVCPWFP FT REGTVNLETWKKVGEQIRTHYTLHGPEKVPVETFSFWTLIRDCLDFDNDELKRLGNLLK FT QEEDPXHIPDPEPGYDAPPPPPPSLKLRPSDNDDSLSSTDEEELAEEAAKYHQEDWGFL FT AQEKGALTSKNELVECLKNLTVVLQNSGAKFSNNHVKPPSAPPFPPAYAPSVVAGLDPP FT PGPSLPTEIMSPLQKALKQAQRLGEVVSDFSFAFPVFENNNQRYYEXLPFKQLKELKIA FT CSQYGPTAPFTIAMIENLGTQALPPNDWKQIARAGLSGGDYLLWKSEFFEQCARIADVN FT RQQNIQTSYEMLIGEGPYQATDTQLNFLPGAYPQISNAARQAWKKLPSSSTKTEDLSKI FT RQGPDEPYQDFVARLLDAIGKIMSDEQAGMLLAKQLAFENANSACQAALRPYRKKGDLS FT DFIRICADIGPSYMQGIAMAAALQGKSIKEVLFQQQARNKKGPQRSGNSGCFVCGQPGH FT RAAVCPQKQQNPVNTPNLCPRCKKGKHWARDCRSKTDVQGNPLPPVSGNWVRGQPLAPK FT QCYGATLQVPKEPLQTSVEPQEAAQDWTSVPPPIQY" FT CDS <1929..2855 FT /codon_start=1 FT /product="gag-pro fusion" FT /note="translated via frameshift" FT /db_xref="GOA:Q7SMF0" FT /db_xref="InterPro:IPR000467" FT /db_xref="InterPro:IPR001969" FT /db_xref="InterPro:IPR001995" FT /db_xref="InterPro:IPR018061" FT /db_xref="InterPro:IPR021109" FT /db_xref="InterPro:IPR029054" FT /db_xref="InterPro:IPR033704" FT /db_xref="InterPro:IPR034170" FT /db_xref="InterPro:IPR036157" FT /db_xref="UniProtKB/TrEMBL:Q7SMF0" FT /protein_id="AAO85305.2" FT /translation="SFAPGFGKLGEGPAPGPETMLWGNXAGSKRTIADLCRATRGSAGL FT DLCATSYTVLTPEMGVQTLATGVFGPLPPGTAGLLLGRSSASLKGILIHPGVIDSDYTG FT EIKILASAPNKIIVINAGQRIAQLLLVPLVIQGKTINRDRQDKGFGSSDAYWVQNVTEA FT RPELELRIDGKLFRGVLDTGADISVISEKYWPTTWPKQIAISTLQGIGQTTNPDQSSSL FT LTWTDKDGHTGQFKPYILPHLPVNLWGRDILSKMGVYLYNPSPTVTDLMLDQGLLSNQG FT LGKQHQGIVLPLDLKPNQERRGLGCFP" FT misc_feature 1947 FT /note="predicted start of pro" FT CDS <2822..5434 FT /codon_start=1 FT /product="gag-pro-pol fusion" FT /note="translated via frameshift" FT /db_xref="GOA:Q7SME9" FT /db_xref="InterPro:IPR000477" FT /db_xref="InterPro:IPR001037" FT /db_xref="InterPro:IPR001584" FT /db_xref="InterPro:IPR002156" FT /db_xref="InterPro:IPR003308" FT /db_xref="InterPro:IPR010661" FT /db_xref="InterPro:IPR012337" FT /db_xref="InterPro:IPR017856" FT /db_xref="InterPro:IPR036397" FT /db_xref="InterPro:IPR036862" FT /db_xref="UniProtKB/TrEMBL:Q7SME9" FT /protein_id="AAO85306.2" FT /translation="SRTKRFGVFSLGTSDSPVTHADPIDWKSGEPVWVDQWPLTQEKLS FT AAQQLVQEQLRLGHIEPSTSAWNSPIFVIKKKSGKWRLLQDLRKVNETMMHMGALQPGL FT PTPSAIPDRSYIIVIDLKDCFYTIPLAPQDCKRFAFSLPSVNFKEPMQRYQWKVLPQGM FT TNSPTLCQKFVAAAIAPVRQCFPQLYLVHYMDDILLAHINEHLLYQAFSLLKQHLSLNG FT LVXADEKIQTHFPYNYLGFHLYPRAYTTQLVKLQTDHLKTLNDFQKLLGDINWIRPYLK FT LPTYILQPLFDILKGNSDPASPRILSPEGRSALQFVEEAIRQQQITYCDYQRSWGLYIL FT STPRAPTGVLYQDKPLRWIYLSATPAKHLLPYYELIAKIVAKGRYEAIQYFGLEPPFIC FT VPYALEQQDWLFRYSDNWAIAFANYPGRIIHHYPSDKLLQFASSHAFIFPKIVRQQPIP FT EATLIFTDGSSNGTAALIINQQTYYAHTNFSSAQVVELFAVHQALLTVSTSFNLFTDSS FT YVVGALQMIETVPIIGTTSPEVLNLFTLIQQALHCRQHPCFFGHIRAHSTLPGALVRGN FT HTADVLTKQVFFQSAIDAARKSHNLHHQNSHSLRLQFKISREAARQIVKSCSTCPQFFV FT LPQYGVNPRGLRPNNLWQTDVTHIPHFGRLKYVHASIDTFSNFLMASLHTGESTRHCIQ FT HLLFCFSISGIPQTLKTDNGPGYTSRSFQRFCLSFQIHHKTGIPYNPQGQGIVERAHQR FT LKHQLLKQKKGSDLYSPSPHNALNHALYVLNFLTLDAEGNSAAQRFWGERSSCKKPLVR FT WKDPLTNLWYGPDPVLIWGRGHVCVFPQDAEAPRWIPERLVRAAEEFPDTSNASNDTE" FT misc_feature 2846 FT /note="approximate start of pol" FT misc_feature 4458..5063 FT /note="open reading for putative ORFx protein; present in FT related viruses; no product has been identified" FT CDS 5322..7190 FT /codon_start=1 FT /product="envelope protein" FT /db_xref="GOA:Q7SME8" FT /db_xref="InterPro:IPR000328" FT /db_xref="UniProtKB/TrEMBL:Q7SME8" FT /protein_id="AAO85307.2" FT /translation="MFVFFHRMPKHRVGYRKGWYARQRNSLTHQMRRMTLSEPTSELPT FT QRQIEALMRYAWNEAHVQPPVTPSNILIMLLLLLQRMQSGEAAAFWAYIPDPPTIQSLG FT WDKEVVPVYVNDTSLLGGKSDIHISPQQANISFYGLTTQYPMCFSYQSQHPHCIQVSAD FT ISYPRVTISGIDEKTGKRSYRDGTGPLDIPFCDKHLSIGIGIDTPWTLCRARVASVYNI FT NNANTTFLWDWAPGGTPDFPEYRGQHPPIFSVNTAQVYQTELWKLLAAFGHGNSLYLQS FT NVSGSKYGDVGVTGFLYPRACVPYPFMLIQGHMEITLSLNIYHLNCSNCILTNCIRGVA FT KGEQVIIVKQPAFVMLPVEITEGWYDETALELLQRINTALSRTERSVSLIVLGIVSLIT FT LIATAVTASVSLAQSIQAAHTVDSLSYNVTKVMGTQEDIDRKIEDRLSALYDVVRVLGE FT QVQSISFRMKIQCHANYKWICVTKKAYNASDFPWDKVKKHLQGIWFNTNISLDLLQLHN FT EILDIENAPKATLNIADTVDNFLQNLFSNFPNLHSLWQSILAVIIIVSVIIIVICLVPC FT LLRGLFRDFLHMRAELLHLKYRHMMQYRQLMELLGNKERGAAGDNP" FT misc_feature 5341..5342 FT /note="splice acceptor; suggests that the envelope protein FT may begin further downstream" FT repeat_region 7171..7435 FT /rpt_type=LONG_TERMINAL_REPEAT FT /note="U3 region" FT regulatory 7410..7417 FT /regulatory_class="TATA_box" FT regulatory 7427..7433 FT /regulatory_class="polyA_signal_sequence" XX SQ Sequence 7448 BP; 2114 A; 1604 C; 1488 G; 2235 T; 7 other; acaaggcatc agccattttg gtctgatcct ctcaacccca tcttttgtct gtcttttttt 60 ctcagcgagg gcgctccgtt ctttccttat acaggtgcat tccttgtcag tgctggtcgc 120 ggcaggtggc gcccaacgtg gggctcgaca gttttcctcg ccactactct tgtaaattgg 180 atagagtgag tatatgcgtg agtacgtaaa ttatattata gtggagtagt aaggtatatt 240 gttgagagta taaatatggg acaaatgcat agtcgccagc tgtttgtaca tatgttgtct 300 gtaatgctaa cacatagggg aattactgtt tctaaaccta aattaatcaa ttttctttcg 360 tttgttgaag aggtctgccc ttggttcccc agagaaggta cagtgaattt ggaaacatgg 420 aagaaggtcg gggaacaaat ccgaactcat tatactttgc atggccctga aaaggttcct 480 gttgaaactt tttccttttg gacactaatt cgtgattgtt tggattttga taatgatgag 540 ttaaaacgtt taggaaattt attaaaacag gaggaagatc ctstccatat tcctgaccca 600 gagcctggtt atgatgctcc tcctcctcct cctccgtccc tgaaattgcg accttcagat 660 aatgatgatt cactttcctc tacagatgag gaggaattag ccgaagaagc tgctaaatac 720 catcaagaag attggggttt tttagcacaa gagaaggggg cattaacatc taaaaatgaa 780 ttggttgaat gtctgaaaaa cctcactgtt gttttacaga actcaggagc taagttttct 840 aacaatcacg tcaaacctcc ctctgctccg ccattcccgc ctgcttacgc tccttctgtt 900 gtggccggtc ttgatccccc tccagggcct tctttaccga ctgagatcat gtctccgcta 960 caaaaggcat tgaaacaagc acagcgactt ggtgaggttg tctctgattt ttcttttgct 1020 tttcctgttt ttgaaaataa caaccagcgt tattacgaay ctttgccttt taagcaatta 1080 aaagagttaa agattgcctg ttcccaatat ggtcccaccg cgcctttcac cattgctatg 1140 atagaaaatt tgggcactca agctttaccc ccaaatgatt ggaaacaaat agctagggct 1200 ggtctttcag ggggagatta tttgttgtgg aaatctgaat tttttgagca atgcgctcgt 1260 atagctgatg ttaaccgaca acaaaacata caaacctcct atgaaatgtt gattggtgaa 1320 ggcccttacc aggctactga tactcaactt aattttttac ctggtgcata tccacagata 1380 tcaaatgctg ctcggcaggc atggaaaaaa cttcctagct ctagtactaa gacagaagat 1440 ctttccaaaa tccgacaagg acctgatgaa ccttatcaag attttgtggc acgacttttg 1500 gatgctatag gtaaaataat gtcagatgaa caggctggga tgttattagc aaagcagttg 1560 gcatttgaaa atgccaactc tgcctgtcaa gctgccttaa gaccttatcg taaaaagggg 1620 gatttgtccg attttattcg tatctgtgcc gatatcgggc cctcctacat gcaaggcatt 1680 gctatggcag cagcattaca aggaaagagt ataaaggagg tacttttcca gcaacaggcc 1740 cggaataaga aaggaccaca aagatcaggt aattcgggtt gttttgtttg tggacaacct 1800 ggccatcggg ctgcagtgtg ccctcaaaaa caacaaaacc ctgttaacac tcctaatttg 1860 tgcccacgat gtaaaaaagg aaagcattgg gcacgggatt gccgttctaa aacggatgtt 1920 caaggtaatc ctttgccccc ggtttcggga aactgggtga ggggccagcc cctggccccg 1980 aaacaatgtt atggggcaac rctgcaggtt ccaaaagaac cattgcagac ctctgtcgag 2040 ccacaagagg cagcgcagga ttggacctct gtgccacctc ctatacagta ttaactcccg 2100 agatgggggt tcaaaccctt gccacaggag tgtttgggcc tttacctcca gggacagctg 2160 gattgctctt ggggcgcagt agtgcgtctt taaaaggaat acttattcat cctggtgtga 2220 ttgattctga ttatacagga gagataaaaa tattagcctc cgctcctaac aaaattattg 2280 tsatcaatgc aggacaacgt atagctcaac tccttttagt cccattagtt atacaaggaa 2340 aaacaattaa tagagaccgt caagataaag gtttcggatc atccgacgct tattgggtac 2400 aaaatgttac tgaggcacga ccagaacttg aattacgcat tgatggcaag ctttttcgtg 2460 gagtacttga tacgggggct gatatcagtg ttatttctga aaagtattgg cctactacat 2520 ggcctaagca gatagcaatc tctactcttc agggcattgg ccaaactact aatccggatc 2580 aaagctcgtc ccttcttact tggacagata aagatggcca tacggggcaa tttaagcctt 2640 atattttgcc ccatctccca gttaatctat ggggacgcga tatcttaagt aaaatgggtg 2700 tctatttata taatccttca cccaccgtaa cagatttaat gttagatcaa gggttacttt 2760 caaatcaggg cttaggtaaa cagcatcaag gtatcgtctt gccccttgat ttaaaaccta 2820 atcaagaacg aagaggtttg gggtgttttc cctagggacc tctgattctc ctgtgacaca 2880 tgccgacccc attgattgga aatctgggga gccggtatgg gtcgatcagt ggcccctaac 2940 acaggaaaaa ctctctgccg cacaacagct ggtgcaggaa cagctgaggc ttggacatat 3000 tgagccctct acctctgctt ggaactcccc gatttttgtg ataaaaaaga agtcagggaa 3060 atggagatta ctacaagatc ttcgtaaggt aaacgaaacc atgatgcata tgggagccct 3120 gcaacctggg ttacccactc cctctgctat acctgatagg tcttatatca ttgttataga 3180 cttaaaagat tgtttttaca ctattcctct tgcacctcaa gattgtaaaa gatttgcctt 3240 tagtttacca tctgttaatt ttaaagagcc tatgcaacgt tatcaatgga aagttctccc 3300 gcaaggcatg actaatagcc ctacgctgtg tcaaaaattt gtggcagcag caatagctcc 3360 cgttcggcaa tgttttccgc agttatactt ggttcattac atggacgata tattgcttgc 3420 ccatattaat gaacatttgt tatatcaggc cttctctctt cttaaacaac atttaagcct 3480 taatggtctt gtcaycgctg atgaaaaaat tcagactcat tttccttata attatttagg 3540 tttccattta tatcctcgtg cttatactac ccagttagta aaattacaga ccgaccattt 3600 aaaaacttta aacgattttc aaaaacttct aggagacatt aactggatac gaccttattt 3660 aaagttacct acctatattt tgcaaccatt gtttgatatc cttaaaggta actctgatcc 3720 tgcgtcaccc cgaatacttt ctccagaagg gcgatcagct ctacaatttg tagaggaagc 3780 gattagacaa caacaaatca cttattgcga ttatcaacgg tcatggggtt tatatatact 3840 ctctacccct cgagcaccca cgggggttct ctatcaagat aaacctttgc gatggatata 3900 tctatctgct actccagcta aacacctgct cccctattat gagcttatcg caaaaattgt 3960 agcgaaggga cgttatgagg ccatccagta tttcgggctg gaacccccct ttatttgtgt 4020 tccttatgcc ttggaacaac aggattggct ttttcgatat tcagataatt gggctatagc 4080 ttttgcaaat tatccgggac ggattattca tcattacccc tctgataaat tgttacaatt 4140 tgctagttct catgctttca tttttccaaa aatagttcgc caacaaccta ttcctgaagc 4200 aacacttata tttacagatg ggtcttccaa tggtactgca gctttaatta ttaatcaaca 4260 aacttattat gcacatacca acttttcttc tgcacaggtt gtggaattat ttgcagttca 4320 ccaggcgctg ctaactgtgt ctacctcttt taatttattt acagatagtt cctatgtggt 4380 cggcgcttta caaatgattg aaactgttcc aattattggt actacttctc ctgaagtttt 4440 aaacttattt acattaatcc aacaggccct ccactgtcgc cagcacccct gcttctttgg 4500 acatattcgc gcacattcca cccttcctgg tgccctcgta cgcggcaacc atactgcgga 4560 tgttcttacc aaacaagtgt ttttccaatc agctattgat gcagctcgaa aatcccataa 4620 cttacatcac caaaatagtc attctttacg gttgcaattt aaaatttccc gagaagctgc 4680 acgacaaatt gttaaatcct gctctacttg tcctcaattt tttgttctcc ctcaatatgg 4740 agtcaatcct cgaggtttac gtcctaacaa cctttggcaa acggacgtta ctcatattcc 4800 tcactttggs cgccttaaat atgtkcacgc ctctattgac accttttcca attttctcat 4860 ggcttccctt catactggag aatcgacacg ccactgtatt caacatttgt tattttgttt 4920 ttctatttca ggaatcccgc aaaccctcaa aacggacaat ggacctggtt atactagtcg 4980 ttcttttcaa cggttctgtc tttcctttca aattcatcat aaaacaggaa ttccctataa 5040 cccacaggga caagggattg tagaacgagc tcatcaacgt cttaaacatc aacttttaaa 5100 acagaaaaag gggagtgatt tgtatagccc ttcaccacat aatgccttga accatgctct 5160 ttatgtttta aattttttaa ctttagatgc ggaaggtaat tcagccgccc agcgtttttg 5220 gggagaacga tcctcatgta aaaaaccact tgtacgatgg aaggatccac ttaccaatct 5280 gtggtatgga ccagaccctg tattgatatg gggacgaggg catgtttgtg tttttccaca 5340 ggatgccgaa gcaccgcgtt ggataccgga aaggctggta cgcgcggcag aggaattccc 5400 tgacacatca aatgcgtcga atgacactga gtgagcctac gagtgagcta cctacccaga 5460 gacagattga ggcgctaatg cgatacgctt ggaatgaggc tcatgtacaa cctccagtga 5520 cacctagtaa tatattgatc atgttgttat tattattaca gcgaatgcaa agtggggagg 5580 ctgcggcttt ttgggcatat attcccgatc cacccacgat tcaatcctta ggatgggata 5640 aggaagtagt acctgtctat gtcaatgata caagtctttt gggaggaaaa tcagatattc 5700 atatttctcc ccagcaagct aatatctctt tttatggtct tactacacaa tatcctatgt 5760 gcttttctta tcaatcacaa catcctcact gtatacaggt gtcagctgat atatcttatc 5820 ctcgagtgac tatttctggc attgatgaaa aaaccggaaa aagatcgtac cgtgacggaa 5880 ccggacctct cgacattccg ttttgcgaca aacatctaag catcggcata ggaatagaca 5940 ctccttggac tttatgtcga gcccgggttg catcggtgta caacatcaac aatgccaata 6000 ccaccttttt atgggattgg gcacctgggg gaacacctga tttccctgaa tatcgaggac 6060 agcatccacc cattttctct gtaaacactg ctcaagtata tcaaacagaa ctgtggaaac 6120 ttttggctgc ttttggtcat ggcaatagcc tatacttaca atctaatgtt agcgggagta 6180 aatatggaga tgtaggagtt acgggattct tatatccacg agcatgtgtc ccttacccat 6240 ttatgttaat acaaggccat atggaaataa cactgtcatt gaacatttat catttaaatt 6300 gttctaattg catacttacc aattgcataa gaggtgttgc caaaggagaa caagttataa 6360 tagtaaaaca acctgctttt gtaatgttac ctgttgaaat aactgaaggc tggtatgatg 6420 agactgcttt agaattacta caacgcatta acacggctct tagccgcact gaaagaagtg 6480 tgagcctgat tgttctgggt atagtatctt taatcaccct tatagcaact gctgttaccg 6540 cttctgtatc tttagcacaa tccattcaag ctgctcatac tgtagattcc ttgtcatata 6600 acgttactaa ggtgatggga actcaggaag atatagatag aaaaatagaa gatagattat 6660 cagctttata tgatgtggtt agagttctag gagaacaagt tcagagcatt agtttccgta 6720 tgaaaattca atgtcatgct aattataagt ggatttgtgt tacaaaaaag gcttataatg 6780 catctgattt tccgtgggat aaggtgaaaa aacatttaca aggaatttgg tttaatacta 6840 atatttcttt agatcttcta caattacata atgagattct tgacattgaa aatgctccga 6900 aagctacctt gaatatagct gacaccgttg ataattttct acagaattta ttttctaatt 6960 tccctaacct tcattcgtta tggcagagta tacttgctgt gattataatt gtgagcgtta 7020 taattattgt aatttgttta gttccttgcc tccttcgtgg tctctttcga gatttcctac 7080 atatgagagc tgaattgcta catttgaaat atagacatat gatgcaatac cgacagctaa 7140 tggagctttt aggaaataaa gagaggggag ctgcggggga caacccgtga agggttaagt 7200 cctgggacct ccttggcaaa atgccaggac cttgggaaag tacctaaact ccctgtcccg 7260 ccaccctcag gaagtcttaa aagctcttac ggctcaaatg atcggtgttg gcataacttc 7320 atagaaaaga gcaggaaatc tgattacata aggatccggt gattgtattc tgaaaataag 7380 cagataagat caagtcacgt ataatactct atatatacta tgacacaata aaacaacaag 7440 gcatcagc 7448 //