ID M37980; SV 1; linear; genomic RNA; STD; VRL; 7286 BP. XX AC M37980; XX DT 01-AUG-1991 (Rel. 28, Created) DT 24-APR-2025 (Rel. 144, Last updated, Version 9) XX DE Avian leukosis virus, complete genome. XX KW . XX OS Avian leukosis virus OC Viruses; Riboviria; Pararnavirae; Artverviricota; Revtraviricetes; OC Ortervirales; Retroviridae; Orthoretrovirinae; Alpharetrovirus. XX RN [1] RP 1-7286 RX PUBMED; 1311072. RA Bieth E., Darlix J.L.; RT "Complete nucleotide sequence of a highly infectious avian leukosis virus"; RL Nucleic Acids Res. 20(2):367(1992). XX RN [2] RP 1-7286 RA Bieth E.; RT ; RL Submitted (24-AUG-1990) to the INSDC. RL CRBGC du CNRS, 118 route de Narbonne, Toulouse 31062 Cedex, France XX DR MD5; 503a20fd625b5b80397f562b531324e4. XX FH Key Location/Qualifiers FH FT source 1..7286 FT /organism="Avian leukosis virus" FT /mol_type="genomic RNA" FT /note="submitted name=Avian leukosis virus - RSA" FT /db_xref="taxon:11864" FT 5'UTR 1..371 FT misc_feature 1..21 FT /note="5' terminal redundancy" FT misc_feature 1 FT /note="CAP site" FT regulatory 10..41 FT /regulatory_class="ribosome_binding_site" FT misc_feature 22..101 FT /note="U5 region" FT repeat_region 91..101 FT /rpt_type=TERMINAL FT /note="5'imperfect repeat" FT misc_feature 102..371 FT /note="leader sequence" FT protein_bind 102..120 FT /bound_moiety="primer" FT misc_feature 210..270 FT /note="dimer promoting sequence" FT misc_feature 218..248 FT /note="encapsidation element" FT regulatory 331..371 FT /regulatory_class="ribosome_binding_site" FT gene 372..6872 FT /gene="env" FT CDS join(372..388,5069..6872) FT /codon_start=1 FT /gene="env" FT /product="envelope protein" FT /protein_id="AAA91268.1" FT /translation="MEAVIKAFLTGYPGKTSKKDSKEKPLATSKKDPEKTPLLPTRVNY FT ILIIGVLVLCEVTGVRADVHLLEQPGNLWITWANRTGQTDFCLSTQSATSPFQTCLIGI FT PSPISEGDFKGYVSDNCTTLGTDRLVSSADFTGGPDNSTTLTYRKVSCLLLKLNVSMWD FT EPHELQLLGSQSLPNITNIAQISGITGGCVGFRPQGVPWYLGWSRQEATRFLLRHPSFS FT KSTEPFTVVTADRHNLFMGSEYCGAYGYRFWNMYNCSQVGRQYRCGNARSPRPGLPEIQ FT CTRRGGKWVNQSQEINESEPFSFTVNCTASSLGNASGCCGKAGTILPGKWVDSTQGSFT FT KPKALPPAIFLICGDRAWQGIPSRPVGGPCYLGKLTMLAPKHTDILKVLVNSSRTGIRR FT KRSTSHLDDTCSDEVQLWGPTARIFASILAPGVARAQALREIERLACWSVKQANLTTSF FT LGDLLDDVTSIRHAVLQNRAAIDFLLLAHGHGCEDVAGMCCFNLSDHSESIQKKFQLMK FT EHVNKIGVDSDPIGSWLRGLFGGIGEWAVHLLKGLLLGLVVILLLVVCLPCLLQIVCGN FT IRKMINNSISYHTEYKKLQKACGQPESRIV" FT gene 372..2474 FT /gene="MAp19" FT misc_feature 372..2474 FT /gene="MAp19" FT /note="3' imperfect repeat; matrix protein" FT gene 372..1029 FT /gene="trans-acting factor" FT CDS join(372..387,674..1029) FT /codon_start=1 FT /gene="trans-acting factor" FT /product="trans-acting factor" FT /protein_id="AAA91267.1" FT /translation="MEAVIREGGSLPQVRSASRNQQRSGESTKGRKWEKQLRSEMRRWR FT RRKWPHLKPLAHPAISAEQLLAVIAPQPRPLLLLMWGVVCILPWRGWESSRARGVTHLG FT GRNSQGRSQGTRVWPLGRP" FT protein_bind 423..440 FT /gene="trans-acting factor" FT /bound_moiety="nucleocapsid" FT protein_bind 502..511 FT /gene="trans-acting factor" FT /bound_moiety="nucleocapsid" FT misc_feature 512..540 FT /gene="trans-acting factor" FT /note="dimer linkage structure" FT misc_feature 536..556 FT /gene="trans-acting factor" FT /note="dimer linkage structure" FT misc_feature 701..792 FT /gene="trans-acting factor" FT /note="negative splicing regulator" FT protein_bind 805..818 FT /gene="trans-acting factor" FT /bound_moiety="nucleocapsid" FT misc_feature 817..876 FT /gene="trans-acting factor" FT /note="enhancer domain" FT gene 837..902 FT /gene="p2" FT protein_bind 880..897 FT /gene="p2" FT /bound_moiety="nucleocapsid" FT gene 903..1088 FT /gene="p10" FT gene 1836..2102 FT /gene="NCp12" FT misc_feature 1836..2102 FT /gene="NCp12" FT /note="nucleocapsid protein" FT protein_bind 1941..1948 FT /gene="NCp12" FT /bound_moiety="nucleocapsid" FT gene 2103..2474 FT /gene="PRp15" FT misc_feature 2103..2474 FT /gene="PRp15" FT /note="neutral protease large subunit" FT protein_bind 2388..2400 FT /gene="PRp15" FT /bound_moiety="nucleocapsid" FT gene <2495..5182 FT /gene="pol" FT CDS <2495..5182 FT /codon_start=1 FT /gene="pol" FT /product="polymerase" FT /protein_id="AAA91269.1" FT /translation="TVALHLAIPLKWKPDHTPVWIDQWPLPEGKLVALTQLVEKELQLG FT HIEPSLSCWNTPVFVIRKASGSYRLLHDLRAVNAKLVPFGAVQQGAPVLSALPRGWPLM FT VLDLKDCFFSIPLAEQDREAFAFTLPSVNNQAPARRFQWKVLPQGMTCSPTICQLVVGQ FT VLEPLRLKHPSLRMLHYMDDLLLAASSHDGLEAAGEEVISTLERAGFTISPDKIQREPG FT VQYLGYKLGSTYVAPVGLVAEPRIATLWDVQKLVGSLQWLRPALGIPPRLMGPFYEQLR FT GSDPNEAREWNLDMKMAWREIVQLSTTAALERWDPALPLEGAVARCEQGAIGVLGQGLS FT THPRPCLWLFSTQPTKAFTAWLEVLTLLITKLRASAVRTFGKEVDILLLPACFREDLPL FT PEGILLALRGFAGKIRSSDTPSIFDIARPLHVSLKVRVTDHPVPGPTAFTDASSSTHKG FT VVVWREGPRWEIKEIADLGASVQQLEARAVAMALLLWPTTPTNVVTDSAFVAKMLLKMG FT QEGVPSTAAAFILEDALSQRSAMAAVLHVRSHSEVPGFFTEGNDVADSQATFQAYPLRE FT AKDLHTALHIGPRALSKACNISMQQAREVVQTCPHCNSAPALEAGVNPRGLGPLQIWQT FT DFTLEPRMAPRSWLAVTVDTASSAIVVTQHGRVTSVAAQHHWATAIAVLGRPKAIKTDN FT GSCFTSKSTREWLARWGIAHTTGIPGNSQGQAMVERANRLLKDKIRVLAEGDGFMKRIP FT TSKQGELLAKAMYALNHFERGENTKTPIQKHWRPTVLTEGPPVKIRIETGEWEKGWNVL FT VWGRGYAAVKNRDTDKVIWVPSRKVKPDVTQKDEVTKKDEASPLFAGISDWIPWEDEQE FT GLQGETASNKQERPGEDTLAANES" FT misc_feature 2495..5179 FT /gene="pol" FT /note="reverse transcriptase beta-subunit" FT misc_feature 2495..4210 FT /gene="pol" FT /note="reverse transcriptase alpha-subunit" FT protein_bind 2950..2964 FT /gene="pol" FT /bound_moiety="nucleocapsid" FT gene 4211..5179 FT /gene="INp32" FT misc_feature 4211..5179 FT /gene="INp32" FT /note="integrase" FT protein_bind 4747..4758 FT /gene="INp32" FT /bound_moiety="nucleocapsid" FT gene 5244..6260 FT /gene="SU" FT misc_feature 5244..6260 FT /gene="SU" FT /note="surface protein" FT protein_bind 5712..5727 FT /gene="SU" FT /bound_moiety="nucleocapsid" FT protein_bind 6187..6201 FT /gene="SU" FT /bound_moiety="nucleocapsid" FT gene 6261..6869 FT /gene="TM" FT misc_feature 6261..6869 FT /gene="TM" FT /note="transmembrane protein" FT protein_bind 6414..6425 FT /gene="TM" FT /bound_moiety="nucleocapsid" FT repeat_region 7037..7047 FT /rpt_type=TERMINAL FT /note="3'imperfect repeat" XX SQ Sequence 7286 BP; 1752 A; 1768 C; 2114 G; 1652 T; 0 other; gccatttgac cattcaccac attggtgtgc acctgggttg atggccggac cgttgattcc 60 ctgacgacta cgagcacctg catgaagcag aaggcttcat ttggtgaccc cgacgtgata 120 gttagggaat agtggtcggc cacagacggc gtggcgatcc tgtctccatc cgtctcgtct 180 atcgggaggc gagttcgatg accctggtgg agggggctgc ggcttaggga ggcagaagct 240 gagtaccgtc ggagggagct ccagggcccg gagcgactga cccctgccga gaactcagag 300 ggtcgtcgga agacggagag tgagcccgac gaccacccca ggcacgtctt tggtcggcct 360 gcggatcaag catggaagcc gtcattaagg tgatttcgtc cgcgtgtaaa acctattgcg 420 ggaaaatctc tccttctaag aaggaaatag gggccatgtt gtccctgtta caaaaggaag 480 ggttgcttat gtctccctca gatttatatt ccccggggtc ctgggatccc atcactgcgg 540 cgctctccca gcgggcaatg gtacttggga aatcgggaga gttaaaaacc tggggattgg 600 ttttgggggc attgaaggcg gctcgagagg aacaggttac atctgagcaa gcaaagtttt 660 ggttgggatt agggggaggg agggtctctc ccccaggtcc ggagtgcatc gagaaaccag 720 caacggagcg gcgaatcgac aaaggggagg aagtgggaga aacaactgcg cagcgagatg 780 cgaagatggc gccggagaaa atggccacac ctaaaaccgt tggcacatcc tgctatcagt 840 gcggaacagc tactggctgt aattgcgcca cagcctcggc ccctcctcct ccttatgtgg 900 ggagtggttt gtatccttcc ctggcggggg tgggagagca gcagggccag gggggtgaca 960 caccttgggg ggcggaacag ccaagggcgg agccagggca cgcgggtctg gcccctgggc 1020 cggccctgac tgactgggca aggatcaggg aggagcttgc gagtacaggt ccgcccgtgg 1080 tggccatgcc tgtagtgatt aagacagagg gacccgcctg gacccctctg gagccaaaat 1140 tgatcacaag actggctgat acggtcagga ccaagggctt acgatccccg atcactatgg 1200 cagaagtgga agcgctcatg tcctccccgt tgctgccgca tgacgtcacg aatctaatga 1260 gagtgatttt aggacctgcc ccatatgcct tatggatgga cgcttgggga gtccaactcc 1320 agacggttat agcggcagcc actcgcgacc cccgacaccc agcgaacggt caagggcggg 1380 gggaacggac taacttggat cgattaaagg gcttagctga tgggatggtg ggcaacccac 1440 agggtcaggc cgcattatta agaccggggg aattggttgc tattacggcg tcggctctcc 1500 aggcgtttag agaagttgcc cggctggcgg aacctgcagg tccatgggcg gacatcacgc 1560 agggaccatc tgagtccttt gttgattttg ccaatcggct tataaaggcg gttgaggggt 1620 cagatctccc gccttccgcg cgggctccgg tgatcattga ctgctttagg cagaagtcac 1680 agccagatat tcagcagctt atacgggcag caccctccac gctgaccacc ccaggagaga 1740 taatcaaata tgtgctagac aggcagaaga ttgcccctct tacggatcaa ggcatagccg 1800 cggccatgtc gtctgctatc cagcccttag ttatggcagt agtcaataga gagagggatg 1860 gacaaactgg gtcgggtggt cgtgcccgag ggctctgcta cacttgtgga tccccgggac 1920 attatcaggc gcagtgcccg aaaaaacgaa agtcaggaaa cagccgtgag cgatgtcagc 1980 tgtgtgacgg gatgggacac aacgctaaac agtgcagaag gcgggatggc aaccagggcc 2040 aacgcccagg aaaaggcctc tcttcggggt cgtggcccgt ctctgagcag cctgccgtct 2100 cgttagcgat gacaatggaa cataaagatc gccccttggt tagggtcatt ctgactaaca 2160 ctgggagtca tccggtcaaa cagcgttcgg tgtatatcac cgcgctgttg gactctggag 2220 cggacatcac tattatttca gaggaggact ggcccaccga ttggccagtg atggaggccg 2280 cgaacccgca gatccatggg ataggagggg gaattcccat gcgaaaatcc cgggatatga 2340 tagaggtggg ggttattaac cgagacgggt ctttggagcg acccctgctc ctcttccccg 2400 cagtagctat ggttagaggg agtatcctag gaagagattg tctgcagggc ctagggctcc 2460 gcttgacaaa tttataggga gggccactgt tcttactgtt gcgctacatc tggctattcc 2520 gctcaaatgg aagccagacc acacgcctgt gtggattgac cagtggcccc ttcctgaagg 2580 taaacttgta gcgctaacgc aattagtgga aaaagaatta cagttaggac atatagaacc 2640 ttcacttagt tgttggaaca cacctgtctt tgtgatccgg aaggcttccg ggtcttatcg 2700 cttattgcat gacttgcgcg ctgttaacgc caagcttgtt ccttttgggg ccgtccaaca 2760 gggggcgcca gttctctccg cgctcccgcg tggctggccc ctgatggtcc tagacctcaa 2820 ggattgcttc ttttctattc ctcttgcgga acaagatcgc gaagcttttg catttacgct 2880 cccctctgtg aataaccagg cccccgctcg aagattccaa tggaaggtct tgccccaagg 2940 gatgacctgt tctcccacta tctgtcagtt ggtggtgggt caggtacttg agcccttgcg 3000 actcaagcac ccatctctgc gcatgttgca ttatatggat gatcttttgc tagccgcctc 3060 aagtcatgat gggttggaag cggcagggga ggaggttatc agtacattgg aaagagccgg 3120 gttcaccatt tcgcctgata agatccagag ggaacccgga gtacaatatc ttgggtacaa 3180 gttaggcagt acgtatgtag cacccgtagg cctggtagca gaacccagga tagccacctt 3240 gtgggatgtt caaaagctgg tggggtcact tcagtggctt cgcccagcgt taggaatccc 3300 gccacgactg atgggcccct tctatgagca gttacgaggg tcagatccta acgaggcgag 3360 ggaatggaat ctagacatga aaatggcctg gagagagatc gtacagctta gcaccactgc 3420 tgccttggaa cgatgggacc ctgccctgcc tttggaggga gcggtcgcta ggtgtgaaca 3480 gggggcaata ggggtcctgg gacagggact gtccacacac ccaaggccat gtttgtggtt 3540 attctccacc caacccacca aggcgtttac tgcttggtta gaagtgctca cccttttgat 3600 tactaagcta cgcgcttcgg cagtgcgaac ctttggcaag gaggttgata tcctcctgtt 3660 gcctgcatgc tttcgggagg accttccgct cccggagggg atcctgttag cccttagggg 3720 gtttgcagga aaaatcagga gtagtgacac gccatctatt tttgacattg cgcgtccact 3780 gcatgtttct ctgaaagtga gggtcaccga ccaccctgta ccgggaccca ctgcctttac 3840 cgacgcctcc tcaagcaccc ataaaggggt ggtagtctgg agggagggcc caaggtggga 3900 gataaaagaa atagctgatt tgggggcaag tgtacaacaa ctggaagcac gcgctgtggc 3960 catggcactt ctgctgtggc cgacaacgcc cactaatgta gtgactgact ccgcgtttgt 4020 cgcgaaaatg ttactcaaga tgggacagga gggagtcccg tctacagcgg cggcttttat 4080 tttagaggat gcgttaagcc aaaggtcagc catggccgcc gttctccacg tgcggagtca 4140 ttctgaagtg ccagggtttt tcacagaagg aaatgacgtg gcagatagcc aagccacctt 4200 tcaagcgtat cccttgagag aggctaaaga tcttcatact gctctccata ttggaccccg 4260 cgcgctatcc aaagcgtgta atatatctat gcagcaggct agggaggttg ttcagacctg 4320 cccgcattgt aattcagccc ctgcgttgga ggccggggta aaccctaggg gtttgggacc 4380 cctacagata tggcagacag actttacgct tgagcctaga atggcccccc gttcctggct 4440 cgctgttact gtggataccg cctcatcggc gatagtcgta actcagcatg gccgtgtcac 4500 atcggttgct gcacaacatc attgggccac ggctatcgcc gttttgggaa gaccaaaggc 4560 cataaaaaca gataatgggt cctgtttcac gtctaaatcc acgcgagagt ggctcgcgag 4620 atgggggata gcacacacca ccgggattcc gggtaattcc cagggtcaag ctatggtaga 4680 gcgggccaac cggctcctga aagataagat ccgtgtgctt gcggaggggg acggctttat 4740 gaaaagaatc cccaccagca aacaggggga actactagcc aaggcaatgt atgccctcaa 4800 tcactttgag cgcggtgaaa acacaaaaac accgattcaa aaacactgga gacctaccgt 4860 tcttacagaa ggacccccgg ttaaaatacg aatagagaca ggggagtggg aaaaaggatg 4920 gaatgtgctg gtctggggac gaggttatgc agctgtgaaa aacagggaca ctgataaggt 4980 tatttgggta ccctctcgga aagttaaacc ggatgtcacc caaaaggatg aggtgactaa 5040 gaaagatgag gcgagccctc tttttgcagg catttctgac tggataccct gggaagacga 5100 gcaagaagga ctccaaggag aaaccgctag caacaagcaa gaaagacccg gagaagacac 5160 ccttgctgcc aacgagagtt aattatattc tcattattgg tgtcctggtc ttgtgtgagg 5220 ttacgggggt aagagctgat gtccacttac tcgagcagcc agggaacctt tggattacat 5280 gggccaaccg tacaggccaa acggattttt gcctctctac acagtcagcc acctcccctt 5340 ttcaaacatg tttgataggt atcccgtccc ctatttccga aggtgatttt aagggatacg 5400 tctctgataa ttgcaccacc ttgggaactg atcggttagt ctcgtcagcc gactttactg 5460 gcggacctga caacagtacc accctcactt atcggaaggt ctcatgcttg ttgttaaagc 5520 tgaatgtctc tatgtgggat gagccacatg aactacagct gttaggttcc cagtctctcc 5580 ctaacattac taatattgct cagatttccg gtataaccgg gggatgcgta ggtttcagac 5640 cacaaggggt tccttggtat ctaggttggt ctagacagga ggccacgcgg tttctcctta 5700 gacacccctc tttctctaaa tccacggaac cgtttacggt ggtgacagcg gataggcaca 5760 atctttttat ggggagtgag tactgcggtg catatggcta cagattttgg aacatgtata 5820 actgctcaca ggtggggcgg cagtaccgct gtggtaatgc gcgcagcccc cgcccgggtc 5880 ttcctgaaat ccagtgtaca aggagaggag gcaaatgggt taatcaatca caggaaatta 5940 atgagtcgga gccgttcagc tttacggtga actgtacagc tagtagtttg ggtaatgcca 6000 gtgggtgttg cggaaaagca ggcacgattc tcccgggaaa gtgggtcgac agcacacaag 6060 gtagtttcac caaaccaaaa gcgctaccac ccgcaatttt cctcatttgt ggggatcgcg 6120 catggcaagg aattcccagt cgtccggtag ggggcccctg ctatttaggc aagcttacca 6180 tgttagcacc taagcataca gatattctca aggtgcttgt caattcatcg cggacaggta 6240 taagacgtaa acgaagcacc tcacacctgg atgatacatg ctcagatgaa gtgcagcttt 6300 ggggtcctac agcaagaatc tttgcatcta tcttagcccc gggggtagca cgtgcgcaag 6360 ccttaagaga aattgagaga ctagcctgtt ggtccgttaa acaggctaac ttgacaacat 6420 cattcctcgg ggacttattg gatgatgtca cgagtattcg acacgcggtc ctgcagaacc 6480 gagcggctat tgacttcttg cttctggctc acggccatgg ctgtgaggac gttgccggaa 6540 tgtgttgttt caatctgagt gatcacagtg agtctataca gaagaagttc cagctaatga 6600 aggaacatgt caataagatc ggcgtggaca gcgacccaat cggaagttgg ctgcgaggat 6660 tattcggggg aataggggaa tgggccgttc atttgctgaa aggactgctt ttggggcttg 6720 tagttatttt gttgctagta gtgtgcctgc cttgcctttt gcaaatcgta tgcggtaaca 6780 tcagaaagat gattaataac tccatcagct accacacgga atataagaag ctgcaaaagg 6840 cctgtgggca gcctgaaagc agaatagtat aaggcagtac atgggtggtg gtatagcgct 6900 tgcgagtcgg gttgtaacgg ggcatggctt aactaagggg actatggcat gtataggcgc 6960 aaagcggggc ttcggttgta cgcggttagg agtcccctca ggatatagta gtttcgcttt 7020 tgcataggga gggggaaatg tagtcttatg caatactctt gtagtcttgc aacatggtaa 7080 cgatgagtta gcaacatgcc ttacaaggag agaaaaagca ccgtgcatgc cgattggtgg 7140 aagtaaggtg gtacgatcgt gccttattag gaaggcaaca gacgggtctg acatggattg 7200 gacgaaccac tgaattccgc attgcagaga tattgtattt aagtgcctag ctcgatacaa 7260 taaacgccat ttgaccattc accaca 7286 //