ID EU030001; SV 1; linear; genomic DNA; STD; VRL; 8406 BP. XX AC EU030001; XX DT 05-AUG-2007 (Rel. 92, Created) DT 05-AUG-2007 (Rel. 92, Last updated, Version 1) XX DE RD114 retrovirus isolate SC3C, complete genome. XX KW . XX OS RD114 retrovirus OC Viruses; Ortervirales; Retroviridae; Orthoretrovirinae; Gammaretrovirus; OC unclassified Gammaretrovirus. XX RN [1] RP 1-8406 RA Ghani K., Caron M.-C., Caruso M.; RT "RD114-based retroviral vectors for gene therapy"; RL Unpublished. XX RN [2] RP 1-8406 RA Ghani K., Caron M.-C., Caruso M.; RT ; RL Submitted (11-JUL-2007) to the INSDC. RL Medical Biology, Laval University, 9, rue McMahon, Quebec, QC G1R 2J6, RL Canada XX DR MD5; cb478712b455e4878378a32c64966699. DR EuropePMC; PMC2786842; 19812152. DR EuropePMC; PMC2822922; 20075164. DR EuropePMC; PMC3421742; 22674983. DR EuropePMC; PMC4734002; 26637454. DR EuropePMC; PMC5625498; 28768854. DR RFAM; RF01073; GP_knot1. XX FH Key Location/Qualifiers FH FT source 1..8406 FT /organism="RD114 retrovirus" FT /isolate="SC3C" FT /mol_type="genomic DNA" FT /country="USA" FT /proviral FT /db_xref="taxon:11834" FT repeat_region 1..482 FT /rpt_type=LONG_TERMINAL_REPEAT FT gene 947..6166 FT /gene="gag" FT CDS 947..6166 FT /codon_start=1 FT /transl_except=(pos:2597..2599,aa:Glu) FT /gene="gag" FT /product="gag-pol precursor polyprotein" FT /db_xref="GOA:A7LKA6" FT /db_xref="InterPro:IPR000477" FT /db_xref="InterPro:IPR000840" FT /db_xref="InterPro:IPR001584" FT /db_xref="InterPro:IPR001878" FT /db_xref="InterPro:IPR001969" FT /db_xref="InterPro:IPR001995" FT /db_xref="InterPro:IPR002156" FT /db_xref="InterPro:IPR003036" FT /db_xref="InterPro:IPR008919" FT /db_xref="InterPro:IPR010999" FT /db_xref="InterPro:IPR012337" FT /db_xref="InterPro:IPR015416" FT /db_xref="InterPro:IPR018061" FT /db_xref="InterPro:IPR021109" FT /db_xref="InterPro:IPR036397" FT /db_xref="InterPro:IPR036875" FT /db_xref="InterPro:IPR036946" FT /db_xref="InterPro:IPR040643" FT /db_xref="InterPro:IPR041577" FT /db_xref="UniProtKB/TrEMBL:A7LKA6" FT /protein_id="ABS71856.1" FT /translation="MGQTLTTPLTLTLTHFSDVQARAHNLSLEVRKGRWRTYCSSEWPT FT LSVGWPRDGTFDLSIILQVKTKVMDPGPRGHPDQIAYILTWEDLIRNPPAWVKPFLPSC FT PLSQSTLLPLKTSKDRASTQPSAPPKPPKPVLPDESQKDPPLLDALSSPPHNPLLQPPP FT YNPPLAPALTPVGPTSPPASSSSSLSPTSSPACTSAPSSTPAPPDLTPQTPPQTPRLRL FT RRLDDPNGPPTWQSSLFPLRTVNRTVQYWPFSASDLYNWKTHNPSFSQEPQALTSLIES FT ILLTHQPTWDDCQQLLQVLLTTEERQRVLLEARKNVPGPGGFPTQLPNEIDEGFPLTRP FT DWDYETAPGRESLRIYRQALLAGLKGAGKRPTNLAKVRTIIQGKEESPAAFMERLLEGF FT RMYTPFTPEAPEHKATVAMSFIDQAASDIKGKLQRLDGIQTYGLQELVREAEKVYNKRE FT TPEEKEARLAKEQEAREERRDRKRDKHLTKILAAVVTENRTGKSGETKRRPKIEKDQCA FT YCKERGHWIKDCPKRPRDSKKSTPVLTLGEESEEGRQGSGAPPEPRLTLSVGGHPTTFL FT VDTGAQHSVLTKANGPLSSRTSWVQGATGRKIHKWTNRRTVDLGQGTVTHSFLVVPECP FT YPLLGRDLLTKLGAQIHFSETGAQVLNRDGQPIQVLTVSLHDEHRLFETPVTTNLLEAW FT LQDFPQAWAETGGLGRAKCQAPIIIDLKPTAMPVSIRQYPMSKEAHMGIQPHITRFLEL FT GVLRPCRSPWNTPLLPVKKPGTRDYRPVQDLREVNKRTMDIHPTVPNPYNLLSTLSPDR FT TWYTVLDLKDAFFCLPLAPQSQELFAFEWRDPERGISGQLTWTRLPQGFKNSPTLFDEA FT LHRDLTDFRTQHPEVTLLQYVDDLLLAAPTEEACTRGTKHLLRELGDKGYRASAKKAQI FT CQTKVTYLGYILSEGKRWLTPGRIETVARIPPPQSPREVREFLGTAGFCRLWIPGFAEL FT AAPLYALTKESAPFTWQEKHQSAFEALKEALLSAPALGLPDTSKPFTLFIDEKQGIAKG FT VLTQKLGPWKRPVAYLSKKLDPVAAGWPPCLRIMAATAMLVKDSAKLTLGQPLTVITPH FT ALEAIVRQPPDRWITNARLTHYQALLLDTDRVQFGPPVTLNPATLLPAPKDQQSAHDCR FT QVLAETHGTREDLKDQELPDADHSWYTDGSSYIDSGTRRAGAAVVDGHHIIWAQSLPPG FT TSAQKAELIALTKALELSEGKKANIYTDSRYAFATAHTHGSIYERRGLLTSEGKEIKNK FT AEIIALLKALFLPRRVAIIHCPGHQKGQDPIATGNRQADQVARQVAVAETLTLTKLEET FT NLTTNKYAYTPEDQEEAKAIGAILNQDTKDWEKEGKIVLPRKEALAMIQQMHAWTHLSN FT RKLKSLIEKTDFLIPKAGTLIEQVTSACKVCQQVNAGATRVPEGKRTRGNRPGVYWEID FT FTEVKPHYAGYKYLLVFVDTFSGWVEAYPTRQETAHVVAKKILEEIFPRFGLPKVIGSD FT NGPAFVSQVSQGLARTLGINWKLHCAYRPQSSGQVERMNRTIKETLTKLTLETGLKDWR FT RLLSLALLRARNTPNRFGLTPYEILYGGPPPLSTLLNSFSPSDPKTDLQARLKGLQAVQ FT AQIWTPLAELYRPGHPQTSHPFQVGDSVYVRRHRSQGLEPRWKGPYIVLLTTPTAIKVD FT GIAAWIHASHAKAAPKTPGPETPKTWKLRRSENPLKIRLSRV" FT CDS 6201..7898 FT /codon_start=1 FT /product="envelope glycoprotein" FT /db_xref="GOA:A7LKA7" FT /db_xref="InterPro:IPR018154" FT /db_xref="UniProtKB/TrEMBL:A7LKA7" FT /protein_id="ABS71857.1" FT /translation="MKLPTGMVILCSLIIVRAGFDDPRKAIALVQKQHGKPCECSGGQV FT SEAPPNSIQQVTCPGKTAYLMTNQKWKCRVTPKNLTPSGGELQNCPCNTFQDSMHSSCY FT TEYRQCRANNKTYYTATLLKIRSGSLNEVQILQNPNQLLQSPCRGSINQPVCWSATAPI FT HISDGGGPLDTKRVWTVQKRLEQIHKAMHPELQYHPLALPKVRDDLSLDARTFDILNTT FT FRLLQMSNFSLAQDCWLCLKLGTPTPLAIPTPSLTYSLADSLANASCQIIPPLLVQPMQ FT FSNSSCLSSPFINDTEQIDLGAVTFTNCTSVANVSSPLCALNGSVFLCGNNMAYTYLPQ FT NWTGLCVQASLLPDIDIIPGDEPVPIPAIDHYIHRPKRAVQFIPLLAGLGITAAFTTGA FT TGLGVSVTQYTKLSHQLISDVQVLSGTIQDLQDQVDSLAEVVLQNRRGLDLLTAEQGGI FT CLALQEKCCFYANKSGIVRNKIRTLQEELQKRRESLASNPLWTGLQGFLPYLLPLLGPL FT LTLLLILTIGPCVFSRLMAFINDRLNVVHAMVLAQQYQALKAEEEAQD" FT repeat_region 7925..8406 FT /rpt_type=LONG_TERMINAL_REPEAT XX SQ Sequence 8406 BP; 2236 A; 2488 C; 1902 G; 1780 T; 0 other; tgagaagtca gaacccccca cctttgctac ataaataacc gctttcattt cgcttctgta 60 aaaccgctta tgcgccccac cctaaccgct ttcatttcgc ttctgtaaaa ccgcttatgc 120 gccccaccct agccggaaag tccccagccg ctacgcaacc cgggccccga gttgcatcag 180 ccgcttcgca acccgggctc cgagttgcat cagccgaaag aaacttcatt tcccaagctt 240 ccccgggacg aaattaccca caaccccaac caccaccgag cagagagcct atcgccagcc 300 tgtatgcaaa tgtaactcaa aatggtataa aagacctgta accccgttta tcggggctct 360 cccgctttct aacactgggg agccctggtg caccagtaaa gactctctgc cgacgtcgga 420 gtgccgcgtg gttctttgcg ccaactctca ttccataggg cctaggagtt tggctcctaa 480 catttggtgc attggccggg aaaccgaggg aaggcaagag aaccccggcc ccgtcggtgg 540 atgactctcc ggcccgggcc actgacggac tgactgaaga cagacttctg tgcctttgtc 600 tgcacacagg tattgtgttc tatgttttgt ctctgaactg tgaattctga ggccggccgg 660 ccacctccgt aggagtgttg aaggaggaca gacgtgtcct gaaccttcac accccagccc 720 cgagggacgc ctcggtggtg tttgtagggg agaactgaca agttcgtcag actccccaaa 780 tccgtaggca ggcctcccct gccatctgaa tctgaatact ggccgcggct ccttggttat 840 tgtctagtct gtgttgtctg gtctgtgtcg acattgtagt catctttgtg tatgtgtgtg 900 taagtgtgct ggtgtggacg cgtggacagg acaggacgac acagccatgg gacagacatt 960 gactactcct ctaactttga ctctgactca tttctcggac gtccaagcaa gagctcacaa 1020 tctttccctt gaggttcgga aaggtcgatg gcgaacttat tgttcatcag agtggcccac 1080 cctatccgta gggtggcccc gggatgggac ttttgacctc tctattatat tacaggttaa 1140 gacgaaggta atggatcccg gaccacgagg tcatccagat caaatagcct atatcctcac 1200 gtgggaggat ctcattcgga atcctccggc atgggtgaaa cccttcctcc cttcctgccc 1260 cctatctcag tctacccttc tccccctgaa aacctcaaaa gaccgagcct caacccaacc 1320 ttcagcccca cccaaaccgc ccaaaccagt tcttcctgat gagtcccaaa aggaccctcc 1380 tctcctagac gctttgtcct cgccgcctca caaccccctc ctacagcctc caccctataa 1440 tccgcccctg gcccccgcct tgaccccggt gggccctacg tcccctcctg cctccagttc 1500 ctcctccctc tccccaactt cttcccccgc ctgtacgtca gccccgtcct ctaccccagc 1560 cccacccgat ctaactcccc agaccccacc tcagaccccc cgcctccgtc tccggcgatt 1620 ggatgaccca aacggccctc ccacttggca atcttccctg tttcctcttc ggacagtcaa 1680 ccggactgtc caatattggc cattctctgc ctccgatctt tataattgga aaactcataa 1740 cccttccttt tcccaagaac cccaggccct aacctcactg atagaatcca tcctcctcac 1800 acaccagccc acctgggacg actgccagca actcctacag gtccttttga ctacagagga 1860 aaggcaacga gttctcctgg aggcccggaa aaatgtgcca gggcctggag gctttccaac 1920 ccagctcccc aatgaaatag atgaggggtt tccccttact cgcccggact gggactatga 1980 aacggctcca ggtagggaga gtctccgaat ctatcgccag gctctgttgg cgggtctcaa 2040 gggggcagga aaacggccca ccaatttggc caaggtaaga actataattc aaggaaaaga 2100 ggaaagcccc gcagccttta tggaacggct tctagagggg tttcggatgt atactccatt 2160 tacgcccgaa gctccagaac ataaggctac agtggccatg tcgttcatag atcaggcagc 2220 atccgatata aaaggaaaac tccaaaggtt agatgggatc caaacctatg ggttgcaaga 2280 actagttagg gaggcagaaa aggtatataa caagagggaa actccagaag aaaaggaggc 2340 taggctagca aaagagcagg aagcacggga ggaacgaaga gatcgaaaga gagataaaca 2400 tctaactaaa atcctggcag ctgtagtgac agaaaataga acaggaaagt caggggaaac 2460 gaagaggcgg cccaaaatag aaaaagacca atgtgcctat tgcaaagagc gcgggcactg 2520 gatcaaagac tgccctaagc gccctagaga ctcaaagaaa tctacccccg tactcacctt 2580 gggcgaagag agcgagtagg gacgtcaggg ctctggagcc ccccccgagc cccggctaac 2640 tttatctgta ggggggcatc ccaccacctt cctggtggac acgggagctc aacactcggt 2700 cttaaccaag gcaaatgggc ccctgtcctc gcgtacctcc tgggtccaag gagcgacagg 2760 aaggaaaatt cacaaatgga ctaaccgccg cacggttgac ctagggcaag ggacagtgac 2820 acactccttt ctggtggtac ccgaatgccc gtaccccctt ctgggacgag acctcctgac 2880 caagcttgga gcccagatac acttctccga aacgggggcc caagtgctaa accgggatgg 2940 ccagcctatc caagtcttga ctgtgtcctt acacgacgaa catagacttt ttgaaacccc 3000 ggtcaccact aacctcctcg aagcctggct gcaggacttt ccccaggcct gggcagaaac 3060 gggagggctc ggtcgagcca aatgtcaggc cccaatcata attgacctaa agcccacggc 3120 aatgcctgta tctatcagac agtatcccat gagcaaggag gctcatatgg gcattcagcc 3180 acacattacc aggtttctag agcttggggt cctgcgacct tgccgctcac cttggaatac 3240 ccctctctta ccagtgaaaa aacctggtac tcgggactac aggcccgtcc aagacttaag 3300 ggaagtcaac aaaaggacta tggatatcca tcctacggtc cccaatccct ataacttgct 3360 cagcactctg agcccagacc gcacctggta cacagtactg gacctaaaag atgcattctt 3420 ttgcttaccc ttggcccccc agagccagga actgtttgct ttcgaatgga gggaccctga 3480 aagaggaatc tcaggccaat taacctggac ccgcttaccc caggggttca aaaactcccc 3540 cactctcttt gatgaggctc tccacaggga cctaactgac ttccggactc aacacccaga 3600 agtaactcta ctccaatatg tggatgacct tcttctggcc gcccccacag aggaagcctg 3660 cacacgaggt accaaacatc tgcttcgaga attgggagac aaaggatacc gggcatccgc 3720 taagaaggcc cagatttgcc aaaccaaagt aacctatctg gggtacatac taagtgaggg 3780 gaaaagatgg ctcacccctg ggcgaataga aactgtggct cgcattccgc cgccccagag 3840 ccccagagag gtacgtgaat tcctgggaac tgctgggttc tgtcgcttat ggatacctgg 3900 gttcgctgag ctagccgccc ccctctatgc cctcacgaaa gagagtgcac ccttcacctg 3960 gcaggaaaaa catcagtcag cctttgaggc cctaaaagag gccctccttt ccgccccggc 4020 tctcgggttg ccagacacct ccaagccctt taccctcttc atagacgaga aacaaggaat 4080 tgccaaagga gttctaaccc aaaaactagg gccctggaag agaccagtag catacctgtc 4140 caaaaagttg gaccctgtgg cagcggggtg gcccccatgt cttcgtatca tggcagccac 4200 tgctatgctg gtcaaggact ctgccaaatt aacccttgga cagccactaa ctgttatcac 4260 cccgcatgct ttagaggcca tagtgcggca accaccagac cgatggataa ccaacgcacg 4320 cctaactcac taccaggccc tcctactgga cacagaccgc gtccaatttg gacctccggt 4380 taccttgaac cctgccacgt tgctaccggc accgaaagac caacagagcg cacacgattg 4440 tcggcaagta ctggctgaga cccatgggac acgggaagat cttaaagatc aagagctccc 4500 agatgcggat cactcttggt atacggacgg gagcagttac atcgactcag gtacccggag 4560 ggcgggagcg gcggtagtag atggccacca catcatatgg gcacaatcac tacctcctgg 4620 cacgtctgcc caaaaggctg agctgatagc actcaccaag gccctagagc tatccgaagg 4680 gaaaaaggct aacatatata cagatagccg atatgccttt gcaacggctc atacacatgg 4740 gagtatttat gaaagaaggg gcctactaac ctcagaagga aaagaaatta aaaataaagc 4800 tgaaatcatt gcattattaa aggccctttt tcttccccgg agagtagcca taattcattg 4860 ccccgggcat cagaaaggac aagacccaat tgcaacagga aatagacagg cagaccaagt 4920 agccaggcaa gtcgctgtgg cagaaacgtt gactttgacc aagcttgaag aaaccaatct 4980 tacgaccaac aaatatgctt acaccccaga agaccaggaa gaggcaaagg ccataggggc 5040 tatactgaat caggacacta aagactggga aaaagaagga aagatagtcc ttccccgaaa 5100 ggaagcccta gcaatgatcc aacaaatgca cgcctggaca catttaagta atcggaagct 5160 gaagtcactg attgaaaaga ctgacttctt aatccctaag gcaggcaccc tcatagaaca 5220 ggtgacctcc gcctgtaagg tctgtcaaca ggtaaacgct ggggctaccc gagtgccaga 5280 aggaaaacgg actcgtggta accgcccagg agtctattgg gaaatagatt ttactgaagt 5340 aaagcctcac tatgcggggt ataagtacct attagtgttt gtagacacct tttcaggatg 5400 ggtagaagcc taccccaccc gacaggaaac ggcacacgtg gtagccaaga agattttaga 5460 agaaatcttc cccagatttg gacttcccaa ggtaatcggg tcagataacg ggccggcctt 5520 cgtttcccag gtaagtcagg ggctcgccag gacactgggg attaattgga aattacactg 5580 tgcatatagg ccccagagct caggacaggt agaaagaatg aatagaacaa taaaagagac 5640 ccttactaaa ttgaccttag agactggctt aaaagattgg agacgcctcc tatctctggc 5700 tttgttaaga gccagaaata cgcccaaccg tttcgggctc accccatatg aaatccttta 5760 tgggggaccc ccccctttgt caaccttgct caattccttc tccccctccg atcctaagac 5820 tgatttacaa gcccgactaa aagggctgca agcggtgcag gcccaaatct ggacacccct 5880 ggccgaattg taccggccag gacatccaca aactagccac ccatttcagg tgggagactc 5940 cgtgtacgtc cggcggcacc gctctcaagg attggagcct cgttggaagg gaccttacat 6000 cgtcctgctg accacgccca ccgccataaa ggttgacggg atcgccgcct ggattcacgc 6060 atcgcacgcc aaggcagccc caaaaacccc tggaccagaa actcccaaaa cctggaagct 6120 ccgccgttcg gagaaccctc ttaagataag actctcccgt gtctgactgc taatccacct 6180 tgtccctgta ctaacccaaa atgaaactcc caacaggaat ggtcatttta tgtagcctaa 6240 taatagttcg ggcagggttt gacgaccccc gcaaggctat cgcattagta caaaaacaac 6300 atggtaaacc atgcgaatgc agcggagggc aggtatccga ggccccaccg aactccatcc 6360 aacaggtaac ttgcccaggc aagacggcct acttaatgac caaccaaaaa tggaaatgca 6420 gagtcactcc aaaaaatctc acccctagcg ggggagaact ccagaactgc ccctgtaaca 6480 ctttccagga ctcgatgcac agttcttgtt atactgaata ccggcaatgc agggcgaata 6540 ataagacata ctacacggcc accttgctta aaatacggtc tgggagcctc aacgaggtac 6600 agatattaca aaaccccaat cagctcctac agtccccttg taggggctct ataaatcagc 6660 ccgtttgctg gagtgccaca gcccccatcc atatctccga tggtggagga cccctcgata 6720 ctaagagagt gtggacagtc caaaaaaggc tagaacaaat tcataaggct atgcatcctg 6780 aacttcaata ccacccctta gccctgccca aagtcagaga tgaccttagc cttgatgcac 6840 ggacttttga tatcctgaat accactttta ggttactcca gatgtccaat tttagccttg 6900 cccaagattg ttggctctgt ttaaaactag gtacccctac ccctcttgcg atacccactc 6960 cctctttaac ctactcccta gcagactccc tagcgaatgc ctcctgtcag attatacctc 7020 ccctcttggt tcaaccgatg cagttctcca actcgtcctg tttatcttcc cctttcatta 7080 acgatacgga acaaatagac ttaggtgcag tcacctttac taactgcacc tctgtagcca 7140 atgtcagtag tcctttatgt gccctaaacg ggtcagtctt cctctgtgga aataacatgg 7200 catacaccta tttaccccaa aactggacag gactttgcgt ccaagcctcc ctcctccccg 7260 acattgacat catcccgggg gatgagccag tccccattcc tgccattgat cattatatac 7320 atagacctaa acgagctgta cagttcatcc ctttactagc tggactggga atcaccgcag 7380 cattcaccac cggagctaca ggcctaggtg tctccgtcac ccagtataca aaattatccc 7440 atcagttaat atctgatgtc caagtcttat ccggtaccat acaagattta caagaccagg 7500 tagactcgtt agctgaagta gttctccaaa ataggagggg actggaccta ctaacggcag 7560 aacaaggagg aatttgttta gccttacaag aaaaatgctg tttttatgct aacaagtcag 7620 gaattgtgag aaacaaaata agaaccctac aagaagaatt acaaaaacgc agggaaagcc 7680 tggcatccaa ccctctctgg accgggctgc agggctttct tccgtacctc ctacctctcc 7740 tgggacccct actcaccctc ctactcatac taaccattgg gccatgcgtt ttcagtcgcc 7800 tcatggcctt cattaatgat agacttaatg ttgtacatgc catggtgctg gcccagcaat 7860 accaagcact caaagctgag gaagaagctc aggattgagc ttccgggaca aaagcagggg 7920 ggaatgagaa gtcagaaccc cccacctttg ctacataaat aaccgctttc atttcgcttc 7980 tgtaaaaccg cttatgcgcc ccaccctaac cgctttcatt tcgcttctgt aaaaccgctt 8040 atgcgcccca ccctagccgg aaagtcccca gccgctacgc aacccgggcc ccgagttgca 8100 tcagccgctt cgcaacccgg gctccgagtt gcatcagccg aaagaaactt catttcccaa 8160 gcttccccgg gacgaaatta cccacaaccc caaccaccac cgagcagaga gcctatcgcc 8220 agcctgtatg caaatgtaac tcaaaatggt ataaaagacc tgtaaccccg tttatcgggg 8280 ctctcccgct ttctaacact ggggagccct ggtgcaccag taaagactct ctgccgacgt 8340 cggagtgccg cgtggttctt tgcgccaact ctcattccat agggcctagg agtttggctc 8400 ctaaca 8406 //