ID AF050215; SV 1; linear; genomic DNA; STD; FUN; 6980 BP. XX AC AF050215; XX DT 01-JUL-1998 (Rel. 56, Created) DT 14-FEB-2020 (Rel. 143, Last updated, Version 2) XX DE Candida albicans Tca2 retrotransposon gag polyprotein (gag) and pol DE polyprotein (pol) genes, complete cds. XX KW . XX OS Candida albicans OC Eukaryota; Fungi; Dikarya; Ascomycota; Saccharomycotina; Saccharomycetes; OC Saccharomycetales; Debaryomycetaceae; Candida/Lodderomyces clade; Candida. XX RN [1] RP 1-6980 RA Goodwin T.J.D., Poulter R.T.M.; RT "Temperature- and strain-dependent expression of the pCal retrotransposon RT of Candida albicans"; RL Unpublished. XX RN [2] RP 1-6980 RA Goodwin T.J.D., Poulter R.T.M.; RT ; RL Submitted (20-FEB-1998) to the INSDC. RL Department of Biochemistry, University of Otago, Cumberland Street, RL Dunedin, New Zealand XX DR MD5; 9bcd1faaab4cb11484fbc93cfc35b178. XX FH Key Location/Qualifiers FH FT source 1..6980 FT /organism="Candida albicans" FT /chromosome="3" FT /map="between CDR1 and DYN1" FT /strain="hOG759" FT /mol_type="genomic DNA" FT /note="contains ICTV exemplar Candida albicans Tca2 virus" FT /db_xref="taxon:5476" FT misc_feature 202..206 FT /note="5' target site duplication" FT repeat_region 207..6632 FT /rpt_family="Tca2 retrotransposon" FT /rpt_type=DISPERSED FT repeat_region 207..486 FT /rpt_type=LONG_TERMINAL_REPEAT FT /note="5' long terminal repeat" FT primer_bind 487..518 FT /note="complements tRNA Arg(UCU) fragment" FT gene 604..1578 FT /gene="gag" FT CDS 604..1578 FT /codon_start=1 FT /transl_table=12 FT /gene="gag" FT /product="gag polyprotein" FT /db_xref="UniProtKB/TrEMBL:O13307" FT /protein_id="AAC24820.1" FT /translation="MSSAKNDDNEGKVMESVDQANAISKVDEHIKARFNMLFIKFNDLP FT KLAVGNQKSVDKWNEEFKYFHVAYPDVLEFLLDYNPKDKFKVKKVEGIYFTGWCLQMCL FT QSIFDRFRLIMISKLPKHLQKEANLIKAAYDAVTKSKDYTITSKILSKFVNVEHELVVC FT YNLPYLSQVEEKLEEILYNTSNVVDEYVRSLPNLIGQVLYFNHVKKSEALSLFLNIHAS FT YYSKWIQADNDTSVLPSCSTIAEEMCDHPDYARLVDIPSNKYELNLIVSLPAPEKPKGK FT PEENSSEQSQKKNSKSRKRNKKHPKSDNDKGEKEKEKEKTSSE" FT misc_feature 1576..1578 FT /gene="gag" FT /note="potentially suppressed stop codon" FT gene <1579..6309 FT /gene="pol" FT CDS <1579..6309 FT /codon_start=1 FT /transl_table=12 FT /gene="pol" FT /product="pol polyprotein" FT /note="includes protease, integrase, reverse transcriptase, FT RNase H" FT /db_xref="GOA:O74209" FT /db_xref="InterPro:IPR001584" FT /db_xref="InterPro:IPR012337" FT /db_xref="InterPro:IPR013103" FT /db_xref="InterPro:IPR036397" FT /db_xref="InterPro:IPR039537" FT /db_xref="UniProtKB/TrEMBL:O74209" FT /protein_id="AAC24821.1" FT /translation="KTGAASINCVMNIHNCSKTTFPVENSHSLNASLNVMNFKGLRFNK FT YLVYDTGATISVVNNKDILSNVKDATIEVSVADGATLEADCIGDLIIRVGIVSITLENT FT LYLPESSFNLVSLKQIEERGFNVLITKESVIVFNQNVAPTIIASRKNAADLYMGPQFSE FT ESLECDFDYDGLADMLSNANQDDKDKSSMNEMSEYQEHDYSSRALINSLTEVDVLDVEI FT SPYGVEQLLPTGDKNDIYNFHLMSNHMSIEKILLLQKYQGLVLHTSKESLQKIADCKVC FT LLSNAKQRSHNHHSERKASRRHERLHCDTLGPFRSENNKWYLTSVIDEHTGYIEGIITK FT DRKVKDLLIQRLKIWNNRFNDKVAYFRSDNAPEFPQPSDLAEFGIWRETIAAYSPELNG FT LAEVVNKLILQQIYRIVVTLGPQILKLIYYVIQYSITMINHTPRRSLKGQTPYGCYYQL FT SEGNFYRFPFAIDCVVTFSNAIEKNRYGVTSTKGAPSSIMGAVIGYASDCFSYYVLLKN FT MRCDIILSPNVRILRSYEVINSYLKNLSTTPMSHIVPMAEGIQGRQSGAQYEVRGTYVE FT SEYDNTNDVMHMPKESYSVQPASFTLTTGNSSNEYVINDDPVQITIENPDDFSNPLQLT FT EESHDMVSEVKSDENPKPSLHELTPGDNPVSKPPQLGTETSVIGKSKEPITNHTKDAPS FT IQGRDHKRSESTAQVGLSHQPQTGTPASEESKLSGTDHFGVDVVKETVSEDWHTSDYPE FT TSAEDEQQNPSLSANKNRVTEKIDEGENISFPGGDDDSVVINSNVEQSNVETEDAGNSP FT IQDEVSQEGRILNEQTDIVDTVAKVIENEKISPINSLDDHTELATDSGNDSNSTESDIQ FT SKNEISPVINEKNTEIIQKHIESILADKRLDEFETYNVDEIENVINDDDIAEANPLPDE FT NNDVQMNESFDNNHSMSRAKKKYTFEKEVNEKIAGTKHSLDTTDPREAIRVLNTGETKR FT IEPKKREVPITVKLNKRSQYKSPYVTRSGRTVINPKRYLHAVVNKIDYNDPGWIKSMNA FT ELEKFRSKDVYEEVPIPTGVKPISMGWVHTEKIDSLKGVVRKSRCVVHGNRQKEKLDYD FT PFSVSSPVIDLVTIRLLTIIGCELGMTIQHLDVESAYLNASITHSNPIYVFPPKSVPLK FT KNHCWLLKRSVYGLKQSGFEWYHTIKRVLEDIGFNQVLHNDGLFHIEYEEGSVIYLGLY FT VDDILMVGSSQKVIDNFVDQLRDHFEVKVFGEISNYLGIEFRKTESGYILSQEKFLKKL FT LKDFKLDDSYGKNIPWIPNDKYEKVAIIRENVNPENDFEKVPNETLLDPDAKKLYQSGV FT GSLLWAATNTRPDISVVVNSLGSKSANPNVHDYEKLIYCLRYIKNSMGYHIEYKRNRLN FT IPPKSFVIECFSDASFAPGLDRKSISGTLIYVNGNLVQWATKKQTVIAQSSAACEMLAL FT NYTMLKAIEIKNNLMDLGFEVGKIHCHQDNQAVIKVLRNNYCHPHRPIDICYKFLRQLI FT NDKVFSISYVKTNDNYADCMTKCLSRAKFKAFVEGMIKRLDLEDNQTSIQNAITAE" FT misc_feature 1579..1586 FT /gene="pol" FT /note="purine-rich tract" FT misc_feature 1587..1661 FT /gene="pol" FT /note="encodes putative pseudoknot" FT misc_feature 3661..3671 FT /gene="pol" FT /note="polypurine tract 2" FT misc_feature 6342..6352 FT /note="polypurine tract 1" FT repeat_region 6353..6632 FT /rpt_type=LONG_TERMINAL_REPEAT FT /note="3' long terminal repeat" FT misc_feature 6633..6637 FT /note="3' target site duplication" XX SQ Sequence 6980 BP; 2447 A; 1121 C; 1338 G; 2074 T; 0 other; aataaatcat tcaattgttt ctatcttcaa acaccgccaa ataagctctt tataaagcat 60 aattggtata tactggtctc tgctaaccag ctgcaatcaa gtttaatggg atcaattaac 120 tatagtgttg tctgatacac gtctaattgt gcaaaagtac acacacaaac caaaagtgtg 180 agaaaaaaaa atcactaagg gacacgtgtt ggtttgtgca ctattttgtg tcagaaactg 240 atcaatgaaa atgatggtta ttatgagaat ggaaaatttt tccatcacac atcaggtgat 300 gacagaacta aactatattg tgtagtataa ataagggtat gaaataccaa catcccagaa 360 tatcaacgag atagaaggga ggagtttcaa tatatatctt gtgaataata acttcgttct 420 aattcactat acacaactag acgtgtacac gctcaatctc aggtaaagaa agtttatatt 480 ccatcagatt agaagtcgat agtgataatc atttcgtccc aaattagcgt tgtataaatt 540 cagtcctcag atttgtatta ttgattgata gtttcgaagt ttgaaggtac agaatttcac 600 aagatgagtt ccgcaaagaa tgatgataac gaagggaagg tcatggaaag tgttgatcaa 660 gctaatgcta ttagtaaggt ggatgaacat atcaaggcta gattcaatat gcttttcata 720 aaatttaatg acttacctaa gttggccgtc ggtaatcaga aaagcgtgga taaatggaat 780 gaagaattta aatatttcca cgttgcttac cccgatgttt tggaattttt gcttgactat 840 aatcctaaag ataaattcaa ggttaaaaag gtagaaggta tttattttac tggttggtgt 900 ttacaaatgt gtttacagtc catttttgat aggttcagat tgatcatgat ttctaagcta 960 ccaaagcact tgcaaaagga agcaaactta atcaaagctg cttatgatgc tgttactaaa 1020 tctaaagatt ataccattac tagtaagatc ttgctgaagt ttgtaaacgt tgaacatgag 1080 ttagtggttt gctataacct tccatatttg ctgcaggtgg aagagaaact tgaggaaata 1140 ctctacaaca cttcaaacgt tgtcgatgag tatgtccgta gtcttccaaa tctcataggt 1200 caagtcttgt acttcaatca tgtgaagaaa tcagaggctt taagtttgtt tttgaatatt 1260 catgcctcat actactcaaa gtggattcaa gctgacaatg atacatcagt actcccaagt 1320 tgctctacca tagctgaaga aatgtgtgat catcctgatt atgctagatt ggttgacatt 1380 ccaagcaaca aatatgaact taatcttatt gttagtttac cagcaccaga gaaaccaaaa 1440 ggaaaaccag aggagaactc actggaacaa tctcaaaaga agaacctgaa atcaagaaag 1500 agaaataaga aacatccaaa atcagataac gataaaggtg aaaaagaaaa agaaaaagaa 1560 aaaacttcac tggaatgaaa aacaggtgct gcttctatta attgtgtaat gaatatacat 1620 aattgcagca aaaccacgtt tccagtagaa aattctcatt ctcttaatgc ttctttgaac 1680 gtaatgaatt ttaaaggttt aaggtttaac aagtatctag tgtatgatac tggtgccaca 1740 atatctgttg tgaacaataa agatatattg ctgaatgtta aggacgcaac aattgaagtt 1800 tctgttgctg atggtgctac attagaagca gattgtattg gtgatctaat tatcagagtc 1860 ggtattgtct cgattacgtt agagaataca ttgtatttac cagaaagttc ctttaatctt 1920 gtgagtttga aacaaattga agaacgagga tttaatgttc ttattactaa agaatcagtg 1980 attgtattta accaaaatgt ggctcctact attattgctt caaggaagaa tgctgctgat 2040 ctttatatgg gtcctcaatt cagtgaagaa tctttagaat gtgattttga ttatgatggt 2100 ttggcagata tgttgtccaa tgctaaccaa gatgacaaag ataaatcaag tatgaatgaa 2160 atgtcagaat atcaagaaca tgattatagt tctcgagcat taataaattc tttgacggag 2220 gttgatgttt tagatgttga aatttcccca tatggagttg aacaattgct accaactgga 2280 gataagaacg atatttataa tttccatttg atgtcaaatc atatgtccat tgagaaaatc 2340 ttgttgttac aaaaatacca gggtctcgta cttcacactt caaaagagag tcttcaaaag 2400 attgctgatt gtaaggtatg tctattatcg aatgccaaac agagaagtca caatcatcat 2460 tcagaaagaa aagcctcgag aagacatgag agacttcatt gtgatactct cggtccattt 2520 aggtccgaaa ataacaagtg gtatttaacg tctgttatag atgaacatac gggttacatt 2580 gaaggaatta ttactaaaga cagaaaggta aaggatctct taattcaacg attaaagatc 2640 tggaataatc ggtttaacga taaggtggca tacttcagaa gtgataatgc tcctgagttc 2700 ccacaacctt ctgatttagc tgagttcggt atttggaggg agactatagc ggcatatctg 2760 cctgagctta atggtctcgc cgaggttgtt aataaattga ttttacaaca gatttacagg 2820 atcgttgtga cacttggtcc acaaatactc aagttgattt attatgtgat tcaatattct 2880 attacaatga tcaaccacac tccacgtcgt tcactcaagg gacaaacccc ttatggttgc 2940 tattatcaat taagtgaggg aaatttctac cggtttcctt ttgccatcga ttgtgtcgtt 3000 acatttagta atgccatcga aaagaaccgt tacggagtta catcaactaa aggagctcct 3060 tcatcgatca tgggtgctgt gattggctac gctagcgatt gttttagtta ttacgtgttg 3120 ctaaaaaata tgcggtgtga tattatcctt agccctaatg tccgtatatt gcgaagctat 3180 gaggttatta actcctatct caaaaactta tccactacac ctatgtcaca cattgttcct 3240 atggctgaag gtatccaggg aaggcaactg ggcgctcagt acgaggtacg cggaacatat 3300 gtggaaagtg aatatgacaa tacaaatgac gtgatgcaca tgcccaaaga gtcatattca 3360 gttcagccag catcgtttac tttaactacg ggtaacagtt ctaacgaata tgttataaat 3420 gatgatccag tacagattac cattgagaat cccgatgatt tttctaaccc tcttcaacta 3480 actgaagaat cacacgatat ggtatccgaa gtaaaatcgg atgagaatcc taaacccagt 3540 ctccacgagc taacacctgg ggataatccg gtgtctaaac ctcctcaact tggtaccgag 3600 acttcagtaa tagggaagtc taaagagcct attacaaacc acacaaagga cgccccttcc 3660 atccagggga gggaccataa acgcctggaa tctactgctc aggttggact atcacaccaa 3720 ccccagactg gtactcccgc ttcggaggag tcaaaattgt caggaacaga tcatttcggt 3780 gtcgacgttg ttaaagaaac agtctcagaa gattggcata cttctgacta cccagaaact 3840 agtgctgaag atgaacagca aaatccctcg ttactggcta ataagaatcg ggtaactgaa 3900 aaaatagatg agggagaaaa tatttcattt ccggggggtg atgatgattc tgtcgtgatc 3960 aactcaaatg ttgagcaatc taatgttgaa acagaggatg ctggtaacag tccaattcaa 4020 gacgaagttt ctcaagaggg aagaatactt aatgaacaaa ctgatatagt tgatactgtt 4080 gctaaagtta ttgagaatga aaaaatctct cctattaatt cattagatga tcatactgaa 4140 cttgctacag actcgggaaa tgatagcaat tcaacagaat ccgacattca atcgaaaaat 4200 gaaatatcac cagtgattaa tgagaaaaat actgaaataa tccaaaaaca cattgaaagt 4260 atccttgctg ataagagatt ggatgaattt gaaacgtata atgttgatga aattgagaat 4320 gtgattaatg acgatgacat tgctgaagct aatccactac cagatgaaaa taatgatgtt 4380 cagatgaatg agagttttga taataatcat agcatgtcac gagcaaagaa gaaatacaca 4440 tttgagaaag aagttaacga aaaaattgct ggtactaaac attcacttga tacaactgat 4500 ccaagagaag caatcagagt gttaaatact ggtgaaacca agagaatcga acccaagaaa 4560 agagaggtgc ctatcactgt gaaattaaac aaaagatcgc aatacaagtc accatatgtt 4620 acaagaagtg gtagaacggt tataaacccc aagaggtatt tacatgcggt cgtcaacaaa 4680 atcgactata atgatccggg atggataaag tcaatgaatg ctgaactaga gaaatttaga 4740 tcaaaagatg tttacgaaga agttccaatt cccaccggtg tgaagcctat atctatgggt 4800 tgggtacata ctgagaaaat tgattctctc aaaggtgttg ttcggaaatc acgttgtgtt 4860 gtccatggca acagacaaaa ggaaaaattg gattatgacc cttttagtgt tagttcacct 4920 gttatagatc ttgtgactat aagattattg acaataatag gttgtgaatt aggaatgaca 4980 attcaacatt tagacgtcga gtcggcgtat ctaaatgcct ctattactca ttcaaatcca 5040 atttatgtct ttcctcctaa atcagtacct ttgaagaaaa accattgttg gttattgaaa 5100 cgttctgtct atgggttaaa acagtcgggt tttgaatggt atcacactat caaaagagta 5160 ttggaagaca ttggttttaa tcaagtttta cacaatgatg gtttatttca cattgaatat 5220 gaagagggat cagtaatata tttaggttta tatgttgatg atattcttat ggttggaagt 5280 tcacaaaaag ttattgataa ttttgtggat caattgagag atcattttga agttaaagtg 5340 tttggtgaaa tatcaaatta tcttggtatt gaatttcgta aaaccgaatc tggttatatt 5400 ttatctcaag aaaaatttct caagaaatta cttaaggatt tcaaactaga tgactcatat 5460 gggaaaaaca taccctggat tccgaatgac aaatatgaaa aggttgcaat aattcgtgaa 5520 aacgttaatc cagagaatga ttttgaaaag gttccgaatg agacattgct tgaccctgat 5580 gctaaaaaac tataccaaag tggtgttggc ctgcttttat gggctgccac aaacacacgt 5640 ccagatatat cggtcgtagt gaattcgttg ggttctaaat ctgcaaatcc aaatgtccat 5700 gattatgaga aattgattta ttgtcttagg tatatcaaaa atagcatggg atatcacatt 5760 gagtacaaaa gaaacagatt gaatatacca ccaaaatcat ttgttatcga atgtttcagt 5820 gatgcgtcat ttgcaccagg attggataga aaatctatta gtggaacttt gatttatgtg 5880 aatggaaatt tggtgcaatg ggcgaccaaa aaacaaacgg tcatagcaca aagctcagca 5940 gcttgtgaaa tgttggctct aaattataca atgttgaaag ctatcgaaat aaaaaacaat 6000 ttaatggatt tgggttttga agtaggtaag atacattgtc atcaagacaa ccaagctgtg 6060 attaaagttt tgagaaataa ctattgtcac ccacatcgac caatagatat ctgctataag 6120 tttctacgcc aattgatcaa tgataaagta ttttcaatat cctatgtgaa gacaaatgat 6180 aattacgccg attgtatgac taagtgtcta agtcgtgcta aattcaaagc attcgttgag 6240 ggtatgataa aacggttaga cctagaagat aatcaaacac tgatacaaaa tgcaataacg 6300 gcagaataag tggatttatc attactatta tcgtaatgct caatcagggg agtgttggtt 6360 tgtgcactat tttgtgtcag aaactgatca atgaaaatga tggttattat gagaatggaa 6420 aatttttcca tcacacatca ggtgatgaca gaactaaact atattgtgta gtataaataa 6480 gggtatgaaa taccaacatc ccagaatatc aacgagatag aagggaggag tttcaatata 6540 tatcttgtga ataataactt cgttctaatt cactatacac aactagacgt gtacacgctc 6600 aatctcaggt aaagaaagtt tatattccat caacacgaaa agtaaacaaa tctgaaaaac 6660 acgaaggctt tcaaaaatag tgatccagaa tattaaaaac agaatcttta tttgaaattt 6720 aattcaacac aatggaagaa ctggtcacac ccttattatc agtcaatcaa ttatatgact 6780 gtataatagg ctttctacct tttaatgaaa atgtgccgaa atttacacag tgtgaagatg 6840 tattgaccaa gtttatatct aacaacaacc aagatacttt gtaccttatc aagtctattg 6900 aagagaaaac atccagaata tcaaatgatc tttcagactt ggatatggga tttaaagatt 6960 tagatacttt tgttattata 6980 //