ID HQ912790; SV 1; circular; genomic DNA; STD; VRL; 7679 BP. XX AC HQ912790; XX DT 20-MAR-2011 (Rel. 108, Created) DT 26-JUL-2011 (Rel. 109, Last updated, Version 2) XX DE Camelus dromedarius papillomavirus type 1, complete genome. XX KW . XX OS Camelus dromedarius papillomavirus type 1 OC Viruses; dsDNA viruses, no RNA stage; Papillomaviridae; OC Deltapapillomavirus; unclassified Deltapapillomavirus. XX RN [1] RP 1-7679 RX DOI; 10.1099/vir.0.031039-0. RX PUBMED; 21471319. RA Ure A.E., Elfadl A.K., Khalafalla A.I., Gameel A.A., Dillner J., RA Forslund O.; RT "Characterization of the complete genomes of Camelus dromedarius RT papillomavirus types 1 and 2"; RL J. Gen. Virol. 92(Pt 8):1769-1777(2011). XX RN [2] RP 1-7679 RA Ure A.E., Elfadl A.K., Khalafalla A.I., Gameel A.A.R., Dillner J., RA Forslund O.; RT ; RL Submitted (21-JAN-2011) to the INSDC. RL Department of Medical Microbiology, Malmo University Hospital Lund RL University, Entrance 78, Malmo, Skane 205 02, Sweden XX FH Key Location/Qualifiers FH FT source 1..7679 FT /organism="Camelus dromedarius papillomavirus type 1" FT /host="Camelus dromedarius" FT /mol_type="genomic DNA" FT /country="Sudan" FT /isolation_source="lip" FT /collection_date="Aug-2009" FT /note="acronym: CdPV1" FT /db_xref="taxon:996650" FT CDS 1..387 FT /codon_start=1 FT /product="E6" FT /db_xref="GOA:F2YGG8" FT /db_xref="InterPro:IPR001334" FT /db_xref="UniProtKB/TrEMBL:F2YGG8" FT /protein_id="ADZ53049.1" FT /translation="MPLKKWSDYNNLPCLFCGKLLDGVEALRCEWKKINVVTRHGADWA FT VCTLCLKQALLIESRLYATVWVKPSLTGDDEKDLVIRCHICGGILSDSEKDRHFVHKEP FT YIYRRGHLRGRCYDCTSDGLRATN" FT CDS 365..727 FT /codon_start=1 FT /product="E7" FT /db_xref="GOA:F2YGG9" FT /db_xref="InterPro:IPR000148" FT /db_xref="UniProtKB/TrEMBL:F2YGG9" FT /protein_id="ADZ53050.1" FT /translation="MVSGPPTEKDLPEEPVCKLPTVELDFEPIAPEVPETASPRPPSPA FT TAFVDFECPVIIKGSSRKCYYIASKCADCTQVMHYAVRTSPVTIWNLQQLLTRDLHLLC FT PTCEQKTRRRKRNGRW" FT CDS join(714..729,3180..3592) FT /codon_start=1 FT /product="E1^E4" FT /note="the E1^E4 product is generated by splicing resulting FT in a fusion protein containing five amino acids from the FT N-terminus of E1" FT /db_xref="UniProtKB/TrEMBL:F2YGH0" FT /protein_id="ADZ53053.1" FT /translation="MADGEDKPKPGAAHEDPQQKPCAQETPSTDKQDGPKEPARDLDKQ FT PCLPCLTLILWNPPAWVPSENATTGPRHTLHPSPFYGAQGQVNSYLPSPLCRARYRKVW FT HEHRKDLKRRRLRTRRKRQQRVPPLNLNCSGARRTRAY" FT CDS 714..2606 FT /codon_start=1 FT /product="E1" FT /db_xref="GOA:F2YGH1" FT /db_xref="InterPro:IPR001177" FT /db_xref="InterPro:IPR014000" FT /db_xref="InterPro:IPR014015" FT /db_xref="InterPro:IPR016393" FT /db_xref="UniProtKB/TrEMBL:F2YGH1" FT /protein_id="ADZ53051.1" FT /translation="MADGEGTSVGGRDAYILLEASCSENESEPGSYISTGDADSEGEDL FT IDNASVRSDAQGNHLAIFQQLEKKAGEQQLLNLKRKLNVSGSSESPSAEDLGPTLGAGA FT LRPVAKRRLFASLENTTDLSASSYEAKGVAPPLSAQVLSPTCSIGLGTSRVYSGSSGKE FT NSRSSGEGGLHLDVLRSKNRESCKLACFKDAFVASFADLTRVFHSDKTTNAQWVTAAFD FT VDEKLYEAAILRLKKHCTYLQACRRCRKKGSVSVFLTEFTVAKCRETVLKLFADLLTLE FT VTKLISQPPKIKGLCAALFWFKNAMCSDTQTFGTVPKWISVQTSVTENSAEALKFDFGL FT MVQWAWDQNFSEESAIAYNYALQAETDRNAKAWLACSNQARYVKDAAAMVRHYKRAAML FT SLSMSAYIKQRCDLITGAGTWLNVANVLKYHGIEPIVFVNALRNWLKGTPKKNCITIIG FT PSNTGKSMFCHSLISFLGGNVLSFANHLTHFWLAPLSETRAALIDDATHKCWRYFDTYL FT RSVLDGYHIQVDRKHRAPVQIKAPPLLVTSNVDVSTEPSLQYLHSRLVTFYFSQPFPVD FT ENGDPLFKITHADWKCFFERLWGRLDLSDQEDEGDDGASRRTFTCNARGPNGAD" FT CDS 2548..3834 FT /codon_start=1 FT /product="E2" FT /db_xref="GOA:F2YGH2" FT /db_xref="InterPro:IPR000427" FT /db_xref="InterPro:IPR001866" FT /db_xref="InterPro:IPR012677" FT /db_xref="UniProtKB/TrEMBL:F2YGH2" FT /protein_id="ADZ53052.1" FT /translation="MVPAAERLLATQEAQMELIEQDSPSLSCHVKYWGLVRAEMALLCA FT ARLKGHKVLGLCPVPPSSVSTAKAKQAILMELELKSLQETQWGRDAWLLTECSYETYSA FT PPTCTFKKLPRIVEVMFDGNPANRTWHTCWEQIYTRQPDGWVCSKGGADGEGLFVQDSN FT GGRAYYQYFADDAKRFSATGQWTVIDKDECFVSSTTSTASEFSDSNYRQTETGGSTRGP FT PAETVCTGDTVDGQAGRTERTSPRLGQTAVSSVSHTYPLESPCVGPIRKRNYRPATHSA FT PVTFLRCSGPGELLPAVPTLPCPVQKSLARTSEGPETPPSPDSSQASAAGATSEFELFG FT GQKNACLLISGNANQVKCHRYRLRKYRRKRYEHITTTWWTVSESGSVRLGSATILVTFS FT TPTQREDFLKAVSLPCGMTVKPISVMCDF" FT CDS 4038..5543 FT /codon_start=1 FT /product="L2" FT /note="minor capsid protein" FT /db_xref="GOA:F2YGH3" FT /db_xref="InterPro:IPR000784" FT /db_xref="UniProtKB/TrEMBL:F2YGH3" FT /protein_id="ADZ53054.1" FT /translation="MAAAARRRVRRANPYDLYRTCKANNTCPPDIIPLVEGNTIADQIL FT KWGSWAVYLGGLGIGTGVGSFGGRAPTTASVKAGIENALVRIFAGGKSGNLPRPSAVDI FT PLDTLGVSGGAGDTSVSVGVSEVSPVVVPEGVGPVDAAANADTLQDTVITYIDSLTDKH FT ALIDIRPLEGGDNTQVYTSSTFSNPAFEGASQQPVLGETTNLENIFVGGRSLGDTGGES FT IELTEFNGPHTSTPADGRSDVQVKGRGNWFSRRYYTQVSMDPSVFESATKEYGFENPAY FT EGDSFTVSDEALTHFRPVLPELQDATHITASRLLRGRSGRVGVDRVASTKTIGTRSGVR FT IGGQVHLRHSLSSIGSDVELLPLHLINSDTPLTLETAFVEAPDASESIDAEGFPVSYSE FT EDLLDDDVSLPHGHLVIGSRFLNETVPMPEYTSVYKAIVPVDVSDILRPSTDPNWTNVV FT PEPGGDMTPGIVIDLSDDYYRHYYLHPSLLKKKKSKVRKLWYA" FT CDS 5555..7057 FT /codon_start=1 FT /product="L1" FT /note="major capsid protein" FT /db_xref="GOA:F2YGH4" FT /db_xref="InterPro:IPR002210" FT /db_xref="InterPro:IPR011222" FT /db_xref="UniProtKB/TrEMBL:F2YGH4" FT /protein_id="ADZ53055.1" FT /translation="MALWQPGQKLYLPPTPVTKVLCTDTYVTRREIYYHAETERLLSVG FT HPYYSLKYGDGKEIPKVSPNQYRVFKVLLPDPNQFALPDKTLHDPSKERLVWALVGIQV FT SRGQPLGAAITGHPYWNVYTDAENMSRRAPGAQTKDDRKQGGLDSKQTQVLSVGCIPAT FT GEYWDIAKACAGADLPAGSCPPLELKNKVIGDGDMMEIGFGAANFKALNQSKSDVPLDI FT ENTTRIYPDYLKMSEEASGNSLFFFARKEQVYTRHIYSRGGLEEPEKPPEGYILKRNGG FT DTAVNNAYMGVPSGSLVSTDSQIFNRPYWIYMAQGMNNGIAWNNTLYVTVGDNTRGINI FT GISVSKNGQTPTEFDSQNINMFQRHVEEYKLAFILQLCSVQLTSETVAHLQFSDPTVLE FT NWQIGMQPPVSSILEDSYRYITSAATKCPDQVNPPKPNDPYEKNTFWLVDLKEKLSLDL FT DQFPLGRRFLAQRGLGCRSRPPKRVANSKHEGVPAKKRKRTR" FT polyA_signal 7085..7090 FT /note="late polyadenylation site" FT protein_bind 7352..7363 FT /bound_moiety="E2" FT /note="putative E2 binding site" FT protein_bind 7381..7392 FT /bound_moiety="E2" FT /note="putative E2 binding site" FT protein_bind 7453..7464 FT /bound_moiety="E2" FT /note="putative E2 binding site" FT protein_bind 7471..7482 FT /bound_moiety="E2" FT /note="putative E2 binding site" FT protein_bind complement(7505..7510) FT /bound_moiety="E1" FT /note="putative E1 binding site" FT protein_bind 7517..7522 FT /bound_moiety="E1" FT /note="putative E1 binding site" FT protein_bind 7569..7580 FT /bound_moiety="E2" FT /note="putative E2 binding site" FT TATA_signal 7649..7654 FT protein_bind 7662..7673 FT /bound_moiety="E2" FT /note="putative E2 binding site" XX SQ Sequence 7679 BP; 2165 A; 1618 C; 1899 G; 1997 T; 0 other; atgcctctta aaaagtggtc tgattacaat aaccttccat gtttattctg tggtaagctg 60 cttgatgggg ttgaagccct cagatgcgag tggaaaaaga tcaatgttgt tactaggcac 120 ggagcagact gggctgtttg cacactctgc ttgaagcaag cactattaat tgaaagtaga 180 ctttatgcca ctgtgtgggt caagccatca ttgacaggag acgacgagaa ggaccttgta 240 attcgctgtc atatttgtgg aggcatcctg tctgatagcg aaaaggacag acattttgtt 300 cataaggagc cttatattta tcgaagaggc catctccgag ggcggtgcta cgattgtaca 360 agtgatggtc tcagggccac caactgaaaa agacctgccg gaggaacctg tatgcaaatt 420 accaactgta gagctggact ttgaacctat tgcacctgag gtacctgaga cagcgtctcc 480 aagacctcca tcaccggcga ctgcatttgt ggattttgag tgccccgtga taatcaaagg 540 aagttctaga aagtgctact atattgcatc caagtgcgcg gactgcacac aggtgatgca 600 ctatgctgta cggacgtccc cagtaacaat ttggaacctg cagcaacttt tgacaagaga 660 cctgcacctg ctttgtccca cgtgtgagca gaagaccagg cgccgcaagc gcaatggccg 720 atggtgaagg tactagtgtg ggcgggaggg atgcttacat tctacttgag gcgtcttgca 780 gcgaaaacga aagcgaaccg gggtcatata taagcacagg cgatgcggac agtgagggcg 840 aagatcttat tgataatgcg tctgtgcgtt ctgacgcgca ggggaatcac ctggctatct 900 tccagcagtt agagaaaaag gcgggagagc aacaattgtt aaacttgaaa agaaaactta 960 atgttagcgg cagtagtgag tctccttctg cggaggacct cggtccaacg ctaggtgctg 1020 gcgctctgcg cccagtcgct aaaagacgtt tgtttgcgtc tctagaaaat actactgacc 1080 tttccgcctc cagctatgaa gctaaaggtg ttgctccgcc gctgtcggcg caggtattat 1140 cgccgacctg tagtattggg ctgggtacta gccgtgtata tagcgggagt tcgggcaagg 1200 aaaatagtag aagctcaggg gagggcggac ttcacttgga tgtgttgcgc tcaaaaaacc 1260 gggagtcttg caagttagct tgctttaaag atgcttttgt tgcaagtttt gcagatttaa 1320 caagggtatt tcatagtgac aaaactacca acgcgcagtg ggttaccgcg gcttttgatg 1380 ttgatgaaaa gttgtacgag gcggccattt tgcgcttaaa gaaacactgc acttatttac 1440 aggcctgcag gcggtgccgt aaaaaaggca gtgtatccgt tttccttacg gaatttactg 1500 tcgcaaagtg tcgtgaaact gtgctgaaac tctttgcaga cttattaacc cttgaggtca 1560 ccaagcttat ttcacagcct ccgaaaatta agggattgtg tgctgctttg ttctggttca 1620 aaaatgcaat gtgctcagat acccaaacat ttggaactgt gcctaagtgg atatctgtac 1680 aaacctctgt gacagaaaat tcagcagagg ccttgaaatt tgattttggt ctaatggtgc 1740 agtgggcttg ggaccaaaat ttttctgagg aatcagccat agcctataat tatgctttac 1800 aggcagaaac agatcgaaat gcaaaagcct ggttggcatg ttccaatcaa gccagatatg 1860 tgaaagatgc tgctgctatg gtcaggcatt ataaaagagc tgctatgctg agcctatcca 1920 tgtctgccta cataaagcaa cgttgtgacc tcattacagg ggctggtaca tggttgaatg 1980 tagcaaatgt cctaaaatac catggtatag aaccaatagt ttttgtaaat gccttacgga 2040 actggctaaa aggaacacct aaaaaaaatt gtataaccat aattggccca agcaatacag 2100 ggaagtcaat gttttgtcac tctttgatat ccttccttgg gggaaatgta ctttcttttg 2160 ctaaccacct cacacacttt tggctagcgc ctctatcaga aactagagcg gctctaatag 2220 atgatgccac acataaatgc tggagatatt ttgataccta cctcagaagt gtcttagacg 2280 gttaccatat acaggtagac aggaaacaca gagcccctgt gcaaattaag gcccctcctt 2340 tgttagttac aagcaatgta gacgtgtcca cagaaccctc tttgcaatat ttgcatagta 2400 ggttagttac cttttacttc tcacagcctt ttccagtgga tgagaatggg gatcccttat 2460 ttaaaataac tcatgcagat tggaaatgtt tctttgaaag gctctgggga cgattagacc 2520 tcagcgatca agaagacgag ggggacgatg gtgccagccg cagaacgttt acttgcaacg 2580 caagaggccc aaatggagct gattgaacag gatagcccaa gcctcagttg ccatgtgaaa 2640 tattggggcc tagtgagagc agaaatggct ttgctatgtg ctgccagatt gaagggacat 2700 aaagttttag gcttgtgccc ggtaccccca agttctgtgt ctactgctaa ggctaagcag 2760 gccattctca tggaattgga attaaaatct ctgcaggaga cacaatgggg gagggacgct 2820 tggctactga ccgagtgcag ctacgagaca tatagtgccc cgcccacctg cacattcaaa 2880 aaactgccgc gaattgtgga agtgatgttt gatgggaatc cagcaaatcg cacctggcac 2940 acctgctggg aacaaattta tacaagacaa cctgacggct gggtgtgttc gaaaggaggt 3000 gcagatggtg aaggcttgtt tgtacaggat agcaatggcg gtcgcgcata ctaccagtat 3060 tttgcagatg atgcgaaaag gtttagtgcg acaggacagt ggactgtaat tgataaggac 3120 gagtgctttg tgtcatctac aacttcaaca gcctctgaat tctctgattc aaattacaga 3180 caaaccgaaa ccgggggcag cacacgagga cccccagcag aaaccgtgtg cacaggagac 3240 accgtcgacg gacaagcagg acggaccgaa agaaccagcc cgagacttgg acaaacagcc 3300 gtgtcttccg tgtctcacac ttatcctctg gaatccccct gcgtgggtcc catcagaaaa 3360 cgcaactaca ggcccgcgac acactctgca cccgtcacct ttctacggtg ctcagggcca 3420 ggtgaactcc tacctgccgt ccccactttg ccgtgcccgg tacagaaaag tttggcacga 3480 acatcggaag gacctgaaac gccgccgtct ccggactcgt cgcaagcgtc agcagcgggt 3540 gccacctctg aatttgaact gttcgggggc cagaagaacg cgtgcctact gatttcaggt 3600 aatgccaatc aagttaaatg tcatcgatac agactgcgta aataccgtcg aaaaaggtac 3660 gaacacatca ccactacctg gtggacagta tctgaatcag gcagtgtccg ccttggttct 3720 gctacaatac tggtgacttt ttctacacct actcaaagag aggacttttt gaaagctgtg 3780 tcactgccat gcggaatgac tgtgaaacct atatcagtta tgtgtgactt ttagacaact 3840 gcctacatat gtggtggaaa tggcttttgt tcattgctat gtatatttct agtgcaaatg 3900 tagtgtgtgt tctggttgac atttgggatg cttgtgcatt gtatgtgctg ttacttatat 3960 ttctgtattt gcatcctgga gttgtaaaat tacctgtacg ttttgctttg taacatttta 4020 tccaaaattc agtaaacatg gctgctgctg cacgtcgtag agtacgcaga gctaatcctt 4080 atgatttgta tagaacctgc aaggcaaata acacctgccc cccggacatc attccacttg 4140 tggagggtaa cactattgca gaccaaattc tgaagtgggg cagctgggct gtgtatttgg 4200 gtggacttgg tattgggaca ggagtaggct cttttggggg acgggcacct accacagcat 4260 cagtgaaagc tggcattgag aatgcgcttg taagaatttt tgcaggtggc aaaagtggta 4320 accttcccag accatctgca gtggatattc ctttagacac attgggtgtt tcaggggggg 4380 caggggacac aagtgtgtct gtgggtgtat cagaggtcag tccagttgta gttcctgaag 4440 gtgtaggtcc agtagatgct gctgcaaacg ctgacacatt gcaagacaca gttatcactt 4500 atattgatag cctaacagat aaacatgcat taatagatat taggccttta gaaggagggg 4560 acaatacaca agtttacaca agcagtacat tcagcaatcc agcctttgaa ggggcaagtc 4620 agcagcctgt actaggtgaa acaacaaact tggaaaatat ttttgttggt ggtcgcagtc 4680 taggcgatac aggaggtgag tctattgaat tgacggagtt taatggtcct cacacaagca 4740 cacctgcaga cggtcggtct gatgtgcaag ttaaaggccg tggtaactgg ttcagtagaa 4800 ggtattatac tcaggtgagt atggacccat ctgtttttga aagtgcaacc aaagagtatg 4860 ggtttgagaa cccagcctat gaaggtgaca gctttactgt aagtgatgag gcccttactc 4920 attttagacc agtactacct gagttacagg atgcaacaca tataacagcc tcgcgcctgc 4980 tacgtggtcg aagtggccgt gttggcgttg atagggtggc cagtactaaa actattggta 5040 ccagaagtgg agtacgtatt gggggacagg tgcacttaag gcattctttg agcagcatag 5100 ggagtgacgt ggaactgctt ccgttgcatc ttataaacag tgatacacca ctgactctgg 5160 aaaccgcttt tgtggaagca cctgacgcat ctgaaagtat agatgcagag gggtttcctg 5220 tttcatacag tgaagaggat ttattggatg acgacgtgtc acttcctcat ggtcatctgg 5280 taattgggtc tcggttttta aatgagactg tgccaatgcc tgaatatacc tcggtgtaca 5340 aagctattgt tcctgttgac gtgtcagaca tacttaggcc cagtactgat cctaattgga 5400 ccaatgtagt accagaaccg ggtggagata tgacacctgg aatagtcata gacctgtcag 5460 atgattatta taggcattat tatcttcacc ctagcctact taaaaaaaaa aagagcaagg 5520 tgcgtaaact atggtatgct taattgtctt gcagatggcg ctatggcagc ctggacagaa 5580 gctgtatctg cctcctacac ctgtgacaaa ggttctatgc acagacacct atgtaacccg 5640 cagggaaata tattaccacg cagagacaga aaggctttta agtgtaggac atccttatta 5700 ctctttaaag tatggtgatg gcaaagaaat acctaaagtg tcccccaatc agtatcgtgt 5760 tttcaaagtg ctattacccg accctaatca atttgctcta ccagataaaa cgttacatga 5820 tccaagtaaa gagaggctcg tgtgggcttt agtaggcatt caggtgtcaa gaggtcaacc 5880 ccttggcgct gcaattacag gccatcctta ttggaatgtg tatactgatg ctgaaaatat 5940 gagtagaagg gctcctggtg cacaaacaaa agatgataga aaacaaggtg gtttagattc 6000 taaacaaaca caggtactat cggtagggtg cattcctgcc acaggtgagt attgggacat 6060 agctaaagca tgtgcaggag ctgatctacc tgccggatca tgtccaccac tagaattaaa 6120 aaataaagta attggggatg gggatatgat ggaaattggg tttggggctg caaatttcaa 6180 ggccttaaat cagtctaaat cagatgtacc tttggatatt gaaaatacaa ctcgcatcta 6240 tccagattat ctaaaaatgt cagaagaggc ttctgggaat agtttgtttt tttttgcccg 6300 caaggaacag gtttacacaa ggcacattta cagccgcggt ggtcttgagg agcctgagaa 6360 accgcctgaa ggatacatat taaaaagaaa tggaggtgat acagctgtta ataatgccta 6420 catgggtgtt ccaagtggtt ccttagtaag cactgatagc caaattttca acaggccata 6480 ctggatatac atggctcaag gcatgaataa tggaatagcc tggaacaata cattatatgt 6540 aactgtaggt gacaacacac ggggcatcaa tataggcatt tctgtttcta aaaatggaca 6600 aacacctaca gagtttgata gccagaacat caacatgttt caaagacatg ttgaagaata 6660 taagctggct tttattttac agctatgctc agtgcaactc acctctgaaa ctgttgccca 6720 tttgcaattt agtgatccca ctgtgcttga aaattggcag ataggcatgc aaccccctgt 6780 gtcttctata ttagaagaca gttatagata tattacatcc gctgccacca aatgcccaga 6840 tcaagtcaac ccccctaaac caaatgatcc atatgaaaaa aatacatttt ggctagtaga 6900 tctcaaggaa aaattgtctt tggatctaga ccagtttccc ctaggacgta ggtttcttgc 6960 acagcgcggt ttagggtgca gaagcagacc ccctaaacgt gtcgcaaatt caaagcatga 7020 aggtgttcct gcaaaaaaaa ggaaacgcac acggtgaggt gggactaagg tatgtaagtg 7080 ggcgaataaa gtgtttaaag atgactctgc tgtctgtgtt tattgcctac gcccttgttt 7140 attgactata aattggaaac agtcttgatg agagtcttga agctgtgtga agtggtgcca 7200 gtatcgtgtg aagctgtgcc aaaaagccgc gcattatgca gcgttgacag tttgaagata 7260 ggactagacg gtgagtaata tctggactgt gagaacctgc tgacacagct gcgcgccaaa 7320 taatcgctga cggtgctgaa atggcgccaa cacctccaac ggttatgtct tcgcgccaaa 7380 accggtctcg gtatggttca aatttaaact tgcatggaca cgccaagttc ccgcccaata 7440 aatttgaact gcaccgtcgg cggtgtagga accgcctagg gttcctaatt ttttacagag 7500 agagattgtt gttaacaaca atcagaccgt agccgtttgt gatgttctgc caacatattt 7560 gaatcagcac cgggaacggt tagtaggaaa ctgttcaaca tgaattttta ttggaactcg 7620 cctggcaaac acgcacatcc tcgcagcata taaagatgag aaccgcctga ggttcacag 7679 //