ID M23122; SV 1; circular; genomic DNA; STD; VRL; 5098 BP. XX AC M23122; XX DT 23-NOV-1989 (Rel. 21, Created) DT 17-APR-2005 (Rel. 83, Last updated, Version 7) XX DE BK polyomavirus strain AS, complete genome. XX KW agnoprotein; complete genome; large T-antigen protein; KW major capsid protein; minor capsid protein; small T-antigen protein; KW T-antigen protein; VP1; VP2 protein; VP3 protein. XX OS BK polyomavirus OC Viruses; dsDNA viruses, no RNA stage; Polyomaviridae; Polyomavirus. XX RN [1] RP 1-5098 RX PUBMED; 2536111. RA Tavis J.E., Walker D.L., Gardner S.D., Frisque R.J.; RT "Nucleotide sequence of the human polyomavirus AS virus, an antigenic RT variant of BK virus"; RL J. Virol. 63(2):901-911(1989). XX CC Draft entry and computer-readable sequence for [1] kindly submitted CC by J.E.Tavis, 15-MAR-1989. The early genes are the small and large CC T-antigen, while the late genes include the agnogene, and the major CC and minor capsid proteins (VP1, and VP2,3). It appears that this CC may not be a distinct human polyomavirus but rather an antigenic CC variant of BKV. The author suggest that its name be changed from CC ASV to BKV(AS) to better reflect this relationship. XX FH Key Location/Qualifiers FH FT source 1..5098 FT /organism="BK polyomavirus" FT /strain="AS" FT /mol_type="genomic DNA" FT /db_xref="taxon:10629" FT rep_origin 1 FT /note="center of origin of replication" FT CDS 211..435 FT /codon_start=1 FT /product="agnoprotein" FT /db_xref="GOA:P14998" FT /db_xref="InterPro:IPR002643" FT /db_xref="UniProtKB/Swiss-Prot:P14998" FT /protein_id="AAA46879.1" FT /translation="MFCEPKNLVVLRQLSRQASVKVGKTWTGTKKRAQRIFIFILELLL FT EFCRGEDSVDGKNKSTTALPAVKDSVKDS" FT CDS 470..1525 FT /codon_start=1 FT /product="minor capsid protein VP2" FT /db_xref="GOA:P14997" FT /db_xref="InterPro:IPR001070" FT /db_xref="UniProtKB/Swiss-Prot:P14997" FT /protein_id="AAA46880.1" FT /translation="MGAALALLGDLVASVSEAAAATGFSVAEIAAGEAAAAIEVQIASL FT ATVEGITTTSEAIAAIGLTPQTYAVIAGAPGAIAGFAALIQTVTGISSLAQVGYRFFSD FT WDHKVSTVGLYQQSGMALELFNPDEYYDILFPGVNTFVNNIQYLDPRHWGPSLFATISQ FT ALWHVIRDDIPAITSQELQRRTERFFRDSLARFLEETTWTIVNAPINFYNYIQDYYSNL FT SPIRPSMVRQVAEREGTHVNFGHTYSIDNADSIEEVTQRMDLRNKESVHSGEFIEKTIA FT PGGANQRTAPQWMLPLLLGLYGTVTPALEAYEDGPNQKKRRVSRGSSQKAKGTRASAKT FT TNKRRSRSSRS" FT CDS 827..1525 FT /codon_start=1 FT /product="minor capsid protein VP3" FT /db_xref="GOA:P14997" FT /db_xref="InterPro:IPR001070" FT /db_xref="UniProtKB/Swiss-Prot:P14997" FT /protein_id="AAA46881.1" FT /translation="MALELFNPDEYYDILFPGVNTFVNNIQYLDPRHWGPSLFATISQA FT LWHVIRDDIPAITSQELQRRTERFFRDSLARFLEETTWTIVNAPINFYNYIQDYYSNLS FT PIRPSMVRQVAEREGTHVNFGHTYSIDNADSIEEVTQRMDLRNKESVHSGEFIEKTIAP FT GGANQRTAPQWMLPLLLGLYGTVTPALEAYEDGPNQKKRRVSRGSSQKAKGTRASAKTT FT NKRRSRSSRS" FT CDS 1410..2498 FT /codon_start=1 FT /product="major capsid protein VP1" FT /db_xref="GOA:P14996" FT /db_xref="InterPro:IPR000662" FT /db_xref="InterPro:IPR011222" FT /db_xref="UniProtKB/Swiss-Prot:P14996" FT /protein_id="AAA46882.1" FT /translation="MAPTKRKGECPGAAPKKPKEPVQVPKLLIKGGVEVLEVKTGVDAI FT TEVECFLNPEMGDPDDNLRGYSQHLSAENAFESDSPDRKMLPCYSTARIPLPNLNEDLT FT CGNLLMWEAVTVKTEVIGITSMLNLHAGSQKVHENGGGKPVQGSNFHFFAVGGDPLEMQ FT GVLMNYRTKYPQGTITPKNPTAQSQVMNTDHKAYLDKNNAYPVECWIPDPSRNENTRYF FT GTYTGGENVPPVLHVTNTATTVLLDEQGVGPLCKADSLYVSAADICGLFTNSSGTQQWR FT GLARYFKIRLRKRSVKNPYPISFLLSDLINRRTQKVDGQPMYGMESQVEEVRVFDGTEQ FT LPGDPDMIRYIDRQGQLQTKMV" FT CDS complement(join(2568..4400,4747..4989)) FT /codon_start=1 FT /product="large T-antigen protein" FT /db_xref="GOA:P14999" FT /db_xref="InterPro:IPR001623" FT /db_xref="InterPro:IPR003133" FT /db_xref="InterPro:IPR010932" FT /db_xref="InterPro:IPR014015" FT /db_xref="InterPro:IPR016392" FT /db_xref="InterPro:IPR017910" FT /db_xref="UniProtKB/Swiss-Prot:P14999" FT /protein_id="AAA46878.1" FT /translation="MDKVLNREESMELMDLLGLERAAWGNLPLMRKAYLKKCKEFHPDK FT GGDEDKMKRMNTLYKKMEQDVKVAHQPDFGTWNSSEVPTYGTEEWESWWSSFNEKWDED FT LFCHEDMFASDEEATADSQHSTPPKKKRKVEDPKDFPSDLHQFLSQAVFSNRTLACFAV FT YTTKEKAQILYKKLMEKYSVTFISRHMCAGHNIIFFLTPHRHRVSAINNFCQKLCTFSF FT LICKGVNKEYLLYSALTRDPYHIIEESIQGGLKEHDFNPEEPEETKQVSWKLITEYAVE FT TKCEDVFLLLGMYLEFQYNVEECKKCQKKDQPYHFKYHEKHFANAIIFAESKNQKSICQ FT QAVDTVLAKKRVDTLHMTREEMLTERFNHILDKMDLIFGAHGNAVLEQYMAGVAWLHCL FT LPKMDSVIFDFLHCVVFNVPKRRYWLFKGPIDSGKTTLAAGLLDLCGGKALNVNLPMER FT LTFELGVAIDQYMVVFEDVKGTGAESKDLPSGHGINNLDSLRDYLDGSVKVNLEKKHLN FT KRTQIFPPGLVTMNEYPVPKTLQARFVRQIDFRPKIYLRKSLQNSEFLLEKRILQSGMT FT LLLLLIWFRPVADFSKDIQSRIVEWKERLDSEISMYTFSRMKYNICMGKCILDITREED FT SETEDSGHGSSTESQSQCSSQVSDTSAPDSENPHSQELHLCKGFQCFKRPKTPPPK" FT intron complement(4401..4746) FT /note="large T-antigen protein intron" FT CDS complement(4471..4989) FT /codon_start=1 FT /product="small T-antigen protein" FT /db_xref="GOA:P15000" FT /db_xref="InterPro:IPR001623" FT /db_xref="InterPro:IPR003354" FT /db_xref="UniProtKB/Swiss-Prot:P15000" FT /protein_id="AAA46883.1" FT /translation="MDKVLNREESMELMDLLGLERAAWGNLPLMRKAYLKKCKEFHPDK FT GGDEDKMKRMNTLYKKMEQDVKVAHQPDFGTWNSSEVCADFPLCPDTLYCKEWPICSKK FT PSVHCPCMLCQLRLRHLNRKFLRKEPLVWIDCYCIDCFTQWFGLDLTEETLQWWVQIIG FT ETPFRDLKL" XX SQ Sequence 5098 BP; 1542 A; 987 C; 993 G; 1576 T; 0 other; gcctcggcct cttatatatt ataaaaaaaa aggccacagg gaggagctgc ttacccatgg 60 aatgcagcca aaccatgacc tcaagaagca agtgcatgac tgggcagcca gccagtggca 120 gttaatagtg aaaccccgcc cctaacattc tcaaataaac acaagaggaa gtggaaactg 180 tccaaaggag tggaaagcag ccagacagac atgttttgcg agcctaagaa tcttgtggtt 240 ttgcgccagc tgtcacgaca agcttcagtg aaagttggta aaacctggac tggaactaaa 300 aaaagagctc agaggatttt tatttttatt ttagagcttt tgctggaatt ttgtagaggt 360 gaagacagtg tagacgggaa aaacaaaagt accactgctt tacctgctgt aaaagactct 420 gtaaaagact cctaggtaag taatgctttt tttttgtatt ttcaggttga tgggtgctgc 480 tctagcactt ttgggggacc tagttgccag tgtatctgag gctgctgctg ccacaggatt 540 ttcagtggct gaaattgctg ctggggaggc tgctgctgcc atagaagttc aaattgcatc 600 ccttgctact gtagagggca taacaactac ctcagaggct atagctgcta taggcctaac 660 acctcaaaca tatgctgtaa ttgctggtgc tccaggggct attgctgggt ttgctgcttt 720 aattcaaact gttactggta ttagttcttt ggctcaagta gggtataggt tttttagtga 780 ttgggatcac aaagtttcca ctgtaggcct ttatcagcaa tcaggcatgg ctttggaatt 840 gtttaaccca gatgagtact atgatatttt gtttcctggt gtaaatactt ttgttaataa 900 tattcaatat ctagatccta ggcattgggg tccttctttg tttgctacta tttcccaggc 960 tttgtggcat gttattagag atgatatacc tgctataact tcacaagaat tgcaaaggag 1020 aacagagaga ttttttaggg actctttggc tagatttttg gaagaaacca cctggacaat 1080 tgtaaatgcc cccataaact tttataatta tattcaggat tattattcta atttgtcccc 1140 tattaggcct tcaatggtta ggcaagtagc tgaaagggaa ggtacccatg taaattttgg 1200 ccatacctac agcatagata atgctgacag tatagaagaa gttacccaaa gaatggattt 1260 aagaaataag gaaagtgtac attcaggaga gtttatagaa aaaactattg ccccaggagg 1320 tgctaatcaa agaactgctc ctcaatggat gttgcctttg cttctaggcc tgtacgggac 1380 tgtaacacct gctcttgaag catatgaaga tggccccaac caaaagaaaa ggagagtgtc 1440 caggggcagc tcccaaaaag ccaaaggaac ccgtgcaagt gccaaaacta ctaataaaag 1500 gaggagtaga agttctagaa gttaaaactg gggtagatgc tataacagag gtagaatgct 1560 ttctaaaccc agaaatgggg gatccagatg ataaccttag gggctatagt cagcacctaa 1620 gtgctgaaaa tgcctttgag agtgatagcc cagacagaaa aatgcttcct tgttacagta 1680 cagcaagaat tccactgccc aacctaaatg aggacctaac ctgtggaaat ctactaatgt 1740 gggaggctgt aactgtaaaa acagaggtta ttggaataac tagcatgctt aaccttcatg 1800 cagggtccca aaaagttcat gagaatggtg gaggtaaacc tgtccaaggc agtaatttcc 1860 acttttttgc tgtgggtgga gaccccttgg aaatgcaggg agtgctaatg aattacagaa 1920 caaagtaccc acaaggtact ataaccccta aaaaccctac agctcagtcc caggtaatga 1980 atactgatca taaggcctat ttggacaaaa acaatgctta tccagttgag tgctggattc 2040 ctgatcctag tagaaatgaa aatactaggt attttggaac ttacacagga ggggaaaatg 2100 ttcctccagt acttcatgtt accaacacag ctaccacagt gttgctggat gaacagggtg 2160 tggggcctct gtgtaaagct gatagcctgt atgtttcagc tgctgatatt tgtgggctgt 2220 ttactaacag ctctgggaca caacagtgga gaggccttgc aagatatttt aagattcgcc 2280 tgagaaaaag atctgtgaag aatccttacc caatttcctt tttgctaagt gaccttataa 2340 acaggagaac ccaaaaagtg gatgggcagc ctatgtatgg tatggaatct caggttgagg 2400 aggtaagggt gtttgatggc acagaacagc ttccagggga cccagatatg ataagatata 2460 ttgacagaca aggacaattg caaacaaaaa tggtttaaac aggtgcttta ttgtacatat 2520 atatgcttaa taaatgctgc ttttgtataa cacagttgaa gcttctgtta ttttgggggt 2580 ggtgttttag gccttttaaa acactgaaag cctttacaca aatgtaactc ttggctgtga 2640 gggttttctg aatcaggggc tgaagtatct gagacttggg aagagcattg tgattgggat 2700 tcagtgcttg atccatgtcc agagtcttca gtttctgaat cttcttctct tgtaatatca 2760 agaatacatt ttcccatgca tatattatat ttcatccttg aaaaagtata catacttatc 2820 tcagaatcca gcctttcctt ccattcaaca attctagact gtatatcttt tgaaaaatca 2880 gctacaggcc taaaccaaat tagtagtagc aaaagggtca ttccactttg taatattctt 2940 ttttcaagta aaaactcaga gttttgcagg gactttctta aatatatttt gggtctaaaa 3000 tctatctgtc ttacaaatct agcctgaaga gttttaggga caggatactc attcattgta 3060 actaaccctg gtggaaatat ttgtgttctt ttgtttaaat gtttcttttc taaattaacc 3120 ttaacacttc catctagata atccctcaaa ctgtctaaat tgtttattcc atgtcctgaa 3180 ggcaaatcct ttgattcagc tcctgtccct tttacatctt caaaaacaac catgtactga 3240 tcaatagcca cacccagttc aaaagttagc ctttccatgg gtaaatttac atttaaagct 3300 ttacctccac ataagtctaa taaccctgca gctaaggttg ttttgccact atcaattgga 3360 cctttaaata accagtatct tcttttaggt acattaaaaa caacacagtg aagaaaatca 3420 aaaataacag aatccatttt aggtagcaaa caatgtagcc aagcaacccc tgccatatat 3480 tgttctagta cagcatttcc atgagctcca aatattaaat ccattttatc taatatatga 3540 ttaaatcttt ctgttagcat ttcttccctg gtcatatgaa gggtatctac tcttttttta 3600 gctaatactg tatctactgc ttgctgacaa atactttttt gatttttact ttctgcaaaa 3660 ataatagcat ttgcaaaatg cttttcatga tacttaaagt ggtaaggttg atcttttttt 3720 tgacactttt tacactcctc tacattgtat tgaaattcta aatacatacc caataataaa 3780 aacacatcct cacactttgt ttctactgca tattcagtaa ttaatttcca agacacctgc 3840 tttgtttctt caggctcctc tgggttaaag tcatgctcct ttaagccccc ttgaatgctt 3900 tcctctatta tatggtatgg atccctagtt aaggcactgt atagtaagta ttccttatta 3960 acacccttac aaattaaaaa actaaaagta cacagctttt gacagaaatt attaattgca 4020 gaaactctat gtctatgtgg agttaaaaag aatataatat tatgaccagc acacatgtgt 4080 ctactgataa aagttacaga atatttttcc ataagttttt tatacagaat ttgagctttt 4140 tctttagtgg tatacacagc aaaacaggca agtgttctat tactaaatac agcttgacta 4200 agaaactggt gtagatcaga gggaaagtct ttagggtctt ctacctttct tttttttttg 4260 ggtggtgttg agtgttggga atctgctgtt gcttcttcat cactggcaaa catatcctca 4320 tggcagaata aatcttcatc ccatttttca ttaaaggagc tccaccagga ctcccactct 4380 tctgttccat aggttggcac ctataaaaaa aaaataatta cttagggtct tcttttaatt 4440 tactactttt ctaaatataa attagttacc ttaaagcttt agatctctga agggagtttc 4500 tccaattatt tggacccacc attgcagggt ttcttcagtg aggtctaagc caaaccactg 4560 tgtgaagcaa tcaatgcagt agcaatctat ccaaaccaat ggctcttttc ttaaaaattt 4620 tctatttaaa tgccttaatc ttagctgaca tagcatgcaa gggcaatgca ctgaaggctt 4680 tttggaacaa ataggccatt ccttgcagta caaagtatct gggcaaagag gaaaatcagc 4740 acaaacctct gagctattcc aggttccaaa atcaggctga tgagctacct ttacatcctg 4800 ctccattttt ttatataaag tattcattct cttcatttta tcctcgtcgc cccctttgtc 4860 agggtgaaat tccttacact tttttaaata ggcttttctc attaagggaa ggtttcccca 4920 ggcagctctt tcaaggccta aaaggtccat gagctccatg gattcttccc tgtttaagac 4980 tttatccatt tttgcaaaaa attgcaaaag aatagggatt tccccaaata gttttgctag 5040 gcctcagaaa aagcctccac acccttacta cttcagagaa agggtggagg cagaggcg 5098 //