ID J02275; SV 1; linear; genomic DNA; STD; VRL; 5149 BP. XX AC J02275; M12520-M12521; M14704; XX DT 15-FEB-1992 (Rel. 31, Created) DT 14-NOV-2006 (Rel. 89, Last updated, Version 5) XX DE Minute virus of mice, complete genome. XX KW alternative splicing; capsid protein; complete genome; KW nonstructural protein. XX OS Minute virus of mice OC Viruses; Parvoviridae; Parvovirinae; Protoparvovirus. XX RN [1] RP 1-5149 RX DOI; 10.1093/nar/11.4.999. RX PUBMED; 6298737. RA Astell C.R., Thomson M., Merchlinsky M., Ward D.C.; RT "The complete DNA sequence of minute virus of mice, an autonomous RT parvovirus"; RL Nucleic Acids Res. 11(4):999-1018(1983). XX RN [2] RP 1-5149 RX PUBMED; 3502703. RA Astell C.R., Gardiner E.M., Tattersall P.; RT "DNA sequence of the lymphotropic variant of minute virus of mice, MVM(i), RT and comparison with the DNA sequence of the fibrotropic prototype strain"; RL J. Virol. 57(2):656-669(1986). XX RN [3] RX PUBMED; 3783817. RA Morgan W.R., Ward D.C.; RT "Three splicing patterns are used to excise the small intron common to all RT minute virus of mice RNAs"; RL J. Virol. 60(3):1170-1174(1986). XX DR MD5; 3d89a3a24846698a0f96e6c27a67aca9. DR EuropePMC; PMC115897; 11222696. DR EuropePMC; PMC2812381; 19955311. DR EuropePMC; PMC296056; 14645570. DR EuropePMC; PMC3078135; 21525999. DR EuropePMC; PMC3255873; 22013064. DR EuropePMC; PMC3486466; 22933276. DR EuropePMC; PMC3807388; 23903839. DR EuropePMC; PMC3838256; 24109231. DR EuropePMC; PMC4118071; 25081268. DR EuropePMC; PMC4178727; 25078696. DR EuropePMC; PMC4178750; 25078698. DR EuropePMC; PMC4254310; 25194919. DR EuropePMC; PMC5850361; 29385689. DR GOA; P0DJZ2. DR PDB; 6CIT; X-ray. DR UniProtKB/Swiss-Prot; P0DJZ2; NS2_MUMIP. XX CC The parvoviridae family cantains two groups that infect mammalian CC hosts: (i) defective (helper-dependent) adeno-associated viruses, CC and (ii) autonomous (helper-independent) parvoviruses. MVM is a CC member of the latter group. Both groups have been demonstrated to CC package both plus and minus strands (in separate particles) of the CC ss-DNA genome, though the minus strand is more typically packaged CC in the latter group. CC The sequence below corresponds to the plus (+) strand, also CC referred to as the C-strand. The minus (-) strand is also referred CC to as the V-strand. CC The 3' and 5' termini both exhibit the potential for forming stable CC 'fold-back' hairpins; these sequences appear to play a role in CC replication [1]. CC The left and right halves of the genome encode two distinct, but CC overlapping transcriptional units. The transcripts can be CC summarized [1] (1 map unit (mu) = 51 bp): CC R1 (4.8 kb): 4.5 mu - 46 mu; 46+ mu - 95 mu CC R2 (3.3 kb): 4.5 mu - 10.7 mu; 38 mu - 46 mu; 46+ mu - 95 mu CC R3 (3.0 kb): 40 mu - 46 mu; 46+ mu - 95 mu CC R3 is the major transcript. CC There are two major open reading frames, both on the plus (or C) CC strand. The left side ORF (261-2279) probably encodes a non-capsid CC protein of 85 kd; the right side ORF probably encodes the viral CC capsid proteins, VP1 (or A, 83 kd), VP2 (or B, 64 kd), and VP3 (or CC C, 61 kd). But because of uncertainties about the precise splice CC points in the transcripts, the exact starts, stops and (possible) CC intron boundaries are not known. CC revision 4804 4870 a-65bp-a in [2]; aa in [1] [2] CC revises [1]. CC [3] sites; splice sites. XX FH Key Location/Qualifiers FH FT source 1..5149 FT /organism="Minute virus of mice" FT /lab_host="mouse l (variant A-9) cell" FT /strain="MVM(p)" FT /mol_type="genomic DNA" FT /db_xref="taxon:10794" FT CDS 114..2279 FT /codon_start=1 FT /gene="NS1" FT /product="nonstructural protein" FT /note="putative" FT /db_xref="GOA:Q84365" FT /db_xref="InterPro:IPR001257" FT /db_xref="InterPro:IPR014015" FT /db_xref="InterPro:IPR021076" FT /db_xref="InterPro:IPR021972" FT /db_xref="InterPro:IPR027417" FT /db_xref="UniProtKB/TrEMBL:Q84365" FT /protein_id="AAA67108.1" FT /translation="MISGSGSLNQGAKRKWAWFKVYKQLLKSVTYLFFHSVSRDAQKES FT NQLTMAGNAYSDEVLGATNWLKEKSNQEVFSFVFKNENVQLNGKDIGWNSYKKELQEDE FT LKSLQRGAETTWDQSEDMEWETTVDEMTKKQVFIFDSLVKKCLFEVLNTKNIFPGDVNW FT FVQHEWGKDQGWHCHVLIGGKDFSQAQGKWWRRQLNVYWSRWLVTACNVQLTPAERIKL FT REIAEDNEWVTLLTYKHKQTKKDYTKCVLFGNMIAYYFLTKKKISTSPPRDGGYFLSSD FT SGWKTNFLKEGERHLVSKLYTDDMRPETVETTVTTAQETKRGRIQTKKEVSIKTTLKEL FT VHKRVTSPEDWMMMQPDSYIEMMAQPGGENLLKNTLEICTLTLARTKTAFDLILEKAET FT SKLTNFSLPDTRTCRIFAFHGWNYVKVCHAICCVLNRQGGKRNTVLFHGPASTGKSIIA FT QAIAQAVGNVGCYNAANVNFPFNDCTNKNLIWVEEAGNFGQQVNQFKAICSGQTIRIDQ FT KGKGSKQIEPTPVIMTTNENITVVRIGCEERPEHTQPIRDRMLNIHLTHTLPGDFGLVD FT KNEWPMICAWLVKNGYQSTMASYCAKWGKVPDWSENWAEPKVPTPINLLGSARSPFTTP FT KSTPLSQNYALTPLASDLEDLALEPWSTPNTPVAGTAETQNTGEAGSKACQDGQLSPTW FT SEIEEDLRACFGAEPLKKDFSEPLNLD" FT mRNA 200..>2279 FT /gene="NS1" FT CDS 261..2279 FT /codon_start=1 FT /gene="NS1" FT /product="nonstructural protein" FT /db_xref="GOA:P03134" FT /db_xref="InterPro:IPR001257" FT /db_xref="InterPro:IPR014015" FT /db_xref="InterPro:IPR021076" FT /db_xref="InterPro:IPR021972" FT /db_xref="InterPro:IPR027417" FT /db_xref="PDB:3WRN" FT /db_xref="PDB:3WRO" FT /db_xref="PDB:3WRQ" FT /db_xref="PDB:3WRR" FT /db_xref="PDB:3WRS" FT /db_xref="PDB:4PP4" FT /db_xref="PDB:4R94" FT /db_xref="UniProtKB/Swiss-Prot:P03134" FT /protein_id="AAA67109.1" FT /translation="MAGNAYSDEVLGATNWLKEKSNQEVFSFVFKNENVQLNGKDIGWN FT SYKKELQEDELKSLQRGAETTWDQSEDMEWETTVDEMTKKQVFIFDSLVKKCLFEVLNT FT KNIFPGDVNWFVQHEWGKDQGWHCHVLIGGKDFSQAQGKWWRRQLNVYWSRWLVTACNV FT QLTPAERIKLREIAEDNEWVTLLTYKHKQTKKDYTKCVLFGNMIAYYFLTKKKISTSPP FT RDGGYFLSSDSGWKTNFLKEGERHLVSKLYTDDMRPETVETTVTTAQETKRGRIQTKKE FT VSIKTTLKELVHKRVTSPEDWMMMQPDSYIEMMAQPGGENLLKNTLEICTLTLARTKTA FT FDLILEKAETSKLTNFSLPDTRTCRIFAFHGWNYVKVCHAICCVLNRQGGKRNTVLFHG FT PASTGKSIIAQAIAQAVGNVGCYNAANVNFPFNDCTNKNLIWVEEAGNFGQQVNQFKAI FT CSGQTIRIDQKGKGSKQIEPTPVIMTTNENITVVRIGCEERPEHTQPIRDRMLNIHLTH FT TLPGDFGLVDKNEWPMICAWLVKNGYQSTMASYCAKWGKVPDWSENWAEPKVPTPINLL FT GSARSPFTTPKSTPLSQNYALTPLASDLEDLALEPWSTPNTPVAGTAETQNTGEAGSKA FT CQDGQLSPTWSEIEEDLRACFGAEPLKKDFSEPLNLD" FT exon 2002..2280 FT /gene="VP" FT /number=1 FT /note="major transcription start site" FT exon 2006..2280 FT /gene="VP" FT /number=1 FT /note="minor transcription start site" FT exon 2009..2280 FT /gene="VP" FT /number=1 FT /note="minor transcription start site" FT intron 2281..2398 FT /gene="VP" FT /note="alternative intron" FT intron 2281..2376 FT /gene="VP" FT /note="alternative intron" FT CDS 2286..2354 FT /codon_start=1 FT /gene="VP" FT /product="unknown protein" FT /note="ORF1; putative" FT /db_xref="UniProtKB/TrEMBL:Q76W05" FT /protein_id="AAA67110.1" FT /translation="MAPPAKRAKRGKGLRDGWLVGY" FT exon <2286..2316 FT /gene="VP" FT /number=1 FT CDS join(2286..2316,2399..4557) FT /codon_start=1 FT /gene="VP1" FT /db_xref="GOA:P03137" FT /db_xref="InterPro:IPR001403" FT /db_xref="InterPro:IPR013607" FT /db_xref="InterPro:IPR016184" FT /db_xref="InterPro:IPR036952" FT /db_xref="PDB:4ZPY" FT /db_xref="UniProtKB/Swiss-Prot:P03137" FT /protein_id="AAA67111.1" FT /translation="MAPPAKRAKRGWVPPGYKYLGPGNSLDQGEPTNPSDAAAKEHDEA FT YDQYIKSGKNPYLYFSAADQRFIDQTKDAKDWGGKVGHYFFRTKRAFAPKLATDSEPGT FT SGVSRAGKRTRPPAYIFINQARAKKKLTSSAAQQSSQTMSDGTSQPDSGNAVHSAARVE FT RAADGPGGSGGGGSGGGGVGVSTGSYDNQTHYRFLGDGWVEITALATRLVHLNMPKSEN FT YCRIRVHNTTDTSVKGNMAKDDAHEQIWTPWSLVDANAWGVWLQPSDWQYICNTMSQLN FT LVSLDQEIFNVVLKTVTEQDLGGQAIKIYNNDLTACMMVAVDSNNILPYTPAANSMETL FT GFYPWKPTIASPYRYYFCVDRDLSVTYENQEGTVEHNVMGTPKGMNSQFFTIENTQQIT FT LLRTGDEFATGTYYFDTNSVKLTHTWQTNRQLGQPPLLSTFPEADTDAGTLTAQGSRHG FT TTQMGVNWVSEAIRTRPAQVGFCQPHNDFEASRAGPFAAPKVPADITQGVDKEANGSVR FT YSYGKQHGENWASHGPAPERYTWDETSFGSGRDTKDGFIQSAPLVVPPPLNGILTNANP FT IGTKNDIHFSNVFNSYGPLTAFSHPSPVYPQGQIWDKELDLEHKPRLHITAPFVCKNNA FT PGQMLVRLGPNLTDQYDPNGATLSRIVTYGTFFWKGKLTMRAKLRANTTWNPVYQVSAE FT DNGNSYMSVTKWLPTATGNMQSVPLITRPVARNTY" FT intron 2317..2398 FT /gene="VP" FT /note="VP intron (alt.)" FT CDS 2332..2361 FT /codon_start=1 FT /gene="VP" FT /product="unknown protein" FT /note="ORF3; putative" FT /db_xref="UniProtKB/TrEMBL:Q76W04" FT /protein_id="AAA67112.1" FT /translation="MVGWWGINV" FT CDS 2354..2398 FT /codon_start=1 FT /gene="VP" FT /product="unknown protein" FT /note="ORF2; putative" FT /db_xref="UniProtKB/TrEMBL:Q76W03" FT /protein_id="AAA67113.1" FT /translation="MFNYLFYRPEITWF" FT exon 2399..>4557 FT /gene="VP1" FT /number=2 FT CDS 2794..4557 FT /codon_start=1 FT /gene="VP1" FT /note="VP2" FT /db_xref="GOA:Q84367" FT /db_xref="InterPro:IPR001403" FT /db_xref="InterPro:IPR016184" FT /db_xref="InterPro:IPR036952" FT /db_xref="PDB:1Z14" FT /db_xref="UniProtKB/TrEMBL:Q84367" FT /protein_id="AAA67114.1" FT /translation="MSDGTSQPDSGNAVHSAARVERAADGPGGSGGGGSGGGGVGVSTG FT SYDNQTHYRFLGDGWVEITALATRLVHLNMPKSENYCRIRVHNTTDTSVKGNMAKDDAH FT EQIWTPWSLVDANAWGVWLQPSDWQYICNTMSQLNLVSLDQEIFNVVLKTVTEQDLGGQ FT AIKIYNNDLTACMMVAVDSNNILPYTPAANSMETLGFYPWKPTIASPYRYYFCVDRDLS FT VTYENQEGTVEHNVMGTPKGMNSQFFTIENTQQITLLRTGDEFATGTYYFDTNSVKLTH FT TWQTNRQLGQPPLLSTFPEADTDAGTLTAQGSRHGTTQMGVNWVSEAIRTRPAQVGFCQ FT PHNDFEASRAGPFAAPKVPADITQGVDKEANGSVRYSYGKQHGENWASHGPAPERYTWD FT ETSFGSGRDTKDGFIQSAPLVVPPPLNGILTNANPIGTKNDIHFSNVFNSYGPLTAFSH FT PSPVYPQGQIWDKELDLEHKPRLHITAPFVCKNNAPGQMLVRLGPNLTDQYDPNGATLS FT RIVTYGTFFWKGKLTMRAKLRANTTWNPVYQVSAEDNGNSYMSVTKWLPTATGNMQSVP FT LITRPVARNTY" FT unsure 3194 FT /gene="VP1" FT /note="c in one clone; g in another clone" FT old_sequence 3521..3527 FT /gene="VP1" FT /citation=[1] FT unsure 3728 FT /gene="VP1" FT /note="g in one clone; a in another clone" FT repeat_region 4740..4804 FT /note="repeat copy A" FT repeat_region 4805..4869 FT /note="repeat copy B" XX SQ Sequence 5149 BP; 1718 A; 1045 C; 1124 G; 1262 T; 0 other; atttttagaa ctgaccaacc atgttcacgt aagtgacgtg atgacgcgcg ctgcgcgcgc 60 gccttcggac gtcacacgtc acttacgttt cacatggttg gtcagttcta aaaatgataa 120 gcggttcagg gagtttaaac caaggcgcga aaaggaagtg ggcgtggttt aaagtatata 180 agcaactact gaagtcagtt acttatcttt tctttcattc tgtgagtcga gacgcacaga 240 aagagagtaa ccaactaacc atggctggaa atgcttactc tgatgaagtt ttgggagcaa 300 ccaactggtt aaaggaaaaa agtaaccagg aagtgttctc atttgttttt aaaaatgaaa 360 atgttcaact gaatggaaaa gatatcggat ggaatagtta caaaaaagag ctgcaggagg 420 acgagctgaa atctttacaa cgaggagcgg aaactacttg ggaccaaagc gaggacatgg 480 aatgggaaac cacagtggat gaaatgacca aaaagcaagt attcattttt gattctttgg 540 ttaaaaaatg tttatttgaa gtgcttaaca caaagaatat atttcctggt gatgttaatt 600 ggtttgtgca acatgaatgg ggaaaagacc aaggctggca ctgccatgta ctaattggag 660 gaaaggactt tagtcaagct caagggaaat ggtggagaag gcaactaaat gtttactgga 720 gcagatggtt ggtaacagcc tgtaatgtgc aactaacacc agctgaaaga attaaactaa 780 gagaaatagc agaagacaat gagtgggtta ctctacttac ttataagcat aagcaaacca 840 aaaaagacta taccaagtgt gttctttttg gaaacatgat tgcttactat tttttaacta 900 aaaagaaaat aagcactagt ccaccaagag acggaggcta ttttcttagc agtgactctg 960 gctggaaaac taacttttta aaagaaggcg agcgccatct agtgagcaaa ctatacactg 1020 atgacatgcg gccagaaacg gttgaaacca cagtaaccac tgcgcaggaa actaagcgcg 1080 gcagaattca aactaaaaaa gaagtttcta ttaaaactac acttaaagag ctggtgcata 1140 aaagagtaac ctcaccagag gactggatga tgatgcagcc agacagttac attgaaatga 1200 tggctcaacc aggtggagaa aacctgctga aaaatacgct agagatttgt acactaactc 1260 tagccagaac caaaacagca tttgacttaa ttttagaaaa agctgaaacc agcaaactaa 1320 ccaacttttc actgcctgac acaagaacct gcagaatttt tgcttttcat ggctggaact 1380 atgttaaagt ttgccatgct atttgctgtg ttttaaacag acaaggaggc aaaagaaata 1440 ctgttttatt tcatggacca gccagcacag gcaaatctat tattgcacaa gccatagcac 1500 aagcagttgg caatgttggt tgctataatg cagccaatgt aaactttcca tttaatgact 1560 gtaccaacaa gaacttgatt tgggtagaag aagctggtaa ctttggacag caagtaaacc 1620 agtttaaagc catttgctct ggtcaaacta ttcgcattga tcaaaaagga aaaggcagca 1680 aacagattga accaacacca gtcatcatga ccacaaatga gaacattaca gtggtcagaa 1740 taggctgcga agaaagacca gaacacactc aaccaatcag agacagaatg cttaacattc 1800 atctaacaca taccttgcct ggtgactttg gtttggttga caaaaatgaa tggcccatga 1860 tttgtgcttg gttggtaaag aatggttacc aatctaccat ggcaagctac tgtgctaaat 1920 ggggcaaagt tcctgattgg tcagaaaact gggcggagcc aaaggtgcca actcctataa 1980 atttactagg ttcggcacgc tcaccattca cgacaccgaa aagtacgcct ctcagccaga 2040 actatgcact aactccactt gcatcggatc tcgaggacct ggctttagag ccttggagca 2100 caccaaatac tcctgttgcg ggcactgcag aaacccagaa cactggggaa gctggttcca 2160 aagcctgcca agatggtcaa ctgagcccaa cttggtcaga gatcgaggag gatttgagag 2220 cgtgcttcgg tgcggaaccg ttgaagaaag acttcagcga gccgctgaac ttggactaag 2280 gtacgatggc gcctccagct aaaagagcta aaagaggtaa gggtttaagg gatggttggt 2340 tggtggggta ttaatgttta attacctgtt ttacaggcct gaaatcactt ggttttaggt 2400 tgggtgcctc ctggctacaa gtacctggga ccagggaaca gccttgacca aggagaacca 2460 accaatccat ctgacgccgc tgccaaagag cacgacgagg cctatgatca atacatcaaa 2520 tctggaaaaa atccttacct gtacttctct gctgctgatc aacgctttat tgaccaaacc 2580 aaggacgcca aagactgggg aggcaaggtt ggtcactact tttttagaac caagcgcgct 2640 tttgcaccta agcttgctac tgactctgaa cctggaactt ctggtgtaag cagagctggt 2700 aaacgcacta gaccacctgc ttacattttt attaaccaag ccagagctaa aaaaaaactt 2760 acttcttctg ctgcacagca aagcagtcaa accatgagtg atggcaccag ccaacctgac 2820 agcggaaacg ctgtccactc agctgcaaga gttgaacgag cagctgacgg ccctggaggc 2880 tctgggggtg ggggctctgg cgggggtggg gttggtgttt ctactgggtc ttatgataat 2940 caaacgcatt atagattctt gggtgacggc tgggtagaaa ttactgcact agcaactaga 3000 ctagtacatt taaacatgcc taaatcagaa aactattgca gaatcagagt tcacaataca 3060 acagacacat cagtcaaagg caacatggca aaagatgatg ctcatgagca aatttggaca 3120 ccatggagct tggtggatgc taatgcttgg ggagtttggc tccagccaag tgactggcaa 3180 tacatttgca acaccatgag ccagcttaac ttggtatcac ttgatcaaga aatattcaat 3240 gtagtgctga aaactgttac agagcaagac ttaggaggtc aagctataaa aatatacaac 3300 aatgacctta cagcttgcat gatggttgca gtagactcaa acaacatttt gccatacaca 3360 cctgcagcaa actcaatgga aacacttggt ttctacccct ggaaaccaac catagcatca 3420 ccatacaggt actatttttg cgttgacaga gatctttcag tgacctacga aaatcaagaa 3480 ggcacagttg aacataatgt gatgggaaca ccaaaaggaa tgaattctca attttttacc 3540 attgagaaca cacaacaaat cacattgctc agaacagggg acgaatttgc cacaggtact 3600 tactactttg acacaaattc agttaaactc acacacacgt ggcaaaccaa ccgtcaactt 3660 ggacagcctc cactgctgtc aacctttcct gaagctgaca ctgatgcagg tacacttact 3720 gctcaaggga gcagacatgg aacaacacaa atgggggtta actgggtgag tgaagcaatc 3780 agaaccagac ctgctcaagt aggattttgt caaccacaca atgactttga agccagcaga 3840 gctggaccat ttgctgcccc aaaagttcca gcagatatta ctcaaggagt agacaaagaa 3900 gccaatggca gtgttagata cagttatggc aaacagcatg gtgaaaattg ggcttcacat 3960 ggaccagcac cagagcgcta cacatgggat gaaacaagct ttggttcagg tagagacacc 4020 aaagatggtt ttattcaatc agcaccacta gttgttccac caccactaaa tggcattctt 4080 acaaatgcaa accctattgg gactaaaaat gacattcatt tttcaaatgt ttttaacagc 4140 tatggtccac taactgcatt ttcacaccca agtcctgtat accctcaagg acaaatatgg 4200 gacaaagaac tagatcttga acacaaacct agacttcaca taactgctcc atttgtttgt 4260 aaaaacaatg cacctggaca aatgttggtt agattaggac caaacctaac tgaccaatat 4320 gatccaaacg gagccacact ttctagaatt gttacatacg gtacattttt ctggaaagga 4380 aaactaacca tgagagcaaa acttagagct aacaccactt ggaacccagt gtaccaagta 4440 agtgctgaag acaatggcaa ctcatacatg agtgtaacta aatggttacc aactgctact 4500 ggaaacatgc agtctgtgcc gcttataaca agacctgttg ctagaaatac ttactaacta 4560 accatgcttt ttctttctgt acttcatata ttattaagac taataaagat acaacataga 4620 aatataatat tacgtataga tttaagaaat agaataatat ggtacttagt aactgttaaa 4680 aataatagaa cctttggaat aacaagatag ttagttggtt aatgttagat agaataagaa 4740 gatcatgtat aatgaataaa agggtggaag ggtggttggt aggttaatgt tagatagaat 4800 aagaagatca tgtataatga ataaaagggt ggaagggtgg ttggtaggta ttcccttaga 4860 cttgatgtta aggaccaaaa aaataataaa acttttttaa aactcaacca agactactgt 4920 ctattcagtg aaccaactga accattagta ttactatgtt tttagggtgg gagggtggga 4980 gatacatgtg ttcgctatga gcgaactggt actggttggt tgctctgctc aaccaaccag 5040 accggcaaag ccggtctggt tggttgagcg caaccaacca gtaccagttc gctcatagcg 5100 aacacatgta tctcccaccc tcccacccta aaaacatagt aatactaat 5149 //