ID U29144; SV 1; linear; genomic DNA; STD; VRL; 3302 BP. XX AC U29144; XX DT 15-AUG-1995 (Rel. 44, Created) DT 04-MAR-2000 (Rel. 63, Last updated, Version 4) XX DE Arctic ground squirrel hepatitis B virus (ASHV), complete genome. XX KW core protein, DNA polymerase, surface antigen, X protein. XX OS Arctic ground squirrel hepatitis B virus OC Viruses; Hepadnaviridae; Orthohepadnavirus. XX RN [1] RP 1-3302 RX PUBMED; 8676441. RA Testut P., Renard C.A., Terradillos O., Vitvitski-Trepo L., Tekaia F., RA Degott C., Blake J., Boyer B., Buendia M.A.; RT "A new hepadnavirus endemic in arctic ground squirrels in Alaska"; RL J. Virol. 70(7):4210-4219(1996). XX RN [2] RP 1-3302 RA Testut P., Buendia M.A.; RT ; RL Submitted (14-JUN-1995) to the INSDC. RL Patrice Testut, Departement des Retrovirus, Institut Pasteur, 28 rue du RL Docteur Roux, 75724 Paris Cedex 15, France XX DR MD5; e3885a90ad084c9907abaefd7c8458cc. DR EuropePMC; PMC3647427; 23631923. DR EuropePMC; PMC4724926; 14760764. DR EuropePMC; PMC5320732; 28222808. DR RFAM; RF01047; HBV_epsilon. XX CC On May 16, 1996 this sequence version replaced gi:939692. XX FH Key Location/Qualifiers FH FT source 1..3302 FT /organism="Arctic ground squirrel hepatitis B virus" FT /host="Spermophilus parryii" FT /mol_type="genomic DNA" FT /note="obtained by direct sequencing of PCR-amplified DNA FT from the liver of an infected Arctic ground squirrel FT (Spermophilus parryii), Fairbanks, Alaska; this species FT name was not published at the time of submission; FT submitter-supplied organism name synonym=Arctic squirrel FT hepatitis virus" FT /db_xref="taxon:41952" FT CDS 1..654 FT /codon_start=1 FT /gene="preC/C" FT /product="e antigen precursor" FT /note="protein has a signal peptide for secretion" FT /db_xref="GOA:Q64896" FT /db_xref="InterPro:IPR002006" FT /db_xref="InterPro:IPR013195" FT /db_xref="InterPro:IPR036459" FT /db_xref="UniProtKB/Swiss-Prot:Q64896" FT /protein_id="AAB08030.1" FT /translation="MYLFHLCLVFACVSCPTVQASKLCLGWLWDMDIDPYKEFGSSYQL FT LNFLPLDFFPELNALVDTATALYEEELTGREHCSPHHTAIRQALVCWEELTRLIAWMSA FT NINSEEVRRVIVAHVNDTWGLKVRQNLWFHLSCLTFGQHTVQEFLVSFGVRIRTPAPYR FT PPNAPILSTLPEHTVIRRRGSARVVRSPRRRTPSPRRRRSQSPRRRPQSPASNC" FT CDS 91..654 FT /codon_start=1 FT /gene="C" FT /product="core protein" FT /note="nucleocapsid" FT /db_xref="GOA:Q64897" FT /db_xref="InterPro:IPR002006" FT /db_xref="InterPro:IPR036459" FT /db_xref="UniProtKB/Swiss-Prot:Q64897" FT /protein_id="AAB08031.1" FT /translation="MDIDPYKEFGSSYQLLNFLPLDFFPELNALVDTATALYEEELTGR FT EHCSPHHTAIRQALVCWEELTRLIAWMSANINSEEVRRVIVAHVNDTWGLKVRQNLWFH FT LSCLTFGQHTVQEFLVSFGVRIRTPAPYRPPNAPILSTLPEHTVIRRRGSARVVRSPRR FT RTPSPRRRRSQSPRRRPQSPASNC" FT CDS 497..3130 FT /codon_start=1 FT /gene="pol" FT /product="polymerase" FT /note="primase/spacer/reverse transcriptase/RNaseH" FT /db_xref="GOA:Q64898" FT /db_xref="InterPro:IPR000201" FT /db_xref="InterPro:IPR000477" FT /db_xref="InterPro:IPR001462" FT /db_xref="InterPro:IPR037531" FT /db_xref="UniProtKB/Swiss-Prot:Q64898" FT /protein_id="AAB08032.1" FT /translation="MHPFSQLFRNIQSLGEEEVQELLGPPEDALPLLAGEDLNHRVAGL FT NLQLPTADLDWVHQTNAITGLYSTQTAKFNPEWKQPDFPKIHLSEDLFLNYNNFCGPLT FT VNEKRKLKLNFPARFFPKATKYFPLSKGIKNNYPDFSIEHFFAAATYLWTLWESGILYL FT RKNQTTLTFKGKPYSWGHRQLEQHNGQQHESHLQSRESSSMVASSGHILHKQHASGPSS FT FPTRDLPNNFFGESQKSARTGGSVREKIQTNRLGFPGKSKITIGQQGSSQVSSPRSKSS FT NFRNQTQANHSSWNQRHPTWYSTTSNTTQSRQREETYSSDSAFKRHSPSFEHEKSEPSS FT SGLCGGTESLNNTGTSTFCLWRSFYNTEPCGAYCLHHIVSSLEDWGPCTISGDVTIRSP FT RTPRRITGGVFLVDKHPHNSSESRLVVDFSQFSRGHTRVHWPKFAVLNLQALANLLSTN FT LQWLSLDVSAAFYHIPVSPAAVPHLLVGSPGLERFTPSMSHTTIHGNNSKLQTMHNLCS FT RHLFNSLLLLFKTYGRKLHLLAHPFIMGFRKLPMGVGLSPFLLAQFTSALASMVRRNFP FT HCVAFAYMDDLVLGARTHEHLTAIYSHICSVFLDLGIHLNVAKTKWWGHHLHFMGYVIT FT GAGILPQDKHVQKVSTYLKSIPLNKPLDYKICERLTGILNYVAPFTKCGYAALLPLYQA FT TSRTAFVFSSLYHSWLLSLYAELWPVARQRGVVCSVSDATPTGWGICTTYQLISPTGAF FT ALPIATADVIAACLARCWTGARLLGTDNSVVLSGKLTSYPWLLACVANWILRGTSFCYV FT PSAANPADLPSRGLLPALHPVPTLRFRPQLSRISLWAASPPVSPRRPVRVAWASPVQNS FT EPWFPP" FT CDS 1059..2342 FT /codon_start=1 FT /gene="preS1/preS2/S" FT /product="large envelope protein" FT /db_xref="GOA:Q64899" FT /db_xref="InterPro:IPR000349" FT /db_xref="UniProtKB/TrEMBL:Q64899" FT /protein_id="AAB08033.1" FT /translation="MGNNMKVTFNPEKVAAWWPAVGTYYTNSTPQDPPVFQPGIYQTTS FT LVNPKNQQELEAVLEKRYKQIDWDSLVNQKLPLVSRVPPKSPPQDQRAQTFEIKPRPII FT VPGIRDIPRGIVPPQTPPNRDKGRKPTPQTPPLRDTHPHLNMKNQSPHLQGFAEGLRVL FT TTPEHQHSAYGDPFTTLSPVVPTVSTTLSPPLKIGDPVQSAEMSPSGLLGLLAGLQVVY FT FLWTNILTIAQSLDWWWTSLSFPGGIPECTGQNLQFLTCKHLPTSCPPTCNGFRWMYLR FT RFIIYLLVLLLCLIFLLVLLDWKGLLPVCPIQPSTETTVNCRQCTISAQDTFSTPYCCC FT LKPTAGNCTCWPIPSSWALGSYLWEWALVRFSWLSLLVPLLQWLGGISLTVWLLLIWMT FT WFWGPVLMSILPPFIPIFALFFLIWAYI" FT CDS 1494..2342 FT /codon_start=1 FT /gene="preS2/S" FT /product="middle envelope protein" FT /db_xref="GOA:Q64900" FT /db_xref="InterPro:IPR000349" FT /db_xref="UniProtKB/TrEMBL:Q64900" FT /protein_id="AAB08034.1" FT /translation="MKNQSPHLQGFAEGLRVLTTPEHQHSAYGDPFTTLSPVVPTVSTT FT LSPPLKIGDPVQSAEMSPSGLLGLLAGLQVVYFLWTNILTIAQSLDWWWTSLSFPGGIP FT ECTGQNLQFLTCKHLPTSCPPTCNGFRWMYLRRFIIYLLVLLLCLIFLLVLLDWKGLLP FT VCPIQPSTETTVNCRQCTISAQDTFSTPYCCCLKPTAGNCTCWPIPSSWALGSYLWEWA FT LVRFSWLSLLVPLLQWLGGISLTVWLLLIWMTWFWGPVLMSILPPFIPIFALFFLIWAY FT I" FT CDS 1674..2342 FT /codon_start=1 FT /gene="S" FT /product="small envelope protein" FT /note="surface antigen" FT /db_xref="GOA:Q64901" FT /db_xref="InterPro:IPR000349" FT /db_xref="UniProtKB/TrEMBL:Q64901" FT /protein_id="AAB08035.1" FT /translation="MSPSGLLGLLAGLQVVYFLWTNILTIAQSLDWWWTSLSFPGGIPE FT CTGQNLQFLTCKHLPTSCPPTCNGFRWMYLRRFIIYLLVLLLCLIFLLVLLDWKGLLPV FT CPIQPSTETTVNCRQCTISAQDTFSTPYCCCLKPTAGNCTCWPIPSSWALGSYLWEWAL FT VRFSWLSLLVPLLQWLGGISLTVWLLLIWMTWFWGPVLMSILPPFIPIFALFFLIWAYI FT " FT CDS 2875..3291 FT /codon_start=1 FT /gene="X" FT /product="X protein" FT /db_xref="GOA:Q64902" FT /db_xref="InterPro:IPR000236" FT /db_xref="UniProtKB/Swiss-Prot:Q64902" FT /protein_id="AAB08036.1" FT /translation="MAARLCCQLDSSRDVVLLRPFGSESGGPAVSRPSAGSASRADSPL FT PSAAESHLPLGRLPACFASPSGPCCLGFTCAEFGAMVSTMNFVTWHAKRQLGMPTKDLW FT TPYVRNQLLTKWEEGTIDSRLPLFVLGGCRHKYM" XX SQ Sequence 3302 BP; 830 A; 827 C; 688 G; 957 T; 0 other; atgtatcttt ttcacctgtg ccttgttttt gcctgtgttt catgtcctac tgttcaagcc 60 tccaagctgt gccttggatg gctttgggac atggacatag atccctataa agaatttggt 120 tcatcctacc agttgttgaa ttttcttcct ttggacttct ttcctgaact caatgccttg 180 gtggacactg ctactgctct ctatgaagaa gaattaacag gtagggagca ctgctctcct 240 catcacacag ctatcagaca agctttagtt tgctgggaag aattaacaag attaattgcg 300 tggatgagtg ctaacattaa ttcagaagaa gtaagaagag ttatagttgc tcatgtcaat 360 gacacttggg gacttaaagt taggcagaat ttatggtttc acttatcctg tctgactttt 420 gggcaacaca cagtgcagga atttttagtc agctttggag taaggatcag aactccggct 480 ccttatagac ctcctaatgc acccattctc tcaactcttc cggaacatac agtcattagg 540 agaagaggaa gtgcaagagt tgttaggtcc cccagaagac gcactccctc tcctcgcagg 600 agaagatctc aatcaccgcg tcgcaggcct caatctccag cttccaactg ctgatcttga 660 ttgggtgcat caaactaatg ctataacggg tctttattct actcagacag ctaagtttaa 720 tcctgaatgg aaacaacctg attttccaaa aattcacttg tctgaagatt tatttctaaa 780 ctacaacaat ttttgtggtc ctcttacagt taatgaaaaa aggaaattaa aattaaattt 840 tcctgctagg ttttttccca aggctactaa atattttccc ctttccaaag gaataaaaaa 900 taattatcct gatttctcta tagaacactt ttttgcagct gcaacttatt tatggacttt 960 gtgggaatca ggaatcttgt atttgaggaa aaatcaaact actctcactt ttaagggtaa 1020 accatattct tggggacaca gacagctaga gcaacataat gggcaacaac atgaaagtca 1080 ccttcaatcc agagaaagta gcagcatggt ggccagcagt gggcacatat tacacaaaca 1140 gcacgcctca ggaccctcca gttttccaac cagggattta ccaaacaact tctttggtga 1200 atcccaaaaa tcagcaagaa ctggaggcag tgttagagaa aagatacaaa caaatagatt 1260 gggattccct ggtaaatcaa aaattaccat tggtcagcag ggttcctccc aagtctcctc 1320 cccaagatca aagagctcaa actttcgaaa tcaaacccag gccaatcata gttcctggaa 1380 tcagagacat cccacgtggt atagtaccac ctcaaacacc acccaatcga gacaaaggga 1440 ggaaacctac tcctcagact ccgcctttaa gagacactca ccctcatttg aacatgaaaa 1500 atcagagccc tcatcttcag ggctttgcgg agggactgag agtcttaaca acaccggaac 1560 atcaacattc tgcttatgga gatcctttta caacactgag ccctgtggtg cctactgtct 1620 ccaccacatt gtctcctccc ttgaagattg gggaccctgt acaatcagcg gagatgtcac 1680 catcaggtct cctaggactc ctcgccggat tacaggtggt gtatttcttg tggacaaaca 1740 tcctcacaat agctcagagt ctcgattggt ggtggacttc tctcagtttt ccagggggca 1800 taccagagtg cactggccaa aatttgcagt tcttaacttg caagcacttg ccaacctctt 1860 gtccaccaac ctgcaatggc tttcgctgga tgtatctgcg gcgttttatc atatacctgt 1920 tagtcctgct gctgtgcctc atcttcttgt tggttctcct ggactggaaa ggtttactcc 1980 cagtatgtcc catacaacca tccacggaaa caacagtaaa ttgcagacaa tgcacaatct 2040 ctgctcaaga caccttttca actccttact gttgttgttt aaaacctacg gcaggaaatt 2100 gcacctgttg gcccatccct tcatcatggg ctttaggaag ctacctatgg gagtgggcct 2160 tagtccgttt ctcttggctc agtttactag tgcccttgct tcaatggtta ggaggaattt 2220 ccctcactgt gtggcttttg cttatatgga tgacttggtt ttgggggccc gtactcatga 2280 gcatcttacc gccatttatt cccatatttg ctctgttttt cttgatttgg gcatacattt 2340 aaatgtagcc aaaactaaat ggtggggaca tcatttacat ttcatgggtt atgttattac 2400 tggagcagga attttacccc aagataaaca tgtgcaaaaa gtatcaacat atttgaaatc 2460 cattccactc aacaaacctt tagattataa aatctgtgaa aggttaacag gcattctgaa 2520 ttatgttgct ccttttacta aatgtggtta tgctgctctc cttcctttgt atcaagctac 2580 ttcgcgtacg gcatttgtgt tttcttctct ctaccacagc tggttgctgt ccctttatgc 2640 tgagttgtgg cctgttgcca ggcaacgtgg cgtggtgtgc tctgtgtctg acgcaacccc 2700 cactggttgg ggcatttgca ccacctatca actcatttcc ccgacgggcg cttttgccct 2760 gccgatcgcc accgcggacg tcatcgccgc ctgccttgct cgctgctgga caggagctcg 2820 gctgttgggc actgacaact ccgtggttct ttcgggcaaa ctgacttcct atccatggct 2880 gctcgcctgt gttgccaact ggattcttcg cgggacgtcg ttctgctacg tcccttcggc 2940 agcgaatccg gcggacctgc cgtctcgagg ccttctgccg gctctgcatc ccgtgccgac 3000 tctccgcttc cgtccgcagc tgagtcgcat ctccctttgg gccgcctccc cgcctgtttc 3060 gcctcgccgt ccggtccgtg ttgcctgggc ttcacctgtg cagaattcgg agccatggtt 3120 tccaccatga actttgtcac ttggcatgca aaacgccaac tgggcatgcc aacaaaggac 3180 ctttggactc cttatgtaag aaatcaatta ttaaccaaat gggaggaggg tactattgat 3240 tctagattac cactgtttgt attagggggc tgtaggcata aatacatgta actgccgcaa 3300 tc 3302 //