Dbfetch

ID   X04615; SV 1; circular; genomic DNA; STD; VRL; 3215 BP.
XX
AC   X04615;
XX
DT   07-JUN-1987 (Rel. 12, Created)
DT   17-NOV-2004 (Rel. 81, Last updated, Version 6)
XX
DE   Hepatitis B virus genome, subtype ayr
XX
KW   C gene; circular; core antigen; DNA polymerase; e antigen; genome;
KW   overlapping genes; P gene; S gene; surface antigen; X gene.
XX
OS   Hepatitis B virus
OC   Viruses; Hepadnaviridae; Orthohepadnavirus.
XX
RN   [1]
RP   1-3215
RX   PUBMED; 3783127.
RA   Okamoto H., Imai M., Shimozaki M., Hoshi Y., Iizuka H., Gotanda T.,
RA   Tsuda F., Miyakawa Y., Mayumi M.;
RT   "Nucleotide Sequence of a Cloned Hepatitis B Virus Genome ,Subtype
RT   ayr:Comparison with Genomes of the Other Three Subtypes";
RL   J. Gen. Virol. 67:2305-2314(1986).
XX
DR   MD5; 1a9e5d598b3a8cc55c540376d3a90948.
DR   EuropePMC; PMC1187008; 11322169.
DR   EuropePMC; PMC1352377; 16401352.
DR   EuropePMC; PMC1395421; 16501106.
DR   EuropePMC; PMC1773865; 14570734.
DR   EuropePMC; PMC1774609; 15951551.
DR   EuropePMC; PMC1829020; 17182753.
DR   EuropePMC; PMC20452; 9576957.
DR   EuropePMC; PMC26492; 10677515.
DR   EuropePMC; PMC2729789; 19187787.
DR   EuropePMC; PMC3116493; 21569595.
DR   EuropePMC; PMC3122685; 21411575.
DR   EuropePMC; PMC3842526; 24348634.
DR   EuropePMC; PMC3970604; 23903967.
DR   EuropePMC; PMC400381; 15113905.
DR   EuropePMC; PMC4202270; 25028473.
DR   EuropePMC; PMC4539793; 26300928.
DR   EuropePMC; PMC4543549; 26288093.
DR   EuropePMC; PMC4971884; 24759937.
DR   EuropePMC; PMC5013995; 27031749.
DR   EuropePMC; PMC5308760; 27486882.
DR   EuropePMC; PMC5368141; 28332361.
DR   EuropePMC; PMC5370826; 27537600.
DR   EuropePMC; PMC5459465; 28582431.
DR   EuropePMC; PMC5656792; 28640739.
DR   EuropePMC; PMC5739625; 29285238.
DR   EuropePMC; PMC5788064; 29267855.
DR   EuropePMC; PMC5841821; 29474353.
DR   EuropePMC; PMC6160064; 30235666.
DR   EuropePMC; PMC6192773; 30132518.
DR   EuropePMC; PMC6354056; 30518174.
DR   EuropePMC; PMC6389180; 29486716.
DR   EuropePMC; PMC6542474; 25599103.
DR   GOA; P0C767.
DR   InterPro; IPR013195; Hepatitis_B_virus_capsid_N.
DR   InterPro; IPR036459; Viral_capsid_core_dom_sf_HBV.
DR   RFAM; RF01047; HBV_epsilon.
DR   UniProtKB/Swiss-Prot; P0C767; HBEAG_HBVCJ.
XX
FH   Key             Location/Qualifiers
FH
FT   source          1..3215
FT                   /organism="Hepatitis B virus"
FT                   /strain="ayr"
FT                   /mol_type="genomic DNA"
FT                   /db_xref="taxon:10407"
FT   CDS             <1..1623
FT                   /note="P gene product (AA 304-843); circular molecule"
FT                   /db_xref="GOA:Q69028"
FT                   /db_xref="InterPro:IPR000201"
FT                   /db_xref="InterPro:IPR000477"
FT                   /db_xref="InterPro:IPR001462"
FT                   /db_xref="InterPro:IPR037531"
FT                   /db_xref="UniProtKB/Swiss-Prot:Q69028"
FT                   /protein_id="CAA28286.1"
FT                   /translation="LHNIPPSSARSQSEGPIFSCWWLQFRNSKPCSDYCLTHIVNLLED
FT                   WGPCTEHGEHNIRIPRTPARVTGGVFLVDKNPHNTTESRLVVDFSQFSRGSTHVSWPKF
FT                   AVPNLQSLTNLLSSNLSWLSLDVSAAFYHIPLHPAAMPHLLVGSSGLPRYVARLSSTSR
FT                   NINYQHGTMQNLHDSCSRNLYVSLLLLYKTFGRKLHLYSHPIILGFRKIPMGVGLSPFL
FT                   LAQFTSAICSVVRRAFPHCLAFSYMDDVVLGAKSVQHLESLFTSITNFLLSLGIHLNPN
FT                   KTKRWGYSLNFMGYVIGSWGTLPQEHIVQKLKQCFRKLPVNRPIDWKVCQRIVGLLGFA
FT                   APFTQCGYPALMPLYACIQSKQAFTFSPTYKAFLCKQYLNLYPVARQRSGLCQVFADAT
FT                   PTGWGLAIGHRRMRGTFVAPLPIHTAELLAACFARSRSGAKLIGTDNSVVLSRKYTSFP
FT                   WLLGCAANWILRGTSFVYVPSALNPADDPSRGRLGLYRPLLHLPFRPTTGRTSLYAVSP
FT                   SVPSHLPDRVHFASPLHVAWRPP"
FT   CDS             155..835
FT                   /note="S gene product, put.surface antigen (AA 1-226)"
FT                   /db_xref="GOA:Q76R62"
FT                   /db_xref="InterPro:IPR000349"
FT                   /db_xref="UniProtKB/Swiss-Prot:Q76R62"
FT                   /protein_id="CAA28287.1"
FT                   /translation="MESTTSGFLGPLLVLQAGFFLLTRILTIPQSLDSWWTSLNFLGGA
FT                   PTCPGQNSQSPTSNHSPTSCPPTCPGYRWMCLRRFIIFLFILLLCLIFLLVLLDYQGML
FT                   PVCPLLPGTSTTSTGPCRTCTIPAQGTSMFPSCCCTKPSDGNCTCIPIPSSWAFARFLW
FT                   EWASVRFSWLSLLVPFVQWFVGLSPTVWLSAIWMMWYWGPSLYNILSPFLPLLPIFFCL
FT                   WVYI"
FT   CDS             1374..1838
FT                   /note="X gene product, (AA 1-154)"
FT                   /db_xref="GOA:Q69027"
FT                   /db_xref="InterPro:IPR000236"
FT                   /db_xref="UniProtKB/Swiss-Prot:Q69027"
FT                   /protein_id="CAA28288.1"
FT                   /translation="MAARLCCQLDPARDVLCLRPVGAESRGRPVSGPFGPLPSPSSSAV
FT                   PADHGAHLSLRGLPVCAFSSAGPCALRFTSARSMETTVNAHQVLPKVLHKRTLGLSAMS
FT                   TTDLEAYFKDCLFKDWEELGEEIRLKVFVLGGCRHKLVCSPAPCNFFPSA"
FT   misc_feature    1804..1806
FT                   /note="translation initiation codon of pre-C region"
FT   CDS             1901..2452
FT                   /note="C gene product, core and e antigens, (AA 1-183)"
FT                   /db_xref="GOA:Q76R61"
FT                   /db_xref="InterPro:IPR002006"
FT                   /db_xref="InterPro:IPR036459"
FT                   /db_xref="PDB:5E00"
FT                   /db_xref="UniProtKB/Swiss-Prot:Q76R61"
FT                   /protein_id="CAA28289.1"
FT                   /translation="MDIDPYKEFGASVELLSFLPSDFFPSIRDLLDTASALYREALESP
FT                   EHCSPHHTALRQAILCWGELMNLATWVGSNLEDPASRELVVSYVNVNMGLKIRQLLWFH
FT                   ISCLTFGRETVLEYLVSFGVWIRTPPAYRPPNAPILSTLPETTVVRRRGRSPRRRTPSP
FT                   RRRRSQSPRRRRSQSRESQC"
FT   CDS             2307..>3215
FT                   /note="P gene product, put.DNA polymerase (AA 1-303)"
FT                   /db_xref="GOA:Q69028"
FT                   /db_xref="InterPro:IPR000201"
FT                   /db_xref="InterPro:IPR000477"
FT                   /db_xref="InterPro:IPR001462"
FT                   /db_xref="InterPro:IPR037531"
FT                   /db_xref="UniProtKB/Swiss-Prot:Q69028"
FT                   /protein_id="CAA28290.1"
FT                   /translation="MPLSYQHFRKLLLLDDEAGPLEEELPRLADEGLNRRVAEDLNLGN
FT                   LNVSIPWTHKVGNFTGLYSSTVPVFNPDWKTPSFPHIHLQEDIINRCQQYVGPLTVNEK
FT                   RRLKLIMPARFYPNLTKYLPLDKGIKPYYPEYAVNHYFKTRHYLHTLWKAGILYKRETT
FT                   RSASFCGSPYSWEQELQHGRLVFQTSTRHGDESFCSQSSGILSRSPVGPCVRSQLKQSR
FT                   LGLQPQQGSLARGKSGRSGSIWSRVHPTTRRPFGVEPSGSGHIDNTASSTSSCLHQSAV
FT                   RKTAYSHLSTSKRQSSSGHAVE"
FT   misc_feature    2713..2715
FT                   /note="translation start codon of pre-S (O) region"
FT   misc_feature    2848..2850
FT                   /note="translation start codon of pre-S(1) region"
FT   misc_feature    3205..3207
FT                   /note="translation start codon of pre-S (2) region"
XX
SQ   Sequence 3215 BP; 725 A; 874 C; 710 G; 906 T; 0 other;
     ctccacaaca ttccaccaag ctctgctaga tcccagagtg aggggcctat attttcctgc        60
     tggtggctcc agttccggaa cagtaaaccc tgttccgact actgcctcac ccatatcgtc       120
     aatcttctcg aggactgggg accctgcacc gaacatggag agcacaacat caggattcct       180
     aggacccctg ctcgtgttac aggcggggtt tttcttgttg acaagaatcc tcacaatacc       240
     acagagtcta gactcgtggt ggacttctct caattttcta gggggagcac ccacgtgtcc       300
     tggccaaaat tcgcagtccc caacctccaa tcactcacca acctcttgtc ctccaacttg       360
     tcctggctat cgctggatgt gtctgcggcg ttttatcata ttcctcttca tcctgctgct       420
     atgcctcatc ttcttgttgg ttcttctgga ctaccaaggt atgttgcccg tttgtcctct       480
     acttccagga acatcaacta ccagcacggg accatgcaga acctgcacga ttcctgctca       540
     aggaacctct atgtttccct cttgttgctg tacaaaacct tcggacggaa actgcacttg       600
     tattcccatc ccatcatcct gggctttcgc aagattccta tgggagtggg cctcagtccg       660
     tttctcctgg ctcagtttac tagtgccatt tgttcagtgg ttcgtagggc tttcccccac       720
     tgtttggctt tcagctatat ggatgatgtg gtattggggg ccaagtctgt acaacatctt       780
     gagtcccttt ttacctctat taccaatttt cttttgtctt tgggtataca tttgaaccct       840
     aataaaacca aacgttgggg ctactccctt aacttcatgg gatatgtaat tggaagttgg       900
     ggtactttac cgcaggaaca tattgtacaa aaactcaagc aatgttttcg aaaattgcct       960
     gtaaatagac ctattgattg gaaagtatgt caaagaattg tgggtctttt gggctttgct      1020
     gcccctttta cacaatgtgg ctatcctgcc ttgatgcctt tatatgcatg tatacaatct      1080
     aagcaggctt tcactttctc gccaacttac aaggcctttc tgtgtaaaca atatctaaac      1140
     ctttaccccg ttgcccggca acggtcaggt ctctgccaag tgtttgctga cgcaaccccc      1200
     acgggttggg gcttggccat aggccatcgg cgcatgcgtg gaacctttgt ggctcctctg      1260
     ccgatccata ctgcggaact cctagcagct tgttttgctc gcagccggtc tggagcgaaa      1320
     cttatcggaa ccgacaactc agttgtcctc tctcggaaat acacctcctt tccatggctg      1380
     ctaggctgtg ctgccaactg gatcctgcgc gggacgtcct ttgtctacgt cccgtcggcg      1440
     ctgaatcccg cggacgaccc gtctcggggc cgtttgggcc tctaccgtcc ccttcttcat      1500
     ctgccgttcc ggccgaccac ggggcgcacc tctctttacg cggtctcccc gtctgtgcct      1560
     tctcatctgc cggaccgtgt gcacttcgct tcacctctgc acgtagcatg gagaccaccg      1620
     tgaacgccca ccaggtcttg cccaaggtct tacacaagag gactcttgga ctctcagcaa      1680
     tgtcaacgac cgaccttgag gcatacttca aagactgttt gtttaaagac tgggaggagt      1740
     tgggggagga gattaggtta aaggtctttg tactaggagg ctgtaggcat aaattggtct      1800
     gttcaccagc accatgcaac tttttcccct ctgcctaatc atctcatgtt catgtcctac      1860
     tgttcaagcc tccaagctgt gccttgggtg gctttggggc atggacattg acccgtataa      1920
     agaatttgga gcttctgtgg agttactctc ttttttgcct tctgacttct ttccttctat      1980
     tcgagatctc ctcgacaccg cctctgctct gtatcgggag gccttagagt ctccggaaca      2040
     ttgttcacct caccatacag cactcaggca agctattctg tgttggggtg agttgatgaa      2100
     tctggccacc tgggtgggaa gtaatttgga agacccagca tccagggaat tagtagtcag      2160
     ctatgtcaat gttaatatgg gcctaaaaat tagacaacta ttgtggtttc acatttcctg      2220
     ccttactttt ggaagagaaa ctgtccttga gtatttggtg tcttttggag tgtggattcg      2280
     cactcctccc gcttacagac caccaaatgc ccctatctta tcaacacttc cggaaactac      2340
     tgttgttaga cgacgaggca ggtcccctag aagaagaact ccctcgcctc gcagacgaag      2400
     gtctcaatcg ccgcgtcgca gaagatctca atctcgggaa tctcaatgtt agtatccctt      2460
     ggactcataa ggtgggaaac tttactgggc tttattcttc tactgtacct gtctttaatc      2520
     ctgattggaa aactccctcc tttcctcaca ttcatttaca ggaggacatt attaatagat      2580
     gtcaacaata tgtgggccct ctgacagtta atgaaaaaag gagattaaaa ttaattatgc      2640
     ctgctaggtt ctatcctaac cttaccaaat atttgccctt ggacaaaggc attaaaccgt      2700
     attatcctga atatgcagtt aatcattact tcaaaactag gcattattta catactctgt      2760
     ggaaggctgg cattctatat aagagagaaa ctacacgcag cgcctcattt tgtgggtcac      2820
     catattcttg ggaacaagag ctacagcatg ggaggttggt cttccaaacc tcgacaaggc      2880
     atggggacga atctttctgt tcccaatcct ctgggattct ttcccgatca ccagttggac      2940
     cctgcgttcg gagccaactc aaacaatcca gattgggact tcaaccccaa caaggatcac      3000
     tggccagagg caaatcaggt aggagcggga gcatttggtc cagggttcac cccaccacac      3060
     ggaggccttt tggggtggag ccctcaggct cagggcatat tgacaacact gccagcagca      3120
     cctcctcctg cctccaccaa tcggcagtca ggaagacagc ctactcccat ctctccacct      3180
     ctaagagaca gtcatcctca ggccatgcag tggaa                                 3215
//