ID M95589; SV 1; circular; genomic DNA; STD; VRL; 3018 BP. XX AC M95589; XX DT 09-DEC-1992 (Rel. 34, Created) DT 06-JAN-2004 (Rel. 78, Last updated, Version 4) XX DE Ross' goose hepatitis B virus, complete genome. XX KW . XX OS Ross's goose hepatitis B virus OC Viruses; Hepadnaviridae; Avihepadnavirus. XX RN [1] RP 1-3018 RA Shi H., Cullen J.M., Newbold J.E.; RT "A novel isolate of duck hepatitis B virus"; RL Unpublished. XX DR MD5; 257c9a60ce4c4d86dea8592266e51855. DR EuropePMC; PMC113909; 11119585. DR EuropePMC; PMC135997; 11861843. DR EuropePMC; PMC140978; 12525630. DR EuropePMC; PMC2620893; 19004940. DR EuropePMC; PMC4065881; 17206758. DR EuropePMC; PMC548436; 15708992. DR RFAM; RF01313; AHBV_epsilon. XX FH Key Location/Qualifiers FH FT source 1..3018 FT /organism="Ross's goose hepatitis B virus" FT /mol_type="genomic DNA" FT /db_xref="taxon:259931" FT CDS join(2515..3018,1..414) FT /codon_start=1 FT /product="precore/core protein" FT /db_xref="GOA:Q67850" FT /db_xref="InterPro:IPR002006" FT /db_xref="InterPro:IPR036459" FT /db_xref="UniProtKB/TrEMBL:Q67850" FT /protein_id="AAA45747.1" FT /translation="MWDLRLHPSPFGAACQGIFTSSLLLFLVTVPLVCTIVYDSCLCMD FT INASRALANVYDLPDDFFPKIDDLVRDAKDALEPYWRNDSIKKHVLIATHFVDLIEDFW FT QTTQGMHEIAEALRAIIPATTAPVPQGFLVQHEEAEEIPLGELFRYQEERLTNFQPDYP FT VTARIHAHLKAYAKINEESLDRARRLLWWHYNCLLWGEPNVTNYISRLRTWLSTPEKYR FT GKDAPTIEAITRPIQVAQGGRNKTQGVRKSRGLEPRRRRVKTTIVYGRRRSKSRERRAP FT TPQRAGSPLPRTSRDHHRSPSPRE" FT CDS 170..2527 FT /codon_start=1 FT /product="polymerase" FT /db_xref="GOA:Q67851" FT /db_xref="InterPro:IPR000201" FT /db_xref="InterPro:IPR000477" FT /db_xref="InterPro:IPR001462" FT /db_xref="UniProtKB/TrEMBL:Q67851" FT /protein_id="AAA45748.1" FT /translation="MPQPLKQSLDQSKWRREAEIRLRELENLVDSNLGEEELKPQLSMG FT EDVQSPGKGEPLHPSVRAPLSRVLLGTTTDLPRLGNKDPARHKLGKLSGLYQMKGCEFN FT PQWKVPDISDTHFNLDIINECPSRNWKYLTPAKFWPKSISYFPAHSGVKPKYPDDVAGH FT EQIVGQYLTKLFEAGILYKRESKHLVTFKGTPYQWERQYLVNQPADLHGAATSKINGRK FT KSRRSGTPPSTIGRKDDPKRDGHMVRKISYHGTRYGPCANNGRDKHHATTRGLAGRSRE FT ETGTNQPSSPCRSGDKLDTGRGRQGPRIFQKISRRETKGNHHHSSHKSTENSVGAETRR FT SSPQHATTVPTSRTSGTGYSGHQNTESPEENVFYLRGNTSWPNRITGRIFLVDKNSRNT FT AEARLVVDFSQFSKGKHAMRFPKYWSPNLSTLRRILPVGMPRISLDLSQAFYHLPLNPA FT CSSRLAISDGQHVYYFRKAPMGVGLSPFLLHLFTTALGSEIARRFNVWTFTYMDDFLLC FT HPNARHLNSRSHAVCSFLQELGVRINFDKTTPSPVTEIKFLGYLIDDKFMKIEDQRWNE FT LRQVIKKIQIGKWYDWKCIQRFIGHLNFILPFTKGNVEMLKPMYHAVTHKVNFSFSSSY FT RTLLYKLTMGVCKLRLNPKVSLPLPRVATDATLTHGAISHITGGCAVFTFSKVRDIHIQ FT ELLMACLAKLMIKPRCLLTDSTFVIHKRYQTLPWHFAVLAKQLMQNIQLYFVPSKYNPA FT DGPTRHKPPDWTALTYTPLSKAIYIPHRLCGT" FT gene 801..1784 FT /gene="preS" FT CDS 801..1784 FT /codon_start=1 FT /gene="preS" FT /product="surface protein" FT /note="putative" FT /db_xref="GOA:Q67852" FT /db_xref="InterPro:IPR000349" FT /db_xref="UniProtKB/TrEMBL:Q67852" FT /protein_id="AAA45749.1" FT /translation="MGQQPAKSMAERRVEGAELLLQQLAGRMIPKGTVTWSGKYPTMEH FT VMDHVQTMEEINTMQQQGAWPEGAGRRLGLTNPAPPAAPVINWTPEEDAKAREYFRRYQ FT EERPKETTTIPPTSPPKTQWELKPGDPLLSTQPLYRPAEPAEPDIPVIKTPKVPKKMSS FT TFGGILAGLIGLLVGFFLLIKILEILRRLDWWWISLSSPKGNMQCAFQNTGAQISPHYV FT GSCPWGCPGFLWTYLRLFIIFLLILHVAAGLLYLTDNMSTIFAKLQWESVSALFSSISS FT LLPSDPKSLVALMFGLLLIWTTSSSVTQTLVTLTQGATLSALFFKS" FT repeat_region 2474..2485 FT /rpt_type=DIRECT FT /note="putative" FT repeat_region 2531..2542 FT /rpt_type=DIRECT FT /note="putative" XX SQ Sequence 3018 BP; 906 A; 686 C; 634 G; 792 T; 0 other; catgctcacc tgaaagcata tgcaaaaata aatgaggaat ctttagatag agctaggaga 60 ttgctttggt ggcattataa ctgtttattg tggggcgagc ctaacgttac caactatatt 120 tcgagattaa gaacttggtt atccacacct gaaaaataca gaggaaaaga tgccccaacc 180 attgaagcaa tcactagacc aatccaagtg gcgcagggag gcagaaataa gactcaggga 240 gttagaaaat ctcgtggact cgaacctagg agaagaagag ttaaaaccac aattgtctat 300 gggagaagac gttcaaagtc cagggaaagg agagccccta caccccagcg tgcgggctcc 360 cctctcccgc gtacttctag ggaccaccac agatctccct cgcctaggga ataaagaccc 420 cgctcgtcat aaactgggga agttatccgg attgtatcaa atgaagggct gtgagtttaa 480 cccccaatgg aaagtacctg atatttctga tactcatttt aatttagata taattaatga 540 gtgcccttcc agaaattgga aatacctgac tccagccaaa ttttggccca agagcatttc 600 ctactttcca gcgcattcag gggttaaacc caagtatccg gatgacgtgg caggacatga 660 gcaaatagtg ggtcaatatt taaccaagct ctttgaagcg ggaatccttt ataagcgaga 720 atctaaacat ttggtcactt ttaagggaac cccttatcag tgggaacgac aataccttgt 780 caatcaacct gctgatttac atggggcagc aaccagcaaa atcaatggca gaaagaagag 840 tcgaaggagc ggaactcctc cttcaacaat tggccggaag gatgatccca aaagggacgg 900 tcacatggtc aggaaaatat cctaccatgg aacacgttat ggaccatgtg caaacaatgg 960 aagagataaa caccatgcaa caacaagggg cttggccgga aggagcaggg aggagactgg 1020 gactaaccaa cccagctccc cctgccgctc cggtgataaa ttggacaccg gaagaggacg 1080 ccaaggcccg agaatatttc agaagatatc aagaagagag accaaaggaa accaccacca 1140 ttcctcccac aagtccaccg aaaactcagt gggagctgaa acccggagat cctctcctca 1200 gcacgcaacc actgtaccga ccagcagaac cagcggaacc ggatattccg gtcatcaaaa 1260 caccgaaagt cccgaagaaa atgtcttcta ccttcggggg aatactagct ggcctaatcg 1320 gattactggt aggatttttc ttgttgataa aaattctaga aatactgcgg aggctagact 1380 ggtggtggat ttctctcagt tctccaaagg gaaacatgca atgcgctttc caaaatactg 1440 gagcccaaat ctctccacat tacgtaggat cctgcccgtg gggatgccca ggatttcttt 1500 ggacctatct caggcttttt atcatcttcc tcttaatcct gcatgtagca gcaggcttgc 1560 tatatctgac ggacaacatg tctactattt tcgcaaagct ccaatgggag tcggtctcag 1620 cccttttctc ctccatctct tcactactgc cctcggatcc gaaatcgctc gtcgctttaa 1680 tgtttggact tttacttata tggacgactt cctcctctgt cacccaaacg ctcgtcacct 1740 taactcaagg agccacgctg tctgctcttt tcttcaagag ctaggagtga gaataaactt 1800 cgacaaaaca actccttcac cagtaactga gataaaattc ctcggttacc tgatcgacga 1860 taaatttatg aaaattgagg atcaacgctg gaatgaacta cgtcaagtaa taaagaagat 1920 tcaaatcgga aaatggtatg actggaaatg tatccaacga tttattggac atttaaactt 1980 cattttacct ttcacaaaag gaaatgtaga aatgttaaaa ccaatgtatc atgctgtgac 2040 tcataaagtg aattttagtt tttctagtag ctataggact ttgttgtata aattaacaat 2100 gggtgtctgt aaacttagat tgaatccaaa ggtctcttta cctttgccac gtgttgccac 2160 agatgcaact ctaacacatg gcgcaatatc ccatatcacc ggcgggtgcg cagtgtttac 2220 cttttcaaag gttagagata tccacattca ggagcttttg atggcatgtt tggctaagtt 2280 aatgattaaa ccaagatgtt tgctcacaga ttccaccttt gtgatccata aacgttatca 2340 gacgttgcca tggcattttg ctgtgctagc caaacaactt atgcagaaca tacagttgta 2400 ctttgtcccc agtaaataca atcctgctga tggcccaacc aggcataaac ctcctgattg 2460 gacagcactt acatacaccc ctctctcgaa agcaatatat atcccacata ggctatgtgg 2520 gacttaagat tacacccctc tccattcgga gctgcttgcc aaggtatttt tacgtcgtct 2580 ttgctgttgt tccttgtgac tgtacctttg gtatgtacca ttgtttatga ttcttgctta 2640 tgtatggata tcaacgcttc aagagcttta gctaatgtat atgatttgcc agatgatttc 2700 tttccaaaga ttgatgattt agttagagat gctaaagatg ctttagagcc ttattggaga 2760 aatgattcaa taaagaaaca tgttttaatt gcaactcact ttgtggatct cattgaggat 2820 ttctggcaaa ccactcaggg tatgcatgaa atagcagagg cactgagagc tataattcct 2880 gccactactg ctccagtacc tcagggattt ctggtccaac acgaagaagc tgaagagata 2940 cctttgggtg aactttttag gtatcaggaa gaaagactaa ctaactttca accagattat 3000 ccagttaccg ccagaatt 3018 //