ID AY494853; SV 1; circular; genomic DNA; STD; VRL; 3051 BP. XX AC AY494853; XX DT 14-JAN-2004 (Rel. 78, Created) DT 20-FEB-2005 (Rel. 82, Last updated, Version 2) XX DE Sheldgoose hepatitis B virus, complete genome. XX KW . XX OS Sheldgoose hepatitis B virus OC Viruses; Hepadnaviridae; Avihepadnavirus. XX RN [1] RP 1-3051 RX DOI; 10.1128/JVI.79.5.2729-2742.2005. RX PUBMED; 15708992. RA Guo H., Mason W.S., Aldrich C.E., Saputelli J.R., Miller D.S., RA Jilbert A.R., Newbold J.E.; RT "Identification and characterization of avihepadnaviruses isolated from RT exotic anseriformes maintained in captivity"; RL J. Virol. 79(5):2729-2742(2005). XX RN [2] RP 1-3051 RA Guo H., Aldrich C.E., Saputelli J.R., Mason W.S., Newbold J.E.; RT ; RL Submitted (03-DEC-2003) to the INSDC. RL Fox Chase Cancer Center, 333 Cottman Ave., Philadelphia, PA 19111, USA XX DR MD5; ce42a372eb5e78b1f4cdad4f99eaf07e. DR EuropePMC; PMC2620893; 19004940. DR EuropePMC; PMC548436; 15708992. DR RFAM; RF01313; AHBV_epsilon. XX FH Key Location/Qualifiers FH FT source 1..3051 FT /organism="Sheldgoose hepatitis B virus" FT /host="Ashy-headed sheldgoose" FT /isolate="Ashy-headed sheldgoose hepatitis B virus" FT /mol_type="genomic DNA" FT /country="USA" FT /db_xref="taxon:259898" FT CDS join(2548..3051,1..414) FT /codon_start=1 FT /product="precore protein" FT /db_xref="GOA:Q6RSE4" FT /db_xref="InterPro:IPR002006" FT /db_xref="InterPro:IPR036459" FT /db_xref="UniProtKB/TrEMBL:Q6RSE4" FT /protein_id="AAR89946.1" FT /translation="MWNLRITPLSFGAACQGIFTSTLLLFFVTVPLVCTIVYDFCLYMD FT VNASRALANVYDLPDDFFPKIDDLVRDAKDALEPYWRSESIKKHVLIATHFVDLIEDFW FT QTTQGMHEIAEALRAVIPPTTTPVPPGYLIQHEEAEEIPLGDLFKHQEERIVSFQPDYP FT ITARIHAHLKAYAKINEESLDKARRLLWWHYNCLLWGEANVTNYISRLRTWLSTPERYR FT GRDAPTIEAITRPIQVAQGGRNKTQGSRKPRGLQPRRRKVKTTVVYGRRRSKSRDRRAP FT SPQRAGSPLPRPSTSHHRSPSPRK" FT CDS join(2677..3051,1..414) FT /codon_start=1 FT /product="core protein" FT /db_xref="GOA:Q6RSE3" FT /db_xref="InterPro:IPR002006" FT /db_xref="InterPro:IPR036459" FT /db_xref="UniProtKB/TrEMBL:Q6RSE3" FT /protein_id="AAR89947.1" FT /translation="MDVNASRALANVYDLPDDFFPKIDDLVRDAKDALEPYWRSESIKK FT HVLIATHFVDLIEDFWQTTQGMHEIAEALRAVIPPTTTPVPPGYLIQHEEAEEIPLGDL FT FKHQEERIVSFQPDYPITARIHAHLKAYAKINEESLDKARRLLWWHYNCLLWGEANVTN FT YISRLRTWLSTPERYRGRDAPTIEAITRPIQVAQGGRNKTQGSRKPRGLQPRRRKVKTT FT VVYGRRRSKSRDRRAPSPQRAGSPLPRPSTSHHRSPSPRK" FT CDS 170..2560 FT /codon_start=1 FT /product="polymerase protein" FT /db_xref="GOA:Q6RSE2" FT /db_xref="InterPro:IPR000201" FT /db_xref="InterPro:IPR000477" FT /db_xref="InterPro:IPR001462" FT /db_xref="UniProtKB/TrEMBL:Q6RSE2" FT /protein_id="AAR89944.1" FT /translation="MPQPLKQSLDQSKWLREAEIKLRVLENLVDCNLEEGKLKPQLSMG FT EDVQSPGIGEPLHPNVRAPLSHVLQLVTTDLPRLGNKEPARHHLGKLSGLYQMKGCEFN FT PAWKVPELSDTHFNIDIKNECPSRNWKYLTPAKFWPKSISYFPVHAGVKPKYPDNVMQH FT EQIVGKYLTRLYEAGILYKRISKHLVTFKGRPYPWEQQYLVNQHLDKNGPNTSKINGCE FT KNRRRRDFIESTSRKNDPKRDCHMVGQISNDRSPIRPCANNGRNKYSSATRCVASRGGK FT EIGIGKSQSSRDSSARLDSRGRSTCTRGFSKISKGKTSRRDSESFEKATRRNKNSTLNS FT SVETATRRFSPGKSILTGDSSVIPESGTSSPSDKNSQTEKEDVWFLRGNTSWPNRITGK FT LFLVDKNSRNTEEARLVVDFSQFSKGKNAMRFPRYWSPNLSTLRRILPVGMPRISLDLS FT QAFYHLPFNPASSSRLAVSDGQRVYYFRKAPMGVGLSPFLLHLFTTALGSEIARRFNVW FT TFTYMDDFLLCHPNARHLHAISNSVCNFLQELGIRINFDKTTPSPVTEIRFLGYQIDSK FT FMRIEDMRWTEIRNVIKKIKVGEWYDWKCIQRFVGHLNFVLPFTKGNTEMLKPMYTAIS FT NQVNFSFSSAYRTLLYKLTMGVCKLSIKPKVSVPLPRVATDATPTHGAISHITGGSAVF FT TFSKVRDIHVQELLMACLAKLMIKPRCLLSDSTFVCHKRYHSLPWSFAMLAKQLLNPIQ FT LYFVPSKYNPADGPSRHKPPDWTALTYTPLSKRIYIPHRLCGT" FT CDS 801..1817 FT /codon_start=1 FT /product="preS protein" FT /db_xref="GOA:Q6RSE1" FT /db_xref="InterPro:IPR000349" FT /db_xref="UniProtKB/TrEMBL:Q6RSE1" FT /protein_id="AAR89945.1" FT /translation="MGQTPAKSMDVRRIEGGEILLNQLAGRMIPKGTVTWSGKYPTIDH FT LLDHVQTMEEINTLQQQGAWPQGAGRRLGLENPNPQEIPQPVWTPEEDQRAREAFQKYQ FT KERPPEEIPSPSKRPPEETKIPPSTPQWKLQPGDSLLGNQSLLETHPLYRNPEPAVPVI FT KTPRLKKKMSGSFEGILAGLIGLLVSFFLLIKILEILRRLDWWWISLSSPKGKMQCAFQ FT DTGAQISPHYVGSCPWGCPGFLWTYLRLFIIFLLILLVAAGLLYLTDNGSTIFGKLQWE FT SALALFSSISSLLPSDPKSLVALTFGLSLIWMTSSSVTQTLVTFTQLATLFAIFFKS" FT CDS 1314..1817 FT /codon_start=1 FT /product="S protein" FT /db_xref="GOA:Q6RSE0" FT /db_xref="InterPro:IPR000349" FT /db_xref="UniProtKB/TrEMBL:Q6RSE0" FT /protein_id="AAR89948.1" FT /translation="MSGSFEGILAGLIGLLVSFFLLIKILEILRRLDWWWISLSSPKGK FT MQCAFQDTGAQISPHYVGSCPWGCPGFLWTYLRLFIIFLLILLVAAGLLYLTDNGSTIF FT GKLQWESALALFSSISSLLPSDPKSLVALTFGLSLIWMTSSSVTQTLVTFTQLATLFAI FT FFKS" XX SQ Sequence 3051 BP; 934 A; 638 C; 615 G; 864 T; 0 other; catgctcatt tgaaagcata cgctaagatt aatgaggaat ccttggataa ggctaggaga 60 ttgctttggt ggcattataa ttgtttactg tggggagaag ctaacgttac taattatatt 120 tctcgtttac gtacttggtt atcaacacct gagagatata gaggtagaga tgccccaacc 180 attgaagcaa tcactagacc aatccaagtg gctcagggag gcagaaataa aactcagggt 240 tctagaaaac ctcgtggatt gcaacctaga agaaggaaag ttaaaaccac agttgtctat 300 gggagaagac gttcaaagtc cagggatagg agagcccctt caccccaacg tgcgggctcc 360 cctctcccac gtccttcaac tagtcaccac cgatctccct cgcctaggaa ataaagaacc 420 tgcaaggcat catttaggta aattgtcagg attatatcaa atgaagggtt gtgaatttaa 480 tcctgcatgg aaagtaccag aactttcgga tactcatttt aatattgata taaaaaatga 540 gtgcccttcc cgaaattgga aatatttgac tccagccaaa ttttggccca agagcatttc 600 ctactttcca gtacatgcag gggttaaacc aaaatatcct gacaatgtga tgcaacatga 660 gcaaattgta ggtaaatatt taaccaggct ctatgaagca ggaatccttt ataagcggat 720 atcaaaacat ttggtaacat ttaaaggtcg gccttatcct tgggaacagc aataccttgt 780 caatcaacat cttgacaaaa atgggccaaa caccagcaaa atcaatggat gtgagaagaa 840 tagaaggagg agagatttta ttgaatcaac tagcaggaag aatgatccca aaagggactg 900 tcacatggtc gggcaaatat ccaacgatag atcacctatt agaccatgtg caaacaatgg 960 aagaaataaa tactcttcag caacaaggtg cgtggcctca aggggcggga aggagattgg 1020 gattggaaaa tcccaatcct caagagattc ctcagcccgt ttggactcca gaggaagatc 1080 aacgtgcacg agaggctttt caaaaatatc aaaaggaaag acctccagaa gagattccga 1140 gtccttcgaa aaggccaccc gaagaaacaa aaattccacc ctcaactcct cagtggaaac 1200 tgcaacccgg agattctctc ctgggaaatc aatccttact ggagactcat ccgttatacc 1260 ggaatccgga accagcagtc ccagtgataa aaactcccag actgaaaaag aagatgtctg 1320 gttccttcga gggaatacta gctggcctaa tcggattact ggtaagcttt ttcttgttga 1380 taaaaattct cgaaatactg aggaggctag attggtggtg gatttctctc agttctccaa 1440 agggaaaaat gcaatgcgct ttccaagata ctggagccca aatctctcca cactacgtag 1500 gatcttgccc gtggggatgc ccaggatttc tttggactta tctcaggctt tttatcatct 1560 tccttttaat cctgctagta gcagccggct tgctgtatct gacggacaac gggtctacta 1620 ttttcggaaa gctccaatgg gagtcggcct tagccctttt ctcctccatc tcttcactac 1680 tgccctcgga tccgaaatcg ctcgtcgctt taacgtttgg actttcactt atatggatga 1740 cttcctcctc tgtcacccaa acgctcgtca ccttcacgca attagcaact ctgtttgcaa 1800 ttttcttcaa gagttaggaa ttcggatcaa ttttgataaa accacacctt ctcctgtaac 1860 tgaaataaga ttcctcggat atcaaattga ttcaaaattt atgagaattg aagacatgag 1920 atggactgaa ataagaaatg tcattaagaa aattaaagtc ggagaatggt atgactggaa 1980 atgtattcaa agatttgttg ggcatttaaa ctttgtatta cctttcacaa aaggaaatac 2040 agaaatgtta aaaccaatgt atactgctat ttccaatcaa gtaaatttta gcttctcttc 2100 ggcttatagg actttgcttt ataaattaac aatgggagta tgtaaattgt caataaaacc 2160 aaaggtctct gttcctttgc caagagtagc cacggatgct acaccaacac atggcgcaat 2220 atcccatatc accggcggga gcgcagtgtt tactttttca aaggtcagag acattcatgt 2280 gcaagaattg ctgatggcat gtttagctaa gctaatgatt aaacctagat gtttgttatc 2340 tgattctact tttgtttgtc acaaaagata tcattcactt ccatggtctt ttgctatgtt 2400 ggcaaagcaa ttgcttaatc ctatacaatt gtactttgta cccagtaaat acaatcctgc 2460 tgatggccca tccaggcata aaccgcctga ttggacagca cttacataca cccctctctc 2520 gaaacgtata tatattccac ataggctatg tggaacttaa gaattacacc cctctccttc 2580 ggagctgctt gccaaggtat ctttacgtct acattgctgt tgttctttgt gactgtacct 2640 ttggtatgta ccattgttta tgatttctgc ttatatatgg atgtcaatgc ttctagagcc 2700 ttagctaatg tgtatgatct gccagatgat ttctttccaa aaattgatga tttagttaga 2760 gatgcaaaag atgctttgga accttattgg agatctgaat caataaagaa acatgtttta 2820 atcgcaactc attttgttga tttgattgaa gacttttggc agactacaca gggtatgcat 2880 gaaattgcag aggcattaag ggctgttatt ccacctacca ctacgcctgt tcccccggga 2940 tatttgattc agcacgaaga ggctgaggaa attcccttgg gagatttatt taaacatcaa 3000 gaagaaagga tagttagttt ccaacctgat tatcctatta ctgcaagaat t 3051 //