ID AF193863; SV 1; circular; genomic DNA; STD; VRL; 3182 BP. XX AC AF193863; XX DT 15-FEB-2000 (Rel. 62, Created) DT 16-APR-2001 (Rel. 67, Last updated, Version 2) XX DE Orangutan hepadnavirus strain Somad, complete genome. XX KW . XX OS Orangutan hepadnavirus OC Viruses; Hepadnaviridae; Orthohepadnavirus. XX RN [1] RP 1-3182 RX PUBMED; 11257195. RA Verschoor E.J., Warren K.S., Langenhuijzen S., Heriyanto, Swan R.A., RA Heeney J.L.; RT "Analysis of two genomic variants of orang-utan hepadnavirus and their RT relationship to other primate hepatitis B-like viruses"; RL J. Gen. Virol. 82(Pt 4):893-897(2001). XX RN [2] RP 1-3182 RA Verschoor E.J., Warren K.S., Langenhuijzen S., Heriyanto, Swan R.A., RA Heeney J.L.; RT ; RL Submitted (12-OCT-1999) to the INSDC. RL Virology, Biomedical Primate Research Centre (BPRC), Lange Kleiweg 157, RL Rijswijk 2288 GJ, Netherlands XX DR MD5; 51d226933f2323a348386ba87f06b968. DR EuropePMC; PMC4336649; 25559671. DR EuropePMC; PMC5320732; 28222808. DR EuropePMC; PMC6347571; 30268787. DR RFAM; RF01047; HBV_epsilon. XX FH Key Location/Qualifiers FH FT source 1..3182 FT /organism="Orangutan hepadnavirus" FT /host="orangutan" FT /strain="Somad" FT /mol_type="genomic DNA" FT /db_xref="taxon:113194" FT CDS join(2307..3182,1..1623) FT /codon_start=1 FT /gene="P" FT /product="polymerase protein" FT /db_xref="GOA:Q9J5S2" FT /db_xref="InterPro:IPR000201" FT /db_xref="InterPro:IPR000477" FT /db_xref="InterPro:IPR001462" FT /db_xref="InterPro:IPR037531" FT /db_xref="UniProtKB/Swiss-Prot:Q9J5S2" FT /protein_id="AAF33117.1" FT /translation="MPLSCQHFRKLLLLDEEAGPLEEELPRLADEGLNHRVAEDLNLQL FT PNVSIPWTHKVGNFTGLYSSTAPVFNPNWQTPSFPDIHLHQDIINKCEQLVGPLTVNEK FT RRLKLIMPARFYPNSTKYFPPDKGIKPYYPEHGVNHYFQARHYLHTLWKAGVLYKRETT FT RSASFCGSPYSWEQELQHGAEPFCHQPFGILPRASIGPAVPSQHKQSRLGLQSQQGHLA FT RSHQGRSGSIWARVHSTSRRSFGVEPAGSGRNHNTASSSSSCLHQSAVRKAAYSHLSTF FT ERHSSSGHAVELHGFPPSSAGSQSKGSVFPCWWLQFRDSEPCSDNCLSHIVNLLEDWGP FT CTEHGEHLIRIPRTPARVTGGVFLVDKNPHNSSESRLVVDFSQFSRGSTRVSWPKFAVP FT NLQSLTNLLSSNLSWLSLDVSAAFYHLPLHPAAMPHLLVGSSGLPRYVARLSSTSRNHH FT HQRGTMQNLHDFCSRNLFVSLMLLYKTFGRKLHLYSHPTIMGFRKIPMGVGLSPFLLAQ FT FTSALCSVVRRAFPHCLAFSYMDDMVLGAKSVQHLESLYTAVTNFLLSLGIHLNPGKTK FT RWGYSLHFMGYVIGSWGTLPQDHIVQKIKQCFRKLPVNRPIDWKVCQRIVGLLGFAAPF FT TQCGYPALMPLYNCIHNRQAFTFSPTYKAFLRTQYLTLYPVARQRPGLCQVFADATPTG FT WGLALGPQRMRGTFVAPLPIHTAELLAACFARSRSGANIIGTDNSVVLSRKYTSFPWLL FT GCAANWILRGTSFVYVPSALNPADDPSRGRLGLYRPLLRLPFRPTTGRTSLYAVSPSVP FT SHLPVRVHFASPLHVAWRPP" FT CDS join(2848..3182,1..835) FT /codon_start=1 FT /gene="S" FT /product="surface protein" FT /db_xref="GOA:Q77NU1" FT /db_xref="InterPro:IPR000349" FT /db_xref="UniProtKB/Swiss-Prot:Q77NU1" FT /protein_id="AAF33120.1" FT /translation="MGQNLSVTNPLGFFPEHQLDPLFRANTNNPDWDFNPNKDTWPEAT FT KVGVGAFGPGFTPPHGGLLGWSPQAQGVTTILPAVPPPASTNRQSGRRPTPISPPLRDT FT HPQAMQWNSTVFHQALQDPRVRGLYFPAGGSSSGTVSPVPTTASPISSTFLKTGDPALN FT MESISSGFLGPLLVLQAGFFLLTKILTIPQSLDSWWTSLNFLGGAPVCPGQNSQSLTSN FT HSPTSCPPICPGYRWMCLRRFIIFLFILLLCLIFLLVLLDYRGMLPVCPLLPGTTTTSV FT GPCRTCTISAPGTSLFPSCCCTKPSDGNCTCIPIPPSWAFAKFLWGWASVRFSWLNLLV FT PFVQWFAGLSPTVWLSVIWMIWYWGPSLYNILSPFIPLLPIFFCLWAYI" FT CDS 1374..1838 FT /codon_start=1 FT /gene="X" FT /product="X protein" FT /db_xref="GOA:Q9J5S3" FT /db_xref="InterPro:IPR000236" FT /db_xref="UniProtKB/Swiss-Prot:Q9J5S3" FT /protein_id="AAF33118.1" FT /translation="MAARLCCQLDTARDVLCLRPVGAESRGRPFSGSVGALPPSSPPAV FT PADHGAHLSLRGLPVCAFSSAGPCALRFTSARCMETTVNAPRNLPKVLHKRTLGLSTMS FT TTGIETYFKDCVFKDWEELGEEIRLKVFVLGGCRHKLVCSPAPCNFFTSA" FT CDS 1814..2452 FT /codon_start=1 FT /gene="C" FT /product="core protein" FT /db_xref="GOA:Q77NU2" FT /db_xref="InterPro:IPR002006" FT /db_xref="InterPro:IPR013195" FT /db_xref="InterPro:IPR036459" FT /db_xref="UniProtKB/TrEMBL:Q77NU2" FT /protein_id="AAF33119.1" FT /translation="MQLFHLCLIISCSCPTVQASKLCLGWLLGMDIDPYKEFGATVELL FT SFLPSDFFPSVRDLLDTASALYREALESPEHCSPNHTALRQAVLCWGELMTLASWVGNN FT LEDPASRELVVNYVNNNMGLKIRQLLWFHISCLTFGRETVLEYLVSFGVWIRTPPAYRP FT PNAPILSTLPETTVVRRRGRSPRRRTPSPRRRRSQSPRRRRSQSPASQC" XX SQ Sequence 3182 BP; 713 A; 859 C; 710 G; 900 T; 0 other; ctccacggtt ttccaccaag ctctgcagga tcccagagta aggggtctgt atttccctgc 60 tggtggctcc agttcaggga cagtgagccc tgttccgaca actgcctctc ccatatcgtc 120 aaccttcttg aagactgggg accctgcact gaacatggag agcatctcat caggattcct 180 aggacccctg ctcgtgttac aggcggggtt tttcttgttg acaaaaatcc tcacaattcc 240 tcagagtcta gactcgtggt ggacttctct caattttcta gggggagcac ccgtgtgtcc 300 tggccaaaat tcgcagtccc taacctccaa tcactcacca acctcttgtc ctccaatttg 360 tcctggttat cgctggatgt gtctgcggcg ttttatcatc ttcctcttca tcctgctgct 420 atgcctcatc ttcttgttgg ttcttctgga ttaccgaggt atgttgcccg tttgtcctct 480 acttccagga accaccacca ccagcgtggg accatgcaga acctgcacga tttctgctcc 540 aggaacctct ttgtttccct catgttgctg tacaaaacct tcggacggaa attgcacctg 600 tattcccatc ccaccatcat gggctttcgc aaaattccta tgggggtggg cctcagtccg 660 tttctcctgg ctcaatttac tagtgccctt tgttcagtgg ttcgcagggc tttcccccac 720 tgtttggctt tcagttatat ggatgatatg gtattggggg ccaagtctgt acaacatctt 780 gagtcccttt ataccgctgt taccaatttt cttttgtctt tgggcataca tttaaaccct 840 ggcaaaacca aacgatgggg ctattcccta catttcatgg gctatgtgat tggaagttgg 900 ggaaccctac cacaagacca tattgtacaa aaaatcaaac aatgttttcg gaaacttcct 960 gtcaacaggc ctattgattg gaaagtatgt caacgaattg taggactatt gggctttgcc 1020 gctcctttta ctcaatgtgg ctatcctgct ttaatgcctt tgtataactg tatacacaat 1080 cgtcaggctt ttactttctc gccaacttac aaggcctttc tgcgtacaca atatctgacc 1140 ctttaccccg ttgctcggca acgaccggga ctgtgccaag tgtttgctga cgcaaccccc 1200 actggctggg gcttggcgct aggtccccag cgcatgcgtg gaacctttgt ggctcctctg 1260 ccgatccata ctgcggaact cctagccgct tgttttgctc gcagcaggtc tggagcaaac 1320 attatcggta ctgacaactc tgttgtgttg tcgcggaaat atacatcttt tccatggctg 1380 ctaggttgtg ctgccaactg gatactgcgc gggacgtcct ttgtctacgt cccgtcggcg 1440 ctgaatcccg cggacgaccc ttctcggggt cggttggggc tctaccgccc tcttctccgc 1500 ctgccgttcc ggccgaccac ggggcgcacc tctctttacg cggtctcccc gtctgtgcct 1560 tctcatctgc cggtccgtgt gcacttcgct tcacctctgc acgttgcatg gagaccaccg 1620 tgaacgcccc ccggaacttg ccaaaggtct tgcataagag gactcttgga ctgtcaacaa 1680 tgtcaacgac cggaattgag acatacttca aagactgtgt gtttaaagac tgggaggagt 1740 taggggagga gatcaggtta aaggtctttg tattaggagg ctgtaggcat aaattggtct 1800 gttcaccagc accatgcaac tttttcacct ctgcctaatc atctcatgtt catgtcctac 1860 tgttcaagcc tccaagctgt gccttgggtg gcttttgggc atggacattg acccttataa 1920 agaatttgga gctactgtgg agttactctc ttttttgcct tcggatttct ttccgtctgt 1980 cagagatcta ctcgacaccg catcagccct gtatcgggaa gccttagagt ctccagaaca 2040 ttgttcacct aaccacacag cactcaggca agcagttctg tgctggggtg agttaatgac 2100 tctggcttcc tgggtgggta ataatttgga agacccagca tctagggaac tggtagttaa 2160 ttatgtcaac aataatatgg ggctaaaaat cagacaacta ctgtggtttc acatttcctg 2220 tcttactttt ggaagagaaa cagttttaga atatttggtg tcttttggag tgtggattcg 2280 cactcctcct gcgtacagac caccaaatgc ccctatcttg tcaacacttc cggaaactac 2340 tgttgttaga cgaagaggca ggtcccctag aagaagaact ccctcgcctc gcagacgaag 2400 gtctcaatca ccgcgtcgca gaagatctca atctccagct tcccaatgtt agtattcctt 2460 ggactcacaa ggtgggaaac tttacggggc tttattcttc taccgcacct gtctttaatc 2520 ctaactggca aactccttct tttcctgaca ttcatttaca ccaggatatc attaacaagt 2580 gtgaacaatt agttggtcca cttacggtaa atgaaaaaag gagattaaag ttaattatgc 2640 ctgctagatt ctatcctaac tctaccaaat atttcccccc cgataaaggt attaaaccct 2700 attatcctga gcatggggtt aatcattatt tccaagccag acactattta catactttgt 2760 ggaaggcggg tgtcttatat aagagagaaa caacacgtag cgcttcattt tgtgggtcac 2820 catattcttg ggaacaagag ctacagcatg gggcagaacc tttctgtcac caaccctttg 2880 ggattcttcc ccgagcatca attggacccg ctgttccgag ccaacacaaa caatccagat 2940 tgggacttca atcccaacaa ggacacttgg ccagaagcca ccaaggtagg agtgggagca 3000 tttgggccag ggttcactcc acctcacgga ggtcttttgg ggtggagccc gcaggctcag 3060 ggcgtaacca caatactgcc agcagttcct cctcctgcct ccaccaatcg gcagtcagga 3120 aggcggccta ctcccatctc tccacctttg agagacactc atcctcaggc catgcagtgg 3180 aa 3182 //