ID AY535004; SV 1; linear; genomic RNA; STD; VRL; 6654 BP. XX AC AY535004; XX DT 01-MAR-2004 (Rel. 79, Created) DT 09-JAN-2022 (Rel. 144, Last updated, Version 4) XX DE Avian hepatitis E virus, complete genome. XX KW . XX OS Avian hepatitis E virus OC Viruses; Riboviria; Orthornavirae; Kitrinoviricota; Alsuviricetes; OC Hepelivirales; Hepeviridae; Orthohepevirus. XX RN [1] RP 1-6654 RX PUBMED; 15166445. RA Huang F.F., Sun Z.F., Emerson S.U., Purcell R.H., Shivaprasad H.L., RA Pierson F.W., Toth T.E., Meng X.J.; RT "Determination and analysis of the complete genomic sequence of avian RT hepatitis E virus (avian HEV) and attempts to infect rhesus monkeys with RT avian HEV"; RL J. Gen. Virol. 85(PT 6):1609-1618(2004). XX RN [2] RP 1-6654 RA Huang F.F., Sun Z.F., Emerson S.U., Purcell R.H., Shivaprasad H.L., RA Pierson W.F., Toth T.E., Meng X.J.; RT ; RL Submitted (29-JAN-2004) to the INSDC. RL Department of Biomedical Sciences and Pathobiology, College of Veterinary RL Medicine, Virginia Polytechnic Institute and State University, 1410 Price's RL Fork Road, Blacksburg, VA 24060, USA XX DR MD5; 439b3ec031384297e19fc8f9bfcdfbe5. DR EuropePMC; PMC2849599; 20107086. DR EuropePMC; PMC3006657; 21203540. DR EuropePMC; PMC3122850; 21307216. DR EuropePMC; PMC3187576; 21972239. DR EuropePMC; PMC3310647; 22000397. DR EuropePMC; PMC3416139; 22696648. DR EuropePMC; PMC3624379; 23388713. DR EuropePMC; PMC3884734; 24378180. DR EuropePMC; PMC3993473; 24478416. DR EuropePMC; PMC4163165; 25187005. DR EuropePMC; PMC4225530; 24252365. DR EuropePMC; PMC4290951; 25339404. DR EuropePMC; PMC4313232; 25187634. DR EuropePMC; PMC4442536; 25741007. DR EuropePMC; PMC4542097; 26260476. DR EuropePMC; PMC4641181; 26605326. DR EuropePMC; PMC497618; 15297556. DR EuropePMC; PMC525261; 15528746. DR EuropePMC; PMC5320732; 28222808. DR EuropePMC; PMC5429376; 28110375. DR EuropePMC; PMC6199738; 30352598. DR EuropePMC; PMC6307819; 30532200. DR EuropePMC; PMC6503432; 31060564. XX FH Key Location/Qualifiers FH FT source 1..6654 FT /organism="Avian hepatitis E virus" FT /mol_type="genomic RNA" FT /country="USA" FT /db_xref="taxon:2907992" FT CDS 25..4620 FT /codon_start=1 FT /product="non-structural polyprotein" FT /note="includes functional domains like methyltransferase, FT helicase and RdRp" FT /db_xref="GOA:Q6QLN1" FT /db_xref="InterPro:IPR001788" FT /db_xref="InterPro:IPR002588" FT /db_xref="InterPro:IPR002589" FT /db_xref="InterPro:IPR007094" FT /db_xref="InterPro:IPR022202" FT /db_xref="InterPro:IPR027351" FT /db_xref="InterPro:IPR027417" FT /db_xref="UniProtKB/Swiss-Prot:Q6QLN1" FT /protein_id="AAS45830.1" FT /translation="MDVSQFAESKGVKTALEAAALAAANTALRNARVVTPYLTQQQTKN FT LLELFRGAQLRFEPRDNWAHPVQRVVHDALEQYVRRAAGPNCLEVGAHPRSINRHQASH FT RCFLPPVGRDEQRWQVAPRRGLCNLIRRALLNGVKVAREFCQLGFGACSHQCEVGIALY FT SLHDMRPADVACAMARHNMRTMYVVLHLPEEAMLPPGSYSNKFYNTVNTADKCIITYAD FT DSCAGYVHKREVLQDWITTTGVSGRHPMLIERVRAIGCHFVLLCTATQPCPMPYTPYPS FT SNTVYVRNVYGPALGAGLFTPKCCVDATFYPVPRRVWQRLMMFGTTLDDDAFCCSRLLT FT YLRGISTKVTVGNIVANEGWQPEEQQLTAVAIAAYLTVCHQRWVRTQGIARGVRRLQAE FT HAQQFWFKVWELFTNTGTVPGYSAGFYRQLATWISGGLTIDFERRVFDKRVKCGCCCVC FT ERRPADPGCLCIDDFPDGANGLVKLKKWPIRAGTKSAVSKWAQVRVRADSTEDLIDLSV FT PKLLTLKELAAAAIRKQPSAPPSLHILDRRPVGDPRRPVNCAPPAVSAGPVPAPPGNPV FT IESVQGSGAGGPEVSESQPGLTPTREVTNMPLPPQRGQEEVLAVLPSGARVIVGNLLDV FT AADWLVNPANRDHQPGGGLCGMFHRRWPHLWPVCGEVQDLPTGPVIFQQGPPKVIHAPG FT PDYRIKPDPDGLRRVYAVVHQAHGTVASPLISAGIYRAPARESFEAWAATARDGDLLVV FT QRSMAQHIRDFVLNEGRHRPRELHVDRAMADMVNYGLATEPEPYNELVKGVEVAPMTVK FT YALIAGVPGSGKSSSVDHRGAVVITPTKTLAREWSARGATAVTPHVAASAAPEGRVIVD FT EAYAIPPHLLVASLRRARDVVMLGDPHQIPALDFDGRCLTSAVDLGLQPTSWRTVSHRC FT PWDVCIFLRTDYPTITTTSRVLRSVVFTGETIGQKIVFTQVAKQSNPGSITVHEAQGST FT FDQTTIIATLDARGLIASSRAHAIVALTRHRERCSVIDVGGVLVEIGVTDAMFNNIEMQ FT LVRPDAAAPAGVLRAPDDTVDGLLDIPPAHTDVAAVLTAEAIGHAPLELAAINPPGPVL FT EQGLLYMPARLDGRDEVVKLQLSDTVHCRLAAPTSRLAVINTLVGRYGKATKLPEVEYD FT LMDTIAQFWHHIGPINPSTLEYAEMCEAMLSKGQDGSLIVHLDLQDADCSRITFFQKDC FT AKFTLDDPVAHGKVGQGISAWPKTLCALFGPWFRAIEKHLVAGLPPGYYYGDLYTEADL FT HRSVLCAPAGHLVFENDFSEFDSTQNNVSLDLECELMRRFGMPDWMVALYHLVRSYWLL FT VAPKEALRGCWKKHSGEPGTLLWNTVWNMTVLHHVYEFDRPSVLCFKGDDSVVVCESVR FT ARPEGVSLVADCGLKMKDKTGPCGAFSNLLIFPGAGVVCDLLRQWGRLTDKNWGPDIQR FT MQDLEQACKDFVARVVTQGKEMLTIQLVAGYYGVEVGMVEVVWGALKACAAARETLVTN FT RLPVLNLSKED" FT CDS 4654..4917 FT /codon_start=1 FT /product="cytoskeleton related protein" FT /note="unknown function" FT /db_xref="GOA:Q913Y8" FT /db_xref="UniProtKB/Swiss-Prot:Q913Y8" FT /protein_id="AAS45832.1" FT /translation="MCLSCQFWCLECQESGVGCRCVDCCSCLQCAAGCQGAPKRSQPEA FT GVASAAVTIQPSGALNNAPREPSAPPLSQTLSPRQVLARYQM" FT CDS 4707..6527 FT /codon_start=1 FT /product="capsid protein" FT /note="structural protein" FT /db_xref="GOA:Q913Y7" FT /db_xref="InterPro:IPR004261" FT /db_xref="InterPro:IPR029053" FT /db_xref="UniProtKB/Swiss-Prot:Q913Y7" FT /protein_id="AAS45831.1" FT /translation="MSLCRLLLMLAMCCGVSRGSQTLPAGGRRGQRRRDNSAQWSTQQR FT PEGAVGPAPLTDVVTAAGTRTVPDVDQAGAVLVRQYNLVTSPLGLATLGSTNALLYAAP FT VSPLMPLQDGTTSNIMSTESSNYAQYRVQGLTVRWRPVVPNAVGGFSISMAYWPQTTST FT PTSIDMNSITSTDVRVVLQPGSAGLLTIPHERLAYKNNGWRSVETVSVPQEDATSGMLM FT VCVHGTPWNSYTNSVYTGPLGMVDFAIKLQLRNLSPGNTNARVTRVKVTAPHTIKADPS FT GATITTAAAARFMADVRWGLGTAEDGEIGHGILGVLFNLADTVLGGLPSTLLRAASGQY FT MYGRPVGNANGEPEVKLYMSVEDAVNDKPIMVPHDIDLGTSTVTCQDYGNQHVDDRPSP FT APAPKRALGTLRSGDVLRITGSMQYVTNAELLPQSVSQGYFGAGSTMMVHNLITGVRAP FT ASSVDWTKATVDGVQVKTVDASSGSNRFAALPAFGKPAVWGPQGAGYFYQYNSTHQEWI FT YFLQNGSSVVWYAYTNMLGQKSDTSILFEVRPIQASDQPWFLAHHTGGDDCTTCLPLGL FT RTCCRQAPEDQSPETRRLLDRLSRTFPSPP" XX SQ Sequence 6654 BP; 1334 A; 1728 C; 1966 G; 1626 T; 0 other; gcatgacccc atgccagggt aagaatggac gtctcgcagt ttgcagagtc caagggggtt 60 aagactgcac tagaagcggc cgctctagct gcagccaata ccgccctgcg aaatgcgcgg 120 gtggtaaccc cctatcttac ccaacagcag actaagaatc tgctggagct gttccgtggt 180 gctcagttgc ggtttgagcc acgcgacaac tgggcgcacc ctgtgcagcg ggtcgtgcat 240 gatgcccttg agcagtatgt acgccgcgca gcggggccca attgcctgga ggtcggcgct 300 catccacgct ccattaatag gcatcaagcc tcgcaccgct gcttcttacc cccagttggg 360 cgtgatgagc agcggtggca ggtagcacca cggaggggtc tttgcaacct aatccgtcgc 420 gcgctgctca atggtgtcaa ggtggcgcgt gagttttgtc agttagggtt cggtgcctgc 480 tctcaccagt gtgaggtggg catagctctt tacagcttac acgacatgcg tcctgcggac 540 gtggcgtgtg cgatggcccg tcataacatg cgcactatgt acgtcgtatt gcatctacct 600 gaggaggcaa tgttaccacc agggtcctac tcaaacaagt tctacaacac agtgaacaca 660 gcggataaat gcattataac gtatgctgac gattcttgcg cggggtatgt gcacaagcgt 720 gaggtattac aggactggat aacgacaaca ggcgtttctg gtcggcatcc catgctaatt 780 gagcgggtca gggccatagg ttgccatttt gtgctgttgt gcactgcaac ccagccctgc 840 cccatgccgt atacgcctta cccgtcctca aacaccgtgt acgttaggaa tgtttacggg 900 cccgcgctgg gtgccggtct ttttacaccc aaatgttgtg ttgatgccac gttctaccct 960 gtgcccaggc gtgtgtggca gcggctgatg atgtttggta ccacacttga tgatgacgcc 1020 ttctgttgct cgcgcctgct aacttattta cgtgggattt caaccaaggt gacagtaggt 1080 aacattgtgg ccaatgaggg ctggcaaccg gaggagcagc agcttacggc tgtcgctata 1140 gctgcatacc ttactgtctg tcatcaaagg tgggtgcgca cacagggcat agctcggggt 1200 gtgaggcgct tacaggctga acatgcacag cagttctggt ttaaagtctg ggagttgttc 1260 accaacaccg gcactgtgcc cgggtattcg gccgggtttt atcgccagct cgccacatgg 1320 attagtggcg gccttaccat tgactttgaa cggcgtgttt tcgacaagcg tgtgaagtgt 1380 gggtgctgtt gcgtctgcga gcgccgtcct gctgaccccg ggtgcttgtg cattgacgat 1440 ttccccgatg gtgcgaacgg gctagtgaag ttgaaaaagt ggccgatacg agctggtacg 1500 aagtctgcag ttagcaagtg ggctcaagtc agggtccggg ctgatagcac ggaggatttg 1560 attgatctga gcgtgcccaa actactcaca cttaaagaac tcgccgctgc agcaatacgc 1620 aagcaaccgt ccgccccgcc atcattgcac attttggacc gccgaccggt aggtgatccc 1680 aggcgccccg ttaactgcgc accaccggcc gtctccgctg gccctgtacc ggcacccccg 1740 ggtaatcctg ttattgagtc tgttcagggg tcgggggctg gcggacctga ggtcagtgag 1800 tcccagcctg gcttgactcc gacgcgcgag gttactaata tgcccttgcc gccacagcgt 1860 ggtcaagagg aggtcctcgc cgtgttgcca tcgggggcgc gcgtcatcgt gggtaaccta 1920 ttggacgttg ccgccgactg gcttgttaac ccggctaatc gagaccatca gcccggaggc 1980 gggttgtgtg gcatgttcca ccgtcgctgg ccgcacttgt ggcctgtttg cggcgaggtc 2040 caggatttgc ccacgggccc ggtgattttc caacaggggc cacctaaggt tattcacgcc 2100 cctggcccgg actaccgtat taagcctgac cccgacggcc tccgcagagt atacgctgtt 2160 gtgcaccagg cgcatggcac ggtagctagc ccgttaatta gcgccgggat ataccgggca 2220 ccggcccggg aatcatttga ggcctgggcg gccaccgccc gtgatggtga cctactggtt 2280 gtgcagcgtt caatggctca gcacatcagg gactttgtgt tgaatgaggg tcgtcataga 2340 cctagggagc ttcatgttga ccgggctatg gctgacatgg ttaattatgg gctggcaacc 2400 gagcccgagc cgtacaatga gcttgtgaag ggcgttgagg ttgcgcctat gactgtgaaa 2460 tacgcgctta tagctggtgt cccaggcagc ggcaagtcgt cgtctgttga ccatcgtggg 2520 gctgttgtta ttacacccac caagacgttg gcacgtgagt ggtccgcgcg tggggccacg 2580 gctgttacac cccacgttgc agcaagtgca gcccccgagg gcagggtgat tgtggacgag 2640 gcgtatgcta tcccgccgca cctgcttgtg gcgtcccttc gtcgggcgcg cgatgttgtt 2700 atgttggggg atccgcacca gataccagca ttggatttcg atggacgctg tttaacgagc 2760 gccgttgatc ttgggttgca gcctaccagc tggcgcaccg tatcccaccg ttgcccttgg 2820 gacgtttgta tatttttgcg tactgattat ccgactatca ccacaaccag tagggtgctg 2880 cggtctgttg tgtttaccgg tgaaaccatt ggtcagaaga tagtgtttac ccaggtggcc 2940 aagcagtcga accccgggtc cataacggtc catgaggcgc agggcagtac ttttgatcag 3000 actactataa tcgccacgtt agatgctcgt ggccttatag cttcatctcg cgcgcatgcc 3060 atagttgcgc taacccgcca ccgggagcgc tgtagtgtga ttgatgttgg tggggtgctg 3120 gtcgagattg gagttactga tgccatgttt aacaatatcg aaatgcagct tgtgcgacct 3180 gatgctgcag cccctgccgg ggtgctacga gccccagacg acaccgtgga tggcttgttg 3240 gacatacccc cggcccacac tgatgtagcg gcggtgttaa cagctgaggc gattgggcat 3300 gcgccccttg aattggccgc cataaatcca cccgggcctg tattggagca gggcctatta 3360 tacatgccgg ccaggcttga tgggcgtgat gaggttgtta agctccagct gtcggatact 3420 gtacactgcc gcctggctgc acccactagc cgtcttgcgg tgattaacac attggttggg 3480 cggtacggta aagccactaa gctgcctgag gttgaatatg acttaatgga cactattgcg 3540 cagttctggc atcatatcgg accaatcaac ccctcaacac tggagtatgc agagatgtgc 3600 gaggccatgc ttagtaaggg ccaggatggg tccttgattg tacatctgga tttacaggat 3660 gctgattgtt ctcgcataac attcttccag aaggactgcg ctaaatttac gctggatgac 3720 cctgttgcac acggtaaagt gggacagggg atatctgcgt ggccgaaaac tttgtgtgca 3780 cttttcggcc cctggttccg ggctatagag aagcaccttg tggctgggtt acccccaggt 3840 tattactatg gggacctgta cacggaagcc gatctgcatc gttctgtgct ttgcgcgcct 3900 gctggtcacc ttgtttttga gaatgatttc tcagagtttg actcaacgca gaataatgtg 3960 tcccttgatc tcgaatgtga attgatgcgc aggtttggga tgcccgattg gatggtagcc 4020 ttgtaccatc ttgttcgatc atactggctc ttggttgccc cgaaagaagc ccttcgtggc 4080 tgttggaaaa aacactctgg tgagccgggc acccttttgt ggaatacagt ttggaacatg 4140 actgtgttgc atcatgttta tgagtttgat cgaccaagtg tgttgtgttt caaaggtgat 4200 gatagtgtcg ttgtctgtga atcggtgcgc gcccgtccag agggcgttag tctcgtggca 4260 gactgcgggc taaaaatgaa ggacaagacc ggcccgtgtg gcgccttttc caacctgctg 4320 atcttcccgg gagctggtgt tgtctgcgac ctgttacggc agtggggccg cttgactgac 4380 aagaactggg ggcccgacat tcagcggatg caggaccttg agcaagcgtg taaggatttt 4440 gttgcacgtg ttgtaactca gggtaaagag atgttgacca tccagcttgt ggcgggttat 4500 tatggtgtgg aagttggtat ggttgaggtg gtttgggggg ctttgaaggc ctgcgccgca 4560 gcccgcgaga ccctagtgac caacaggttg ccggtactaa acttatctaa ggaggactga 4620 acaaataaca atcattatgc agtctgcgcg tccatgtgcc ttagctgcca gttctggtgt 4680 ttggagtgcc aggaaagtgg ggtgggatgt cgctgtgtag attgttgctc atgcttgcaa 4740 tgtgctgcgg ggtgtcaagg ggctcccaaa cgctcccagc cggaggcagg cgtggccagc 4800 gccgccgtga caattcagcc cagtggagca ctcaacaacg ccccgaggga gccgtcggcc 4860 ccgcccctct cacagacgtt gtcaccgcgg caggtactcg cacggtacca gatgtagatc 4920 aagccggtgc cgtgctggtg cgccagtata atctagtgac cagcccgtta ggcctggcca 4980 cccttggtag caccaatgcc ttgctttatg ccgcaccggt gtcaccgtta atgccgcttc 5040 aggacggcac gacgtctaat atcatgagca cggagtctag caactatgct caataccgtg 5100 tacagggcct aactgtccgc tggcgcccag ttgtgccaaa tgcggtgggc ggcttctcta 5160 taagcatggc ctattggccc cagacaacat ccacccctac aagcattgac atgaattcca 5220 tcacgtccac tgacgtccgt gtggtgcttc agccgggctc tgctggtttg ctgactatac 5280 cacatgagcg tttggcgtat aagaacaatg gttggcggtc cgtcgaaacg gtatccgtcc 5340 cacaggagga tgccacgtcc ggcatgctca tggtttgtgt ccacgggacc ccctggaata 5400 gttataccaa tagtgtttac accgggccgc ttggtatggt tgattttgcc ataaagttac 5460 agctaaggaa cttgtcgccc ggtaatacaa atgccagggt cacccgtgtg aaggtgacgg 5520 ccccacatac catcaaggct gacccatctg gtgctaccat aacaacagca gctgcggcca 5580 ggtttatggc ggatgtgcgt tggggcttgg gcactgctga ggatggcgaa attggtcacg 5640 gcatccttgg tgttctgttt aacctggcgg acacagtttt aggtggcttg ccctcgacac 5700 tgctgcgggc ggcgagtggt cagtacatgt acggccggcc tgtggggaac gcgaacggcg 5760 agcctgaggt gaaactgtat atgtcggttg aggatgccgt taacgataaa cctattatgg 5820 tcccccatga catcgacctc gggaccagca ctgtcacctg ccaggactat gggaatcagc 5880 atgtggatga ccgcccatcc ccggccccgg cccctaagcg agctttgggc accctaaggt 5940 caggggatgt gttgcgtatt actggctcca tgcagtatgt gactaacgcc gagttgttac 6000 cgcagagtgt gtcacagggg tactttgggg ccggcagcac catgatggtg cataatttga 6060 tcactggtgt gcgcgccccc gccagttcag tcgactggac gaaggcaaca gtggatgggg 6120 tccaggtgaa gactgtcgat gctagttctg ggagtaatag gtttgcagcg ttacctgcat 6180 ttggaaagcc agctgtgtgg gggccccagg gcgctgggta tttctaccag tataacagca 6240 cccaccagga gtggatttat tttcttcaga atggtagctc cgtggtttgg tatgcatata 6300 ctaatatgtt gggccagaag tcagatacat ccattctttt tgaggtccgg ccaatccaag 6360 ctagtgatca gccttggttt ttggcacacc acactggcgg cgatgactgt accacctgtc 6420 tgcctctggg gttaagaaca tgttgccgcc aggcgccaga agaccagtca cctgagacgc 6480 gccggctcct agaccggctt agtaggacat tcccctcacc accctaatgt cgtggttttg 6540 gggttttagg ttgattttct gtatctgggc gtaattgccc ctatgtttaa tttattgtga 6600 tttttataac tgttcatttg attatttatg aaatcctccc atctcgggca tagt 6654 //