Dbfetch

ID   AY535004; SV 1; linear; genomic RNA; STD; VRL; 6654 BP.
XX
AC   AY535004;
XX
DT   01-MAR-2004 (Rel. 79, Created)
DT   09-JAN-2022 (Rel. 144, Last updated, Version 4)
XX
DE   Avian hepatitis E virus, complete genome.
XX
KW   .
XX
OS   Avian hepatitis E virus
OC   Viruses; Riboviria; Orthornavirae; Kitrinoviricota; Alsuviricetes;
OC   Hepelivirales; Hepeviridae; Orthohepevirus.
XX
RN   [1]
RP   1-6654
RX   PUBMED; 15166445.
RA   Huang F.F., Sun Z.F., Emerson S.U., Purcell R.H., Shivaprasad H.L.,
RA   Pierson F.W., Toth T.E., Meng X.J.;
RT   "Determination and analysis of the complete genomic sequence of avian
RT   hepatitis E virus (avian HEV) and attempts to infect rhesus monkeys with
RT   avian HEV";
RL   J. Gen. Virol. 85(PT 6):1609-1618(2004).
XX
RN   [2]
RP   1-6654
RA   Huang F.F., Sun Z.F., Emerson S.U., Purcell R.H., Shivaprasad H.L.,
RA   Pierson W.F., Toth T.E., Meng X.J.;
RT   ;
RL   Submitted (29-JAN-2004) to the INSDC.
RL   Department of Biomedical Sciences and Pathobiology, College of Veterinary
RL   Medicine, Virginia Polytechnic Institute and State University, 1410 Price's
RL   Fork Road, Blacksburg, VA 24060, USA
XX
DR   MD5; 439b3ec031384297e19fc8f9bfcdfbe5.
DR   EuropePMC; PMC2849599; 20107086.
DR   EuropePMC; PMC3006657; 21203540.
DR   EuropePMC; PMC3122850; 21307216.
DR   EuropePMC; PMC3187576; 21972239.
DR   EuropePMC; PMC3310647; 22000397.
DR   EuropePMC; PMC3416139; 22696648.
DR   EuropePMC; PMC3624379; 23388713.
DR   EuropePMC; PMC3884734; 24378180.
DR   EuropePMC; PMC3993473; 24478416.
DR   EuropePMC; PMC4163165; 25187005.
DR   EuropePMC; PMC4225530; 24252365.
DR   EuropePMC; PMC4290951; 25339404.
DR   EuropePMC; PMC4313232; 25187634.
DR   EuropePMC; PMC4442536; 25741007.
DR   EuropePMC; PMC4542097; 26260476.
DR   EuropePMC; PMC4641181; 26605326.
DR   EuropePMC; PMC497618; 15297556.
DR   EuropePMC; PMC525261; 15528746.
DR   EuropePMC; PMC5320732; 28222808.
DR   EuropePMC; PMC5429376; 28110375.
DR   EuropePMC; PMC6199738; 30352598.
DR   EuropePMC; PMC6307819; 30532200.
DR   EuropePMC; PMC6503432; 31060564.
XX
FH   Key             Location/Qualifiers
FH
FT   source          1..6654
FT                   /organism="Avian hepatitis E virus"
FT                   /mol_type="genomic RNA"
FT                   /country="USA"
FT                   /db_xref="taxon:2907992"
FT   CDS             25..4620
FT                   /codon_start=1
FT                   /product="non-structural polyprotein"
FT                   /note="includes functional domains like methyltransferase,
FT                   helicase and RdRp"
FT                   /db_xref="GOA:Q6QLN1"
FT                   /db_xref="InterPro:IPR001788"
FT                   /db_xref="InterPro:IPR002588"
FT                   /db_xref="InterPro:IPR002589"
FT                   /db_xref="InterPro:IPR007094"
FT                   /db_xref="InterPro:IPR022202"
FT                   /db_xref="InterPro:IPR027351"
FT                   /db_xref="InterPro:IPR027417"
FT                   /db_xref="UniProtKB/Swiss-Prot:Q6QLN1"
FT                   /protein_id="AAS45830.1"
FT                   /translation="MDVSQFAESKGVKTALEAAALAAANTALRNARVVTPYLTQQQTKN
FT                   LLELFRGAQLRFEPRDNWAHPVQRVVHDALEQYVRRAAGPNCLEVGAHPRSINRHQASH
FT                   RCFLPPVGRDEQRWQVAPRRGLCNLIRRALLNGVKVAREFCQLGFGACSHQCEVGIALY
FT                   SLHDMRPADVACAMARHNMRTMYVVLHLPEEAMLPPGSYSNKFYNTVNTADKCIITYAD
FT                   DSCAGYVHKREVLQDWITTTGVSGRHPMLIERVRAIGCHFVLLCTATQPCPMPYTPYPS
FT                   SNTVYVRNVYGPALGAGLFTPKCCVDATFYPVPRRVWQRLMMFGTTLDDDAFCCSRLLT
FT                   YLRGISTKVTVGNIVANEGWQPEEQQLTAVAIAAYLTVCHQRWVRTQGIARGVRRLQAE
FT                   HAQQFWFKVWELFTNTGTVPGYSAGFYRQLATWISGGLTIDFERRVFDKRVKCGCCCVC
FT                   ERRPADPGCLCIDDFPDGANGLVKLKKWPIRAGTKSAVSKWAQVRVRADSTEDLIDLSV
FT                   PKLLTLKELAAAAIRKQPSAPPSLHILDRRPVGDPRRPVNCAPPAVSAGPVPAPPGNPV
FT                   IESVQGSGAGGPEVSESQPGLTPTREVTNMPLPPQRGQEEVLAVLPSGARVIVGNLLDV
FT                   AADWLVNPANRDHQPGGGLCGMFHRRWPHLWPVCGEVQDLPTGPVIFQQGPPKVIHAPG
FT                   PDYRIKPDPDGLRRVYAVVHQAHGTVASPLISAGIYRAPARESFEAWAATARDGDLLVV
FT                   QRSMAQHIRDFVLNEGRHRPRELHVDRAMADMVNYGLATEPEPYNELVKGVEVAPMTVK
FT                   YALIAGVPGSGKSSSVDHRGAVVITPTKTLAREWSARGATAVTPHVAASAAPEGRVIVD
FT                   EAYAIPPHLLVASLRRARDVVMLGDPHQIPALDFDGRCLTSAVDLGLQPTSWRTVSHRC
FT                   PWDVCIFLRTDYPTITTTSRVLRSVVFTGETIGQKIVFTQVAKQSNPGSITVHEAQGST
FT                   FDQTTIIATLDARGLIASSRAHAIVALTRHRERCSVIDVGGVLVEIGVTDAMFNNIEMQ
FT                   LVRPDAAAPAGVLRAPDDTVDGLLDIPPAHTDVAAVLTAEAIGHAPLELAAINPPGPVL
FT                   EQGLLYMPARLDGRDEVVKLQLSDTVHCRLAAPTSRLAVINTLVGRYGKATKLPEVEYD
FT                   LMDTIAQFWHHIGPINPSTLEYAEMCEAMLSKGQDGSLIVHLDLQDADCSRITFFQKDC
FT                   AKFTLDDPVAHGKVGQGISAWPKTLCALFGPWFRAIEKHLVAGLPPGYYYGDLYTEADL
FT                   HRSVLCAPAGHLVFENDFSEFDSTQNNVSLDLECELMRRFGMPDWMVALYHLVRSYWLL
FT                   VAPKEALRGCWKKHSGEPGTLLWNTVWNMTVLHHVYEFDRPSVLCFKGDDSVVVCESVR
FT                   ARPEGVSLVADCGLKMKDKTGPCGAFSNLLIFPGAGVVCDLLRQWGRLTDKNWGPDIQR
FT                   MQDLEQACKDFVARVVTQGKEMLTIQLVAGYYGVEVGMVEVVWGALKACAAARETLVTN
FT                   RLPVLNLSKED"
FT   CDS             4654..4917
FT                   /codon_start=1
FT                   /product="cytoskeleton related protein"
FT                   /note="unknown function"
FT                   /db_xref="GOA:Q913Y8"
FT                   /db_xref="UniProtKB/Swiss-Prot:Q913Y8"
FT                   /protein_id="AAS45832.1"
FT                   /translation="MCLSCQFWCLECQESGVGCRCVDCCSCLQCAAGCQGAPKRSQPEA
FT                   GVASAAVTIQPSGALNNAPREPSAPPLSQTLSPRQVLARYQM"
FT   CDS             4707..6527
FT                   /codon_start=1
FT                   /product="capsid protein"
FT                   /note="structural protein"
FT                   /db_xref="GOA:Q913Y7"
FT                   /db_xref="InterPro:IPR004261"
FT                   /db_xref="InterPro:IPR029053"
FT                   /db_xref="UniProtKB/Swiss-Prot:Q913Y7"
FT                   /protein_id="AAS45831.1"
FT                   /translation="MSLCRLLLMLAMCCGVSRGSQTLPAGGRRGQRRRDNSAQWSTQQR
FT                   PEGAVGPAPLTDVVTAAGTRTVPDVDQAGAVLVRQYNLVTSPLGLATLGSTNALLYAAP
FT                   VSPLMPLQDGTTSNIMSTESSNYAQYRVQGLTVRWRPVVPNAVGGFSISMAYWPQTTST
FT                   PTSIDMNSITSTDVRVVLQPGSAGLLTIPHERLAYKNNGWRSVETVSVPQEDATSGMLM
FT                   VCVHGTPWNSYTNSVYTGPLGMVDFAIKLQLRNLSPGNTNARVTRVKVTAPHTIKADPS
FT                   GATITTAAAARFMADVRWGLGTAEDGEIGHGILGVLFNLADTVLGGLPSTLLRAASGQY
FT                   MYGRPVGNANGEPEVKLYMSVEDAVNDKPIMVPHDIDLGTSTVTCQDYGNQHVDDRPSP
FT                   APAPKRALGTLRSGDVLRITGSMQYVTNAELLPQSVSQGYFGAGSTMMVHNLITGVRAP
FT                   ASSVDWTKATVDGVQVKTVDASSGSNRFAALPAFGKPAVWGPQGAGYFYQYNSTHQEWI
FT                   YFLQNGSSVVWYAYTNMLGQKSDTSILFEVRPIQASDQPWFLAHHTGGDDCTTCLPLGL
FT                   RTCCRQAPEDQSPETRRLLDRLSRTFPSPP"
XX
SQ   Sequence 6654 BP; 1334 A; 1728 C; 1966 G; 1626 T; 0 other;
     gcatgacccc atgccagggt aagaatggac gtctcgcagt ttgcagagtc caagggggtt        60
     aagactgcac tagaagcggc cgctctagct gcagccaata ccgccctgcg aaatgcgcgg       120
     gtggtaaccc cctatcttac ccaacagcag actaagaatc tgctggagct gttccgtggt       180
     gctcagttgc ggtttgagcc acgcgacaac tgggcgcacc ctgtgcagcg ggtcgtgcat       240
     gatgcccttg agcagtatgt acgccgcgca gcggggccca attgcctgga ggtcggcgct       300
     catccacgct ccattaatag gcatcaagcc tcgcaccgct gcttcttacc cccagttggg       360
     cgtgatgagc agcggtggca ggtagcacca cggaggggtc tttgcaacct aatccgtcgc       420
     gcgctgctca atggtgtcaa ggtggcgcgt gagttttgtc agttagggtt cggtgcctgc       480
     tctcaccagt gtgaggtggg catagctctt tacagcttac acgacatgcg tcctgcggac       540
     gtggcgtgtg cgatggcccg tcataacatg cgcactatgt acgtcgtatt gcatctacct       600
     gaggaggcaa tgttaccacc agggtcctac tcaaacaagt tctacaacac agtgaacaca       660
     gcggataaat gcattataac gtatgctgac gattcttgcg cggggtatgt gcacaagcgt       720
     gaggtattac aggactggat aacgacaaca ggcgtttctg gtcggcatcc catgctaatt       780
     gagcgggtca gggccatagg ttgccatttt gtgctgttgt gcactgcaac ccagccctgc       840
     cccatgccgt atacgcctta cccgtcctca aacaccgtgt acgttaggaa tgtttacggg       900
     cccgcgctgg gtgccggtct ttttacaccc aaatgttgtg ttgatgccac gttctaccct       960
     gtgcccaggc gtgtgtggca gcggctgatg atgtttggta ccacacttga tgatgacgcc      1020
     ttctgttgct cgcgcctgct aacttattta cgtgggattt caaccaaggt gacagtaggt      1080
     aacattgtgg ccaatgaggg ctggcaaccg gaggagcagc agcttacggc tgtcgctata      1140
     gctgcatacc ttactgtctg tcatcaaagg tgggtgcgca cacagggcat agctcggggt      1200
     gtgaggcgct tacaggctga acatgcacag cagttctggt ttaaagtctg ggagttgttc      1260
     accaacaccg gcactgtgcc cgggtattcg gccgggtttt atcgccagct cgccacatgg      1320
     attagtggcg gccttaccat tgactttgaa cggcgtgttt tcgacaagcg tgtgaagtgt      1380
     gggtgctgtt gcgtctgcga gcgccgtcct gctgaccccg ggtgcttgtg cattgacgat      1440
     ttccccgatg gtgcgaacgg gctagtgaag ttgaaaaagt ggccgatacg agctggtacg      1500
     aagtctgcag ttagcaagtg ggctcaagtc agggtccggg ctgatagcac ggaggatttg      1560
     attgatctga gcgtgcccaa actactcaca cttaaagaac tcgccgctgc agcaatacgc      1620
     aagcaaccgt ccgccccgcc atcattgcac attttggacc gccgaccggt aggtgatccc      1680
     aggcgccccg ttaactgcgc accaccggcc gtctccgctg gccctgtacc ggcacccccg      1740
     ggtaatcctg ttattgagtc tgttcagggg tcgggggctg gcggacctga ggtcagtgag      1800
     tcccagcctg gcttgactcc gacgcgcgag gttactaata tgcccttgcc gccacagcgt      1860
     ggtcaagagg aggtcctcgc cgtgttgcca tcgggggcgc gcgtcatcgt gggtaaccta      1920
     ttggacgttg ccgccgactg gcttgttaac ccggctaatc gagaccatca gcccggaggc      1980
     gggttgtgtg gcatgttcca ccgtcgctgg ccgcacttgt ggcctgtttg cggcgaggtc      2040
     caggatttgc ccacgggccc ggtgattttc caacaggggc cacctaaggt tattcacgcc      2100
     cctggcccgg actaccgtat taagcctgac cccgacggcc tccgcagagt atacgctgtt      2160
     gtgcaccagg cgcatggcac ggtagctagc ccgttaatta gcgccgggat ataccgggca      2220
     ccggcccggg aatcatttga ggcctgggcg gccaccgccc gtgatggtga cctactggtt      2280
     gtgcagcgtt caatggctca gcacatcagg gactttgtgt tgaatgaggg tcgtcataga      2340
     cctagggagc ttcatgttga ccgggctatg gctgacatgg ttaattatgg gctggcaacc      2400
     gagcccgagc cgtacaatga gcttgtgaag ggcgttgagg ttgcgcctat gactgtgaaa      2460
     tacgcgctta tagctggtgt cccaggcagc ggcaagtcgt cgtctgttga ccatcgtggg      2520
     gctgttgtta ttacacccac caagacgttg gcacgtgagt ggtccgcgcg tggggccacg      2580
     gctgttacac cccacgttgc agcaagtgca gcccccgagg gcagggtgat tgtggacgag      2640
     gcgtatgcta tcccgccgca cctgcttgtg gcgtcccttc gtcgggcgcg cgatgttgtt      2700
     atgttggggg atccgcacca gataccagca ttggatttcg atggacgctg tttaacgagc      2760
     gccgttgatc ttgggttgca gcctaccagc tggcgcaccg tatcccaccg ttgcccttgg      2820
     gacgtttgta tatttttgcg tactgattat ccgactatca ccacaaccag tagggtgctg      2880
     cggtctgttg tgtttaccgg tgaaaccatt ggtcagaaga tagtgtttac ccaggtggcc      2940
     aagcagtcga accccgggtc cataacggtc catgaggcgc agggcagtac ttttgatcag      3000
     actactataa tcgccacgtt agatgctcgt ggccttatag cttcatctcg cgcgcatgcc      3060
     atagttgcgc taacccgcca ccgggagcgc tgtagtgtga ttgatgttgg tggggtgctg      3120
     gtcgagattg gagttactga tgccatgttt aacaatatcg aaatgcagct tgtgcgacct      3180
     gatgctgcag cccctgccgg ggtgctacga gccccagacg acaccgtgga tggcttgttg      3240
     gacatacccc cggcccacac tgatgtagcg gcggtgttaa cagctgaggc gattgggcat      3300
     gcgccccttg aattggccgc cataaatcca cccgggcctg tattggagca gggcctatta      3360
     tacatgccgg ccaggcttga tgggcgtgat gaggttgtta agctccagct gtcggatact      3420
     gtacactgcc gcctggctgc acccactagc cgtcttgcgg tgattaacac attggttggg      3480
     cggtacggta aagccactaa gctgcctgag gttgaatatg acttaatgga cactattgcg      3540
     cagttctggc atcatatcgg accaatcaac ccctcaacac tggagtatgc agagatgtgc      3600
     gaggccatgc ttagtaaggg ccaggatggg tccttgattg tacatctgga tttacaggat      3660
     gctgattgtt ctcgcataac attcttccag aaggactgcg ctaaatttac gctggatgac      3720
     cctgttgcac acggtaaagt gggacagggg atatctgcgt ggccgaaaac tttgtgtgca      3780
     cttttcggcc cctggttccg ggctatagag aagcaccttg tggctgggtt acccccaggt      3840
     tattactatg gggacctgta cacggaagcc gatctgcatc gttctgtgct ttgcgcgcct      3900
     gctggtcacc ttgtttttga gaatgatttc tcagagtttg actcaacgca gaataatgtg      3960
     tcccttgatc tcgaatgtga attgatgcgc aggtttggga tgcccgattg gatggtagcc      4020
     ttgtaccatc ttgttcgatc atactggctc ttggttgccc cgaaagaagc ccttcgtggc      4080
     tgttggaaaa aacactctgg tgagccgggc acccttttgt ggaatacagt ttggaacatg      4140
     actgtgttgc atcatgttta tgagtttgat cgaccaagtg tgttgtgttt caaaggtgat      4200
     gatagtgtcg ttgtctgtga atcggtgcgc gcccgtccag agggcgttag tctcgtggca      4260
     gactgcgggc taaaaatgaa ggacaagacc ggcccgtgtg gcgccttttc caacctgctg      4320
     atcttcccgg gagctggtgt tgtctgcgac ctgttacggc agtggggccg cttgactgac      4380
     aagaactggg ggcccgacat tcagcggatg caggaccttg agcaagcgtg taaggatttt      4440
     gttgcacgtg ttgtaactca gggtaaagag atgttgacca tccagcttgt ggcgggttat      4500
     tatggtgtgg aagttggtat ggttgaggtg gtttgggggg ctttgaaggc ctgcgccgca      4560
     gcccgcgaga ccctagtgac caacaggttg ccggtactaa acttatctaa ggaggactga      4620
     acaaataaca atcattatgc agtctgcgcg tccatgtgcc ttagctgcca gttctggtgt      4680
     ttggagtgcc aggaaagtgg ggtgggatgt cgctgtgtag attgttgctc atgcttgcaa      4740
     tgtgctgcgg ggtgtcaagg ggctcccaaa cgctcccagc cggaggcagg cgtggccagc      4800
     gccgccgtga caattcagcc cagtggagca ctcaacaacg ccccgaggga gccgtcggcc      4860
     ccgcccctct cacagacgtt gtcaccgcgg caggtactcg cacggtacca gatgtagatc      4920
     aagccggtgc cgtgctggtg cgccagtata atctagtgac cagcccgtta ggcctggcca      4980
     cccttggtag caccaatgcc ttgctttatg ccgcaccggt gtcaccgtta atgccgcttc      5040
     aggacggcac gacgtctaat atcatgagca cggagtctag caactatgct caataccgtg      5100
     tacagggcct aactgtccgc tggcgcccag ttgtgccaaa tgcggtgggc ggcttctcta      5160
     taagcatggc ctattggccc cagacaacat ccacccctac aagcattgac atgaattcca      5220
     tcacgtccac tgacgtccgt gtggtgcttc agccgggctc tgctggtttg ctgactatac      5280
     cacatgagcg tttggcgtat aagaacaatg gttggcggtc cgtcgaaacg gtatccgtcc      5340
     cacaggagga tgccacgtcc ggcatgctca tggtttgtgt ccacgggacc ccctggaata      5400
     gttataccaa tagtgtttac accgggccgc ttggtatggt tgattttgcc ataaagttac      5460
     agctaaggaa cttgtcgccc ggtaatacaa atgccagggt cacccgtgtg aaggtgacgg      5520
     ccccacatac catcaaggct gacccatctg gtgctaccat aacaacagca gctgcggcca      5580
     ggtttatggc ggatgtgcgt tggggcttgg gcactgctga ggatggcgaa attggtcacg      5640
     gcatccttgg tgttctgttt aacctggcgg acacagtttt aggtggcttg ccctcgacac      5700
     tgctgcgggc ggcgagtggt cagtacatgt acggccggcc tgtggggaac gcgaacggcg      5760
     agcctgaggt gaaactgtat atgtcggttg aggatgccgt taacgataaa cctattatgg      5820
     tcccccatga catcgacctc gggaccagca ctgtcacctg ccaggactat gggaatcagc      5880
     atgtggatga ccgcccatcc ccggccccgg cccctaagcg agctttgggc accctaaggt      5940
     caggggatgt gttgcgtatt actggctcca tgcagtatgt gactaacgcc gagttgttac      6000
     cgcagagtgt gtcacagggg tactttgggg ccggcagcac catgatggtg cataatttga      6060
     tcactggtgt gcgcgccccc gccagttcag tcgactggac gaaggcaaca gtggatgggg      6120
     tccaggtgaa gactgtcgat gctagttctg ggagtaatag gtttgcagcg ttacctgcat      6180
     ttggaaagcc agctgtgtgg gggccccagg gcgctgggta tttctaccag tataacagca      6240
     cccaccagga gtggatttat tttcttcaga atggtagctc cgtggtttgg tatgcatata      6300
     ctaatatgtt gggccagaag tcagatacat ccattctttt tgaggtccgg ccaatccaag      6360
     ctagtgatca gccttggttt ttggcacacc acactggcgg cgatgactgt accacctgtc      6420
     tgcctctggg gttaagaaca tgttgccgcc aggcgccaga agaccagtca cctgagacgc      6480
     gccggctcct agaccggctt agtaggacat tcccctcacc accctaatgt cgtggttttg      6540
     gggttttagg ttgattttct gtatctgggc gtaattgccc ctatgtttaa tttattgtga      6600
     tttttataac tgttcatttg attatttatg aaatcctccc atctcgggca tagt            6654
//