ID JX045650; SV 1; linear; genomic RNA; STD; PHG; 3562 BP. XX AC JX045650; XX DT 15-JUL-2012 (Rel. 113, Created) DT 03-SEP-2012 (Rel. 114, Last updated, Version 2) XX DE Enterobacteria phage Hgal1, complete sequence. XX KW . XX OS Enterobacteria phage Hgal1 OC Viruses; Riboviria; Leviviridae; Levivirus. XX RN [1] RP 1-3562 RX PUBMED; 22821966. RA Kannoly S., Shao Y., Wang I.N.; RT "Rethinking the Evolution of Single-Stranded RNA (ssRNA) Bacteriophages RT Based on Genomic Sequences and Characterizations of Two R-Plasmid-Dependent RT ssRNA Phages, C-1 and Hgal1"; RL J. Bacteriol. 194(18):5073-5079(2012). XX RN [2] RP 1-3562 RA Kannoly S., Shao Y., Wang I.-N.; RT ; RL Submitted (10-MAY-2012) to the INSDC. RL Biological Sciences, University at Albany, 1400 Washington Avenue, Albany, RL NY 12222, USA XX DR MD5; 6364c86f728c4bb6ad1cb4bbe4687e2e. DR EuropePMC; PMC3430324; 22821966. XX FH Key Location/Qualifiers FH FT source 1..3562 FT /organism="Enterobacteria phage Hgal1" FT /host="Enterobacteriaceae" FT /mol_type="genomic RNA" FT /db_xref="taxon:1206300" FT CDS 174..1397 FT /codon_start=1 FT /transl_table=11 FT /product="putative maturation/attachment protein" FT /db_xref="GOA:I6XDB9" FT /db_xref="InterPro:IPR005563" FT /db_xref="UniProtKB/TrEMBL:I6XDB9" FT /protein_id="AFN37816.1" FT /translation="MQVAAPVKVGTVYKGNAKYRTVVVDRNGNPISDKTTVNTKFEQWI FT VERDNVRKSTKTRTKWRPPLAYRKTILSLSDIRGECSQTVNSSGRTTSISGDVPATNFN FT WNLKKILHGQSDMVMWDLNELARAETEVLVKLKNSNINLGEGLAEAKSTYEHLAKTTKT FT LTQTIRAVRHGDWKGALNALKVDPNRKWSTKDPAGRWLELQFGWTPLINDISALMDYKK FT TFDVWDAGKRRNYFSVTRALKSQDSGPYNTALSGFPANTSRGNIELGTTVKLYCTVANP FT KLALLNQIGLLAPQQVAWALMPWSFAIDWFLPIGTFLEAITAPLGFELVGGSRTRYAQG FT GGTLVAQAWRNSASSSQNGLFERDFEINAMRREPITSFPIPRPYIKSPFSTSHSISALA FT LIRQLTKR" FT CDS 1403..1801 FT /codon_start=1 FT /transl_table=11 FT /product="capsid protein" FT /note="putative coat protein" FT /db_xref="GOA:I6X3K8" FT /db_xref="InterPro:IPR002703" FT /db_xref="InterPro:IPR015954" FT /db_xref="UniProtKB/TrEMBL:I6X3K8" FT /protein_id="AFN37817.1" FT /translation="MPQLQNLVLKDRAATPKSHTFTPRNVEQNVGTVVETTGVPVGEPR FT FSISLRQTADNYKAELRLMVPVVQDQIINNVSSPVAVRQAIASATFTFAKTSTEAERND FT IVGMFADALATDKTLVNDCVVKLGGIYG" FT CDS 1794..1991 FT /codon_start=1 FT /transl_table=11 FT /product="lysis protein" FT /note="empirically verified" FT /db_xref="GOA:I6WIU6" FT /db_xref="UniProtKB/TrEMBL:I6WIU6" FT /protein_id="AFN37818.1" FT /translation="MANPRRETLSVHRFIVTKSQAQFSIRIIRCIRNLLAIAGMLFLLS FT PIYLDKFIQIYLNAYVVPMT" FT CDS 1880..3451 FT /codon_start=1 FT /transl_table=11 FT /product="replicase" FT /db_xref="GOA:I6XI25" FT /db_xref="InterPro:IPR005093" FT /db_xref="InterPro:IPR007096" FT /db_xref="UniProtKB/TrEMBL:I6XI25" FT /protein_id="AFN37819.1" FT /translation="MHKKSPSNSRNVISSFPDLSRQIHSDLSECVRRSDDIDDFRLSYL FT CDVFLSKHPSLEGGVSAEAREKLAFEKMQASEKRNSATNCRLLPSDGILPSDVSVVFHY FT ARRIMAGILGPLSADVVVNATFSNGASTSKKRSQGDAFFKIVGKSDVTQDAYGFAVAAV FT RAYPLYEELLTEQYGSSDNWFNVVQGNVAFTVPKNSSIDRAACKEPDLNMFLQAGVGSH FT IRRRLRRHGIDLNDQTQNQRLAREGSVDGRLATIDLSSASDSVTRALVMQLLPYQWYSY FT LDAIRSKTGLISGKRHRWEMFSSMGNGFTFELESAIFYSLARACLIHLGIDDVVGVYGD FT DIILPCAAYNLLEVILSFAGFTVNSKKSFARGFFRESCGKHFYEGVDVTPFFVKKPITD FT TTRVIWFLNQLRSWCSVNEFCDSRFYPIWKKYSRLVPKLLHGGKDLQSIFALVTPGQPN FT RILRPLNKGTLISGKAALIRYGLAPTVNSSTKLDEERLKHVIEGRYRLVPNNAWWTPIP FT EFQEEL" XX SQ Sequence 3562 BP; 899 A; 847 C; 860 G; 956 T; 0 other; ggtggagagc cctttcgggc tctctgtctc acatctctgc aagactgacc attaaggtct 60 ctccaaagag ttgccccatc actggggctc cgccatgtca atagatttag gcggctgtga 120 ctcgatatta tcgtcacggt atggcccctt agttcactta ttaagggaac accatgcaag 180 tagcggcacc tgtcaaggtg ggcactgtct acaaaggcaa tgcaaaatat cgaaccgtag 240 tggtagatcg caacggtaat cctatctcag ataagactac cgtaaacact aaatttgagc 300 aatggatcgt cgaacgtgat aacgttcgta aatcgaccaa aactcgaact aagtggcgac 360 cgccattggc ttaccgtaag accatattat cactgagtga tatacgtggg gagtgttccc 420 agaccgttaa ctctagcggt aggacgacat ccatttctgg tgacgttcct gctactaatt 480 ttaactggaa tctgaagaaa attctccacg gtcaatcaga tatggtaatg tgggatctta 540 atgagttagc gagagctgaa actgaggttc tcgtaaagct caagaactct aatatcaact 600 taggcgaagg cctagctgaa gctaagagta cttacgagca tttagccaaa acaaccaaaa 660 cattgacgca gacgatccgc gccgtgagac acggcgattg gaaaggtgcg ctcaatgctt 720 tgaaggtcga tcctaaccgt aagtggtcca ctaaggaccc tgccggccga tggctcgaac 780 ttcagtttgg ttggactccg ttgattaacg acatctccgc attaatggac tataaaaaga 840 cctttgatgt atgggatgcc gggaagcgac ggaattactt ctcagttact cgggcactga 900 agtcccaaga tagtggtccg tataatacgg ccctatccgg gttcccagcc aatacctcca 960 gaggaaatat tgaactgggg acaacagtta agttatattg tacggtagct aatccaaaac 1020 tggctctatt gaaccagatt gggctactgg cgcctcagca agtggcttgg gcattaatgc 1080 cttggtcatt tgctatcgat tggttcctac ctataggaac ttttcttgag gctataactg 1140 cgcccttagg ctttgagttg gtcggcggta gtcgcacccg ctatgcgcaa ggagggggaa 1200 ccctcgttgc acaggcttgg cgcaactccg cttcatcatc tcaaaacggg cttttcgaac 1260 gtgatttcga gattaatgcg atgaggcgag agcctatcac atctttccca attccacgtc 1320 cgtatatcaa aagccctttt tcaacttctc actcgataag cgcattagcg ctgattcgtc 1380 aacttaccaa aaggtaatct gcatgccaca gctgcaaaac ctcgtcctaa aggaccgagc 1440 tgcaactccc aagagccata catttactcc acgtaacgtg gaacagaatg ttggtactgt 1500 agtagaaact accggcgtgc ctgttggaga gccgcggttt agcatctctc ttcgccagac 1560 ggctgacaac tataaagctg agttacgtct gatggttccg gtcgttcaag accagatcat 1620 caataacgta tcttcgccag ttgccgttcg tcaagcgatt gcgtctgcca cctttacgtt 1680 cgcgaagacc tctactgagg ctgagcgaaa cgatattgtt ggtatgttcg ctgatgcttt 1740 agctactgat aagaccctgg ttaacgactg tgtcgttaaa ctgggtggta tctatggcta 1800 acccgcgccg ggagacgctt agcgtccacc ggtttattgt gactaagtca caagctcaat 1860 tctccatcag gataatccga tgcataagaa atctcctagc aatagcagga atgttatttc 1920 ttctttcccc gatctatctc gacaaattca ttcagattta tctgaatgcg tacgtcgttc 1980 cgatgacata gacgatttcc gtctaagtta tctttgcgac gtcttcttgt cgaagcatcc 2040 gtctttagaa ggcggtgttt cggcagaagc gagagagaaa cttgcatttg agaagatgca 2100 agcctcggag aagcgcaact cagcgacaaa ttgtcgtcta cttccaagcg atggcatact 2160 gccatccgac gttagcgtag tatttcacta tgcacggcgc attatggctg gaatattggg 2220 acctctttcc gctgatgtgg tagttaacgc tacattctca aatggcgcct caacttcaaa 2280 gaagaggagc cagggagatg cgtttttcaa gattgtcgga aaaagcgacg ttactcagga 2340 tgcgtacggc tttgccgttg cagccgttag ggcctaccct ctatatgagg agctcctaac 2400 agaacaatat ggttccagtg ataactggtt caatgttgtt cagggtaacg tggccttcac 2460 cgtgccgaaa aattcgtcaa tagatcgtgc tgcctgtaag gagcccgatt tgaacatgtt 2520 tttgcaggct ggcgtaggat ctcatatcag acgacgtctt cgtcgccatg gtatcgatct 2580 taatgaccag acgcaaaacc agcgcttagc acgagaagga tctgtagacg gccgcctggc 2640 cactatagat ctctcgagcg cgagtgactc agtcacccgt gcgcttgtta tgcagctgct 2700 tccttatcaa tggtattcgt accttgatgc aatccggtct aagaccggtt tgatctctgg 2760 caaacgccac agatgggaaa tgttctcctc tatgggtaat ggtttcacgt tcgagcttga 2820 gagtgcgata ttttactctc tcgcgcgtgc gtgccttatc catcttggga ttgacgacgt 2880 cgtaggcgta tatggcgatg atataatctt gccatgtgct gcctataacc ttctcgaagt 2940 tatcttgagt ttcgcaggtt ttactgtgaa ctctaagaag tcatttgctc gtggattctt 3000 cagagaatcc tgtggtaaac acttctatga gggtgtagac gtaacccctt tcttcgttaa 3060 gaagccaatt acagacacga ctcgcgttat ttggttttta aaccaattac gctcgtggtg 3120 ttctgtaaat gaattctgtg atagtcgctt ctaccctatt tggaagaaat attccagatt 3180 ggttccgaag cttcttcacg gaggaaagga cctgcaatct atctttgctt tagtaacccc 3240 agggcagccc aaccgcattc ttcgtcccct taacaaaggg acgttgattt cgggtaaggc 3300 tgctctaatt cgctatgggc ttgcgcctac agtgaattcg tcaactaagt tagacgagga 3360 gcgacttaag catgtgatag agggccgtta taggctagtg cctaataacg catggtggac 3420 gccgatacct gagttccaag aggaactcta gtatccgacg tcgccccagc ctggagggtt 3480 ggggttcctt agttagcaac taaggtggtg gtgtcgggag acactatagg ttaaggaact 3540 cgctttgctc cttaacccac ca 3562 //