ID K01689; SV 1; linear; genomic RNA; STD; VRL; 2071 BP. XX AC K01689; XX DT 07-NOV-1985 (Rel. 07, Created) DT 17-APR-2005 (Rel. 83, Last updated, Version 7) XX DE Influenza C/California/78, hemagglutinin (seg 4), cDNA. XX KW glycoprotein; haemagglutinin. XX OS Influenza C virus OC Viruses; Riboviria; Negarnaviricota; Polyploviricotina; Insthoviricetes; OC Articulavirales; Orthomyxoviridae; Gammainfluenzavirus. XX RN [1] RP 1-2071 RX PUBMED; 6699942. RA Nakada S., Creager R.S., Krystal M., Aaronson R.P., Palese P.; RT "Influenza C virus hemagglutinin: comparison with influenza A and B virus RT hemagglutinins"; RL J. Virol. 50(1):118-124(1984). XX DR MD5; d48cafdf8d6713d7d42cbe8bc2052096. XX CC The location of the hemagglutinin gene was deduced by computer CC analysis of influenza C/Cal/78 and comparison with the CC organizations of influenzas B/Lee/40 and A/PR/8/43. The alignment CC of the amino acid sequence of the C virus HA remains tentative, CC since it is based solely on structural homologies. Plus strand is CC shown. XX FH Key Location/Qualifiers FH FT source 1..2071 FT /organism="Influenza C virus" FT /mol_type="genomic RNA" FT /db_xref="taxon:11552" FT CDS 22..1986 FT /codon_start=1 FT /note="hemagglutinin precursor (putative); putative" FT /db_xref="GOA:A0A2C9NIC3" FT /db_xref="InterPro:IPR001364" FT /db_xref="InterPro:IPR003860" FT /db_xref="InterPro:IPR007142" FT /db_xref="InterPro:IPR008980" FT /db_xref="InterPro:IPR014831" FT /db_xref="UniProtKB/TrEMBL:A0A2C9NIC3" FT /protein_id="AAA43791.1" FT /translation="MFFSLLLMLGLTEAEKIKICLQKQVNSSFSLHNGFGGNLYATEEK FT RMFELVKPKAGASVLNQSTWIGFGDSRTDQSNSAFPRSLMSAKTADKFRSLSGGSLMLS FT MFGPPGKVDYLYQGCGKHKVFYEGVNWSPHAAIDCYRKNWTDIKLNFQKSIYELASQSH FT CMSLVNALDKTIPLQVTKGVAKNCNNSFLKNPALYTQEVKPLEQICGEENLAFFTLPTQ FT FGTYECKLHLVASCYFIYDSKEVYNKRGCGNYFQVIYDSSGKVVGGLDNRVSPYTGNSG FT DTPTMQCDMLQLKPGRYSVRSSPRFLLMPERSYCFDMKEKGPVTAVQSIWGKGRKSDYA FT VDQACLSTPGCMLIQKQKPYIGEADDHHGDQEMRELLSGLDYEARCISQSGWVNETSPF FT TEEYLLPPKFGRCPLAAKEESIPKIPDGLLIPTSGTDTTVTKPKSRIFGIDDLIIGLLF FT VAIVEAGIGGYLLGSRKESGGGVTKESAEKGFEKIGNDIQILRSSTNIAIEKLNDRISH FT DEQAIRDLTLEIENARSEALLGELGIIRALLVGNISIGLQESLWELASEITNRAGDLAV FT EVSPGCWIIDNNICDQSCQNFIFKFNETAPVPTIPPLDTKIDLQSDPFYWGSSLGLAIT FT AANLMAALVISGIAICRTK" FT sig_peptide 22..63 FT /note="hemagglutinin signal peptide (putative); putative" FT mat_peptide 64..1356 FT /note="hemagglutinin HA1 chain (putative); putative" FT mat_peptide 1357..1983 FT /note="hemagglutinin HA2 chain (putative); putative" XX SQ Sequence 2071 BP; 696 A; 381 C; 440 G; 554 T; 0 other; agcaaaagca ggggtttaat aatgtttttc tcattactct tgatgttggg cctcacagag 60 gctgaaaaaa taaagatatg ccttcaaaag caagtgaaca gtagcttcag cctacacaat 120 ggcttcggag gaaatttgta tgccacagaa gaaaaaagaa tgtttgagct tgttaagccc 180 aaagctggag cctctgtctt gaatcaaagc acatggattg gctttggaga ttcaagaact 240 gaccaaagca attcagcttt tcctaggtcg ctgatgtcag caaaaactgc tgataaattt 300 cgttctttgt ctggtggatc cttgatgttg agtatgtttg gcccacctgg gaaggtagat 360 tacctttacc aaggatgtgg aaagcataaa gttttttatg aaggagtcaa ctggagtcca 420 catgctgcta tagattgtta cagaaaaaat tggactgaca tcaaactgaa tttccagaaa 480 agcatttatg aattggcttc acaatcacat tgcatgagct tggtgaatgc cttggacaaa 540 actattcctt tacaagtgac taaaggagtt gcaaaaaatt gcaacaacag cttcttaaaa 600 aatccagcat tgtacacaca agaagtcaaa cctttagagc aaatatgtgg ggaagaaaat 660 cttgcttttt tcacacttcc aacccaattt ggaacctatg agtgcaaact gcatcttgtg 720 gcttcttgct atttcatcta tgatagcaaa gaagtgtaca ataaaagagg atgtggcaac 780 tactttcaag tgatctatga ttcatctgga aaagttgttg gagggctaga taacagggta 840 tcaccttaca cagggaattc tggagacact ccaacaatgc aatgtgacat gctccagctg 900 aaacctggaa gatattcagt aagaagctct ccaagattcc ttttaatgcc tgaaaggagt 960 tattgctttg acatgaaaga aaaaggacca gtcactgctg tccaatccat ctggggaaaa 1020 ggcagaaaat ctgactatgc agtagatcag gcttgcttga gcactccagg gtgcatgttg 1080 atccaaaagc aaaagccata cattggagag gctgatgatc accatggaga tcaagaaatg 1140 agggagttgc tgtcaggact ggactatgaa gctagatgca tatcacaatc agggtgggtg 1200 aatgaaacca gtccttttac ggaagaatac ctccttcctc ccaaatttgg aagatgtccc 1260 ttggccgcaa aggaagaatc cattccaaaa atcccagatg gacttctaat tcccaccagt 1320 ggaactgata ccactgtaac caaacctaaa agcagaattt ttggaatcga tgaccttatt 1380 attggtctac tatttgttgc aattgttgaa gcaggaattg gaggctatct gcttggaagt 1440 agaaaagaat caggaggagg tgtgacaaaa gaatcagctg aaaaagggtt tgaaaaaatt 1500 ggaaatgaca tacaaatctt aagatcttct acaaatattg caatagaaaa actgaacgac 1560 agaatttctc atgatgagca agccatcaga gatctaactt tagaaattga aaatgcaaga 1620 tctgaagctc tattaggaga attgggaata ataagagcct tgctggtagg aaatataagc 1680 ataggattac aagaatcttt atgggaacta gcttcagaaa taacaaatag agcaggagac 1740 ctggcagtcg aagtctctcc aggttgctgg ataatcgaca ataacatttg tgatcaaagt 1800 tgtcaaaact ttattttcaa gttcaacgaa actgcgcctg ttccaaccat tccccctctt 1860 gacacaaaaa ttgatctgca atcagatcct ttttactggg gaagcagctt gggcttagca 1920 ataactgctg ctaatctaat ggcagctttg gtgatctctg ggatcgccat ctgcagaact 1980 aaatgatcag gacaattttg aaaaatggat aatatattag tcaatatttt gtacagcttt 2040 ataaaaaaac aaaaaacccc ttgctactgc t 2071 //