Here is the original phylogeny article. A revisited and extended view is presented here.
If you use items coming from this site, please quote the original article.
Abstract. An extensive phylogenetic analysis of the nicotinic acetylcholine receptor subunit genes family has been performed by cladistic and phenetic methods. The conserved parts of amino acid sequences have been analyzed by CLUSTAL V and PHYLIP software. The structure of the genes was also taken in consideration. The results show that a first gene duplication may have occurred before the appearance of Bilateria. Three subfamilies then appeared: I-the neuronal a-bungarotoxin binding site subunits (a7, a8), III-the neuronal nicotinic subunits (a2-a6, b2-b4) which also contains the muscle acetylcholine binding subunit (a1), and IV-the muscle non-a subunits (b1, g, d, e). The Insecta subunits (subfamily II) could be orthologous to family III and IV. Several tissular switches of expression from neuron to muscle and the converse can be inferred from the extant expression of subunits and the reconstructed trees. The diversification of the neuronal nicotinic subfamily begins in the stem lineage of chordates, the last duplications occurring shortly before the onset of the mammalian lineage. Such evolution parallels the increase in complexity of the cholinergic systems.
key words: Nicotinic receptor - Ligand-gated channel - Multigene family - gene phylogeny
Acetylcholine (ACh) has long been recognized as a neurotransmitter active in Bilateria nervous system and muscle. Two distinct categories of receptors are engaged in the biological effects of ACh: the muscarinic and nicotinic receptors. Muscarinic receptors belong to the super-family of G-protein coupled receptors; they consist of single integral proteins with seven transmembrane segments and interact, on their cytoplasmic face with heterotrimeric G-proteins. Nicotinic receptors (nAChR) belong to the super-family of ligand-gated ion channels; they are hetero-oligomers composed of five subunits, each with four transmembrane domains (Devillers-Thiéry et al. 1993, Galzi and Changeux 1994). ACh binding causes an ionic channel, most often cationic, to open, resulting in a rapid change in the electrical, and secondarily metabolic, state of the target cell (Greenberg et al. 1986, rev. Bertrand et al. 1993).
The nAChR of striated muscle is the best characterized member of the ligand gated ion channel super-family (rev. Changeux 1990, Karlin 1993): it is an hetero-pentamer (with the stoichiometry a2bgd). According to current models (rev. Bertrand et al. 1993), the ion channel forms along the axis of pseudosymmetry perpendicular to the cell membrane. The subunits share a similar hydropathic profile with 4 short hydrophobic domains (MI-MIV) and 2 long hydrophilic domains. The largest, relatively conserved, hydrophilic domain is located at the amino-terminal side of the subunit polypeptide, and the other, highly variable, joins hydrophobic domains MIII and MIV. The amino-terminal hydrophilic domain, carries the ACh binding site (Devillers-Thiéry 1993) and faces the synaptic cleft, whereas the other hydrophilic domain is exposed to the cytoplasm.
Molecular cloning and sequencing studies have revealed the existence of a family of genes, expressed in neurons, which code for nAChR subunits homologous (see Appendix 1 for definition of boldface terms) to those of muscle nAChR (Sargent 1993). In the jawed vertebrate nervous systems, several subunits (named a2-a8) have been identified which share with the muscle-type receptor a1 subunit the pair of cysteines, shown to contribute to the ACh binding site (Wada et al. 1988, Shoepfer et al. 1990, rev. Cockcroft et al. 1992, Karlin 1993). Other homologous chains, lacking the cystein pair, have been characterized and named non-a or b2-b4. As for muscle nAChR, the functional neuronal nAChR is an hetero-pentamer made up by the assembly of a and b subunits, with a putative stoichiometry in vitro of a2b3 (Anand et al. 1991, Cooper et al. 1991). The recent evidence that a5 is coprecipitated with another a and b subunit in some neuronal nAChRs indicates that more than 2 different subunits may assemble together to form a receptor molecule (Conroy et al 1992, Vernallis et al. 1993). In contrast, in reconstituted systems, the a7 or a8 subunits can form functional homo-oligomers (Couturier et al. 1990, Revah et al. 1991, Anand et al. 1993). The autoradiographic studies in the brain revealed that 3H-nicotine labels receptors formed by subunit a2-a6 and b2-b4 but not receptors formed by subunit a7 and a8 (Clarcke et al. 1985), which are labelled by a-Bungarotoxin (a-Bgt).
The combinatorial diversity resulting from the assembly of the multiple neuronal subunits results in a wide spectrum of structurally and functionally distinct nAChRs, with different pharmacological specificities and ion channel properties (Role 1992). Such differences have been directly demonstrated in Xenopus oocytes and mouse fibroblasts after heterologous expression (Luetje and Patrick 1991, Whiting et al. 1991). Furthermore, multiple functionally distinct types of nAChRs have been detected in different brain areas and subcellular compartments (see Mulle et al. 1991).
The nAChR is present in the whole phylum of Bilateria, from nematodes to humans (Gershenfeld 1973, Darlison et al. 1993, Fleming et al. 1993, Leech and Sattelle 1993). Several nAChR neuronal subunits have been cloned in Drosophila, locust and nematode (Gundelfinger 1992, Fleming et al. 1993). In the insect nervous system, ACh is the major excitatory neurotransmitter, in contrast to vertebrates, where glutamate predominates. At the neuromuscular junction, glutamate is the excitatory transmitter in arthropods, whereas it is ACh in vertebrates. Since some lines of evidence suggest that nAChR is also responsible for neuromuscular transmission in nematodes, molluscs and annelids (Gershenfeld 1973, Segerberg and Stretton 1993), it is of interest to assess whether the original form of nAChR appeared in muscle or in neurons.
The neuronal nAChRs provide a good example of multigene family differentially expressed in the nervous system. Its evolution deserves comparison with the increase in complexity of vertebrate nervous system and, in particular, cholinergic systems. Some partial trees have already been constructed, but without comparative methods and without statistical support (Brehm et al 1991, Cockcroft et al 1992). Here we provide a molecular phylogenetic study of the whole family of nAChR genes.
The programs used in this work were runned on a Sun computer in a UNIX environment. The sequences were loaded from Genbank and EMBL databases (Table 1) by the Sequence Analysis Software Package 7.1 from the Genetic Computer Group.
| Gene | Species | Genbank-Embl Acc. No. | Ref. |
|---|---|---|---|
| ba1 | Bos taurus | X02509 | Noda et al. Nature 305:818 (1983) |
| bb1 | Bos taurus | X00962 | Tanabe et al. Eur J Biochem 144:11 (1984) |
| bd | Bos taurus | X02473 | Kubo et al. Eur J Biochem 149:5 (1985) |
| be | Bos taurus | X02597 | Takai et al. Nature 315:761 (1985) |
| bg | Bos taurus | M28307 | Takai et al. Eur J Biochem 143:109 (1984) |
| ca2 | Gallus domesticus | M07339-44 | Nef et al. EMBO J 7:595 (1988) |
| ca3 | Gallus gallus | M37336 | Couturier et al. JBC 265:17560 (1990) |
| ca4 | Gallus domesticus | X07348-53,99 | Nef et al. EMBO J 7:595 (1988) |
| ca5 | Gallus gallus | J05642 | Couturier et al. JBC 265:17560 (1990) |
| ca7 | Gallus gallus | X68586 | Couturier et al. Neuron 5:847 (1990) |
| ca8 | Gallus gallus | X52296 | Schoepfer et al. Neuron 5:35 (1990) |
| cb2 | Gallus domesticus | X53092 | Schoepfer et al. Neuron 1:241 (1988) |
| cb4 | Gallus gallus | J05643 | Couturier et al. JBC 265:17560 (1990) |
| cd | Gallus gallus | K02903 | Nef et al. PNAS 81:7975 (1984) |
| cg | Gallus gallus | K02904 | Nef et al. PNAS 81:7975 (1984) |
| daLi | Drosophila melanogaster | X07194 | Bossy et al. EMBO J 7:611 (1988) |
| da2 | Drosophila melanogaster | X53583 | Sawruk et al. EMBO J 9:2671 (1990) |
| db2 | Drosophila melanogaster | X55676 | Sawruk et al. FEBS Lett 273:177 (1990) |
| dARD | Drosophila melanogaster | X04016 | Hermans-Borgmeyer et al. EMBO J 5:1503 (1986) |
| gfa3 | Carassius auratus | X54051 | Hieber et al. NAR 18:5293 (1990) |
| gfb2 | Carassius auratus | X54052 | Hieber et al. NAR 18:5307 (1990) |
| gfna2 | Carassius auratus | X14786 | Cauley et al. J Cell Biol 108:637 (1989) |
| gfna3 | Carassius auratus | M29529 | Cauley et al. J Neurosci 10:670 (1990) |
| ha1 | Homo sapiens | Y00762 | Schoepfer et al. FEBS Lett 226:235 (1988) |
| ha3 | Homo sapiens | M37981 | Mihovilovic et al. J Exp Neurol 111:175 (1991) |
| ha5 | Homo sapiens | M83712 | Chini et al. PNAS 89:1572 (1992) |
| ha7 | Homo sapiens | X70297 | Peng et al. Mol Pharmacol 45:546 (1994) |
| hb1 | Homo sapiens | X14830 | Beeson et al. NAR 17:4391 (1989) |
| hb2 | Homo sapiens | X53179 | Anand et al. NAR 18:4272 (1990) |
| hb3 | Homo sapiens | Willoughby et al. Neurosci Lett 155:136 (1993) | |
| hb4 | Homo sapiens | X68275 | Tarroni et al. FEBS Lett 312:66 (1992) |
| hd | Homo sapiens | X55019 | Luther et al. J Neurosci 9:1082 (1989) |
| he | Homo sapiens | X66403 | Beeson et al. unpublished |
| mser | Mus musculus | M74425 | Maricq et al. Science 254:432 (1991) |
| onachr | Onchocerca volvulus | L20465 | Ajuh and Egwang. Brain Res (1994) |
| ra2 | Rattus norvegicus | L10077 | Wada et al. Science 240:330 (1988) |
| ra3 | Rattus norvegicus | X03440 | Boulter et al. Nature 319:368 (1986) |
| ra4 | Rattus norvegicus | M15681-82 | Goldman et al. Cell 48:965 (1987) |
| ra5 | Rattus norvegicus | J05231 | Boulter et al. J Biol Chem 265:4472 (1990) |
| ra6 | Rattus norvegicus | L08227 | Boulter unpublished (1988) |
| ra7 | Rattus norvegicus | M85273 | Seguela et al. J Neurosci 13:596 (1993) |
| rb2 | Rattus norvegicus | Deneris et al. Neuron 1:45 (1988) | |
| rb3 | Rattus norvegicus | J04636 | Deneris et al. J Biol Chem 264:6268 (1989) |
| rb4 | Rattus norvegicus | J05232,M89971,M33951-3,M89989 | Boulter et al. J Biol Chem 265:4472 (1990) |
| rd | Rattus norvegicus | X74835 | Witzemann et al. Eur J Biochem 194:437 (1990) |
| re | Rattus norvegicus | X13252 | Criado et al. NAR 16:10920 (1988) |
| rg | Rattus norvegicus | X74834 | Witzemann et al. Eur J Biochem 194:437 (1990) |
| rglya3 | Rattus norvegicus | M55250 | Kuhse et al. J Biol Chem 265:22317 (1990) |
| saL1 | Schistocerca gregaria | X55439 | Marshall et al. EMBO J 9:4391 (1991) |
| ta1 | Torpedo californica | J00963 | Noda et al. Nature 299:793 (1982) |
| tb1 | Torpedo californica | J00964 | Noda et al. Nature 301:251 (1983) |
| td | Torpedo californica | J00965 | Noda et al. Nature 301:251 (1983) |
| tg | Torpedo californica | J00966 | Ballivet et al. PNAS 79:4466 (1982) |
| xa1a | Xenopus laevis | X17244 | Hartman et al. Nature 343:372 (1990) |
| xa1b | Xenopus laevis | X07067 | Baldwin et al. J Cell Biol. 106:469 (1988) |
| xb1 | Xenopus laevis | U04618 | Kullberg et al. Rec Chan (1994) in press |
| xd | Xenopus laevis | X07069 | Baldwin et al. J Cell Biol. 106:469 (1988) |
| xe | Xenopus laevis | U19612 | Murray et al. Neuron (1995) |
| xg | Xenopus laevis | X07068 | Baldwin et al. J Cell Biol. 106:469 (1988) |
Alignments were performed using the CLUSTAL V software (Higgins and Sharp 1988). This program compares the sequences in pairs according to Wilbur and Lipman (1983) (gap penalty=3) and builds a preliminary tree by an unweighted pair-group method of arithmetic averages (UPGMA)(Sneath and Sokal 1973). Then the program aligns all sequences in order of decreased similarity according to Feng and Doolittle (1987) (fixed and floated gap penalty=10). The use of different values of gap cost changed neither topology nor the ratio of branch lengths but did result in a homothetic transformation of the trees. The similarity have been determined by the Dayhoff PAM 250 matrix. The protein sequences were aligned after the following modifications:
Deletion of the signal peptide (corresponding to ta1 aa 1-27), the small non conserved part in amino-terminal part (corresponding to ta1 R188), the highly variable cytoplasmic region (corresponding to ta1 aa 356-393) and the carboxy-terminal part (corresponding to ta1 aa 452-461). The alignment obtained with 48 sequences shows 394 sites with 357 informative sites (Appendix 2).
To determine the branching of the nematode sequence onachr (Fig 4) which amino-terminal part is not known, a further deletion (corresponding to the 38 amino-terminal aa of ca8 from the Appendix 2 alignment) was performed on twelve sequences. the alignment of the 13 sequences show 351 sites with 263 informative sites (Appendix 2).
Inferences on gene evolution were obtained with the PHYLIP 3.5c software of Felsenstein (1993). The cladistic method was the maximum of parsimony (MP) (Fitch 1971, program PROTPARS). The mouse 5-HT3 subunit and the rat glycine a3 subunit were used as outgroups. The use of the rat GABA a1 subunit instead of the glycine a3 subunit did not change the results (data not shown). The phenetic method was the Neighbor-joining (NJ) (Saitou and Nei 1987, program NEIGHBOR). The distance matrix was provided by the Dayhoff PAM matrix (Dayhoff 1979, program PROTDIST). The statistical test used to determine the strength of the trees was bootstrap resampling (Felsenstein 1985) with the SEQBOOT (seed: 5) and CONSENSE programs.
The mixed parsimony algorithm with the Wagner method (Eck and Dayhoff 1966, program MIX) and the compatibility method (Le Quesne 1969, Estabrook et al 1976, program CLIQUE) were used to analyze the genomic structure.
The majority-rule consensus trees were constructed by the program DRAWGRAM. The results of CONSENSE analysis after the bootstrap resamplings are written in ovals on the node considered.
The determination of approximated time divergence beetween subunits (figure 5) is based on the NJ analysis of the Appendix 2 protein alignment. The external branches of the resulting tree showed an approximative molecular clock for each group. We are then able to determine the dates of the last duplications. However, the rates of evolution vary greatly between the subgroups, and the precocious duplications can't be calculate in this way. To determine the date of divergence between two subunits, we averaged the branch lengths and the evolution rate between all the orthologs. The estimated rate of evolution is obtained by dividing the branch length by the duration. The dates used are: Torpedo/osteichthyans 450 MYA, goldfish/Tetrapoda 405 MYA, Xenopus/Amniota 365 MYA, chicken/mammals 310 MYA, mouse/human 110 MYA, (Benton 1990). For instance each external branch inside the d,e,g group provides an estimated rate of evolution of between 3.2 10-10 and 7.3 10-10 substitution per site and per year (M=5.3 s=1.2), which is not far from a molecular clock.
The number of exons identified in the gene of nAChR subunits varies largely in the family although some common features can be recognized (Fig 1.A). Four subfamilies can be identified on the basis of the genomic structure. These are I-the neuronal a-bungarotoxin binding site subunits subfamily, II-the arthropoda neuronal subunits subfamily, and III-the vertebrate neuronal nicotinic subunits and IV-the muscle subunits subfamily.
figure 1A:Structure of the subunit genes. Only the exons at least partialy coding are represented. The gray level grossly reflects the conservation of the exon through the family (i.e. the presence of an exonic frontier at this place in different subunits, but not the sequence similitude between exons). A, B, C: binding site loops. M1, M2, M3, M4: transmembrane segments. The arrowheads mark the informative limits (in a cladistic acception). Adapted from Jonas et al. 1990, with the help of Alain Bessis.
high resolution
The genes of the subfamily IV possess 11 or 12 exons of which ten are conserved. In the subfamily III, the main part of the coding sequence is distributed within a single exon. The structure of a1 and a7 genes differs from the two "holotypes" (III or IV). However, it is difficult to determine if the structure of these two genes is mainly plesiomorph or contains autapomorphies. In order to extract information from genomic structure, we made a cladistic analysis of the frontiers between introns and exons. Only the ten informative sites were considered (Fig 1). A parsimony analysis of two state character (i.e. presence or absence of the frontier) gave three 12 step cladograms (summarized in Fig 1.B with an arbitrary root corresponding to sequence analyses, see below).
figure 1B:Cladogram constructed with the exonic structure of the genes from a MP analysis which gave three equivalent trees. The informative limits of each gene is coded at the right of its name (0: absence; 1: presence). open box: loss of a limit; Filled box: gain of a limit. The dendrogram is arbitrarily rooted. The branch lengths make no sense.
high resolution
A compatibility analysis gave the same results (although automatically rooted at a different point, i.e. between muscle and neuronal genes). One explanation can be that the genic structure of type I subunits is mainly made of autapomorphies, whereas that of a1 is plesiomorphic in the subfamily IV. On the basis of gene structure analysis, this latter subunit would be a sister group of all other subunits of subfamily IV. However, sequence analyses (see below) place a1 as a sister group of subfamily III. A translocation (e.g. between exon 4 and 5) could mask the real onset of the a1 subunit. Further studies of sequence homology between exons in paralogs will help to clarify this issue.
The sequence analysis revealed the existence of the same four subfamilies of nAChR subunits as did analysis of the gene structure (Figs 2 and 3). We obtained successive divergences of subfamilies I, then II and, at last III and IV. The position of subfamily II as a sister group of subfamily IV is weakly supported by NJ (Fig 3) analysis and not by MP analysis (Fig 2). These subunits appeared polyphyletic with MP analysis and monophyletic with NJ analysis. However their position was supported by very weak bootstrap score. The b2 and b4 subunits were branched with the subfamily IV with a weak bootstrap score. This position may be an artifact resulting from the precocious appearance and the weak divergence of these two subunits.
figure 2:Bootstrap Majority-Rule consensus tree obtained from 150 MP replicates (SEQBOOT, PROTPARS and CONSENSE programs) with the alignment shown in appendix 1. The nodes indicated by an arrowhead are uncertain.
high resolution
figure 3:Bootstrap Majority-Rule consensus tree obtained from 300 NJ replicates (SEQBOOT, PROTDIST, NEIGHBOR and CONSENSE programs) with the alignment shown in appendix 1. The nodes indicated by an arrowhead are uncertain.
high resolution
In subfamily IV, the b1 subunit diverged first followed by the d. A subsequent duplication resulted in the g and e subunits. This latter duplication seems to have occurred shortly before the divergence of Torpedo subunits (i.e before the divergence of the elasmobranch lineage).
In subfamily III, several groupings were present in more than 99% of trials: (b2, b4), (b3, a5), (a2, a4), (a3, a6). A first duplication gave b2 and b4. The position of b2 and b4 was unstable, jumping between subfamily III and subfamily IV according to the sequences sampled. However, if we consider the gene structure, the neuronal localisation and the pharmacological characteristics of these subunits, b2 and b4 have to be placed in subfamily III. Then, we observe the separation of b3 and a5. At last, a monophyletic group formed by the two pairs (a4, a2) and (a3, a6) is present in MP and NJ analyses. The position of a1 does not match the gene structure analysis. This strange position of a1 inside the neuronal subgroup is, however, weakly supported by bootstrap scores. The last clear duplications arose in the lineage of teleosteans which possess two homologs of b3 (appeared about 280 MYA), and in Tetrapoda which have two homologs of the goldfish b2, b2 and b4 subunits. Moreover, goldfish a3 is not clearly homolog to tetrapod a3 (NJ Fig 3) or a6 (MP Fig 2), which could then appear only after the divergence of teleosteans.
As the nAChR is an oligomer formed by subunits coded by paralogs, it is reasonable to assume that the primitive receptor resulted from the assembly of just one subunit. Then, the ability for a subunit to form functional homo-oligomers could reflect a plesiomorphic ("primitive") mode of functioning. a-Bgt sensitive homooligomers from Locusta migratoria (Breer et al. 1985) have been purified and reconstituted in vitro (Hanke and Breer 1986). The daL1 subunit of Schistocerca gregaria forms functional homo-oligomeric channels (Marshall et al. 1990) blocked by a-Bgt in vitro. da2 (also called SAD for second alpha subunit), the putative Drosophila homolog of locust saL1, forms functional receptors alone in Xenopus oocytes (Sawruk et al. 1990) though these receptors display an atypical pharmacology. In the same way, a7 from chicken is able to form homo-oligomers in Xenopus oocytes (Couturier et al. 1990, Revah et al. 1991, Anand et al. 1993). In contrast, vertebrate nAChR subunits from subfamilies III and IV, expressed in Xenopus oocyte, cannot form functionnal homo-oligomeric channels.
Although a small number of mutations sometimes suffice to dramatically change the properties of a receptor (e.g. Galzi et al. 1992), the pharmacological properties of the families of ligand-gated ion channels seem to diverge slowly. Ascaris muscle (Walker et al 1992), Aplysia (Ono and Salvaterra 1981), aL1 (insect class2, Marshall et al. 1990), dnAChR (insect class1, Schloss et al. 1988), chicken a7 (Couturier et al. 1990, Anand et al. 1993) and vertebrate striated muscle (Lee and Chang 1966, Changeux et al. 1970) receptors are a-Bgt sensitive. Though a1 belongs to the subfamily III, the functional a-Bgt sites of the vertebrate muscle receptor is formed partially by the subunits of subfamily IV. Moreover, if a1 is placed as a sister group of all other subunit of subfamily III (as indicated by the gene structure), the loss of a-Bgt sensitivity in the neuronal nicotinic subfamily is a synapomorphy . In addition, Ascaris muscle receptor (Walker et al. 1992), Aplysia neuronal receptors (Ono and Salvaterra 1981) and the receptor formed by saL1 of Schistocerca (Marshall et al. 1990) are sensitive to strychnine, an antagonist of the glycine receptor, and to bicuculline, an antagonist of the GABAA receptor. a7 is also sensitive to strychnine (Anand et al. 1993). The members of a multigene family can then share pharmacological properties, even after a long divergence (probably more than 1000 MYA here). Receptors of the subfamily III do not seem to be blocked by these antagonists (Clément Léna, personal communication). Overall, the evidence from pharmacological studies, further supports the notion of the monophyly of the subfamily III.
The analyses presented in this paper lead to the reconstruction of a global history of nAChR evolution. Although several nodes have not been perfectly resolved, the major relationships between subunits were clarified. Except for a1 all the analyses performed were congruent.
Subfamily I diverged before the split insects/vertebrates and this subfamily could be present in insects (the cloned insect subunits are orthologous to the subfamily III and IV).
Neuromuscular transmission via nAChRs is known to occur in nematodes (Gerschenfeld 1973, Walker et al 1992), annelids, molluscs (Gerschenfeld 1973) and vertebrates but not in insects and crustaceans. The chemical excitation of muscle in the bilateria non-vertebrates/non-arthropods has to be mediated by subunits which do not belong to the subfamily IV (figure 2). Thus, neuromuscular transmission in vertebrates is not homologous to that occuring in other phyla.
The e subunit seems to be present in the whole gnathostomata phylum. This is in consistent with the reported presence of an e subunit in Xenopus, yet this subunit has not been cloned in chicken.
The bootstrap confirms that a7 and a8 diverged prior to the separation of Sauropsida and Theropsida. Thus, an a8 subunit may be present in mammals.
Based on present and previous results, the history of nAChR subunit gene family can be reconstructed as follows (Fig 5):
We can plausibly assume (still without proof) that in the primitive metazoans (e.g. coelenterates) nAChR was made of a single subunit, able to form homo-oligomers. The coelenterates have no true muscle cells but have already multipolar neurons, of ectodermal origin. This first nAChR presumably had a neuronal localisation. With the appearance of a third embryonic sheet, the nAChR acquired a novel role in neuromuscular transmission. However, if the nematodes and the molluscs have a muscle nAChR, it is not homologous to the vertebrate subfamily IV. Indeed this latter plausibly appeared after the differentiation of Deuterostomata. The subunit cloned in Onchocerca does not possess the third loop of the ACh binding site (Devillers-Thiéry et al 1993) and might be a non-a subunit. The NJ analysis of the alignment shown in Appendix 3 (Fig 4), placed this subunit as an extra group of three Drosophila subunits, containing two a and one non-a subunits. The idea of a precocious emergence of the insect subunits is supported by the ability of these subunits to form homo-oligomers in vitro. Assuming that aL1 of Schistocerca and a2 of Drosophila are orthologs, the duplication between them and daLi is older than the divergence of Orthoptera and Diptera - i.e., older than 300 MYA (Labandeira and Sepkoski 1993). In Deuterostomata, several duplications occurred to give the extant subfamily IV, which was complete in vertebrate phylum before the appearence of chondrichthyes (450 MYA) and the extant subfamily III, one of the paralogs being expressed in muscle.
figure 4:Bootstrap Majority-Rule consensus tree obtained from 1000 NJ replicates with the alignment shown in Appendix 3, presenting the possible emergence of the nematode subunit. The nodes indicated by an arrowhead are uncertain.
high resolution
Several tissular switches of expression from neuron to muscle or from muscle to neuron can be hypothetized (Fig 5). Between the divergence of subfamily II and the split subfamily III/subfamily IV (in the chordate lineage), one switch of expression might have given a muscle receptor, possibly homopentameric. After the first duplication between the ancestor of subfamily IV and a1, (a duplication which is responsible of the heteromeric muscle receptor), a further duplication from a1 provided a new gene, which expression became neuronal. The evolution of the promoters (and of the transcription regulators) may thus have played a role as important as gene duplication in the diversification of the nAChR family.
The neuronal non-a subunit group is likely to be polyphyletic, whereas the neuronal a subunits (the "binding subunits") would form a monophyletic group. a5, which lacks some important aromatic amino acids in the third loop of ACh binding site, cannot form functional receptors in vitro with any b subunit (Boulter et al. 1990) but is coprecipitated from endogeneous material with other a subunits (Conroy et al. 1992, Vernallis et al. 1993), thus may represent a new type of "structural" subunit and should be given another name. a5 and b3 could be called g2 and g3. Then the three types of subfamily III subunits could form monophyletic groups - the tribes a, b, and g.
figure 5:Summary tree, integrating the results of the whole study. The dates of the last divergences have been calculated from the protein alignment of Appendix 2 (cf Materiels and Methods). a7/a8: 380 MYA; e/g: 508 MYA; e,g/d: 711 MYA; e,g,d/b1: 926 MYA; a3/a6: 529 MYA; a2/a4: 669 MYA; a5/b3: 770 MYA. The age of the precocious divergences have been approximatively infered from the divergence of nematodes (1000 MYA: Vanfleteren et al. 1994). M: muscle subunit; N: neuronal subunit. The grey branches represent subunits putatively expressed in neurons. The black branches represent subunits possibly expressed in muscle.
high resolution
The multiple duplications in subfamily III parallel the progressive increased complexity of the chordate nervous system, in particular, of the cholinergic system. At the beginning of the evolution of this phylum, one subunit was plausibly present in the nervous system, resulting from the duplication of a1 (In addition to the ancestor of a7 and a8). The diversity of the group increased during the first 400 MY, until the appearance of Tetrapoda. The whole evolution of the subfamily occurred in the first half of Deuterostomata history (from 600-800 MYA to about 300 MYA). In the first prechordate fossils, we find only two ganglia, the peripheral and the cerebroid ganglia. Branchiostoma has only one pseudovesicle in the head. Spinal chord and cholinergic peripheral nervous system were present early in the vertebrate lineage (although the complete autonomous system was reached only in mammals). Lamprey has five vesicles but the main development of the brain and particularly of the forebrain occurred in Gnathostomata.
In situ hybridization (Deneris et al. 1989, Wada et al. 1989;1990, Zoli et al. 1994) as well as immunohistochemical (Britto et al. 1992, Hill et al. 1993) studies have shown that, in rat and chicken brain, a4 and b2 mRNA distribution is diffuse, whereas a2, a3, a5, a6, b3 and b4 are mainly restricted to a few major cholinergic or cholinoceptive pathways, which, however, also express a4 and b2. b2 has diverged early in the neuronal subfamily history (Figs 2, 3 and results). a4 and b2 could represent a "fossil" expression, which was present in most areas of the ancestral brain. When a duplication occurred, one of the paralogs kept the specific role of the "father gene", whereas the other paralog has to acquire a new role. This role can be defined by a new domain of expression (like the switches muscle/neuron developped before) or by a modified function. The hypothesis that a4 maintained its previous role while another paralog acquired a new role is supported by some evidence on the a2 subunit. In chick brain, the a2 subunit is restricted to the lateral spiriform nucleus (Daubas et al 1990) but, in rat brain it is restricted to the interpeduncular nucleus (Wada et al 1989) - a nonhomolog structure (the interpeduncular nucleus also exists in the chick brain). Moreover, there is no homolog of the lateral spiriform nucleus in rat brain, a fact that points to the genesis of this structure after the divergence of the bird lineage. It is attractive to suppose that a gene duplication occurred a short time before the branching of Theropsida and Sauropsida (i.e. before 310 MYA, Benton 1990). This time would have been too short to define the specificity of a2 (in contrast to a4 which maintained its ancient role). Then two independent specificities of expression took place in the two phyla. Accordingly, a transgene with the avian a2 gene (including the promoter) is expressed throughout the rat brain, mostly in cholinergic structures (motor nuclei and basal telencephalon) (Daubas et al. 1993) (nevertheless, this distribution corresponds neither to the distribution of endogenous a2 nor to that of a4).
In the same way, the duplications a3/a6 and b2/b4 occurred a little before, or a little after, the split between the teleost and the tetrapod lineages. In the rat brain, a3 and b4 are mainly expressed in the medial habenula, a cholinergic and cholinoceptive structure. However, in a teleost fish (Phoxinus phoxinus), an immunocytochemical study did not find any cholinergic cell and found only a small number of cholinergic fibers in the habenula (Ekström 1987). If these characteristics are plesiomorph, there could be again a correlation between gene duplications and a further change of function. a4 and b2 could thus have kept the ancestor role of the neuronal nAChR whereas other paralogs could have found new functional specificities in the evolving cholinergic systems.
We have shown, on the basis of cladograms and phenograms, that the first duplications in the nAChR occurred before the divergence of nematodes. Several nAChR subfamilies were identified. There is congruence between sequence and gene structure analyses and the three subfamilies present in vertebrates correspond closely to the functional subgroups (described from anatomical, pharmacological and structural considerations). Two phenomena seem to have generated the wealth of the family. First, several switches of expression seem to have occured from neuron to muscle and the opposite. Second, multiple gene duplications gave the extant number of paralogs. The neuronal nicotinic subunit subfamily (type III subunits) appeared at the beginning of the chordate phylum, and grew until the separation of Sauropsida and Theropsida lineages. This diversification, both quantitative and functional, paralleled the increase in complexity of the cholinergic systems. A link between an increased combinatorial complexity of subunit combinations and a larger plasticity in the functioning of these pathways is plausible.
We thank Clément Léna, Pascal Tassy and Fredj Tékaia for helpful advice. We also thank Richard Miles, Michele Zoli and the anonymous referee for critical reading of the manuscript. This work was supported by the Centre National de la Recherche Scientifique, the Institut National de la Santé et de la Recherche Médicale (contract no. 872004), the Collège de France and the Direction des Recherches et Etudes Techniques (contract no. 90/142).