back to Receptors and Cognition homepage (JP Changeux) back to Pasteur Institute

Here is the original phylogeny article. A revisited and extended view is presented here.
If you use items coming from this site, please quote the original article.


Molecular evolution of the nicotinic acetylcholine receptor: an example of multigene family in excitable cells.

Nicolas Le Novère and Jean-Pierre Changeux (1995). J Mol Evol 40 : 155-172

Abstract. An extensive phylogenetic analysis of the nicotinic acetylcholine receptor subunit genes family has been performed by cladistic and phenetic methods. The conserved parts of amino acid sequences have been analyzed by CLUSTAL V and PHYLIP software. The structure of the genes was also taken in consideration. The results show that a first gene duplication may have occurred before the appearance of Bilateria. Three subfamilies then appeared: I-the neuronal a-bungarotoxin binding site subunits (a7, a8), III-the neuronal nicotinic subunits (a2-a6, b2-b4) which also contains the muscle acetylcholine binding subunit (a1), and IV-the muscle non-a subunits (b1, g, d, e). The Insecta subunits (subfamily II) could be orthologous to family III and IV. Several tissular switches of expression from neuron to muscle and the converse can be inferred from the extant expression of subunits and the reconstructed trees. The diversification of the neuronal nicotinic subfamily begins in the stem lineage of chordates, the last duplications occurring shortly before the onset of the mammalian lineage. Such evolution parallels the increase in complexity of the cholinergic systems.

key words: Nicotinic receptor - Ligand-gated channel - Multigene family - gene phylogeny

Introduction

Acetylcholine (ACh) has long been recognized as a neurotransmitter active in Bilateria nervous system and muscle. Two distinct categories of receptors are engaged in the biological effects of ACh: the muscarinic and nicotinic receptors. Muscarinic receptors belong to the super-family of G-protein coupled receptors; they consist of single integral proteins with seven transmembrane segments and interact, on their cytoplasmic face with heterotrimeric G-proteins. Nicotinic receptors (nAChR) belong to the super-family of ligand-gated ion channels; they are hetero-oligomers composed of five subunits, each with four transmembrane domains (Devillers-Thiéry et al. 1993, Galzi and Changeux 1994). ACh binding causes an ionic channel, most often cationic, to open, resulting in a rapid change in the electrical, and secondarily metabolic, state of the target cell (Greenberg et al. 1986, rev. Bertrand et al. 1993).

The nAChR of striated muscle is the best characterized member of the ligand gated ion channel super-family (rev. Changeux 1990, Karlin 1993): it is an hetero-pentamer (with the stoichiometry a2bgd). According to current models (rev. Bertrand et al. 1993), the ion channel forms along the axis of pseudosymmetry perpendicular to the cell membrane. The subunits share a similar hydropathic profile with 4 short hydrophobic domains (MI-MIV) and 2 long hydrophilic domains. The largest, relatively conserved, hydrophilic domain is located at the amino-terminal side of the subunit polypeptide, and the other, highly variable, joins hydrophobic domains MIII and MIV. The amino-terminal hydrophilic domain, carries the ACh binding site (Devillers-Thiéry 1993) and faces the synaptic cleft, whereas the other hydrophilic domain is exposed to the cytoplasm.

Molecular cloning and sequencing studies have revealed the existence of a family of genes, expressed in neurons, which code for nAChR subunits homologous (see Appendix 1 for definition of boldface terms) to those of muscle nAChR (Sargent 1993). In the jawed vertebrate nervous systems, several subunits (named a2-a8) have been identified which share with the muscle-type receptor a1 subunit the pair of cysteines, shown to contribute to the ACh binding site (Wada et al. 1988, Shoepfer et al. 1990, rev. Cockcroft et al. 1992, Karlin 1993). Other homologous chains, lacking the cystein pair, have been characterized and named non-a or b2-b4. As for muscle nAChR, the functional neuronal nAChR is an hetero-pentamer made up by the assembly of a and b subunits, with a putative stoichiometry in vitro of a2b3 (Anand et al. 1991, Cooper et al. 1991). The recent evidence that a5 is coprecipitated with another a and b subunit in some neuronal nAChRs indicates that more than 2 different subunits may assemble together to form a receptor molecule (Conroy et al 1992, Vernallis et al. 1993). In contrast, in reconstituted systems, the a7 or a8 subunits can form functional homo-oligomers (Couturier et al. 1990, Revah et al. 1991, Anand et al. 1993). The autoradiographic studies in the brain revealed that 3H-nicotine labels receptors formed by subunit a2-a6 and b2-b4 but not receptors formed by subunit a7 and a8 (Clarcke et al. 1985), which are labelled by a-Bungarotoxin (a-Bgt).

The combinatorial diversity resulting from the assembly of the multiple neuronal subunits results in a wide spectrum of structurally and functionally distinct nAChRs, with different pharmacological specificities and ion channel properties (Role 1992). Such differences have been directly demonstrated in Xenopus oocytes and mouse fibroblasts after heterologous expression (Luetje and Patrick 1991, Whiting et al. 1991). Furthermore, multiple functionally distinct types of nAChRs have been detected in different brain areas and subcellular compartments (see Mulle et al. 1991).

The nAChR is present in the whole phylum of Bilateria, from nematodes to humans (Gershenfeld 1973, Darlison et al. 1993, Fleming et al. 1993, Leech and Sattelle 1993). Several nAChR neuronal subunits have been cloned in Drosophila, locust and nematode (Gundelfinger 1992, Fleming et al. 1993). In the insect nervous system, ACh is the major excitatory neurotransmitter, in contrast to vertebrates, where glutamate predominates. At the neuromuscular junction, glutamate is the excitatory transmitter in arthropods, whereas it is ACh in vertebrates. Since some lines of evidence suggest that nAChR is also responsible for neuromuscular transmission in nematodes, molluscs and annelids (Gershenfeld 1973, Segerberg and Stretton 1993), it is of interest to assess whether the original form of nAChR appeared in muscle or in neurons.

The neuronal nAChRs provide a good example of multigene family differentially expressed in the nervous system. Its evolution deserves comparison with the increase in complexity of vertebrate nervous system and, in particular, cholinergic systems. Some partial trees have already been constructed, but without comparative methods and without statistical support (Brehm et al 1991, Cockcroft et al 1992). Here we provide a molecular phylogenetic study of the whole family of nAChR genes.

Material and Methods

The programs used in this work were runned on a Sun computer in a UNIX environment. The sequences were loaded from Genbank and EMBL databases (Table 1) by the Sequence Analysis Software Package 7.1 from the Genetic Computer Group.

Alignments of the Sequences

Genes used in this study. Abbreviations are those used in the text and trees. The first letters represent the species, followed by the name of the subunit. The six sign codes are the accession numbers of the Genbank-Embl databases. The rat a4 is the isoform 4-2.
GeneSpeciesGenbank-Embl Acc. No.Ref.
ba1Bos taurusX02509Noda et al. Nature 305:818 (1983)
bb1Bos taurusX00962Tanabe et al. Eur J Biochem 144:11 (1984)
bdBos taurusX02473Kubo et al. Eur J Biochem 149:5 (1985)
beBos taurusX02597Takai et al. Nature 315:761 (1985)
bgBos taurusM28307Takai et al. Eur J Biochem 143:109 (1984)
ca2Gallus domesticusM07339-44Nef et al. EMBO J 7:595 (1988)
ca3Gallus gallusM37336Couturier et al. JBC 265:17560 (1990)
ca4Gallus domesticusX07348-53,99Nef et al. EMBO J 7:595 (1988)
ca5Gallus gallusJ05642Couturier et al. JBC 265:17560 (1990)
ca7Gallus gallusX68586Couturier et al. Neuron 5:847 (1990)
ca8Gallus gallusX52296Schoepfer et al. Neuron 5:35 (1990)
cb2Gallus domesticusX53092Schoepfer et al. Neuron 1:241 (1988)
cb4Gallus gallusJ05643Couturier et al. JBC 265:17560 (1990)
cdGallus gallusK02903Nef et al. PNAS 81:7975 (1984)
cgGallus gallusK02904Nef et al. PNAS 81:7975 (1984)
daLiDrosophila melanogasterX07194Bossy et al. EMBO J 7:611 (1988)
da2Drosophila melanogasterX53583Sawruk et al. EMBO J 9:2671 (1990)
db2Drosophila melanogasterX55676Sawruk et al. FEBS Lett 273:177 (1990)
dARDDrosophila melanogasterX04016Hermans-Borgmeyer et al. EMBO J 5:1503 (1986)
gfa3Carassius auratusX54051Hieber et al. NAR 18:5293 (1990)
gfb2Carassius auratusX54052Hieber et al. NAR 18:5307 (1990)
gfna2Carassius auratusX14786Cauley et al. J Cell Biol 108:637 (1989)
gfna3Carassius auratusM29529Cauley et al. J Neurosci 10:670 (1990)
ha1Homo sapiensY00762Schoepfer et al. FEBS Lett 226:235 (1988)
ha3Homo sapiensM37981Mihovilovic et al. J Exp Neurol 111:175 (1991)
ha5Homo sapiensM83712Chini et al. PNAS 89:1572 (1992)
ha7Homo sapiensX70297Peng et al. Mol Pharmacol 45:546 (1994)
hb1Homo sapiensX14830Beeson et al. NAR 17:4391 (1989)
hb2Homo sapiensX53179Anand et al. NAR 18:4272 (1990)
hb3Homo sapiens Willoughby et al. Neurosci Lett 155:136 (1993)
hb4Homo sapiensX68275Tarroni et al. FEBS Lett 312:66 (1992)
hdHomo sapiensX55019Luther et al. J Neurosci 9:1082 (1989)
heHomo sapiensX66403Beeson et al. unpublished
mserMus musculusM74425Maricq et al. Science 254:432 (1991)
onachrOnchocerca volvulusL20465Ajuh and Egwang. Brain Res (1994)
ra2Rattus norvegicusL10077Wada et al. Science 240:330 (1988)
ra3Rattus norvegicusX03440Boulter et al. Nature 319:368 (1986)
ra4Rattus norvegicusM15681-82Goldman et al. Cell 48:965 (1987)
ra5Rattus norvegicusJ05231Boulter et al. J Biol Chem 265:4472 (1990)
ra6Rattus norvegicusL08227Boulter unpublished (1988)
ra7Rattus norvegicusM85273Seguela et al. J Neurosci 13:596 (1993)
rb2Rattus norvegicus Deneris et al. Neuron 1:45 (1988)
rb3Rattus norvegicusJ04636Deneris et al. J Biol Chem 264:6268 (1989)
rb4Rattus norvegicusJ05232,M89971,M33951-3,M89989 Boulter et al. J Biol Chem 265:4472 (1990)
rdRattus norvegicusX74835Witzemann et al. Eur J Biochem 194:437 (1990)
reRattus norvegicusX13252Criado et al. NAR 16:10920 (1988)
rgRattus norvegicusX74834Witzemann et al. Eur J Biochem 194:437 (1990)
rglya3Rattus norvegicusM55250Kuhse et al. J Biol Chem 265:22317 (1990)
saL1Schistocerca gregariaX55439Marshall et al. EMBO J 9:4391 (1991)
ta1Torpedo californicaJ00963Noda et al. Nature 299:793 (1982)
tb1Torpedo californicaJ00964Noda et al. Nature 301:251 (1983)
tdTorpedo californicaJ00965Noda et al. Nature 301:251 (1983)
tgTorpedo californicaJ00966Ballivet et al. PNAS 79:4466 (1982)
xa1aXenopus laevisX17244Hartman et al. Nature 343:372 (1990)
xa1bXenopus laevisX07067Baldwin et al. J Cell Biol. 106:469 (1988)
xb1Xenopus laevisU04618Kullberg et al. Rec Chan (1994) in press
xdXenopus laevisX07069Baldwin et al. J Cell Biol. 106:469 (1988)
xeXenopus laevisU19612Murray et al. Neuron (1995)
xgXenopus laevisX07068Baldwin et al. J Cell Biol. 106:469 (1988)

Alignments were performed using the CLUSTAL V software (Higgins and Sharp 1988). This program compares the sequences in pairs according to Wilbur and Lipman (1983) (gap penalty=3) and builds a preliminary tree by an unweighted pair-group method of arithmetic averages (UPGMA)(Sneath and Sokal 1973). Then the program aligns all sequences in order of decreased similarity according to Feng and Doolittle (1987) (fixed and floated gap penalty=10). The use of different values of gap cost changed neither topology nor the ratio of branch lengths but did result in a homothetic transformation of the trees. The similarity have been determined by the Dayhoff PAM 250 matrix. The protein sequences were aligned after the following modifications:

Deletion of the signal peptide (corresponding to ta1 aa 1-27), the small non conserved part in amino-terminal part (corresponding to ta1 R188), the highly variable cytoplasmic region (corresponding to ta1 aa 356-393) and the carboxy-terminal part (corresponding to ta1 aa 452-461). The alignment obtained with 48 sequences shows 394 sites with 357 informative sites (Appendix 2).

To determine the branching of the nematode sequence onachr (Fig 4) which amino-terminal part is not known, a further deletion (corresponding to the 38 amino-terminal aa of ca8 from the Appendix 2 alignment) was performed on twelve sequences. the alignment of the 13 sequences show 351 sites with 263 informative sites (Appendix 2).

Sequences Analyses

Inferences on gene evolution were obtained with the PHYLIP 3.5c software of Felsenstein (1993). The cladistic method was the maximum of parsimony (MP) (Fitch 1971, program PROTPARS). The mouse 5-HT3 subunit and the rat glycine a3 subunit were used as outgroups. The use of the rat GABA a1 subunit instead of the glycine a3 subunit did not change the results (data not shown). The phenetic method was the Neighbor-joining (NJ) (Saitou and Nei 1987, program NEIGHBOR). The distance matrix was provided by the Dayhoff PAM matrix (Dayhoff 1979, program PROTDIST). The statistical test used to determine the strength of the trees was bootstrap resampling (Felsenstein 1985) with the SEQBOOT (seed: 5) and CONSENSE programs.

Analyses of Gene Structure

The mixed parsimony algorithm with the Wagner method (Eck and Dayhoff 1966, program MIX) and the compatibility method (Le Quesne 1969, Estabrook et al 1976, program CLIQUE) were used to analyze the genomic structure.

Construction of Figures

The majority-rule consensus trees were constructed by the program DRAWGRAM. The results of CONSENSE analysis after the bootstrap resamplings are written in ovals on the node considered.

The determination of approximated time divergence beetween subunits (figure 5) is based on the NJ analysis of the Appendix 2 protein alignment. The external branches of the resulting tree showed an approximative molecular clock for each group. We are then able to determine the dates of the last duplications. However, the rates of evolution vary greatly between the subgroups, and the precocious duplications can't be calculate in this way. To determine the date of divergence between two subunits, we averaged the branch lengths and the evolution rate between all the orthologs. The estimated rate of evolution is obtained by dividing the branch length by the duration. The dates used are: Torpedo/osteichthyans 450 MYA, goldfish/Tetrapoda 405 MYA, Xenopus/Amniota 365 MYA, chicken/mammals 310 MYA, mouse/human 110 MYA, (Benton 1990). For instance each external branch inside the d,e,g group provides an estimated rate of evolution of between 3.2 10-10 and 7.3 10-10 substitution per site and per year (M=5.3 s=1.2), which is not far from a molecular clock.

Results

Evolution of the Gene Structure

The number of exons identified in the gene of nAChR subunits varies largely in the family although some common features can be recognized (Fig 1.A). Four subfamilies can be identified on the basis of the genomic structure. These are I-the neuronal a-bungarotoxin binding site subunits subfamily, II-the arthropoda neuronal subunits subfamily, and III-the vertebrate neuronal nicotinic subunits and IV-the muscle subunits subfamily.

Figure 1A, low resolution

figure 1A:Structure of the subunit genes. Only the exons at least partialy coding are represented. The gray level grossly reflects the conservation of the exon through the family (i.e. the presence of an exonic frontier at this place in different subunits, but not the sequence similitude between exons). A, B, C: binding site loops. M1, M2, M3, M4: transmembrane segments. The arrowheads mark the informative limits (in a cladistic acception). Adapted from Jonas et al. 1990, with the help of Alain Bessis.
high resolution

The genes of the subfamily IV possess 11 or 12 exons of which ten are conserved. In the subfamily III, the main part of the coding sequence is distributed within a single exon. The structure of a1 and a7 genes differs from the two "holotypes" (III or IV). However, it is difficult to determine if the structure of these two genes is mainly plesiomorph or contains autapomorphies. In order to extract information from genomic structure, we made a cladistic analysis of the frontiers between introns and exons. Only the ten informative sites were considered (Fig 1). A parsimony analysis of two state character (i.e. presence or absence of the frontier) gave three 12 step cladograms (summarized in Fig 1.B with an arbitrary root corresponding to sequence analyses, see below).

Figure 1B, low resolution

figure 1B:Cladogram constructed with the exonic structure of the genes from a MP analysis which gave three equivalent trees. The informative limits of each gene is coded at the right of its name (0: absence; 1: presence). open box: loss of a limit; Filled box: gain of a limit. The dendrogram is arbitrarily rooted. The branch lengths make no sense.
high resolution

A compatibility analysis gave the same results (although automatically rooted at a different point, i.e. between muscle and neuronal genes). One explanation can be that the genic structure of type I subunits is mainly made of autapomorphies, whereas that of a1 is plesiomorphic in the subfamily IV. On the basis of gene structure analysis, this latter subunit would be a sister group of all other subunits of subfamily IV. However, sequence analyses (see below) place a1 as a sister group of subfamily III. A translocation (e.g. between exon 4 and 5) could mask the real onset of the a1 subunit. Further studies of sequence homology between exons in paralogs will help to clarify this issue.

Sequences Analysis

The sequence analysis revealed the existence of the same four subfamilies of nAChR subunits as did analysis of the gene structure (Figs 2 and 3). We obtained successive divergences of subfamilies I, then II and, at last III and IV. The position of subfamily II as a sister group of subfamily IV is weakly supported by NJ (Fig 3) analysis and not by MP analysis (Fig 2). These subunits appeared polyphyletic with MP analysis and monophyletic with NJ analysis. However their position was supported by very weak bootstrap score. The b2 and b4 subunits were branched with the subfamily IV with a weak bootstrap score. This position may be an artifact resulting from the precocious appearance and the weak divergence of these two subunits.

Figure 2, low resolution

figure 2:Bootstrap Majority-Rule consensus tree obtained from 150 MP replicates (SEQBOOT, PROTPARS and CONSENSE programs) with the alignment shown in appendix 1. The nodes indicated by an arrowhead are uncertain.
high resolution

Figure 3, low resolution

figure 3:Bootstrap Majority-Rule consensus tree obtained from 300 NJ replicates (SEQBOOT, PROTDIST, NEIGHBOR and CONSENSE programs) with the alignment shown in appendix 1. The nodes indicated by an arrowhead are uncertain.
high resolution

In subfamily IV, the b1 subunit diverged first followed by the d. A subsequent duplication resulted in the g and e subunits. This latter duplication seems to have occurred shortly before the divergence of Torpedo subunits (i.e before the divergence of the elasmobranch lineage).

In subfamily III, several groupings were present in more than 99% of trials: (b2, b4), (b3, a5), (a2, a4), (a3, a6). A first duplication gave b2 and b4. The position of b2 and b4 was unstable, jumping between subfamily III and subfamily IV according to the sequences sampled. However, if we consider the gene structure, the neuronal localisation and the pharmacological characteristics of these subunits, b2 and b4 have to be placed in subfamily III. Then, we observe the separation of b3 and a5. At last, a monophyletic group formed by the two pairs (a4, a2) and (a3, a6) is present in MP and NJ analyses. The position of a1 does not match the gene structure analysis. This strange position of a1 inside the neuronal subgroup is, however, weakly supported by bootstrap scores. The last clear duplications arose in the lineage of teleosteans which possess two homologs of b3 (appeared about 280 MYA), and in Tetrapoda which have two homologs of the goldfish b2, b2 and b4 subunits. Moreover, goldfish a3 is not clearly homolog to tetrapod a3 (NJ Fig 3) or a6 (MP Fig 2), which could then appear only after the divergence of teleosteans.

Evolution of the Stoichiometry

As the nAChR is an oligomer formed by subunits coded by paralogs, it is reasonable to assume that the primitive receptor resulted from the assembly of just one subunit. Then, the ability for a subunit to form functional homo-oligomers could reflect a plesiomorphic ("primitive") mode of functioning. a-Bgt sensitive homooligomers from Locusta migratoria (Breer et al. 1985) have been purified and reconstituted in vitro (Hanke and Breer 1986). The daL1 subunit of Schistocerca gregaria forms functional homo-oligomeric channels (Marshall et al. 1990) blocked by a-Bgt in vitro. da2 (also called SAD for second alpha subunit), the putative Drosophila homolog of locust saL1, forms functional receptors alone in Xenopus oocytes (Sawruk et al. 1990) though these receptors display an atypical pharmacology. In the same way, a7 from chicken is able to form homo-oligomers in Xenopus oocytes (Couturier et al. 1990, Revah et al. 1991, Anand et al. 1993). In contrast, vertebrate nAChR subunits from subfamilies III and IV, expressed in Xenopus oocyte, cannot form functionnal homo-oligomeric channels.

Pharmacological Argument for Monophyly of Neuronal Nicotinic Subfamily.

Although a small number of mutations sometimes suffice to dramatically change the properties of a receptor (e.g. Galzi et al. 1992), the pharmacological properties of the families of ligand-gated ion channels seem to diverge slowly. Ascaris muscle (Walker et al 1992), Aplysia (Ono and Salvaterra 1981), aL1 (insect class2, Marshall et al. 1990), dnAChR (insect class1, Schloss et al. 1988), chicken a7 (Couturier et al. 1990, Anand et al. 1993) and vertebrate striated muscle (Lee and Chang 1966, Changeux et al. 1970) receptors are a-Bgt sensitive. Though a1 belongs to the subfamily III, the functional a-Bgt sites of the vertebrate muscle receptor is formed partially by the subunits of subfamily IV. Moreover, if a1 is placed as a sister group of all other subunit of subfamily III (as indicated by the gene structure), the loss of a-Bgt sensitivity in the neuronal nicotinic subfamily is a synapomorphy . In addition, Ascaris muscle receptor (Walker et al. 1992), Aplysia neuronal receptors (Ono and Salvaterra 1981) and the receptor formed by saL1 of Schistocerca (Marshall et al. 1990) are sensitive to strychnine, an antagonist of the glycine receptor, and to bicuculline, an antagonist of the GABAA receptor. a7 is also sensitive to strychnine (Anand et al. 1993). The members of a multigene family can then share pharmacological properties, even after a long divergence (probably more than 1000 MYA here). Receptors of the subfamily III do not seem to be blocked by these antagonists (Clément Léna, personal communication). Overall, the evidence from pharmacological studies, further supports the notion of the monophyly of the subfamily III.

Discussion

The analyses presented in this paper lead to the reconstruction of a global history of nAChR evolution. Although several nodes have not been perfectly resolved, the major relationships between subunits were clarified. Except for a1 all the analyses performed were congruent.

Hypothetically Missing Genes

Subfamily I diverged before the split insects/vertebrates and this subfamily could be present in insects (the cloned insect subunits are orthologous to the subfamily III and IV).

Neuromuscular transmission via nAChRs is known to occur in nematodes (Gerschenfeld 1973, Walker et al 1992), annelids, molluscs (Gerschenfeld 1973) and vertebrates but not in insects and crustaceans. The chemical excitation of muscle in the bilateria non-vertebrates/non-arthropods has to be mediated by subunits which do not belong to the subfamily IV (figure 2). Thus, neuromuscular transmission in vertebrates is not homologous to that occuring in other phyla.

The e subunit seems to be present in the whole gnathostomata phylum. This is in consistent with the reported presence of an e subunit in Xenopus, yet this subunit has not been cloned in chicken.

The bootstrap confirms that a7 and a8 diverged prior to the separation of Sauropsida and Theropsida. Thus, an a8 subunit may be present in mammals.

Reconstructed History of the nAChR Subunit Gene Family

Based on present and previous results, the history of nAChR subunit gene family can be reconstructed as follows (Fig 5):
We can plausibly assume (still without proof) that in the primitive metazoans (e.g. coelenterates) nAChR was made of a single subunit, able to form homo-oligomers. The coelenterates have no true muscle cells but have already multipolar neurons, of ectodermal origin. This first nAChR presumably had a neuronal localisation. With the appearance of a third embryonic sheet, the nAChR acquired a novel role in neuromuscular transmission. However, if the nematodes and the molluscs have a muscle nAChR, it is not homologous to the vertebrate subfamily IV. Indeed this latter plausibly appeared after the differentiation of Deuterostomata. The subunit cloned in Onchocerca does not possess the third loop of the ACh binding site (Devillers-Thiéry et al 1993) and might be a non-a subunit. The NJ analysis of the alignment shown in Appendix 3 (Fig 4), placed this subunit as an extra group of three Drosophila subunits, containing two a and one non-a subunits. The idea of a precocious emergence of the insect subunits is supported by the ability of these subunits to form homo-oligomers in vitro. Assuming that aL1 of Schistocerca and a2 of Drosophila are orthologs, the duplication between them and daLi is older than the divergence of Orthoptera and Diptera - i.e., older than 300 MYA (Labandeira and Sepkoski 1993). In Deuterostomata, several duplications occurred to give the extant subfamily IV, which was complete in vertebrate phylum before the appearence of chondrichthyes (450 MYA) and the extant subfamily III, one of the paralogs being expressed in muscle.

Figure 4, low resolution

figure 4:Bootstrap Majority-Rule consensus tree obtained from 1000 NJ replicates with the alignment shown in Appendix 3, presenting the possible emergence of the nematode subunit. The nodes indicated by an arrowhead are uncertain.
high resolution

Several tissular switches of expression from neuron to muscle or from muscle to neuron can be hypothetized (Fig 5). Between the divergence of subfamily II and the split subfamily III/subfamily IV (in the chordate lineage), one switch of expression might have given a muscle receptor, possibly homopentameric. After the first duplication between the ancestor of subfamily IV and a1, (a duplication which is responsible of the heteromeric muscle receptor), a further duplication from a1 provided a new gene, which expression became neuronal. The evolution of the promoters (and of the transcription regulators) may thus have played a role as important as gene duplication in the diversification of the nAChR family.

The neuronal non-a subunit group is likely to be polyphyletic, whereas the neuronal a subunits (the "binding subunits") would form a monophyletic group. a5, which lacks some important aromatic amino acids in the third loop of ACh binding site, cannot form functional receptors in vitro with any b subunit (Boulter et al. 1990) but is coprecipitated from endogeneous material with other a subunits (Conroy et al. 1992, Vernallis et al. 1993), thus may represent a new type of "structural" subunit and should be given another name. a5 and b3 could be called g2 and g3. Then the three types of subfamily III subunits could form monophyletic groups - the tribes a, b, and g.

Figure 5, low resolution

figure 5:Summary tree, integrating the results of the whole study. The dates of the last divergences have been calculated from the protein alignment of Appendix 2 (cf Materiels and Methods). a7/a8: 380 MYA; e/g: 508 MYA; e,g/d: 711 MYA; e,g,d/b1: 926 MYA; a3/a6: 529 MYA; a2/a4: 669 MYA; a5/b3: 770 MYA. The age of the precocious divergences have been approximatively infered from the divergence of nematodes (1000 MYA: Vanfleteren et al. 1994). M: muscle subunit; N: neuronal subunit. The grey branches represent subunits putatively expressed in neurons. The black branches represent subunits possibly expressed in muscle.
high resolution

Growth of the neuronal nicotinic subfamily and increase in complexity of the cholinergic system.

The multiple duplications in subfamily III parallel the progressive increased complexity of the chordate nervous system, in particular, of the cholinergic system. At the beginning of the evolution of this phylum, one subunit was plausibly present in the nervous system, resulting from the duplication of a1 (In addition to the ancestor of a7 and a8). The diversity of the group increased during the first 400 MY, until the appearance of Tetrapoda. The whole evolution of the subfamily occurred in the first half of Deuterostomata history (from 600-800 MYA to about 300 MYA). In the first prechordate fossils, we find only two ganglia, the peripheral and the cerebroid ganglia. Branchiostoma has only one pseudovesicle in the head. Spinal chord and cholinergic peripheral nervous system were present early in the vertebrate lineage (although the complete autonomous system was reached only in mammals). Lamprey has five vesicles but the main development of the brain and particularly of the forebrain occurred in Gnathostomata.

In situ hybridization (Deneris et al. 1989, Wada et al. 1989;1990, Zoli et al. 1994) as well as immunohistochemical (Britto et al. 1992, Hill et al. 1993) studies have shown that, in rat and chicken brain, a4 and b2 mRNA distribution is diffuse, whereas a2, a3, a5, a6, b3 and b4 are mainly restricted to a few major cholinergic or cholinoceptive pathways, which, however, also express a4 and b2. b2 has diverged early in the neuronal subfamily history (Figs 2, 3 and results). a4 and b2 could represent a "fossil" expression, which was present in most areas of the ancestral brain. When a duplication occurred, one of the paralogs kept the specific role of the "father gene", whereas the other paralog has to acquire a new role. This role can be defined by a new domain of expression (like the switches muscle/neuron developped before) or by a modified function. The hypothesis that a4 maintained its previous role while another paralog acquired a new role is supported by some evidence on the a2 subunit. In chick brain, the a2 subunit is restricted to the lateral spiriform nucleus (Daubas et al 1990) but, in rat brain it is restricted to the interpeduncular nucleus (Wada et al 1989) - a nonhomolog structure (the interpeduncular nucleus also exists in the chick brain). Moreover, there is no homolog of the lateral spiriform nucleus in rat brain, a fact that points to the genesis of this structure after the divergence of the bird lineage. It is attractive to suppose that a gene duplication occurred a short time before the branching of Theropsida and Sauropsida (i.e. before 310 MYA, Benton 1990). This time would have been too short to define the specificity of a2 (in contrast to a4 which maintained its ancient role). Then two independent specificities of expression took place in the two phyla. Accordingly, a transgene with the avian a2 gene (including the promoter) is expressed throughout the rat brain, mostly in cholinergic structures (motor nuclei and basal telencephalon) (Daubas et al. 1993) (nevertheless, this distribution corresponds neither to the distribution of endogenous a2 nor to that of a4).

In the same way, the duplications a3/a6 and b2/b4 occurred a little before, or a little after, the split between the teleost and the tetrapod lineages. In the rat brain, a3 and b4 are mainly expressed in the medial habenula, a cholinergic and cholinoceptive structure. However, in a teleost fish (Phoxinus phoxinus), an immunocytochemical study did not find any cholinergic cell and found only a small number of cholinergic fibers in the habenula (Ekström 1987). If these characteristics are plesiomorph, there could be again a correlation between gene duplications and a further change of function. a4 and b2 could thus have kept the ancestor role of the neuronal nAChR whereas other paralogs could have found new functional specificities in the evolving cholinergic systems.

We have shown, on the basis of cladograms and phenograms, that the first duplications in the nAChR occurred before the divergence of nematodes. Several nAChR subfamilies were identified. There is congruence between sequence and gene structure analyses and the three subfamilies present in vertebrates correspond closely to the functional subgroups (described from anatomical, pharmacological and structural considerations). Two phenomena seem to have generated the wealth of the family. First, several switches of expression seem to have occured from neuron to muscle and the opposite. Second, multiple gene duplications gave the extant number of paralogs. The neuronal nicotinic subunit subfamily (type III subunits) appeared at the beginning of the chordate phylum, and grew until the separation of Sauropsida and Theropsida lineages. This diversification, both quantitative and functional, paralleled the increase in complexity of the cholinergic systems. A link between an increased combinatorial complexity of subunit combinations and a larger plasticity in the functioning of these pathways is plausible.

Aknowledgments:

We thank Clément Léna, Pascal Tassy and Fredj Tékaia for helpful advice. We also thank Richard Miles, Michele Zoli and the anonymous referee for critical reading of the manuscript. This work was supported by the Centre National de la Recherche Scientifique, the Institut National de la Santé et de la Recherche Médicale (contract no. 872004), the Collège de France and the Direction des Recherches et Etudes Techniques (contract no. 90/142).

references

Appendix 1: Glossary of the cladistic terms present in the text.

Homology
Two similar characters are homologous when they come from a common ancestor. There is homology between two genes if they arose by duplication. Characters are homologous if 1)-They look like each other, 2)-they do not coexist in the same organism (the arms and the wings of angels) and 3)-if they provide the same phylogenies than other homologous characters.
Orthology
Two orthologs are homologs arisen by speciation (the "same" gene in two different species).
Paralogy
Two paralogs are homologs arisen by duplication in the same organism.
Monophyly
A monophyletic group is formed by one ancestor and all its descendants.
Polyphyly
A polyphyletic group is formed by different subgroups, which do not share any common ancestor belonging to the group.
Plesiomorphy
A plesiomorphy is an ancestor character. It is not useful to reconstruct phylogenies
Autapomorphy
An autapomorphy is a derived character, present in one descendant only. It is not useful to reconstruct phylogenies.
Synapomorphy
A synapomorphy is a derived character, shared by a monophyletic group of several descendants. The synapomorphies are the only useful caracteres to infer phylogenies.
Informative site
A character is informative if it exists under at least two state present twice. A character presenting everytime the same state as well as a character different in every compared object are not informative.

Last modification November 19th 2002.
Nicolas Le Novère.
back to Receptors and Cognition homepage (JP Changeux) back to Pasteur Institute