Skip Navigation

This Article
Right arrow Abstract Freely available
Right arrow FREE Full Text (PDF) Freely available
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in ISI Web of Science
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Right arrow Search for citing articles in:
ISI Web of Science (26)
Right arrowRequest Permissions
Right arrow Disclaimer
Google Scholar
Right arrow Articles by Floriano, W. B.
Right arrow Articles by Goddard, W. A.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Floriano, W. B.
Right arrow Articles by Goddard, W. A., III
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?

Chem. Senses 29: 269-290, 2004
© Oxford University Press 2004

Making Sense of Olfaction through Predictions of the 3-D Structure and Function of Olfactory Receptors

Wely B. Floriano, Nagarajan Vaidehi and William A. Goddard, III

Materials and Process Simulation Center, MC 139-74, Division of Chemistry and Chemical Engineering, California Institute of Technology, Pasadena, CA 91125, USA

Correspondence to be sent to: William A. Goddard III, Materials and Process Simulation Center, MC 139-74, Division of Chemistry and Chemical Engineering, California Institute of Technology, Pasadena, CA 91125, USA. e-mail: wag{at}wag.caltech.edu


    Abstract
 Top
 Abstract
 Introduction
 Materials and methods
 Results and discussion
 Screening odorant libraries to...
 Other ORs sharing similar...
 Summary and conclusions
 Acknowledgements
 References
 
We used the MembStruk first principles computational technique to predict the three-dimensional (3-D) structure of six mouse olfactory receptors (S6, S18, S19, S25, S46 and S50) for which experimental odorant recognition profiles are available for a set of 24 odorants (4–9 carbons aliphatic alcohols, acids, bromo-acids and diacids). We used the HierDock method to scan each predicted OR structure for potential odorant binding site(s) and to calculate binding energies of each odorant in these binding sites. The calculated binding affinity profiles are in good agreement with experimental activation profiles, validating the predicted 3-D structures and the predicted binding sites. For each of the six ORs, the binding site is located between trans-membrane domains (TMs) 3–6, with contributions from extracellular loops 2 and 3. In particular, we find six residue positions in TM3 and TM6 to be consistently involved in the binding modes of the odorants. Indeed, the differences in the experimental recognition profiles can be explained on the basis of these critical residues alone. These predictions are also consistent with mutation data on ligand binding for catecholamine receptors and sequence hypervariability studies for ORs. Based on this analysis, we defined amino acid patterns associated with the recognition of short aliphatic alcohols and mono-acids. Using these two sequence fingerprints to probe the alignment of 869 OR sequences from the mouse genome, we identified 34 OR sequences matching the fingerprint for aliphatic mono-acids and 36 corresponding to the recognition pattern for aliphatic alcohols. We suggest that these two sets of ORs might function as basic arrays for uniquely recognizing aliphatic alcohols and acids. We screened a library of 89 additional molecules against the six ORs and found that this set of ORs is likely to respond to aldehydes and esters with longer carbon chains than their currently known agonists. We also find that compounds associated with the flavor in foods are often among the best calculated binding affinities. This suggests that physiologic ligands for these ORs may be found among aldehydes and esters associated with flavor.

Key words: biosensors, docking, function prediction, GPCR, G protein coupled receptor, HierDock, mice, olfaction, olfactory receptor, structure prediction, taste, virtual screening


    Introduction
 Top
 Abstract
 Introduction
 Materials and methods
 Results and discussion
 Screening odorant libraries to...
 Other ORs sharing similar...
 Summary and conclusions
 Acknowledgements
 References
 
Recent advances in the study of the olfactory system have provided important information about the mechanisms underlying odor recognition. We now know that the early stage in odorant detection involves binding of odor molecules to olfactory receptors (ORs), which are expressed in olfactory sensory neurons (OSNs) in the nose (Buck and Axel, 1991Go; Lancet and Ben-Airie, 1993Go). Each OSN expresses only one OR type but a particular OR can respond to multiple odorants, while a particular ligand can elicit response from multiple ORs, leading to unique combinations of ORs for each odorant (Krautwurst et al., 1998Go; Malnic et al., 1999Go). ORs are members of the rhodopsin-like class of G protein coupled receptors (GPCR), which also includes catecholamine receptors, and their three-dimensional structure contains the seven helical transmembrane (TM) motif characteristic of GPCRs (Mombaerts et al., 1996Go). The ORs share sequence fingerprints not seen in other GPCRs and their higher sequence variability is consistent with the ability to recognize structurally diverse odorants (Buck and Axel, 1991Go). Each OR interacts with a variety of structurally diverse odor molecules, in contrast to other members of the GPCR family such as the receptors for dopamine, histamine and serotonin, which are narrowly tuned to one or a few closely related agonists (Bozza and Kauer, 1998Go; Krautwurst et al., 1998Go; Zhao et al., 1998Go; Duchamp-Viret et al., 1999Go; Malnic et al., 1999Go; Mori et al., 1999Go; Rubin and Katz, 1999Go).

Bioinformatic analyses of genome sequence databases indicate that mice have ~1200 different ORs (Zhang and Firestein, 2002Go), while humans have ~350 (Glusman et al., 2000Go; Zozulya et al., 2001Go). The combinatorial nature of odor recognition explains how humans can detect and distinguish an immense number and variety of odorants using only 350 distinct ORs. ‘Families’ of ORs, whose members have >40% protein sequence identity appear largely to be common to both species (Zhang and Firestein, 2002Go).

In order to learn more about the function of GPCRs, we developed the MembStruk computational technique to predict the three-dimensional (3-D) structure of these proteins and we developed the HierDock computational technique to predict the binding site and binding energy of ligands to GPCRs (Floriano et al., 2000Go; Vaidehi et al., 2002Go). These methods have previously been applied to predict the binding site for epinephrine to adrenergic receptor (Vaidehi et al., 2002Go), for dopamine to dopamine receptor (Vaidehi et al., 2002Go) and for alcohols to the S25 mouse OR (Floriano et al., 2000Go). Particularly for epinephrine and dopamine, detailed mutation and binding studies provide strong evidence for the accuracy of the predicted 3-D structure.

In an earlier work (Floriano et al., 2000Go), we used computational modeling to build a 3-D structure for OR S25 shown experimentally by Malnic et al. (1999Go) to bind hexanol and heptanol. The calculated binding affinity profile for a C4–C9 linear alcohol series suggests that it is the pattern of structural features rather than the specific molecular structure that triggers recognition. We found that the binding site of hexanol and heptanol in OR S25 involves the helical domains TM3, TM4, TM5 and TM6, which is consistent with earlier computational work on rat I7 receptor (Singer, 2000Go) and with mutation studies for binding of dopamine and epinephrine in biogenic amine receptors (Strader et al., 1994Go; Shi and Javitch, 2002Go; Bissantz et al., 2003Go). Residues we found to be involved in binding in S25 (Floriano et al., 2000Go) were previously proposed on the basis of high variability among ORs to be involved in odorant binding (Malnic et al., 1999Go; Pilpel and Lancet, 1999Go).

In this paper we report the predicted structures of six ORs studied by Malnic et al. (1999Go). We also report the predicted binding sites of the 24 aliphatic odorants they studied experimentally and calculated their relative binding energies to each OR. We find quite good agreement between the predicted binding profiles and the experimental activation profiles.

We also screened a library of 89 additional molecules against the six ORs and found that compounds associated with the flavor in mouse foods appear often among the best binders. This suggests that ORs function partly to identify nutrients-rich foods.


    Materials and methods
 Top
 Abstract
 Introduction
 Materials and methods
 Results and discussion
 Screening odorant libraries to...
 Other ORs sharing similar...
 Summary and conclusions
 Acknowledgements
 References
 
We built 3-D structures for six mice (S6, S18, S19, S25, S46 and S50) olfactory receptors (ORs) studied experimentally by Malnic et al. (1999Go) who measured recognition profiles for a systematic library of 24 aliphatic alcohols and acids (each with 4–9 carbons). Of the 14 ORs studied by Malnic et al. (1999Go), we selected six with complete amino acid sequences.

After predicting each OR structure, we used the HierDock procedure to first scan the receptors for potential binding site(s) and then to dock the ligands into the putative binding site and predict their binding energies. The steps of 3-D structure prediction, scanning for possible binding sites and binding of ligands were performed independently to each OR, with no assumptions or constraints being carried from one OR to another. We compared the predicted affinity profiles for the set of 24 odorants to the known experimental response and found good correlation. We then screened a library of 89 additional molecules against the six ORs.

Ligand/odorant structure building

We considered the two libraries of ligands shown in Table 1. List A consists of the 24 aliphatic acids, alcohols, diacids and bromoacids (each with 4–9 carbons) tested experimentally by Malnic et al. (1999Go). List B contains an additional 89 molecules including common odorants, molecules that are odorless to humans, tastants, glucose and some molecules of potential biodefense interest. We included tastants in this list to examine how tastants might affect ORs. Comparing the binding affinities of such tastants as sugars (which are natural agonists of the functionally related taste receptors, also members of the GPCR family) against ORs could provide insights into how function diversity evolved among the chemosensory receptors and more generally in the GPCR family.


View this table:
[in this window]
[in a new window]
 
Table 1 Ligands for which we have calculated binding sites and binding energies to the six OR’s in the present study
 
Three-dimensional structures for all molecules in Table 1 were built as follows.

  1. 1. Chemical structures were drawn using the program ISISDRAW (MDL Information Systems Inc., 2001) and saved in the two-dimensional (2-D) ‘mol’ format.
  2. 2. The program Stereoplex 1.2 (Tripos Inc., 2001) was used to generate all stereoisomers, whenever applicable.
  3. 3. We used the 2-D structures from (1) and (2) and the program Concord version 1.2 (Tripos Inc. 2001) to generate 3-D structures, add hydrogens and assign Gasteiger atomic charges to each molecule. Acids, diacids and bromoacids were assigned a formal charge of –1.0 to each carboxyl group.
  4. 4. The starting structures from (3) were optimized by conjugate gradient minimization of the potential energy using the Dreiding force field (Mayo et al., 1990Go) with Gasteiger (Gasteiger and Marsili, 1980Go) charges. The structures were minimized (to an RMS force of 0.2 kcal/mol/Å) and used as starting conformations in HierDock for locating the putative binding site of odorants in the 3-D structure of each OR and to determine their binding configuration and energy. The solvation energies for the odorants were calculated using the analytical volume generalized born (AVGB) continuum solvation approach (Zamanakos, 2002Go).

Building the 3-D structure of ORs

The 3-D structure of each ORs was predicted using the MembStruk 1.0 procedure as detailed in Floriano et al. (2000Go). The main steps in the MembStruk procedure are as follows.

  1. 1. Predict the TM regions using hydropathicity analysis (Donnelly, 1993Go) combined with input from multi-sequence profiles calculated using the Eisenberg hydrophobicity scale (Eisenberg et al., 1984Go). The sequence alignment profile is obtained from the sequence analysis for the set of ORs described in Floriano et al. (2000Go). The TM regions are assigned using the hydropathicity profiles and capping rules based on known helix breaker residues (Pro, Gly and charged residues). The average hydrophobicity at each sequence position is calculated over window sizes from 12 to 20, until seven peaks can clearly be identified in the profile. The sequence alignment showing the predicted TMs as well as the residues we find to be involved in odorant binding is given in Figure 4 and will be discussed in the results section.
  2. 2. Construct canonical right-hand {alpha}-helices for the predicted TM segments and add counterions Cl and Na+ to neutralize charges in Lys, Arg, Glu and Asp residues. Optimize these individual helices using conjugate gradients energy minimization followed by 200 ps of torsional dynamics using the NEIMO (Newton–Euler inverse mass operator) torsional molecular dynamics method (Mathiowetz et al., 1994Go; Vaidehi et al., 1996Go) at 300K, which fixes bonds and angles. This allows the helix to distort and bend in response to helix breakers such as proline and also optimize the side-chain conformations.
  3. 3. Identify lipid-accessible residues from analysis of the periodicity of the hydrophobic residues (Donnelly, 1993Go) in the sequence.
  4. 4. Assemble the helix bundle as follows. The helical axes were oriented according to the 7.5 Å electron density map of frog rhodopsin (Schertler, 1998Go). Although there are high resolution structures available for rhodopsin (Okada et al., 2002Go; Teller et al., 2001Go; Palczewski et al., 2000Go), we used this low resolution one because we only wanted the orientations of the helical axes (not atomic coordinates) to build an initial template bundle. The helices were then rotated to orient their hydrophobic moments to the outside of the helix barrel. We then added an explicit lipid bilayer of 52 molecules of dilauroylphosphatidyl choline to simulate the membrane. This lipid–TM bundle was minimized to an RMS force of 0.2 kcal/mol/Å.
  5. 5. Further optimize the lipid-TMs bundle for 100 ps using constant temperature rigid body molecular dynamics (RBMD) at 300K.
  6. 6. Construct disulfide bridges as described below and add intracellular and extracellular connecting loops using the software WHATIF (Vriend, 1990Go).
  7. 7. Energy minimize the entire lipid-TM bundle using the surface generalized born (SGB; Ghosh et al., 1998Go) continuum solvent method (we used dielectric constant of 60.0 to simulate the dielectric region surrounding the membrane).



View larger version (72K):
[in this window]
[in a new window]
 
Figure 4 Sequence alignment for mouse olfactory receptors S6, S50, S18, S19, S25, S46 and bovine rhodopsin. Predicted trans-membrane regions are highlighted. Residues found within 3.5 Å of an experimental agonist are in bold underlined font. Cys involved in disulfide bonds are in lower case letter. Blue letters represent the amino acid differences in the 94% homologous pair S6 and S50. Hypervariable positions are highlighted in the secondary structure assignment rows. There have been suggestions in the literature that hypervarient positions are involved in odorant recognition (Malnic et al., 1999; Pilpel and Lancet, 1999). Trans-membrane helices (TM) 1–7, extracellular loops (EC) 1–3 and intracellular loops (IC) 1–3 are identified in the alignment. For bovine rhodopsin, the secondary structure assignment was from the pdb file 1F88. Some of the residues found to be involved in binding in our models are also hypervariable positions. Trans-membrane domains 3, 5 and 6 have positions consistently involved in binding for most ORs studied here.

 
All calculations were carried out with Dreiding force field (Mayo et al., 1990Go) using CHARMM22 (MacKerell et al., 1998Go) charges for the protein.

ORs have four highly conserved cysteines believed to form disulfide bridges (Floriano et al., 2000Go). One of the conserved Cys is located in the extracellular loop 1 (EC1) and the other three are located in the extracellular loop 2 (EC2). Mutation analysis (Singer et al., 1996Go) and truncation experiments (Gimelbrant et al., 1999Go) suggest that the EC1 Cys is paired to the second Cys in EC2 and that the first and third Cys in EC2 are bound together. These correspond to Cys127–Cys209 and Cys199–Cys219 in the amino acid sequence of S25. Disulfide bridges in the loop region increase the confidence in the modeled loop structure since they restrict the loops conformational space. However, because of their high flexibility, loops are still the part of the structure with the least degree of confidence in our structure predictions.

There is one additional highly conserved Cys located in TM6 that is unpaired in all six predicted structures (in S25 this is Cys 241). S25 and S19 have additional disulfide bridges (S25: Cys132–Cys192 and Cys157–Cys171; S19: Cys60–Cys102, Cys92–Cys274 and Cys155–Cys176), all of which were assigned by mutation analysis (Singer et al., 1996Go) and assumed in our calculations. S18 (Cys105–Cys187 and Cys177–Cys197) and S46 (Cys99–Cys181 and Cys171–Cys191) have only the two highly conserved disulfide bonds. S50 and S6 have the two conserved linkages (Cys101–Cys183 and Cys173–Cys193) plus an extra disulfide bond between Cys116 and Cys289. S6 and S50 have three Cys (Cys76, Cys116, and Cys289) located about half-way into the barrel on TMs 2, 3 and 7 that could potentially pair in any combination, consequently pulling two of the three helices closer to each other. Our predicted structures for S50 and S6 place the pair Cys116–Cys289 close enough to form an SS bond in the final structures and hence we made this bond.

The 3-D structure for S25 used in our previous publication (Floriano et al., 2000Go) was improved by correcting some chirality problems not detected in the original structure. The binding profile of the odorants for S25 was re-calculated using the HierDock protocol reported here, which is considerably improved from the previous procedure (Floriano et al., 2000Go).

Predicting function for the ORs

Since there is no experimental information on the location of the binding site of the odorants, we used ScanBindSite (a variation of HierDock) to scan the entire receptor structure to locate the binding site of each odorant to each OR. No assumption was made on the nature or the location of the binding site in these receptors. Once the binding site was determined, we applied HierDock to this putative binding site to predict the best binding configuration and energy for each odorant in this site.

Scanning for a putative binding site (ScanBindSite)
Scanning of each OR for an energetically favorable binding region was performed as follows.

We partitioned the empty volume suitable for docking in each OR structure into 4–13 (depending on the size of the empty volume) small overlapping regions. A modified version of HierDock (called ScanBindSite) was used to independently dock the 24 ligands into each region. Rigid docking of each ligand into each docking region was performed using the program Dock 4.0 (Ewing and Kuntz, 1997Go). The maximum number of docked orientations was set to 100. Each configuration was torsion minimized. We used the option for bump filter (four bumps), energy scoring and automated matching with no anchor search in the Dock 4.0 code.

The configurations with the 10 best Dock 4.0 scores were selected for each ligand in each region. Each of these configurations (240 per region) was then energy minimized for 80 steps while keeping the protein coordinates fixed. We then eliminated any configuration whose molecular surface was <70% buried into the receptor and selected the lowest energy configuration for each ligand in each region. For each OR there was one region where most of the selected configurations cluster together. This region was then selected as the binding site. No presumption was made on the nature or the location of the binding site in these receptors.

Calculating binding energy profiles using the HierDock protocol
We used the HierDock protocol to predict the structures of the odorant/OR bound complexes and to estimate their binding affinities. HierDock has been validated for other ORs (Floriano et al., 2000Go), other GPCRs (Vaidehi et al., 2002Go) and for globular proteins (Wang et al., 2002Go; Datta et al., 2003Go; Kekenes-Huskey et al., 2003Go). The main steps of the HierDock protocol are as follows.

  1. 1. Define docking region. We run HierDock twice for each OR. In the first pass, the docking region for each target protein is defined as the location found in the ScanBindSite step. For the second HierDock pass, the docking region is redefined using the best known positive from the first pass. The new binding region is defined as the 1 Å around the selected known positive and all the ligands are re-docked into this narrower binding region.
  2. 2. Protein grid calculation. The docking step uses an energy grid for the protein contribution to the interaction energy. This grid is calculated only once per target, using the program Grid provided as part of the Dock 4.0 (Ewing and Kuntz, 1997Go) package.
  3. 3. Level 0. Generation of docked configurations of each ligand into the binding site of the target protein using the program Dock 4.0 (Ewing and Kuntz, 1997Go). We used the options for flexible docking, torsion drive on, energy scoring, 75% reduced van der Waals parameters for ligands, minimization before scoring and 1000 maximum scored configurations. The 500 best configurations by Dock 4.0 scores for each ligand in the library were saved and used in subsequent steps.
  4. 4. Filter. A combined criterion of differential solvation and level 0 scores (Dock 4.0 energies) was applied to the 500 configurations per ligand from level 0. We eliminated configurations with <75% of buried surface and those having energy >100 kcal/mol. The best 50 configurations per ligand were carried to next step.
  5. 5. Level 1. For each of the 50 configurations per ligand surviving filter step 4, we used the Dreiding FF with CHARMM22 charges for protein and Gasteiger charges for ligand to minimize the structure of each ligand (gas phase) for (100 steps) using fixed protein coordinates. The best five configurations by energy were selected for each ligand.
  6. 6. Level 2. For each of the five configurations per ligand from level 1, we used the Dreiding FF to minimize the structure of the ligand–protein complex for 100 steps, allowing all atoms of the protein to move. The best configuration by energy was selected for each ligand.
  7. 7. Ranking. The complexes from level 2 were scored by single point energy calculation including solvation using the AVGB (Zamanakos, 2002Go) continuum solvation method with fixed protein (this gives the energy of the ligand in the field of the protein) using a dielectric constant of 2.5. In addition, we calculated the energy including solvation of the free ligand in its docked conformation. The dielectric constant used for non-bonded interactions was 1.0, while the dielectric constant for the continuum solvation calculation was set to 78.2 for exterior regions and 1.3 for the interior region. The binding energies were calculated as the difference in the energies of the ligand in protein and ligand in water [BindE = E(bound_ligand_in_protein) – E(free_ligand_in_water)]. The ligand list was then grouped according to the main chemical functional group and sorted by binding energies.

Refining the binding site location and the binding affinities
Each of the six ORs studied here has one or more experimentally known agonist ligand (Malnic et al., 1999Go), that is, an odorant that causes response from cells found to express that OR. Hence we used the putative binding mode of the known odorants having the most favorable binding energy in a first HierDock run to refine the location of the binding site in each OR. Note that this procedure does not change the general location of the binding region. This only reduces its size, making the conformational sampling step more efficient. For the first HierDock run on each OR, we used the binding site from ScanBindSite (described above). We then used the bound configuration of the best (by binding energy) known ligand to define a smaller docking region surrounding the ligand with a margin of 1 Å for a second HierDock run. All ligands, including the ones used for refining the binding regions, were re-docked and re-scored in the new binding region using steps 2–7 described above. This procedure ensures that we calculate binding affinities for ligand configurations that are in similar binding modes. It also allows for a better sampling of the binding site. This is important because ligands that do not elicit experimental response may have better affinity for regions other than the one where binding of the ligand leads to activation of the receptor. However, the affinity at the active site is what should differentiate experimental positives from negatives.

Structural analysis of the complexes generated in the first HierDock pass for S18, S19 and S46 ORs, pointed to a strong interaction of the carboxyl groups in the ligands with histidine in the receptors. These histidines (His113/S18, His133/S19 and His107/S46) were treated as charge neutral during ScanBindSite and for the first HierDock pass. However, since they were found to interact with carboxyl groups in the ligands, we assumed that the ring nitrogens will be protonated and assigned a formal charge of +1.0 to these histidines, which was used for the second HierDock pass. These histidines are conserved in S18, S19 and S46 (all three recognize acids and bromo acids, but not diacids), mutated to valine in S25 (which does not recognize acids, bromo acids, or diacids) and mutated to tyrosine in S6 and S50 (both recognizes diacids, but not acids or bromo acids). The binding energies of the ligands calculated using the refined receptor structure for the second HierDock run have been used for binding energy comparisons across ligands and ORs.

Using perturbation to further enhance the conformational search
The differences in binding energies for members of the same chemical class are often in the order of a few kilocalories. Since small differences in the binding mode of a ligand can cause differences in binding energy of similar magnitude, we need to ensure that the ligands being compared are scored in equivalent binding modes. To do this we start with the best binding conformation of the most energetically favorable ligand of a chemical class and generate all other ligands in the class (differing only in the carbon chain length) by adding or removing methyl groups. We then minimize (with movable protein) and score the perturbed complexes as described in level 2 of HierDock (steps 6 and 7, described above). This guarantees that different ligands in the same chemical class are scored consistently so that their relative binding energies can be used as a measurement of relative biological affinities.

This perturbation procedure was used when members of a chemical class were found to have different binding modes after HierDock. However, it was not used when the members of a chemical class were found by HierDock to have equivalent binding modes. The need for perturbation comes from the non-exhaustive conformational sampling in level 0 of HierDock.

Among the six receptors reported here, this perturbation approach was found to be necessary only for the alcohol series bound to S25. In this case, all other members of the alcohol series were perturbed from the bound configuration of hexanol in S25. For the other five receptors, we found that HierDock led to equivalent binding modes for all members of each chemical class and therefore perturbations were not required.


    Results and discussion
 Top
 Abstract
 Introduction
 Materials and methods
 Results and discussion
 Screening odorant libraries to...
 Other ORs sharing similar...
 Summary and conclusions
 Acknowledgements
 References
 
In this section we discuss the predicted 3-D structures of the six mouse ORs S6, S18, S19, S25 and S50 bound to their known and potential agonists. The TrEMBL identification numbers for these OR sequences are given in Table 6.


View this table:
[in this window]
[in a new window]
 
Table 6 TrEMBL codes for the sequences extracted from the OR genome database set (comprised of 869 mouse OR amino acid sequences) by matching the recognition pattern deduced for 6–9 carbon aliphatic alcohols and aliphatic acids (highlighted in Table 4 based on predicted binding sites for mouse olfactory receptors: S18, S19, S46 and S25
 
Comparison of the predicted 3-D structure of the six ORs

The calculated percentage of sequence similarity and the corresponding root mean square deviation in (carbon alpha) coordinates (CRMS) difference between predicted structures for the ORs studied here are given in Table 2. The sequence comparison considered all amino acids in each aligned OR, while the structure comparison included only the TM regions. Members of the same protein family with high sequence similarity are expected to have a high degree of structural homology. For the family of six ORs studied here, sequence similarity ranges from 20 to 94%. The CRMS differences between the TM regions of the predicted OR structures range from 1.5 Å (between S6 and S50 with a sequence similarity of 94%) to 6.3 Å (between S19 and S25 with a sequence similarity of 25%; as discussed below, the bottom part of S19 had a distorted chain that led to larger CRMS errors).


View this table:
[in this window]
[in a new window]
 
Table 2 Sequence and structural relationship among the six mouse olfactory receptors studied here
 
The sequence similarity between these six ORs and bovine rhodopsin (pdb code 1F88) ranges from 11% (S6) to 18% (S46) leading to CRMS deviations of 4.0 Å (S46) to 7.3 Å (S19), with five of the six receptors in the 4.0–4.4 Å range.

Our predicted structure for OR S19 differs dramatically from the other structures, with a CRMS of 6.3 Å. This arises from TM 4 in S19, which is unraveled in the bottom one-third (C terminus). This probably resulted from an early bad contact in the helix optimization steps, but we did not go back to re-examine it because the intracellular C terminus is not expected to be involved in direct binding of the ligand.

The CRMS differences found for the six ORs can be compared with that found in sets of homologous proteins including globins, serine proteases and immunoglobulins (Chothia and Lesk, 1986Go). Sequence identities in the 20–50% range translate to structural differences between 1 and 2 Å. This agrees with the accuracy found for structures of globular proteins predicted using homology methods for cases with 50% sequence similarity (Marti-Renom et al., 2002Go).

The MembStruk structure prediction algorithm for GPCRs uses only sequence information combined to physical principles to predict the 3-D structure. This leads to independent unbiased predictions for each of the six ORs. The advantage of this approach is that mistakes made on a model will not pass to models generated from it and the problems that come with aligning low homology sequences are avoided. However, this brings the disadvantage that structures of the same family may differ from each other more that they should because of the statistical nature of our approach. We plan to apply homology methods to predict the other five ORs from each of our predicted structures. Such methods might be used to obtain several predicted structures that could be averaged in some way to provide more accurate structures than can be derived solely from independent MembStruk runs.

Predicted odorant recognition profiles for the six ORs

Table 3 and Figure 1a–f show the binding energies predicted for the series of 24 odorants for which odorant activation profiles were determined experimentally (Malnic et al., 1999Go). It is important to note that the calculated binding energy is a necessary but not sufficient condition for activation of the receptor. Some odorants may bind to the receptor but may not activate it. In Figure 1a–f odorants that activate experimentally cells from where the ORs were isolated are fully shaded. Odorants not tested experimentally against cells expressing a particular OR are half shaded. Odorants tested and found not to cause activation are not shaded. Each odorant family is ordered with C4 at the left and C9 at the right.


View this table:
[in this window]
[in a new window]
 
Table 3 Calculated (HierDock) binding energies (kcal/mol) of the 24 odorants in Table 1for the six mouse olfactory receptors
 


View larger version (55K):
[in this window]
[in a new window]
 
Figure 1 Calculated (HierDock) binding profiles for mouse olfactory receptors (a) S50, (b) S6, (c) S18, (d) S19, (e) S46 and (f) S46. Odorants found to elicit experimental response (Malnic et al., 1999) are filled in black. Odorants that were not tested experimentally are filled in grey. Odorants that do not elicit response experimentally are represented as non-shaded bars. The experimental concentration thresholds (Malnic et al., 1999) are provided to show their correlation with the calculated binding energies.

 
The calculated recognition profiles for S50 (Figure 1a) and S6 (Figure 1b) clearly identify aliphatic diacids as the best odorant binding class for both receptors, which is amply confirmed by the experiments. Within the diacids class, the ligand with the most favorable calculated binding affinity for S50, azelaic acid (C9 diacid), is the only odorant observed to elicit response experimentally out of the 24 odorants. The predicted odorant binding energy profile for S6 (Figure 1b) is also in good agreement with the experimental profile in terms of the chemical class identified by this OR. Also, the predicted best ligand (by binding energy), azelaic acid (C9 diacid), is one of the two odorant agonists observed experimentally. However, the theory predicts that the second experimental positive (the C8 diacid suberic acid) is the third highest (more negative) binding energy while the second best calculated binding energy (the C7 diacid pimelic acid) was not observed experimentally. However, suberic acid elicited only a weak response in the experiments (Malnic et al., 1999Go) and it is possible that pimelic acid could elicit response at higher concentrations or if tested with longer residence time.

The ranking of ligands by predicted binding energy for S18 (Figure 1c), S19 (Figure 1d) and S46 (Figure 1e) correlates well to the experimental recognition profiles within the same chemical class. Thus ligands found to elicit experimental response have more favorable binding energies than ligands that do not elicit response. However, the energy differences cannot clearly separate them apart. For example, the calculated binding energies of the alcohols that activate the ORs S18 and S19 are not comparable to the binding energies of acids. This discrepancy in binding energies across chemical classes could arise from many factors such as dominance of the electrostatic contribution to the binding energy of the charged ligands with insufficient desolvation contribution, or underestimation of the hydrogen bond contribution to the energy of the receptor–ligand complex. Moreover, the binding energies are calculated at 0K temperature and do not include entropic effects.

The predicted binding profile for S25 is in good agreement with the experimental one. The alcohols are predicted to bind much more strongly than any of the other three classes. Within the alcohol series, heptanol and hexanol, which were found to elicit response experimentally (Malnic et al., 1999Go), have the best (more negative) binding energies, although butanol (not tested experimentally), pentanol and octanol are very close in energy to the experimental positives.

Overall, all three receptors give good correlation for each of the four chemical classes between the predicted binding energies and the observed activation profile. However, there seem to be differences in the absolute values for the activation threshold for different odorant classes. Also, the absolute binding energies for different receptors vary significantly between the various cases. This discrepancy in binding energies for different chemical classes could arise from

  1. 1. slight errors in the 3-D structure (rotation or translation of one or more helices by an Ångstrom or so that might have differential effects on different classes);
  2. 2. approximations in the solvation of the active site where we use a non-variable dielectric constant and ignore the possible role of water (and metals such as Zn) buried in the active site; or
  3. 3. differential charging of the bound ligand with respect to the ligand isolated in solvent (we assume that all acids have the same net charge in the protein and in solution.

The calculated odorant binding profiles also correlate with the experimental concentration threshold. Thus, nonanoic acid (which elicits experimental responses at 10 µM for S18 and S46 and at 1 µM for S19) has higher binding affinity than the acids that elicit experimental response at 10 µM (octanoic and heptanoic acids in S19) and 100 µM (octanoic acid in S46; octanoic and heptanoic acids in S18; hexanoic acid in S19). Correlation between concentration threshold and binding energy is also found for S19 binding to alcohols where nonanol (1 µM) has better (more negative) binding energy than octanol (10 µM), which is better than heptanol (100 µM). The same holds true for S19 and bromo acids, with 8-bromo octanoic acid (1 µM) ranking better than 6-bromo hexanoic acid (100 µM). In all these cases, the calculated binding decreases with the decrease in carbon chain length. Three odorants not tested experimentally show better binding energy than known ligands and are consistent with the trend described above: 9-bromo nonanoic acid for S18; 9-bromononanoic acid and 7-bromoheptanoic acid for S19. We predict that these ligands should bind and likely serve as agonists.

Location and general features of the binding sites

Figure 2 shows that the binding sites (side view and the extracellular top view) of odorants in all six ORs are located between TMs 3, 4, 5 and 6 and lie ~10 Å below the extracellular loops of the helical barrel. Figure 3 superimposes the predicted binding site for azelaic acid in S6 (receptor and ligand in blue) and compares it to the binding site of cis-retinal in bovine rhodopsin crystal structure (pdb code: 1F88) colored in red. This shows that the binding sites for ORs are in the same general location as retinal in rhodopsin, even though all potential binding sites were determined by independent scanning of the receptors. Figure 3 also shows the differences between the crystal structure of bovine rhodopsin and our predicted structure for OR S6.



View larger version (53K):
[in this window]
[in a new window]
 
Figure 2 Front and top (extracellular end) view of the 3-D structures of mouse olfactory receptors S6, S50, S18, S19, S25 and S46 with their respective agonists bound to the HierDock predicted binding sites. The TMs are, counter-clockwise from the back: TM 1 (red-brown), TM2 (ochre), TM3 (moss green), TM4 (lime green), TM5 (green), TM6 (light blue) and TM7 (blue). Figures were generated using VMD (Humphrey et al., 1998).

 


View larger version (58K):
[in this window]
[in a new window]
 
Figure 3 Structural superposition of mouse olfactory receptor S6 (with azelaic acid bound to predicted binding site) to bovine rhodopsin bound to retinal (pdb code 1F88) showing the differences in the protein structures and the similarity in the binding sites location. S6 and azelaic acid are blue; rhodopsin and retinal are red. The OR structures and the locations of their binding sites were each derived independently. No structural information about the binding site location, length of the trans-membrane domains, tilt of the helices or loop configuration from the rhodopsin structure was used in predicting the structure of the ORs.

 
Residues within 3.5Å of the experimental agonists for all six olfactory receptors are shown in Figure 5 and Table 4. The same set of residues is also marked in the sequence alignment of these ORs shown in Figure 4. Residues that form hydrogen bonds with the ligand are shown in bold font in Table 4. Amino acid positions in the sequence alignment (Figure 4) found to be involved in odorant binding for four ORs (S18, S19, S25 and S46) are highlighted in Table 4. These are positions 6, 9, 10 and 13 in TM3, and position 19 in TM6. Consensus positions 6, 9 and 13 in TM3 are in the same face of that helix. These four ORs recognize odorants from the same chemical classes, while the other two (S6 and S50) recognize a class of odorants (diacids) not recognized by any of the other four ORs.



View larger version (38K):
[in this window]
[in a new window]
 
Figure 5 Details of the binding modes for the six mouse olfactory receptors: azelaic acid/S6, azelaic acid/S50, 8-bromo-octanoic acid/S18, (nonanol and octanoic acid)/S19, nonanoic acid/ S46 and heptanal/ S25. Hydrogen bonds to anchor residues in the binding site are traced in the figures. Different ORs use different binding modes to recognize the same ligands. Different anchor points within the same binding site can also be used for different functional groups.

 

View this table:
[in this window]
[in a new window]
 
Table 4 Residues involved in binding of experimental agonists for the six mouse olfactory receptors
 
We assume that these amino acid positions found to be consistently involved in binding of odorants in S18, S19, S46 and S25 are indeed responsible for odorant recognition. We will now examine whether this information on the binding site can be used to explain the differences in the experimental odorant activation profiles for these ORs.

The ORs that recognize acids and bromo acids (S18, S19 and S46) have a His in position TM3-6 (this notation means sixth residue on TM3 starting from the amino end of the helix 3), a residue with a short hydrophobic side chain (Val, Ala or Gly) at position TM3-10 and a Pro in position TM6-19.

The only OR among the six considered here that does not recognize acids (S25) has:

  • a Val in TM3-6;
  • a polar Thr at position TM3-10;
  • another Thr in TM6-19.

The Thr284 at position TM6-19 in S25 is hydrogen-bonded to Lys302 (EC3), which is the anchor residue for the hydroxyl group.

The two ORs that recognize both alcohols and acids (S18 and S19) have Phe at position TM3-9, which is occupied by a Thr in the OR S46, that recognizes only acids and by a Gly in the OR S25 that is activated only by alcohols. Figure 6 shows a comparison between these positions in S18 (recognizes alcohols and mono-acids) compared with S25 (alcohols but not mono-acids) and S46 (mono-acids but not alcohols). Together, these four positions seem to explain the differences in chemical class preference for S18, S19, S25 and S46. Three of them (TM3-6, TM3-9 and TM3-10) are considered hypervariable positions in OR sequence alignments and have been suggested to be involved in differentiating among odorants (Malnic et al., 1999Go; Pilpel and Lancet, 1999Go). The fifth consensus position TM3-13 (also a hypervariable position according to the literature; Malnic et al., 1999Go; Pilpel and Lancet, 1999Go) does not seem to account for differences in preference among the Malnic 24 odorant set and may be involved in the binding of some other odorant not included in that set.



View larger version (29K):
[in this window]
[in a new window]
 
Figure 6 Schematic diagram showing that the residues found to be involved in odorant binding are in conserved alignment positions across ORs S18, S19, S25 and S46. These four sequence alignment positions in TM3 and TM6 can explain the differences in the experimental response profiles. We compare S18 (recognizes long alcohols and mono-acids) to S25 (medium alcohols but not mono-acids) and S46 (long mono-acids but not alcohols). The ORs that recognize mono-acids (S18, S19 and S46) have a His in position TM3-6, a short hydrophobic side chain residue (V, A or G) at position TM3-10 and a Pro in position TM6-19, while the only OR in the set that does not recognize acids (S25) has a Val in TM3-6, a polar Thr at position TM3-10 and another Thr in TM6-19. The Thr284 at position TM6-19 in S25 is hydrogen-bonded to Lys302 (EC3), which is the anchor residue for the hydroxyl group. The two ORs that recognize both alcohols and mono-acids (S18 and S19) have Phe at position TM3-9, which is occupied by a Thr in the acids-only S46 and by a Gly in the alcohols-only S25.

 
Comparing the binding sites for diacids in the homologues S6 and S50

The olfactory receptors S6 and S50 share 94% sequence identity. The structural homology of the MembStruk predicted structures for S6 and S50 is high showing a CRMS difference of only 1.5Å in the TM domain. The main deviations are in the loop conformations, which are not significant since the loops are conformationally flexible. S50 recognizes only azelaic acid (C9 diacid), while S6 recognizes both azelaic acid and suberic acid (C8 diacid). The mutations from S50 to S6 responsible for changing the activity are in TM2 (Leu82–Val), TM3 (Val112–Ile, Ser113–Val, Met114–Phe) and TM5 (Ala202–Val, Ile205–Thr, Ile206–Val, Thr208–Ile, Thr220–Ala).

Figures 4 and 5 show that of the 13 residues found within 3.5 Å of agonist ligands, only four residues (Tyr109, Leu204, Arg207 and Ser261) are common to both S6 and S50. We find that a single mutation at Ser113 (TM3)–Val, in S50 to S6 is directly involved in odorant binding. However, 8 out of 16 mutations between S50 and S6 are in TMs 3 and 5 and are involved in forming the shape of the active odorant binding site in S50 and S6. Another structural difference is that S6 has significant contributions from residues in EC2, which in S6 is somewhat folded towards the opening of the helix barrel, whereas it is in an ‘open’ conformation for S50. Since loops are the least accurate parts of our predicted 3-D structures, loop residues identified with binding should be taken with some reservation. It should be noted, however, that EC2 is folded into the binding site in the crystal structure of rhodopsin with bound cis-retinal.

The binding site for alcohols in S25 compared with previous work

The calculated binding energy profile of the alcohol binding receptor S25 shows good agreement with the experimental profile as described in above. The binding energies reported here for odorants in S25 use the refined HierDock protocol described in this paper. The absolute binding energies are different from the previous published work (Floriano et al., 2000Go), but the relative binding energy profile remains identical. Both studies lead to very good agreement between calculated and experimental profiles in terms of the chemical class recognized by the OR and, within that chemical class, the ranking of ligands by their likelihood of activating the receptor. We also found the same residues to be involved in binding and the same critical binding feature in the hydrogen bond (HB) to Lys302.

Comparing the binding modes of acids and alcohols with ORs S18, S19 and S46

The binding site of mono-acids, bromoacids and alcohols in this family involves residues in TMs 3, 4, 5 and 6, and EC2. Odorants bind with their long axis generally parallel to the TMs, with their carboxyl group pointing toward the extracellular side of the helical barrel. Moreover their orientation is consistent with the idea that the N-terminal region of an alpha helix favors the binding of negatively charged ligands because of stabilization due to the positive charge of the helix dipole (Hol et al., 1978Go). The binding mode (Figure 4) of alcohols to S19 involves a HB to Ser140 (TM3), while the binding mode for acids has His133 (TM3) as the anchor point. The hydrophobic tails of alcohols and acids are bound similarly, with Leu190 (TM4), Ile191 (TM4), Leu194 (EC2), Ile 281 (TM6), Pro282 (TM6) and Phe285 (TM6) providing a hydrophobic pocket around it. The hydroxyl and carboxyl groups point in opposite directions in order to reach their anchors. This shows that the same OR can bind to two different odorants in two different binding modes.

Comparing critical residues for binding to available data for other GPCRs

Above, we listed common positions in the ORs sequence alignment that we identify as critical for odor recognition and discussed their agreement with sequence hypervariability intrinsic to OR function. Now the question is how well these positions compare to experimental data for ligand binding to other GPCRs. For this comparison we chose a set of family A type (rhodopsin-like) GPCRs that have abundant experimental data because of their pharmaceutical relevance: muscarinic acetylcholine receptor, ß2-adrenergic receptor, dopamine receptor, type-1B angiotensin II receptor and purinergic receptor. We mapped reported experimental data (Strader et al., 1994Go; Wieland et al., 1995Go; Bourdon et al., 1997Go; Alberts et al., 1998Go; Allman et al., 2000Go; Klabunde and Hessler, 2002Go; Bissantz et al., 2003Go) on residue positions involved in binding for those receptors and then compared their important positions for binding to the ones identified here for ORs. These are shown in Figure 7.



View larger version (30K):
[in this window]
[in a new window]
 
Figure 7 Comparison between residues predicted as critical for odorant recognition in ORs and residues experimentally found to be involved in agonist binding for other GPCRs. Transmembrane domains for rhodopsin (1F88), muscarinic acetylcholine receptor (acm1_human), ß2-adrenergic receptor (b2ar_human), dopamine receptor (d3dr_human), type-1B angiotensin II receptor (ag2s_rat) and purinergic receptor (p2yr_human) correspond to the regions assigned in the swiss-prot files. For each OR, positions found to be involved in binding are in underlined font with consensus positions (i.e. common to all six cases presented in this paper) bold underlined. TM residues found within 3.5 Å of retinal in the rhodopsin structure 1F88 are underlined in the rhodopsin sequence. Residues experimentally found to be involved in binding (Strader et al., 1994; Wieland et al., 1995; Bourdon et al., 1997; Alberts et al., 1998; Allman et al., 2000; Klabunde and Hessler, 2002; Bissantz et al., 2003) for muscarinic acetylcholine (Allman et al., 2000), ß2-adrenergic receptor, dopamine, type-1B angiotensin II (Klabunde and Hessler, 2002) and purinergic receptors (Klabunde and Hessler, 2002) are also bold underlined in their respective sequences. The Trp residue half way through TM4 that is conserved in most GPCR (Rhee et al., 2000) is not conserved in ORs. In fact, for the six ORs studied here, only S18 has this Trp. The binding sites in the ORs were predicted based on interaction energies between odorants and ORs with no use of experimental information. All four OR consensus positions in TM3 have been implicated in ligand binding for other GPCRs. The OR consensus position in TM6 is in the same region of the helix as in the other receptors; however, not exactly in the same position. One of the two residues in TM5 that we found to be involved in binding for some but not all six ORs studied here is also in good agreement with the data for the other GPCRs.

 
The sequences in Figure 7 were aligned as follows (no gaps were allowed in the TMs):

  1. 1. TM3 sequences were aligned based on the conserved pattern (D, E, or H)RY;
  2. 2. TM4 was aligned based on the conserved Trp;
  3. 3. TMs 5 and 6 were aligned using clustalW (Thompson et al., 1994Go).

The Trp residue half way through TM4 that is conserved in most GPCR (Rhee et al., 2000Go) is not conserved in ORs. In fact, out of the six ORs studied here, only S18 has this Trp, so we used S18 for aligning TM4 of the ORs to the other GPCRs. The alignment for ORs and rhodopsin shown in Figure 4 was based on the complete amino acid sequences and hence it differs from the TM-based alignment for rhodopsin TMs in Figure 7.

All four OR consensus positions in TM3 have been implicated in ligand binding for other GPCRs. The OR consensus position in TM6 is in the same region of the helix as for the other receptors; however, it is not in exactly the same position. One of the two residues in TM5 that we found to be involved in binding for some but not all six ORs studied here is also in good agreement with the data for the other GPCRs. The good agreement between our predictions and the available data for these other GPCRs is one indication that our methods provide accurate structures and binding site, since the binding sites in the ORs were predicted based on interaction energies between odorants and ORs with no use of experimental information.


    Screening odorant libraries to find potential agonists
 Top
 Abstract
 Introduction
 Materials and methods
 Results and discussion
 Screening odorant libraries to...
 Other ORs sharing similar...
 Summary and conclusions
 Acknowledgements
 References
 
We docked all ligands shown in Table 1B to all six OR structures studied here and sorted them in order of decreasing binding affinity. The top 10 potential ligands by binding affinity for each OR are shown in Table 5. These ligands are candidates to serve as agonists or antagonists to these ORs. We included sugars in our screening set, even though they are generally odorless, in order to provide data for comparing the binding affinities to ORs of odorants versus tastants (such as glucose, a natural agonist for taste receptors, which are functionally related members of the GPCR family). This may provide insights into how function diversity evolved in the GPCR family, and more specifically, among the chemosensory receptors.


View this table:
[in this window]
[in a new window]
 
Table 5 Suggested ligands for the six olfactory receptors
 
Binding of a ligand to an OR is a necessary but not sufficient condition for activation and hence calculated binding energies alone cannot predict which molecules will elicit response in vivo. A ligand could bind strongly and not activate the G protein. Also, the calculated binding energies do not take into account many physical–chemical properties that may differentiate structurally similar ligands in vivo. However, we have shown in the previous sections that there is correlation between calculated binding energies and the chemical classes recognized by each OR. The binding energies also indicate very well odorants most likely to activate the receptor within each of the four odorant classes of Malnic, even though the energy distribution may not clearly separate active and inactive ligands. In addition, there is some correlation when comparing ligands of different classes. Thus we assume that ligands with better (more negative) calculated binding energies than the best experimental positives are likely to activate the ORs, while ligands with less favorable binding energy than the best experimental positives are not likely to activate the ORs. These ligands can be tested experimentally.

The experimental procedure used in identifying the six ORs studied here may lead to functional relationships. Experimentally, calcium imaging was used (Malnic et al., 1999Go) to select from 647 mouse olfactory neurons from the dorsal nasal septum those that responded to one or more of the odorants in Table 1A. Then, RT–PCR was used to identify the OR genes expressed by the responsive neurons. Odorants were first tested at 100 µM, then at 10 and 1 µM, depending on response. Consequently, only ORs sharing affinity to those odorants were selected and identified, making them likely to share affinity for other odorants.

We find that some ligands have favorable binding energy to more than one of the six OR studied here. This may arise because this set of ORs was selected by their common affinity to aliphatic alcohols and acids with 6–9 carbons. This may explain their common preference for the aldehydes and esters in Table 5.

The most frequent hits are (number of ORs for which they rank among the top 10 binders/total number of ORs considered between parentheses):

  • furaneol glucoside (6/6);
  • ethyl vanillin (6/6);
  • glucose (5/6);
  • furaneol (2,5-dimethyl-4-hydroxy-3(2H)-furanone) (4/6);
  • mesifurane (4/6);
  • eugenol (4/6);
  • decanol (4/6);
  • tridecanal (3/6);
  • sec-butyl-3-methoxypyrazine (3/6).

Interestingly, the ligands showing the best calculated binding affinities to the ORs studied in our work are important aroma and/or flavor components of nutrient sources mice are known to like, such as fruits, vegetables and cheese (Bramwell et al., 1969Go; Buttery et al., 1969Go, 1990; Murray et al., 1970Go; Maga and Sizer, 1973Go; Cronin and Stanton, 1976Go; Ohloff, 1978Go; Wu et al., 1991Go; Duke, 1992Go; Belay and Poole, 1993Go). This is also true for the Malnic compounds (Table 1A) that elicit experimental response from these ORs (Murray et al., 1970Go; Pabst et al., 1991Go; Wu et al., 1991Go; Teranishi et al., 1992Go; Krammer et al., 1994Go; Sanz et al., 1995Go; Forney, 2001Go; Jordan et al., 2001Go; Suriyaphan et al., 2001Go; Jackson and Linskens, 2002Go; Lavid et al., 2002Go). In particular, furaneol and its methoxy derivative (mesifurane) are among the most important volatile compounds in the aroma of strawberries (Sanz et al., 1995Go; Forney, 2001Go; Lavid et al., 2002Go). The concentration of these compounds along with the furaneol glucoside derivative, sharply increase during fruit ripening, with maximum values at the ripe stage (Forney, 2001Go; Lavid et al., 2002Go). Furaneol is also an important component in the aroma and flavor of many fruits including pineapples (Wu et al., 1991Go), tomatoes (Krammer et al., 1994Go), mangos (Lavid et al., 2002Go), grapes (Gunata et al., 1985Go) and raspberries (Pabst et al., 1991Go).

Our results support the idea that natural agonists for ORs are likely to be found among environmentally related substances.

The high affinity of the glycosidically bound furaneol to all six ORs studied here may indicate the complex role ORs play in flavor perception. Glycosidically bound aroma compounds are considered important flavor precursors (Gunata et al., 1985Go; Pabst et al., 1991Go; Wu et al., 1991Go; Krammer et al., 1994Go; Jackson and Linskens, 2002Go), and binding of those compounds to ORs may not lead to specific odor descriptor but instead to an overall ‘pleasant’ sensation.


    Other ORs sharing similar odorant preferences
 Top
 Abstract
 Introduction
 Materials and methods
 Results and discussion
 Screening odorant libraries to...
 Other ORs sharing similar...
 Summary and conclusions
 Acknowledgements
 References
 
One question that arises from the theory of combinatorial coding for odors is how many receptors are used to uniquely identify an odorant. We cannot answer this question until we finish predicting the structure and odorant binding of all ORs in the mouse or in the human genomes. However, we can provide a rough estimate of how many ORs could potentially recognize the same class of chemical compounds as the six ORs studied here. To do this we derived simplified recognition patterns for identification of aliphatic alcohol and aliphatic acid from the amino acids we found to be directly involved in the binding pocket of these odorants in the six ORs we studied. We found that these binding sites lead to specific positions in the ORs sequence alignment (see Table 4). We will now use these patterns to identify related ORs by pattern matching. In order to do this, we searched for all sequences in the TrEMBL database (http://ca.expasy.org/sprot/sprot-search.html) matching ‘mouse’ in the organism and ‘MOR[number]’ in the descriptor fields. The sequences resulting from this query were extracted from the database. Incomplete and redundant sequences (homology >97%) were eliminated from the set. Sequence alignment was performed using ClustalW (Thompson et al., 1994Go) and the TM assignment for S25 was used to predict the TM domains in the alignment. Sequence alignment was also used to identify the ‘S’ olfactory receptors in the TrEMBL set. The final set had 869 mouse OR sequences.

The pattern (shown in Table 4) of important amino acids derived for binding of aliphatic acids with 6–9 carbons to S18, S19 and S46, is:

  • His at position 6 in TM3;
  • any non-polar or Gly residues at TM3/10;
  • Ser, Tyr or Thr at TM3/13, Pro at TM6/19;
  • Phe at TM6/22.

We found that 32 out of 869 OR sequences match this pattern making them potential receptors for similar aliphatic acids. The TrEMBL codes for these sequences are given in Table 6.

The pattern for recognition of aliphatic alcohols with 6–9 carbons (shown in Table 4) was derived as:

  • non-polar residue or Gly at TM3/9;
  • Ser at TM3/13;
  • polar or Pro residues at TM6/19;
  • Phe at TM6/22.

This pattern is satisfied by 36 sequences from the 869 set. The TrEMBL codes for these sequences are given in Table 6. Of these 36, 12 were in common with the 32 recognizing acids.


    Summary and conclusions
 Top
 Abstract
 Introduction
 Materials and methods
 Results and discussion
 Screening odorant libraries to...