Show item record

dc.contributor.authorKleinman, Claudia
dc.contributor.authorRodrigue, Nicolas
dc.contributor.authorBonnard, Cécile
dc.contributor.authorPhilippe, Hervé
dc.contributor.authorLartillot, Nicolas
dc.date.accessioned2007-01-05T21:56:42Z
dc.date.available2007-01-05T21:56:42Z
dc.date.issued2006
dc.identifier.urihttp://hdl.handle.net/1866/659
dc.identifier.urihttp://www.biomedcentral.com/1471-2105/7/326
dc.format.extent1126890 bytes
dc.format.mimetypeapplication/pdf
dc.rightsCeci est un article en accès libre diffusé sous une licence Creative Commons Paternité laquelle permet une libre utilisation, diffusion et reproduction de l'article sous toutes formes, à la condition de l'attribuer à l'auteur en citant son nom. This is an open access article distributed under the terms of the Creative Commons Attribution License which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
dc.rights.urihttp://creativecommons.org/licenses/by/2.0
dc.titleA maximum likelihood framework for protein design
dc.typeArticle
dc.contributor.affiliationUniversité de Montréal. Faculté de médecine. Département de biochimie et médecine moléculairefr
dc.identifier.doi10.1186/1471-2105-7-326
dcterms.abstractBACKGROUND:The aim of protein design is to predict amino-acid sequences compatible with a given target structure. Traditionally envisioned as a purely thermodynamic question, this problem can also be understood in a wider context, where additional constraints are captured by learning the sequence patterns displayed by natural proteins of known conformation. In this latter perspective, however, we still need a theoretical formalization of the question, leading to general and efficient learning methods, and allowing for the selection of fast and accurate objective functions quantifying sequence/structure compatibility.RESULTS:We propose a formulation of the protein design problem in terms of model-based statistical inference. Our framework uses the maximum likelihood principle to optimize the unknown parameters of a statistical potential, which we call an inverse potential to contrast with classical potentials used for structure prediction. We propose an implementation based on Markov chain Monte Carlo, in which the likelihood is maximized by gradient descent and is numerically estimated by thermodynamic integration. The fit of the models is evaluated by cross-validation. We apply this to a simple pairwise contact potential, supplemented with a solvent-accessibility term, and show that the resulting models have a better predictive power than currently available pairwise potentials. Furthermore, the model comparison method presented here allows one to measure the relative contribution of each component of the potential, and to choose the optimal number of accessibility classes, which turns out to be much higher than classically considered.CONCLUSION:Altogether, this reformulation makes it possible to test a wide diversity of models, using different forms of potentials, or accounting for other factors than just the constraint of thermodynamic stability. Ultimately, such model-based statistical analyses may help to understand the forces shaping protein sequences, and driving their evolution.en
dcterms.descriptionAffiliation: Claudia Kleinman, Nicolas Rodrigue & Hervé Philippe : Département de biochimie, Faculté de médecine, Université de Montréal
dcterms.isPartOfurn:ISSN:1471-2105
UdeM.VersionRioxxVersion acceptée / Accepted Manuscript
oaire.citationTitleBMC bioinformatics
oaire.citationVolume7


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show item record

Ceci est un article en accès libre diffusé sous une licence Creative Commons Paternité laquelle permet une libre utilisation, diffusion et reproduction de l'article sous toutes formes, à la condition de l'attribuer à l'auteur en citant son nom. This is an open access article distributed under the terms of the Creative Commons Attribution License which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
Usage rights : Ceci est un article en accès libre diffusé sous une licence Creative Commons Paternité laquelle permet une libre utilisation, diffusion et reproduction de l'article sous toutes formes, à la condition de l'attribuer à l'auteur en citant son nom. This is an open access article distributed under the terms of the Creative Commons Attribution License which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.