Three-dimensional integral equation theory for predicting protein-ligand binding affinities
Biophysics
Final Report Abstract
Understanding and modeling the interaction between a protein and a ligand is an essential early step in drug discovery and development. In order to reproduce and predict both structural data such as binding modes and thermodynamic quantities like binding affinities, a computational model has to account for the complexity of the binding process. In practice this means that we have to consider the following issues: knowledge about structures and protonation as well as tautomerization states, definition of interaction energy terms between the complex partners and between solutes and water/electrolyte as a solvent, adequate and sufficient reproduction of naturally fluctuating molecular geometries (sampling problem), conversion of sampled structures into a thermodynamic affinity expression (binding free energy), and, finally, benchmarking of model data with respect to appropriately chosen experimental reference quantities. Despite many well-known conceptual difficulties, molecular dynamics (MD) simulations with specific extensions to determine the binding free energy are de facto the most reliable theoretical method presently available for studying binding processes in solution, though at a high price to pay for the computational effort. We therefore proposed to develop and apply an alternative theoretical platform for answering all question related to complex formation, socalled liquid state theory in the form of the three-dimensional (3D) “reference interaction site model” (RISM) integral equation formalism. Briefly, 3D RISM theory represents one specific class of approximations within the family of liquid state theories that yields solute-solvent (uv) site distribution functions on a 3D grid for a given structural solute model and solute-solvent interaction potential. Such an approach can in principle compete with MD simulations in that thermodynamic quantities can be computed at a fraction of computational cost. However, a number of conceptual and technical problems needed to be addressed in the present project. We specifically focused on (a) the numerical problem to solve the integral equations efficiently, (b) the 3D RISM-based calculation of (local) binding free energy data in the form of the socalled “potential of mean force” (PMF) that defines the energy surfaces governing the binding process, and (c) the complete physicochemical characterization of small molecules with pharmaceutical relevance. Subproject (a) was tackled by developing and implementing novel theory and numerical approximations with particular emphasis on the problem of long-range (Coulomb) interactions which are problematic to handle on relatively small, finite 3D grids. While it was not yet possible to solve the ultimate goal, the molecular solute-solute equation in a numerically stable and physically plausible manner, we succeeded to lay the theoretical groundwork that, at the same time, improved the computational speed and numerical precision of 3D RISM calculations considerably. Part (b) was therefore restricted to benchmarking calculations on model cases comprising molecular hosts binding atomic, possibly charged, guest partners. It could be shown which kind of RISM-based theory agrees best with reference MD simulation. As an added value, it was possible to define and test so-called binding “free energy derivatives” (FED) with respect to complex interaction parameters, thus paving the way for rational design in “chemical property space”. Problem area (c) was addressed by coupling electronic structure calculations to integral equation theory in the form of the “embedded cluster” (EC-RISM) approach. We contributed to the full range of relevant questions, such as the prediction of acidity, tautomer preference, distribution coefficients, and conformational equilibria, in aqueous and non-aqueous phases as well as within bound configurations in a complex. The performance of the methodology was critically assessed by contributions to several blind prediction contests where a number of groups were asked to test their methods on unknown compounds, revealing concrete problem areas for further improvements. In summary, we have now established the conceptual and computational basis for addressing the next major step, molecular complexes on the basis of accurate small molecule properties.
Publications
- Prediction of tautomer ratios by embedded-cluster integral equation theory. J. Comput.-Aided Molec. Des. 2010, 24, 343-353
Kast, S. M.; Heil, J.; Güssregen, S.; Schmidt, K. F.
(See online at https://dx.doi.org/10.1007/s10822-010-9340-x) - Acidity in DMSO from the embedded cluster integral equation quantum solvation model. J. Mol. Model. 2014, 20, 2161
Heil, J.; Tomazic, D.; Egbers, S.; Kast, S. M.
(See online at https://doi.org/10.1007/s00894-014-2161-4) - 3D RISM theory with fast reciprocal-space electrostatics. J. Chem. Phys. 2015, 142, 114107
Heil, J.; Kast, S. M.
(See online at https://doi.org/10.1063/1.4914321) - Integral equation theory as a solvation model for classical and quantum solute systems. In Computational Trends in Solvation and Transport in Liquids; Sutmann G.; Grotendorst, J.; Gompper, G.; Marx, D., Eds. IAS Series Vol. 28: Jülich, Deutschland, 2015, pp. 419-434
Kast, S. M.; Heil, J.; Hoffgaard, F.
- Targeting drug resistance in EGFR with covalent inhibitors: a structure-based design approach. J. Med. Chem. 2015, 58, 6844-6863
Engel, J.; Richters, A.; Getlik, M., Tomassi, S.; Keul, M.; Termathe, M.; Lategahn, J.; Becker, C.; Mayer-Wrangowski, S.; Grütter, C.; Uhlenbrock, N.; Krüll, J.; Schaumann, N.; Eppmann, S.; Kibies, P.; Hoffgaard, F.; Heil, J.; Menninger, S.; Ortiz-Cuaran, S.; Heuckmann, J. M.; Tinnefeld, V.; Zahedi, R. P.; Sos, M. L.; Schultz-Fademrecht, C.; Thomas, R. K.; Kast, S. M.; Rauh, D.
(See online at https://doi.org/10.1021/acs.jmedchem.5b01082) - Designing molecular complexes by free energy derivatives from liquid state integral equation theory. J. Phys.: Cond. Matter 2016, 28, 344004
Mrugalla, F.; Kast, S. M.
(See online at https://doi.org/10.1088/0953-8984/28/34/344004) - The SAMPL5 challenge for embedded-cluster integral equation theory: solvation free energies, aqueous pKa, and cyclohexane-water log D. J. Comput.-Aided Molec. Des. 2016
Tielker, N.; Tomazic, D.; Heil, J.; Kloss, T.; Ehrhart, S.; Güssregen, S.; Schmidt, K. F.; Kast, S. M.
(See online at https://doi.org/10.1007/s10822-016-9939-7)