Sampling Protocols

Introduction

Much of the work in IMP involves trying to find conformations which minimize a scoring function. The scoring function is composed of scoring terms and the space of possible conformations is called the conformation or sampling space.

Most of the scoring functions used are on points or spheres in 3D and have terms which limit the packing density (excluded volume, VDW, sterics etc).

Protocols details

Most sampling protocols involve a combination of

Sampling pitfalls

  1. instability due to excessively high values of various terms of the scoring function
  2. rough scoring functions have many local minima in which protocols can get stuck

Representation manipulations

  1. use rigid bodies
    • reduces the dimensionality
  2. use a coarser grained representation of the entities, eg, represent a protein with 10 spheres rather than 1000 atoms
    • reduces the dimensionality
    • smooths the scoring function

Scoring function manipulations

  1. removing "detail" terms from the scoring function. Example include, removing excluded volume interactions or force fields.
    • this smooths the scoring function
    • it can also bring minima closer to the current conformation (by increasing the set of conformations which score at the minima)
  2. remove long range terms to allow the optimizer to find a conformation in which the short range ones are satisfied before (re)adding the long range ones
    • trying to satisfy both short range and long range terms simulatenously can be too much
    • satisfying the long range ones first can lead to a system which is too packed for the conformation to sort out the short range terms
    • perhaps best to do "after" finding conformations which satisfy the long range ones (just a guess)
  3. capping terms, such as replacing Harmonic with TruncatedHarmonic terms.

    • improves numerical and algorithmic stability of the optimizers
    • it is probably better to remove the terms entirely and add systematically
  4. scaling parameters of the system. For example, it is useful to scale the radius of particles down when there are many steric clashes
    • this removes many terms from the scoring function (since when using HarmonicLowerBound-style terms

    • reduces the magnitude of others to remove numerical instabilities and keep terms in balance
  5. remove high resolution terms from the scoring function. For example, remove all terms that refer to atoms and just deal with the residues. This is best accompanied by coarsening the representation.
    • doing it smooths the scoring function

Adding noise to the system

Adding noise to the system is a good way to expand the set of conformations searched and escape from local minima. Ways to add noise include

  1. randomized starting conformations
  2. Monte Carlo steps
  3. high temperature MD simulation

Optimization support in IMP

Currently supported methods

Future methods

IMP: optimization_protocols (last edited 2010-11-17 20:55:41 by DanielRussel)