| Question: | Find a subset of ligand atoms that is isomorphous with a subset of binding site descriptors. Isomorphism: in terms of internal distances and/or chemical nature |
Answer: Clique detection - part of graph theory
Graph Theory Terminology:
| graph | set of nodes and edges |
| nodes | objects ug = { ui } |
| edges | pairs of nodes eg = {(ui,uj)
| ui,uj edge indicates a relationship between the nodes |
| adjacency | presence of edge between nodes |
| subgraph | subset of nodes and edges of a graph |
| completely connected subgraph | subgraph with edges between all node pairs |
| clique | maximal completely connected subgraph |

Docking by clique detection:
Example:

.
Crippen algorithm: single docking graph (implemented
in DOCK 4)
|
node = pairing of ligand atom with protein
descriptor
edge = distance between 2 atoms equals distance between 2 descriptors |4 3| = |A B| or A4 is adjacent to B3 |4 3| = |B A| or B4 is adjacent to A3 |1 3| = |E B| = |B E| |1 4| = |E A| = |A E| |
.
|
number of possibilities |
= nCNP x nPNL |
| = NP! [n! (NP-n)]-1 x NL! (NL-n)-1 | |
| with: | NP = protein descriptor nodes NL = ligand nodes n = required matchings xCy = combinations (all groups of size x out of population y; order irrelevant) xPy = permutations (all groups of size x out of population y; order matters) |
e.g. if NP=50; NL=20; n=4 then 1011 possibilities
graph is a sparse matrix (typically 1 % filled) determined by distance tolerance and distance minimum
typically only 103-105 possibilities
allow only chemically compatible matches
e.g. exclude matching of H bond donor descriptor with carbonyl O