Skoltech CDISE assistant professor Petr Popov and PhD student Igor Kozlovskii developed a new computational approach for spatio-temporal detection of binding sites in proteins by applying deep learning algorithms and computer vision to protein structures treated as 3D images. With this new technology, one can detect even elusive sites: for instance, scientists managed to detect binding sites concealed in experimental atomic structures or formed by several protein molecules for the ion channel, G protein-coupled receptor, and the epithelial growth factor, one of the most important drug targets.
Petr Popov, the study lead and assistant professor at Skoltech, comments: “The human genome consists of nearly 20,000 proteins, and very few among them get associated with a pharmacological target. Our approach allows searching the protein for binding sites for drug-like compounds, thus expanding the array of possible pharmacological targets. Besides, initial structure-based drug discovery strongly depends on the choice of the protein’s atomic structure.
Working on a structure with the binding site barred for the drug or missing altogether can fail. Our method enables analyzing a large number of structures in a protein and finding the most suitable one for a specific stage.”
According to Igor Kozlovskii, the first author of the paper, BiteNet outperforms its counterparts both in speed and accuracy: “BiteNet is based on the computer vision, we treat protein structures as images, and binding sites as objects to detect on this images. It takes about 0.1 seconds to analyze one spatial structure and 1.5 minutes to evaluate 1,000 protein structures of about 2,000 atoms.”
Identification of novel protein binding sites expands druggable genome and opens new opportunities for drug discovery. Generally, presence or absence of a binding site depends on the three-dimensional conformation of a protein, making binding site identification resemble the object detection problem in computer vision.
Here we introduce a computational approach for the large-scale detection of protein binding sites, that considers protein conformations as 3D-images, binding sites as objects on these images to detect, and conformational ensembles of proteins as 3D-videos to analyze.
BiteNet is suitable for spatiotemporal detection of hard-to-spot allosteric binding sites, as we showed for conformation-specific binding site of the epidermal growth factor receptor, oligomer-specific binding site of the ion channel, and binding site in G protein-coupled receptor. BiteNet outperforms state-of-the-art methods both in terms of accuracy and speed, taking about 1.5 minutes to analyze 1000 conformations of a protein with ~2000 atoms.