Venn diagram showing the scope of the human druggable genome.
Druggability is a term used in drug discovery to describe a biological target (such as a protein) that is known or predicted to bind with high affinity to a drug. Importantly, binding of the drug to the target must result in a functional change that provides a therapeutic benefit to the patient. In other words, the target must be disease-modifying. The concept of druggability is most commonly applied to the ability of drug targets to bind small molecules—low molecular weight organic compounds.[1] However, the term has also been extended to encompass biologic medical products, such as therapeutic monoclonal antibodies.
The term “druggable genome” was originally coined by Hopkins et al. to describe proteins with genetic sequences similar to those of known drug targets and capable of binding "rule of five"-compliant small molecules.[2] Related concepts include “ligandability”, “bindability”, and “(chemical) tractability”.[3][4]
Drug discovery involves a series of stages that progress from a biological hypothesis to an approved drug. The process typically begins with target identification. Candidate targets may be selected based on various experimental criteria, including disease linkage (e.g. mutations in the protein are known to cause disease), mechanistic rationale (e.g. the protein is part of a pathway implicated in disease), or evidence from genetic screens in model organisms.[5] However, disease relevance alone is not sufficient for a protein to serve as a drug target, the target must also be druggable.
Prediction of druggability
If a drug has already been identified for a target, that target is by definition druggable. If no known drugs bind to a target, then druggability is implied or predicted using different methods that rely on evolutionary relationships, 3D-structural properties or other descriptors.[6]
Precedence-based
A protein is predicted to be "druggable" if it is a member of a protein family[7] for which other members of the family are known to be targeted by drugs (i.e., "guilt" by association). While this is a useful approximation of druggability, this definition has limitations for two main reasons: (1) it highlights only historically successful proteins, ignoring the possibility of a perfectly druggable, but yet undrugged protein family; and (2) assumes that all protein family members are equally druggable.[citation needed]
Structure-based
This relies on the availability of experimentally determined 3D structures or high quality homology models. A number of methods exist for this assessment of druggability but all of them consist of three main components:[8][9][10][11]
Identifying cavities or pockets on the structure
Calculating physicochemical and geometric properties of the pocket
Assessing how these properties fit a training set of known druggable targets, typically using machine learning algorithms
Early work on introducing some of the parameters of structure-based druggability came from Abagyan and coworkers[12] and then Fesik and coworkers,[13] the latter by assessing the correlation of certain physicochemical parameters with hits from an NMR-based fragment screen. There has since been a number of publications reporting related methodologies.[8][14][15]
There are several commercial tools and databases for structure-based druggability assessment. A publicly available database of pre-calculated druggability assessments for all structural domains within the Protein Data Bank (PDB) is provided through the ChEMBL's DrugEBIlity portal.[16]
Structure-based druggability is usually used to identify suitable binding pocket for a small molecule; however, some studies have assessed 3D structures for the availability of grooves suitable for binding helical mimetics.[17] This is an increasingly popular approach in addressing the druggability of protein-protein interactions.[18]
Predictions based on other properties
As well as using 3D structure and family precedence, it is possible to estimate druggability using other properties of a protein such as features derived from the amino-acid sequence (feature-based druggability)[6] which is applicable to assessing small-molecule based druggability or biotherapeutic-based druggability or the properties of ligands or compounds known to bind the protein (Ligand-based druggability).[19][20]
The importance of training sets
All methods for assessing druggability are highly dependent on the training sets used to develop them. This highlights an important caveat in all the methods discussed above: which is that they have learned from the successes so far. The training sets are typically either databases of curated drug targets;[21][22] screened targets databases (ChEMBL, BindingDB, PubChem etc.); or on manually compiled sets of 3D structure known by the developers to be druggable. As training sets improve and expand, the boundaries of druggability may also be expanded.
Undruggable targets
About 3% of human proteins are known to be "mode of action" drug targets, i.e., proteins through which approved drugs act.[23] Another 7% of the human proteins interact with small molecule chemicals.[23] Based on DrugCentral, 1795 human proteins annotated to interact with 2455 approved drugs.[24]
Furthermore, it is estimated that only 10-15% of human proteins are disease modifying while only 10-15% are druggable (there is no correlation between the two), meaning that only between 1 and 2.25% of disease modifying proteins are likely to be druggable. Hence it appears that the number of new undiscovered drug targets is very limited.[25][26][27]
A potentially much larger percentage of proteins could be made druggable if protein–protein interactions could be disrupted by small molecules. However the majority of these interactions occur between relatively flat surfaces of the interacting protein partners and it is very difficult for small molecules to bind with high affinity to these surfaces.[28][29] Hence these types of binding sites on proteins are generally thought to be undruggable but there has been some progress (by 2009) targeting these sites.[30][31]
Chemoproteomics techniques have recently expanded the scope of what is deemed a druggable target through the identification of covalently modifiable sites across the proteome.[32]
^Hopkins AL, Groom CR (2002). "The druggable genome". Nature Reviews Drug Discovery. 1 (9): 727–730. doi:10.1038/nrd892. PMID12209152.
^Edfeldt FN, Folmer RH, Breeze AL (April 2011). "Fragment screening to predict druggability (ligandability) and lead discovery success". Drug Discovery Today. 16 (7–8): 284–287. doi:10.1016/j.drudis.2011.02.002. PMID21315179.
^Abi Hussein H, Geneix C, Petitjean M, Borrel A, Flatters D, Camproux AC (1 February 2017). "Global vision of druggability issues: applications and perspectives". Drug Discovery Today. 22 (2): 404–415. doi:10.1016/j.drudis.2016.11.021. ISSN1359-6446. PMID27939283.
^ abAl-Lazikani B, Gaulton A, Paolini G, Lanfear J, Overington J, Hopkins A (2007). "The Molecular Basis of Predicting Druggability". In Wess G, Schreiber SL, Kapoor TM (eds.). Chemical Biology: From Small Molecules to Systems Biology and Drug Design. Vol. 1–3. Weinheim: Wiley-VCH. pp. 804–823. ISBN978-3-527-31150-7.
^ abHalgren TA (February 2009). "Identifying and characterizing binding sites and assessing druggability". Journal of Chemical Information and Modeling. 49 (2): 377–389. doi:10.1021/ci800324m. PMID19434839.
^Nayal M, Honig B (June 2006). "On the nature of cavities on protein surfaces: application to the identification of drug-binding sites". Proteins. 63 (4): 892–906. doi:10.1002/prot.20897. PMID16477622. S2CID23887061.
^Seco J, Luque FJ, Barril X (April 2009). "Binding site detection and druggability index from first principles". Journal of Medicinal Chemistry. 52 (8): 2363–2371. doi:10.1021/jm801385d. PMID19296650.
^An J, Totrov M, Abagyan R (2004). "Comprehensive identification of "druggable" protein ligand binding sites". Genome Informatics. International Conference on Genome Informatics. 15 (2): 31–41. PMID15706489.
^Hajduk PJ, Huth JR, Fesik SW (April 2005). "Druggability indices for protein targets derived from NMR-based screening data". Journal of Medicinal Chemistry. 48 (7): 2518–2525. doi:10.1021/jm049131r. PMID15801841.
^Schmidtke P, Barril X (August 2010). "Understanding and predicting druggability. A high-throughput method for detection of drug binding sites". Journal of Medicinal Chemistry. 53 (15): 5858–5867. doi:10.1021/jm100574m. PMID20684613.
^Barelier S, Krimm I (August 2011). "Ligand specificity, privileged substructures and protein druggability from fragment-based screening". Current Opinion in Chemical Biology. 15 (4): 469–474. doi:10.1016/j.cbpa.2011.02.020. PMID21411360.
^Overington JP, Al-Lazikani B, Hopkins AL (December 2006). "How many drug targets are there?". Nature Reviews. Drug Discovery. 5 (12): 993–996. doi:10.1038/nrd2199. PMID17139284. S2CID11979420.
^Buchwald P (October 2010). "Small-molecule protein-protein interaction inhibitors: therapeutic potential in light of molecular size, chemical space, and ligand binding efficiency considerations". IUBMB Life. 62 (10): 724–731. doi:10.1002/iub.383. PMID20979208. S2CID205970009.
^Morelli X, Bourgeas R, Roche P (August 2011). "Chemical and structural lessons from recent successes in protein-protein interaction inhibition (2P2I)". Current Opinion in Chemical Biology. 15 (4): 475–481. doi:10.1016/j.cbpa.2011.05.024. PMID21684802.
^Verdine GL, Walensky LD (December 2007). "The challenge of drugging undruggable targets in cancer: lessons learned from targeting BCL-2 family members". Clinical Cancer Research. 13 (24): 7264–7270. doi:10.1158/1078-0432.CCR-07-2184. PMID18094406. S2CID7918779.
^Arkin MR, Whitty A (June 2009). "The road less traveled: modulating signal transduction enzymes by inhibiting their protein-protein interactions". Current Opinion in Chemical Biology. 13 (3): 284–290. doi:10.1016/j.cbpa.2009.05.125. PMID19553156.