Exploring chemical space for epigenetic drug discovery using AI

Gul, Sheraz; Tegin Sahin, Alp

Exploring chemical space for epigenetic drug discovery using AI

24

SHARES

Share via

Posted: 17 December 2019 | Alp Tegin Sahin (Fraunhofer-IME), Sheraz Gul (Fraunhofer Institute) | No comments yet

It is said that, on average, it takes a new drug 12 years to go from research lab to patient, with many thousands of candidates discarded along the way. Can artificial intelligence (AI) help to speed things up? Sheraz Gul and Alp Sahin provide an overview of an AI approach to accelerate epigenetic drug discovery.

Artificial intelligence (AI) is making inroads in drug discovery. One aim for its application has been to shrink the pre-clinical phase in drug discovery from a typical five years to less than one year. This would involve designing compounds with optimised physico-chemical, off-target liability, ADMET, pharmacokinetic and pharmacodynamic properties, together with novel intellectual property rights. The designed compound should be suitable for in vivo studies and would necessitate the exploitation of all available knowledge of the target. This article relates to the application of this approach to accelerate epigenetic drug discovery.

Table 1: Typical in silico and in vitro workflow to identify a lead compound.

Epigenetic drug targets are particularly suited for AI studies as various contributions have shown them to be involved in diseases such as cancer and complex biological processes such as ageing, where a multi-targeting approach may be required. Within the epigenetic drug target class, there are three main categories: writers, readers and erasers. The writers ‘mark’ histones and DNA by adding chemical groups including acetyl, phosphoryl and methyl. The readers recognise and act upon these modifications and the erasers remove them.¹ In some cases, drugs that modulate the activities of specific epigenetic targets, eg, Vorinostat and Romidepsin, which inhibit the histone deacetylase class of enzymes,² have been approved for clinical use. Other members of this class of protein are being explored as potential therapeutic targets to treat metabolic and cardiovascular disease and cancer.³ In order to improve the chances of AI delivering compounds that are likely to be successful in vivo, maximum use of existing in vitro (assays, screening campaigns, design and evaluation of compounds), in silico (structural studies to identify compounds that bind the target protein) and in vivo (animal and human) studies should be made. A typical workflow for identifying a lead compound using in silico and in vitro methods is shown in Figure 1.

The use of AI in pre-clinical drug discovery offers the potential to reduce the cycle time for lead identification. The activities involved in the AI-driven workflow are as follows:

1. Analysis of databases

Interrogate all publicly available compound databases (eg, ChEMBL, PubChem, ZINC and DrugBank) and prepare an independent database of all available drug-like compounds. This will form the starting point for applying AI to design new compounds against the relevant drug targets.

2. Prioritisation of drug targets

The drug targets should be prioritised based upon the literature and databases with a focus on a) existing approved drugs (DrugBank), b) agents in clinical trials, c) structural information and d) druggability.

3. Applying AI methods

This activity should make use of those AI techniques necessary to allow the design of novel compounds that are potential modulators of the epigenetic drug targets. The applications of AI at exascale – ie, the rate achieved by supercomputers – will allow a) preparation of a virtual library of chemically accessible compounds to increase the probability of technical success of the project, b) parallel docking to defined targets of interest with structural information, together with off targets and c) the AI-driven analysis of the results and design of novel compounds.

4. Applying in vitro methods

Having used the AI methods to design novel compounds, screening can be undertaken to confirm their activities following their synthesis. The focus at this stage should be on the assay development and screening; particularly, development of primary bioassays, compound profiling, automated data analysis and visualisation. In the case of epigenetic drug targets, many assays have been reported, which can facilitate the completion of this in vitro work (see Table 1).

Table 1: Assay types for epigenetic drug targets. MBT: malignant brain tumour; HAT: histone acetyltransferases; DNMT: DNA methyltransferases; PARP: poly (ADP-ribose) polymerase; PRMT: protein arginine methyltransferase; HDAC: histone deacetylase; HDM: histone demethylase; SIRT: sirtuin; PTP: protein tyrosine phosphatase.

5. Further development of hit compounds

The most promising compounds identified above should possess acceptable physico-chemical, off-target liability, ADME-toxicity, pharmacokinetic and pharmacodynamic properties together with novel intellectual property rights (Table 2).

The use of AI to quickly design compounds that are suitable for in vivo validation is now becoming a reality.20-24 Although the outputs of AI require confirmation using in vitro methods, it is anticipated that in the near future compounds will be designed almost entirely using AI methods; significantly shortening the length of time a project spends in the pre-clinical phase of drug discovery.

Table 2: Typical properties of a lead compound designed by AI.

About the authors

Sheraz Gul is an expert in drug discovery with experience gained in academia (University of London), industry (GlaxoSmithKline Pharmaceuticals) and the largest applied research organisation in Europe (Fraunhofer Institute). He is also an adjunct lecturer at NUI-Galway, Ireland and scientific co-founder of Transcriptogen Ltd. He has coordinated work packages in drug discovery projects, which have attracted more than €7 million funding and has organised 42 drug discovery workshops since 2011 across the globe and trained 780 scientists.

Alp Tegin Sahin is currently a visiting scientist at the Fraunhofer-IME, Germany. He is also studying Bioinformatics and Genetics at the Kadir Has University, Turkey and has a special interest in in silico and in vitro screening for epigenetic targets.

References

Prachayasittikul V, et al. Exploring the epigenetic drug discovery landscape. Exp Opin Drug Discov. 2017, 12, 345–362.
Bates SE, Robey RW, Piekarz RL. CCR 20th Anniversary Commentary: Expanding the Epigenetic Therapeutic Portfolio. Clin Can Res. 2015, 21, 2195–2197.
Gul S. Epigenetic assays for chemical biology and drug discovery. Clin Epigen. 2017, 9, 41.
Gillette TG, Hill JA. Readers, Writers, and Erasers. Circ Research. 2015, 116, 1245–1253.
Wagner JM, et al. Histone deacetylase (HDAC) inhibitors in recent clinical trials for cancer therapy. Clin Epigen. 2010, 1, 117–136.
Murga C, et al. G Protein-Coupled Receptor Kinase 2 (GRK2) as a Potential Therapeutic Target in Cardiovascular and Metabolic Diseases. Front Pharm. 2019, 10.
Zhan Y, et al. Development of novel cellular histone-binding and chromatin displacement assays for bromodomain drug discovery. Epigen Chrom. 2015, 8, 37.
Xue X, et al. Discovery of Benzo[cd]indol-2(1H)-ones as Potent and Specific BET Bromodomain Inhibitors: Structure-Based Virtual Screening, Optimization, and Biological Evaluation. J Med Chem. 2016, 59, 1565–1579.
Wong WR, et al. Autism-associated missense genetic variants impact locomotion and neuro development in Caenorhabditis elegans. Human Mol Gen. 2019, 28, 2271–2281.
Mielke JG, et al. Biochemical and functional characterization of diet- induced brain insulin resistance. J Neurochem. 2015, 93, 1568–1578.
Kim M, et al. Tudor Domain Containing Protein TDRD12 Expresses at the Acrosome of Spermatids in Mouse Testis. Asian-Australasian J Ani Sci. 2015, 29, 944–951.
Jha PK, et al. HAT2 mediates histone H4K4 acetylation and affects micrococcal nuclease sensitivity of chromatin in Leishmania donovani. PLoS One. 2017, 12:e0177372.
Mao SQ, et al. DNA G-quadruplex structures mold the DNA methylome. Nat Struct Mol Biol. 2018, 25, 951–957.
Wang JC, et al. Loss of Sfrp2 contributes to the neurological disorders related with morphine withdrawal via Wnt/ß-catenin signaling. Behav Brain Res. 2019, 359, 609–618.
ZhaoY,etal.PRMT1regulatesthetumour-initiatingpropertiesof esophageal squamous cell carcinoma through histone H4 arginine methylation coupled with transcriptional activation. Cell Death Dis. 2019, 10, 359.
Chriett S, et al. Prominent action of butyrate over ß-hydroxybutyrate as histone deacetylase inhibitor, transcriptional modulator and anti- inflammatory molecule. Sci Rep. 2019, 9, 742.
Galoian K, et al. Effect of cytostatic proline rich polypeptide-1 on tumor suppressors of inflammation pathway signaling in chondrosarcoma. Mol Clin Oncol. 2016, 5, 618–624.
Chen Y, et al. Hydroquinone-induced malignant transformation of TK6 cells by facilitating SIRT1-mediated p53 degradation and up-regulating KRAS. Toxi Lett. 2016, 259, 133–142.
LorenzU.ProteinTyrosinePhosphataseAssays.CurrentProtocolsin Immunology. John Wiley & Sons Inc. 2011 93, 1–11.
StrangBL,etal.Identificationofleadanti-humancytomegalovirus compounds targeting MAP4K4 via machine learning analysis of kinase inhibitor screening data. PLoS One. 2018, 13, e0201321.
BajorathJ,etal.TheFutureIsNow:ArtificialIntelligenceinDrug Discovery. J Med Chem. 2019, 62, 5249.
EkinsS,etal.Exploitingmachinelearningforend-to-enddrugdiscovery and development. Nat Mater. 2019, 18, 435–441.
Bhhatarai B, et al. Opportunities and challenges using artificial intelligence in ADME/Tox. Nat Mater. 2019, 18, 418–422.
Engkvist O, et al. Computational prediction of chemical reactions: current status and outlook. Drug Discov Today. 2018, 6, 1203–1218.

Related organisations
ChEMBL, DrugBank, PubChem, ZINC

Cookie	Type	Duration	Description
cookielawinfo-checkbox-advertising-targeting	persistent	1 year	The cookie is set by GDPR cookie consent to record the user consent for the cookies in the category "Advertising & Targeting".
cookielawinfo-checkbox-analytics	persistent	1 year	This cookie is set by GDPR Cookie Consent WordPress Plugin. The cookie is used to remember the user consent for the cookies under the category "Analytics".
cookielawinfo-checkbox-necessary	persistent	1 year	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Necessary".
cookielawinfo-checkbox-performance	persistent	1 year	This cookie is set by GDPR Cookie Consent WordPress Plugin. The cookie is used to remember the user consent for the cookies under the category "Performance".
PHPSESSID	session	1 year	This cookie is native to PHP applications. The cookie is used to store and identify a users' unique session ID for the purpose of managing user session on the website. The cookie is a session cookies and is deleted when all the browser windows are closed.
viewed_cookie_policy	persistent	1 year	The cookie is set by the GDPR Cookie Consent plugin and is used to store whether or not user has consented to the use of cookies. It does not store any personal data.
zmember_logged	session	1 year	This session cookie is served by our membership/subscription system and controls whether you are able to see content which is only available to logged in users.

Cookie	Type	Duration	Description
advanced_ads_browser_width	persistent	1 month	This cookie is set by Advanced Ads and measures the browser width.
advanced_ads_page_impressions	persistent	2 years	This cookie is set by Advanced Ads and measures the number of previous page impressions.
advanced_ads_pro_server_info	persistent	1 month	This cookie is set by Advanced Ads and sets geo-location, user role and user capabilities. It is used by cache busting in Advanced Ads Pro when the appropriate visitor conditions are used.
advanced_ads_pro_visitor_referrer	persistent	1 year	This cookie is set by Advanced Ads and sets the referrer URL.
bscookie	persistent	2 years	This cookie is a browser ID cookie set by LinkedIn share Buttons and ad tags.
IDE	persistent	2 years	This cookie is set by Google DoubleClick and stores information about how the user uses the website and any other advertisement before visiting the website. This is used to present users with ads that are relevant to them according to the user profile.
li_sugr	persistent	3 months	This cookie is set by LinkedIn and is used for tracking.
UserMatchHistory	persistent	1 month	This cookie is set by Linkedin and is used to track visitors on multiple websites, in order to present relevant advertisement based on the visitor's preferences.
VISITOR_INFO1_LIVE	persistent	5 months	This cookie is set by YouTube. Used to track the information of the embedded YouTube videos on a website.

Cookie	Type	Duration	Description
bcookie	persistent	2 years	This cookie is set by LinkedIn. The purpose of the cookie is to enable LinkedIn functionalities on the page.
GPS	persistent	30 minutes	This cookie is set by YouTube and registers a unique ID for tracking users based on their geographical location
lang	session	1 year	This cookie is set by LinkedIn and is used to store the language preferences of a user to serve up content in that stored language the next time user visit the website.
lidc	persistent	1 day	This cookie is set by LinkedIn and used for routing.
lissc	persistent	11 months	This cookie is set by LinkedIn share Buttons and ad tags.
vuid	persistent	2 years	We embed videos from our official Vimeo channel. When you press play, Vimeo will drop third party cookies to enable the video to play and to see how long a viewer has watched the video. This cookie does not track individuals.
wow.anonymousId	persistent	2 years	This cookie is set by Spotler and tracks an anonymous visitor ID.
wow.schedule	persistent	20 minutes	This cookie is set by Spotler and enables it to track the Load Balance Session Queue.
wow.session	persistent	20 minutes	This cookie is set by Spotler to track the Internet Information Services (IIS) session state.
wow.utmvalues	persistent	20 minutes	This cookie is set by Spotler and stores the UTM values for the session. UTM values are specific text strings that are appended to URLs that allow Communigator to track the URLs and the UTM values when they get clicked on.
_ga	persistent	2 years	This cookie is set by Google Analytics and is used to calculate visitor, session, campaign data and keep track of site usage for the site's analytics report. It stores information anonymously and assign a randomly generated number to identify unique visitors.
_gat	persistent	1 minute	This cookies is set by Google Universal Analytics to throttle the request rate to limit the collection of data on high traffic sites.
_gid	persistent	1 day	This cookie is set by Google Analytics and is used to store information of how visitors use a website and helps in creating an analytics report of how the website is doing. The data collected including the number visitors, the source where they have come from, and the pages visited in an anonymous form.

Cookie	Type	Duration	Description
cf_ob_info	persistent	1 minute	This cookie is set by Cloudflare content delivery network and, in conjunction with the cookie 'cf_use_ob', is used to determine whether it should continue serving “Always Online” until the cookie expires.
cf_use_ob	persistent	1 minute	This cookie is set by Cloudflare content delivery network and is used to determine whether it should continue serving “Always Online” until the cookie expires.
free_subscription_only	session	1 year	This session cookie is served by our membership/subscription system and controls which types of content you are able to access.
ls_smartpush	persistent	1 month	This cookie is set by Litespeed Server and allows the server to store settings to help improve performance of the site.
one_signal_sdk_db	persistent	Until cleared	This cookie is set by OneSignal push notifications and is used for storing user preferences in connection with their notification permission status.
YSC	session	1 year	This cookie is set by Youtube and is used to track the views of embedded videos.

Recommended

Exploring chemical space for epigenetic drug discovery using AI

1. Analysis of databases

2. Prioritisation of drug targets

3. Applying AI methods

4. Applying in vitro methods

5. Further development of hit compounds

About the authors

References

Leave a Reply Cancel reply

Recommended

Exploring chemical space for epigenetic drug discovery using AI

1. Analysis of databases

2. Prioritisation of drug targets

3. Applying AI methods

4. Applying in vitro methods

5. Further development of hit compounds

About the authors

References

Cancer drug discovery breakthroughs: research that’s changing lives

Protein folding milestone achieved with quantum tech

The AI model that is changing clinical trial design

Bird flu is changing – AI might help us keep up

The biotech mapping thousands of hidden therapeutic clues

Leave a Reply Cancel reply