Hit-to-lead in drug discovery

Kitchen, Douglas B.; Wolf, Mark

Hit-to-lead in drug discovery

126

SHARES

Share via

Posted: 4 September 2016 | Douglas B. Kitchen (Albany Molecular Research Inc.), Mark Wolf (Albany Molecular Research Inc.) | No comments yet

Starting in the 1970s, the drug discovery research process changed dramatically. Our understanding of biological targets and pathways grew and the largely empirical and target agnostic in vivo pharmacology approach was replaced with target-centric methods…

Use of in vitro biochemical assays grew in popularity during the 1980s and high-content screening (HTS) of large compound libraries became the norm. One of the advantages of in vivo pharmacology starting points was that in vivo activity was assured. In contrast, HTS of large libraries for single-target activity produces the semblance of success in that large numbers of diverse chemical series are identified as starting points (hits) for further work towards identifying clinical candidates. However, these hits are typically weakly active in a primary screen and do not necessarily possess drug-like characteristics. Therefore, a new process was developed in drug discovery termed hit-to-lead (H2L), or more fully, hit series-to-lead optimisation.

H2L, which explores the chemistry and biology of hits to eliminate dead-end structures and provides improvements to the remaining hit series so that development time and costs are saved, is now a key process in drug discovery organisations. This process explores the chemical space around each hit-series of compounds and narrows it down to more ‘clinic-ready’ lead structures.

The primary goal of H2L research is to identify a few hit series that each demonstrate the promise of producing a drug candidate after focused lead-optimisation efforts. While every programme will have its own idiosyncratic needs, there are a number of general concerns with identifying orally active drug candidates. Measurements such as potency, selectivity, solubility, permeability, metabolic stability, low Cytochrome P450 (CYP) inhibition and good pharmacokinetic (PK) properties tend to apply across most small-molecule discovery programmes.

The H2L process has improved over the past decade. At first, compounds were optimised primarily on the basis of their primary biological activity, with little attention given to their fate in later in vivo assays. Screening libraries were often not very ‘drug-like’ and the H2L starting points reflected that fact. Rather than improving success in the clinic, the screening focus had decreased success. In his seminal paper¹, Lipinski was among the first to formalise the effects of target-activity focus and the resulting decreased oral bioavailability due to limited aqueous solubility and permeability, giving birth to the ‘rule-of-5’. Lipinski recognised that the invention of larger and larger molecules limited their continued development.

Most H2L processes have increased the use of calculations to confine lipophilicity (logP), molecular weight and a growing number of other molecular descriptors. Even though the pitfalls of chasing potency were clear, many scaffolds continued to fail due to poor drug-like characteristics. Consequently, various new guideposts such as ligand efficiency² (LE), lipophilic efficiency³ (LipE), and lipophilic ligand efficiency⁴ (LLE) were introduced. Each of these parameters attempt to penalise compounds that improve potency with unnecessary increases in molecular size and/or lipophilicity.

In the late 1990s and early 2000s, assays were introduced into H2L programmes that attempted to model absorption, distribution, metabolism, excretion and toxicity (ADMET). Due to their relatively low cost, high speed and reliable predictions of in vivo experiments these assays are now used routinely to profile key factors that influence oral bioavailability.

Current approaches in H2L

To illustrate H2L, we will discuss some sample output from an internal HTS. Approximately 200 weak hits were obtained by screening ~110,000 compounds at a single concentration screen, a fairly typical result. Additional concentration response curves showed that 125 hits possessed activities ranging from 62nM – 75M. The relevant question is: what does a good starting point look like? And how do you choose among 100 or more compounds? This selection step is commonly referred to as hit triage⁵.

The computer-aided drug design (CADD) scientists analysed the confirmed hits and worked with the medicinal chemists to group them into similar scaffolds. Often, the total number of hits from an HTS effort will reduce to groups of 5-10 similar compounds, each containing a common scaffold.

The ‘most active’ compound is not always the best starting point because many parameters need to be optimised to create a successful drug. To help take the focus off any single parameter the ‘Traffic Light’ (TL) approach, as described by Lobell et al.⁶, can be used. While the original paper focused on computed parameters, this approach can be extended to include experimental results. The basic premise behind the TL approach is to assign three ranges for each parameter and define them as good (0), warning (+1) and bad scores (+2). The scores for each compound are added across all the scoring parameters to give a final TL score, which can be used for ranking prospects. Since a lower score is more desirable, this is often affectionately termed a ‘golf score.’

Though we have not done so, if desired, the categories can be weighted to reflect their importance. Table 1 below shows the TL analysis of two hits that came out of the example screen. The categories used to help rank hits include calculated parameters such as TPSA and cLogP, experimentally-derived primary assay data, secondary assay data and kinetic solubility. While compound two is the more active of the two compounds, the TL analysis highlights poor ligand efficiency and high cLogP as drawbacks for this starting point. While solubility was not obtained for compound two, it is not expected to exceed the high solubility found in compound one.

Table 1: Traffic light comparison of two HTS hits.

The flexibility of the TL approach is demonstrated by addition of kinetic solubility and a selectivity assay parameter. Any assay category can be added to the table as H2L proceeds. We have used PAMPA assay data in our TL filter, because the costs and time associated with caco-2 evaluation of hundreds of compounds made it impractical to include. Additionally, the analysis can be performed as aggregate properties of each scaffold. For example, means, medians and standard deviations for each quantity can be summarised.

Once the TL analysis narrows the field to five-10 possible hit series, the rankings of each series can be used to prioritise the resources assigned to their progression. Several critical activities are needed. The activities of the original hits are confirmed independently by purchase or by independent synthesis and rigorous structural analysis. Spurious hits and artifacts are often eliminated by testing in an orthogonal assay that measures binding. Also, mechanism of action studies and experimental assessment of undesirable off-target activities are explored. If possible, new co-crystal structures are obtained. A common tactic in hit triage is to expand potential hits to find similar compounds available for purchase, sometimes referred to as ‘SAR by catalogue.’ Typically, 30-50 compounds are identified, purchased and screened for activity. This approach can be helpful in identifying ‘flat’ SAR where no improvements in activity are achieved, as structures are changed from the original hit or where all changes result in elimination of activity.

H2L begins in earnest as the project team agrees on a set of early ADMET assays to identify weaknesses within a series and to help compare with other potential starting points. Teams commonly monitor solubility, permeability, metabolic stability and CYP inhibition measurements. Early on throughput is often prioritised, but as the programme gets closer to lead optimisation the assays will change. For instance, a single time point for microsomal stability studies may suffice early on, to rank order compounds; later, full determinations of intrinsic clearance in microsomes and/or stability in hepatocytes will be added. In addition to the ADMET assays, proper selectivity assays of related targets need to be evaluated. The team will need to determine whether these counter screens must be routine or if intermittent checks of compounds are sufficient. The key is that the team regularly evaluates their screening funnel to ensure they are collecting the required data to make sound decisions.

With the proper screening funnel in place, it is left to the medicinal chemist to execute a synthetic strategy for each potential hit series to determine the viability of each series. While the primary task of synthesis lies with the medicinal chemist, design and prioritisation of targets is greatly aided using computational resources. These calculations include physicochemical properties such as cLogP, TPSA, rotatable bonds etc., as well as docking models derived from crystal structures of the target, or homologous proteins. The CADD team often provides any secondary activity calculations such as LE, LipE, or LLE to aid the medicinal chemist’s understanding of physicochemical changes that accompany activity gains.

A very important step is to identify existing prior art in order to develop new intellectual property. Early efforts are primarily focused on improving activity at the primary target and finding those changes in structure which eliminate activity. As the potency of a series improves, the chemist considers the physicochemical parameters of new targets as well as their performance in the ADMET and selectivity assays. Even early in the H2L chemistry, the chemist is mindful of creating new intellectual property, as it is important that the drug candidate be protected by patents. Rapid development of diverse synthetic schemes is critical for the exploration of new chemical IP around each series. Frequently, the chemistry team needs to develop multiple convergent synthesis methods in a narrow time window.

Before moving a hit series into lead optimisation, where resources and expenses increase, compounds from that series will need to have demonstrated favourable characteristics as shown in Table 2. During the H2L exploration it is unlikely that all of these qualities will be found in one compound. However, within the series the team is looking for examples that meet the predetermined requirements in these areas. Once those criteria are met, the team can feel confident proceeding into lead optimisation with the series.

Table 2: Sample evaluation criteria.

The H2L phase of the discovery process is a challenging one. However, with good and regular communication among biology, chemistry, CADD and ADMET team members, the chances of success increase greatly. Tools such as the TL system allow customisation of each HTS effort and are valuable aids in narrowing down the possible starting points.

Biographies

MARK WOLF has 20 years of drug discovery and development experience at AMRI and has held positions of increasing responsibility during that time. He has led interdisciplinary scientific teams to support large pharma, biotech collaborations and AMRI R&D programmes. His research has spanned several therapeutic areas including oncology, CNS, pain/inflammation and respiratory. Mark has experience establishing practical screening strategies to identify drug candidate compounds with a first-in-class or best-in-class product profile. He has a PhD in organic chemistry.

DOUG KITCHEN obtained his PhD from Princeton University (Chemistry). After a postdoctoral fellowship at Rutgers University studying biomolecular simulations, he joined Lederle Labs in 1992 in the computer-aided drug discovery group. Doug joined AMRI in 1997 to begin the computational chemistry group. The group has worked on well over 100 projects spanning all aspects of drug discovery. Doug’s interests are in compound library design, structure-guided drug design and lead optimisation. He is a co-inventor on more than 10 issued patents and author on more than forty refereed articles and book chapters.

References

Lipinski CA, Lombardo F, Dominy BW, Feeny PJ. Experimental and computational approaches to estimate solubility and permeability in drug discovery and development settings. Adv Drug Deliv Rev. 1997; 23: 3-25
Hopkins AL, Groom CR, Alex A. Ligand efficiency: a useful metric for lead selection. Drug Disc Today. 2004; 9 (10): 430-431
Ryckmans T, Edwards MP, Horne VA, Correia AM, Owen DR, Thompson LR, Tran I, Tutt MF, Young T. Rapid assessment of a novel series of selective CB₂ agonists using parallel synthesis protocols: A Lipophilic Efficiency (LipE) analysis. Bioorg Med Chem Letters. 2009; 19: 4406-4409
Leeson PD, Springthorpe B. The influence of drug-like concepts on decision-making in medicinal chemistry. Nat Rev Drug Disc. 2007; 6:881-890
Duffy BC, Zhu L, Decornez H, Kitchen DB. Early phase drug discovery: Cheminformatics and computational techniques in identifying lead series. Bioorg Med Chem. 2012, 20: 5324-5342
Lobell M, Hendrix M, Hinzen B, Keldenich J, Meier H, Schmeck C, Schohe-Loop R, Wunberg T, Hillisch A. In Silico ADMET Traffic Lights as a Tool for the Prioritization of HTS Hits. Chem Med Chem. 2006; 1: 1229-1236

Cookie	Type	Duration	Description
cookielawinfo-checkbox-advertising-targeting	persistent	1 year	The cookie is set by GDPR cookie consent to record the user consent for the cookies in the category "Advertising & Targeting".
cookielawinfo-checkbox-analytics	persistent	1 year	This cookie is set by GDPR Cookie Consent WordPress Plugin. The cookie is used to remember the user consent for the cookies under the category "Analytics".
cookielawinfo-checkbox-necessary	persistent	1 year	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Necessary".
cookielawinfo-checkbox-performance	persistent	1 year	This cookie is set by GDPR Cookie Consent WordPress Plugin. The cookie is used to remember the user consent for the cookies under the category "Performance".
PHPSESSID	session	1 year	This cookie is native to PHP applications. The cookie is used to store and identify a users' unique session ID for the purpose of managing user session on the website. The cookie is a session cookies and is deleted when all the browser windows are closed.
viewed_cookie_policy	persistent	1 year	The cookie is set by the GDPR Cookie Consent plugin and is used to store whether or not user has consented to the use of cookies. It does not store any personal data.
zmember_logged	session	1 year	This session cookie is served by our membership/subscription system and controls whether you are able to see content which is only available to logged in users.

Cookie	Type	Duration	Description
advanced_ads_browser_width	persistent	1 month	This cookie is set by Advanced Ads and measures the browser width.
advanced_ads_page_impressions	persistent	2 years	This cookie is set by Advanced Ads and measures the number of previous page impressions.
advanced_ads_pro_server_info	persistent	1 month	This cookie is set by Advanced Ads and sets geo-location, user role and user capabilities. It is used by cache busting in Advanced Ads Pro when the appropriate visitor conditions are used.
advanced_ads_pro_visitor_referrer	persistent	1 year	This cookie is set by Advanced Ads and sets the referrer URL.
bscookie	persistent	2 years	This cookie is a browser ID cookie set by LinkedIn share Buttons and ad tags.
IDE	persistent	2 years	This cookie is set by Google DoubleClick and stores information about how the user uses the website and any other advertisement before visiting the website. This is used to present users with ads that are relevant to them according to the user profile.
li_sugr	persistent	3 months	This cookie is set by LinkedIn and is used for tracking.
UserMatchHistory	persistent	1 month	This cookie is set by Linkedin and is used to track visitors on multiple websites, in order to present relevant advertisement based on the visitor's preferences.
VISITOR_INFO1_LIVE	persistent	5 months	This cookie is set by YouTube. Used to track the information of the embedded YouTube videos on a website.

Cookie	Type	Duration	Description
bcookie	persistent	2 years	This cookie is set by LinkedIn. The purpose of the cookie is to enable LinkedIn functionalities on the page.
GPS	persistent	30 minutes	This cookie is set by YouTube and registers a unique ID for tracking users based on their geographical location
lang	session	1 year	This cookie is set by LinkedIn and is used to store the language preferences of a user to serve up content in that stored language the next time user visit the website.
lidc	persistent	1 day	This cookie is set by LinkedIn and used for routing.
lissc	persistent	11 months	This cookie is set by LinkedIn share Buttons and ad tags.
vuid	persistent	2 years	We embed videos from our official Vimeo channel. When you press play, Vimeo will drop third party cookies to enable the video to play and to see how long a viewer has watched the video. This cookie does not track individuals.
wow.anonymousId	persistent	2 years	This cookie is set by Spotler and tracks an anonymous visitor ID.
wow.schedule	persistent	20 minutes	This cookie is set by Spotler and enables it to track the Load Balance Session Queue.
wow.session	persistent	20 minutes	This cookie is set by Spotler to track the Internet Information Services (IIS) session state.
wow.utmvalues	persistent	20 minutes	This cookie is set by Spotler and stores the UTM values for the session. UTM values are specific text strings that are appended to URLs that allow Communigator to track the URLs and the UTM values when they get clicked on.
_ga	persistent	2 years	This cookie is set by Google Analytics and is used to calculate visitor, session, campaign data and keep track of site usage for the site's analytics report. It stores information anonymously and assign a randomly generated number to identify unique visitors.
_gat	persistent	1 minute	This cookies is set by Google Universal Analytics to throttle the request rate to limit the collection of data on high traffic sites.
_gid	persistent	1 day	This cookie is set by Google Analytics and is used to store information of how visitors use a website and helps in creating an analytics report of how the website is doing. The data collected including the number visitors, the source where they have come from, and the pages visited in an anonymous form.

Cookie	Type	Duration	Description
cf_ob_info	persistent	1 minute	This cookie is set by Cloudflare content delivery network and, in conjunction with the cookie 'cf_use_ob', is used to determine whether it should continue serving “Always Online” until the cookie expires.
cf_use_ob	persistent	1 minute	This cookie is set by Cloudflare content delivery network and is used to determine whether it should continue serving “Always Online” until the cookie expires.
free_subscription_only	session	1 year	This session cookie is served by our membership/subscription system and controls which types of content you are able to access.
ls_smartpush	persistent	1 month	This cookie is set by Litespeed Server and allows the server to store settings to help improve performance of the site.
one_signal_sdk_db	persistent	Until cleared	This cookie is set by OneSignal push notifications and is used for storing user preferences in connection with their notification permission status.
YSC	session	1 year	This cookie is set by Youtube and is used to track the views of embedded videos.

Recommended

Hit-to-lead in drug discovery

Current approaches in H2L

Biographies

References

Leave a Reply Cancel reply

Recommended

Hit-to-lead in drug discovery

Current approaches in H2L

Biographies

References

Rapid functional cell-based activity profiling of multi-receptor targeted anti-obesity therapies

Development of a new and promising antimalarial agent

Translating ‘nature’s cues’ into breakthrough immunotherapies

Part three: pragmatic guidelines to getting the best out of LLMs

Women in STEM with Juliet Williams

Leave a Reply Cancel reply