Future-proofing drug development with GenAI

Share via

Posted: 3 July 2025 | Greg Lever (Director - AI Solutions Delivery - IQVIA) | No comments yet

Using GenAI and expert reasoning, drug developers can now explore an asset’s long-term potential as early as the preclinical stage. This shift is helping to reshape pipeline planning and refine therapeutic strategy.

Artificial Intelligence in Life Sciences

It is becoming increasingly evident that generative artificial intelligence (GenAI) is a resourceful tool for helping pharmaceutical companies reduce manual tasks required by clinical trials. However, R&D stakeholders are learning that GenAI underpinned by domain expertise also enables deep dives into the broader long-term potential of their investigational asset(s) as early as the preclinical phase. This is especially relevant with today’s heavier focus on enhancing personalised medicine via broader emerging scientific findings.

Given macro healthcare influences (eg, economic uncertainty, environmental changes) and the numerous available treatments for major diseases, drug developers may need to reassess their therapeutic strategies. Long-term sustainability may benefit from identifying priority indications, exploring emerging mechanisms of action and refining development priorities.

The multi-agent framework of large language models (LLMs), reasoning models and advanced algorithms is revealing the critical unknowns within R&D strategies that impact an asset’s chance of meeting its potential.

Reserve your FREE place

AI-powered drug discovery: Accelerating the development of life-saving therapies

18 September 2025 | 14:00PM BST | FREE Webinar

Join this webinar to learn how AI is accelerating early-stage drug discovery and improving target identification, practical strategies for applying AI effectively within your organisation and to ask your questions to our industry expert! Dr Remco Jan Geukes Foppen will share practical insights into how AI is being applied across the pharmaceutical sector, helping teams move faster and make better-informed decisions. With experience spanning data management, image analysis, bioinformatics, and machine learning in clinical research, he brings both deep technical expertise and strategic understanding of real-world challenges.

Register Now – It’s Free!

Given the full potential of future assets can’t be directly observed or measured, drug developers are often unaware of certain opportunities or risks. Thus, early-stage uncovering of asset value and direction via AI may help them prioritise pipelines and business goals.

Unintentionally limiting asset potential

There is a tremendous level of untapped chemical space that could further the development of drug molecules, yet traditional drug discovery and development has been hindered by its resource-heavy manual approaches to exploring strategic pathways. This has led drug developers to unintentionally limit their potential within chosen therapeutic spaces. With up to 90 percent of assets never making it to market when competition is at an all-time high, it is worth discussing how AI-driven approaches might help set a stronger foundation for possibilities downstream.¹

Seeing the bigger picture of strategic options

Because LLMs are trained on extensive, internet-scale datasets, they can learn to identify contexts linking words and language. When grounding an LLM with scientific datasets, the model can learn context that helps identify entities (eg, diseases, symptoms, molecules, etc) across multiple data sources. Whether targeting the same drug, disease, protein, etc, a well-designed LLM grounded in trusted data can understand context between scientific literature, clinical trial results and real-world evidence sources, such as electronic health records or omics datasets.

Because LLMs are trained on extensive, internet-scale datasets, they can learn to identify contexts linking words and language.

Leveraging the extensive breadth of available data to identify entities and relationships across data sources, clinical research experts, therapeutic specialists, machine learning (ML) engineers and others can collectively evaluate areas of interest that may create new opportunities for the asset and a broader clinical strategy.

However, drug developers should know the various ways LLMs and other AI-based methodologies can shed light on asset profile, strategy and potential therapeutic and commercial promise before a trial starts. AI-driven solutions allow drug developers to gauge forward-looking questions, including:

What may be achieved with our asset over the next 15 years? Could we expand beyond the initial approved indication?
For our therapeutic focus, what future trends should be expected? Any growth areas?
Where do we focus our resources and capabilities for pipeline and/or asset direction over the long-term? Which indications have the highest commercial potential?
What is the full scope of therapeutic focus(es) and patient subpopulations our asset may reach?
How differentiated is our asset from existing and emerging competitors?

New approach methodologies data

In April, the US Food and Drug Administration (FDA) announced a plan to replace animal testing in the development of monoclonal antibodies and other therapies with validated “human-relevant” methods, including AI-based computational models evaluating toxicity, cellular lines and organoid toxicity.²

This plan further encourages the strategic use of AI modelling and real-world human data, which are considered new approach methodologies (NAM) data. Encouraging inclusion of NAMs data in investigational new drug (IND) applications allows drug developers to use AI to produce predictive outcomes regarding asset profiles, including:

Creating virtual cohorts (PBPK/PD digital twins) to explore absorption, distribution and metabolism, mitigating risk
Using deep learning on chemical structures and historic toxicity data, gauging organ-specific safety issues
Mapping on- and off-target interactions across thousands of proteins to prioritise molecules before wet lab screens
Exploring precision dosing by combining and reviewing genomic, transcriptomic and exposome data to model response variability.

When looking to analyse safety and efficacy across the development pipeline, it is also possible to integrate imaging, multiomics and clinical endpoints into end-to-end predictive models. These models can help uncover subtle patterns and correlations that may not otherwise be evident. By combining diverse data modalities, drug developers can generate more holistic insights to better anticipate adverse events, stratify patient populations and optimise trial design. This integrated approach also supports earlier go/no-go decisions and more targeted therapeutic development.

Creating a strategic head start with expert reasoning

These models must align with what R&D stakeholders aim to achieve and be based on curated, connected data to provide thorough, accurate and useful outputs. Upon using an LLM to pull, define and organise the context between data, expert human oversight is necessary to offer clinical reasoning and logic to derive meaningful insights made possible from these models. Therapeutic and clinical trial experts with deep understanding of emerging medicine trends and developments can recognise nuanced context and decipher the R&D possibilities.

The ability to extract and analyse layered, connected insights proffers evidence-driven answers about how assets or portfolios can realise their potential. Drug developers may discover:

Mechanistic flexibility. The compound’s mechanism of action (MOA) may be used in a slightly different patient population or adjacent therapeutic focus, enabling label expansion or pursuit of novel indications.
Indication prioritisation. Indications of interest may be ranked according to potential for technical success, depth of the unmet need and likelihood of commercial success for more targeted and informed investments.
Molecular innovation. With slight modifications to an antibody class, it is possible to explore new therapeutic areas or enhance impact within an existing one. Some single antibody scaffolds are already being applied across different disease areas or are demonstrating improved outcomes for patients within the same indication due to improved targeting or delivery mechanisms.
Preclinical advantage. Compared to the standard of care at the same stage of development, a preclinical asset may demonstrate promising activity against a known target but with a more favourable toxicity profile. This early level of insight can help prioritise assets with differentiated potential and reduce risk of downstream attrition.
Biomarker and patient stratification insights. Language and reasoning models can help identify predictive biomarkers or patient subgroups more likely to respond to treatment, enabling more precise trial design and a higher likelihood of success.
Lifecycle planning and repurposing. By mapping the scientific and clinical landscape over time, developers can spot opportunities to reposition shelved assets, repurpose for rare diseases or explore combinations with synergistic therapies.

Instead of focusing on one-off indications, making tweaks as findings are extracted allows developers to build comprehensive portfolio strategies.

A hindering factor is that knowledge about failed assets and why they failed is often unpublished and limited. It can also be difficult to secure all necessary data to adequately analyse a drug’s potential benefits and alternative uses due to study design, lack of endpoints or the small number of patients enrolled. Such information gaps emphasise the need for clinical data scientists to supervise methodologies.

Connecting the context: what does it technically take?

It is important to recognize that extracting broad insights during preclinical stages is not straightforward. LLMs are trained on internet-scale datasets, which is similar to walking into a library filled with books. While plenty of data is available, vast amounts are not relevant to your search.

Grounding LLMs with particular datasets and engineering prompts helps you access the appropriate data. A knowledge graph can help to visualise meaningful connections across entities.

By creating new algorithms and embedding the context of relationships among them, the LLM can act like a librarian who knows where to search among the library’s stacks, as well as how the books you are interested in relate to one another and why.

A graphical representation of connections in knowledge graphs can provide therapeutic and clinical experts with insight to drum up questions. For example, if the aim is to map out a long-term disease strategy in immunology, it would be helpful to gauge what, if any, new biologic pathways or MOAs might spark a paradigm shift in the space or which indications may have stronger commercial potential or interest.

Figure 1. GenAI-driven knowledge graph to visualise meaningful connections across entities of interest.

By collaborating with ML scientists and engineers, experts can dive deeper into extracted insights and train LLMs on different sets of tasks and/or datasets to build out scenarios for moving forward and identifying what to prioritise.

Fine-tuning strategic pathways forward

Several years ago, it would have been difficult to imagine drug developers in preclinical stages being able to realistically look a decade ahead with evidence-driven insight into the fuller potential of their assets and pipelines. But just as emerging medicine is personalising care for patients, drug development and commercialisation strategies are also being fine-tuned through advances in LLM frameworks.

Since nothing related to AI sits still for long, we know its promise for helping curate and connect meaningful data insights to show downstream value and potential of assets and pipelines will only grow.

References:

[1] Sun D, Gao W, Hu H, Zhou S. Why 90% of clinical drug development fails and how to improve it? Acta Pharm Sin B. 2022 Jul;12(7):3049-3062.

[2] U.S. Food and Drug Administration. FDA news release, “FDA announces plan to phase out animal testing requirement for monoclonal antibodies and other drugs.” April 10, 2025.

Meet the author

Greg Lever, Director, AI Solutions Delivery, IQVIA

With more than 15 years of life sciences and technology experience, Greg currently helps clients discover innovative ways to bring life-changing therapies to patients faster within IQVIA’s Applied Data Science Center’s consulting sales team. Previously, he led a team of ML engineers within IQVIA’s Analytics Center of Excellence.

Greg has worked with several technology startup companies in London and helped see Genomics England’s 100,000 Genomes Project through project completion. He earned his PhD at the University of Cambridge, combining quantum physics and ML to develop new approaches for small-molecule drug discovery, and has worked as a postdoctoral associate at MIT.

Related organisations
IQVIA

Cookie	Type	Duration	Description
cookielawinfo-checkbox-advertising-targeting	persistent	1 year	The cookie is set by GDPR cookie consent to record the user consent for the cookies in the category "Advertising & Targeting".
cookielawinfo-checkbox-analytics	persistent	1 year	This cookie is set by GDPR Cookie Consent WordPress Plugin. The cookie is used to remember the user consent for the cookies under the category "Analytics".
cookielawinfo-checkbox-necessary	persistent	1 year	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Necessary".
cookielawinfo-checkbox-performance	persistent	1 year	This cookie is set by GDPR Cookie Consent WordPress Plugin. The cookie is used to remember the user consent for the cookies under the category "Performance".
PHPSESSID	session	1 year	This cookie is native to PHP applications. The cookie is used to store and identify a users' unique session ID for the purpose of managing user session on the website. The cookie is a session cookies and is deleted when all the browser windows are closed.
viewed_cookie_policy	persistent	1 year	The cookie is set by the GDPR Cookie Consent plugin and is used to store whether or not user has consented to the use of cookies. It does not store any personal data.
zmember_logged	session	1 year	This session cookie is served by our membership/subscription system and controls whether you are able to see content which is only available to logged in users.

Cookie	Type	Duration	Description
advanced_ads_browser_width	persistent	1 month	This cookie is set by Advanced Ads and measures the browser width.
advanced_ads_page_impressions	persistent	2 years	This cookie is set by Advanced Ads and measures the number of previous page impressions.
advanced_ads_pro_server_info	persistent	1 month	This cookie is set by Advanced Ads and sets geo-location, user role and user capabilities. It is used by cache busting in Advanced Ads Pro when the appropriate visitor conditions are used.
advanced_ads_pro_visitor_referrer	persistent	1 year	This cookie is set by Advanced Ads and sets the referrer URL.
bscookie	persistent	2 years	This cookie is a browser ID cookie set by LinkedIn share Buttons and ad tags.
IDE	persistent	2 years	This cookie is set by Google DoubleClick and stores information about how the user uses the website and any other advertisement before visiting the website. This is used to present users with ads that are relevant to them according to the user profile.
li_sugr	persistent	3 months	This cookie is set by LinkedIn and is used for tracking.
UserMatchHistory	persistent	1 month	This cookie is set by Linkedin and is used to track visitors on multiple websites, in order to present relevant advertisement based on the visitor's preferences.
VISITOR_INFO1_LIVE	persistent	5 months	This cookie is set by YouTube. Used to track the information of the embedded YouTube videos on a website.

Cookie	Type	Duration	Description
bcookie	persistent	2 years	This cookie is set by LinkedIn. The purpose of the cookie is to enable LinkedIn functionalities on the page.
GPS	persistent	30 minutes	This cookie is set by YouTube and registers a unique ID for tracking users based on their geographical location
lang	session	1 year	This cookie is set by LinkedIn and is used to store the language preferences of a user to serve up content in that stored language the next time user visit the website.
lidc	persistent	1 day	This cookie is set by LinkedIn and used for routing.
lissc	persistent	11 months	This cookie is set by LinkedIn share Buttons and ad tags.
vuid	persistent	2 years	We embed videos from our official Vimeo channel. When you press play, Vimeo will drop third party cookies to enable the video to play and to see how long a viewer has watched the video. This cookie does not track individuals.
wow.anonymousId	persistent	2 years	This cookie is set by Spotler and tracks an anonymous visitor ID.
wow.schedule	persistent	20 minutes	This cookie is set by Spotler and enables it to track the Load Balance Session Queue.
wow.session	persistent	20 minutes	This cookie is set by Spotler to track the Internet Information Services (IIS) session state.
wow.utmvalues	persistent	20 minutes	This cookie is set by Spotler and stores the UTM values for the session. UTM values are specific text strings that are appended to URLs that allow Communigator to track the URLs and the UTM values when they get clicked on.
_ga	persistent	2 years	This cookie is set by Google Analytics and is used to calculate visitor, session, campaign data and keep track of site usage for the site's analytics report. It stores information anonymously and assign a randomly generated number to identify unique visitors.
_gat	persistent	1 minute	This cookies is set by Google Universal Analytics to throttle the request rate to limit the collection of data on high traffic sites.
_gid	persistent	1 day	This cookie is set by Google Analytics and is used to store information of how visitors use a website and helps in creating an analytics report of how the website is doing. The data collected including the number visitors, the source where they have come from, and the pages visited in an anonymous form.

Cookie	Type	Duration	Description
cf_ob_info	persistent	1 minute	This cookie is set by Cloudflare content delivery network and, in conjunction with the cookie 'cf_use_ob', is used to determine whether it should continue serving “Always Online” until the cookie expires.
cf_use_ob	persistent	1 minute	This cookie is set by Cloudflare content delivery network and is used to determine whether it should continue serving “Always Online” until the cookie expires.
free_subscription_only	session	1 year	This session cookie is served by our membership/subscription system and controls which types of content you are able to access.
ls_smartpush	persistent	1 month	This cookie is set by Litespeed Server and allows the server to store settings to help improve performance of the site.
one_signal_sdk_db	persistent	Until cleared	This cookie is set by OneSignal push notifications and is used for storing user preferences in connection with their notification permission status.
YSC	session	1 year	This cookie is set by Youtube and is used to track the views of embedded videos.

Recommended

Future-proofing drug development with GenAI

AI-powered drug discovery: Accelerating the development of life-saving therapies

Register Now – It’s Free!

Unintentionally limiting asset potential

Seeing the bigger picture of strategic options

New approach methodologies data

Creating a strategic head start with expert reasoning

Connecting the context: what does it technically take?

Fine-tuning strategic pathways forward

Leave a Reply Cancel reply

Recommended

Future-proofing drug development with GenAI

AI-powered drug discovery: Accelerating the development of life-saving therapies

Register Now – It’s Free!

Unintentionally limiting asset potential

Seeing the bigger picture of strategic options

New approach methodologies data

Creating a strategic head start with expert reasoning

Connecting the context: what does it technically take?

Fine-tuning strategic pathways forward

Cancer drug discovery breakthroughs: research that’s changing lives

Inside the search-and-develop model tackling 1,000 untreated skin diseases

Chronic neuron overactivation drives Parkinson’s cell death

New AI method maps how tuberculosis drugs destroy bacteria

Scientists chart ovarian reserve to help advance new infertility treatments

Leave a Reply Cancel reply