Drug side effects: an open source tool to predict adverse reactions

Share via

Posted: 14 July 2020 | Victoria Rees (Drug Target Review) | No comments yet

Researchers who developed a machine learning algorithm to predict the adverse effects of new drug compounds have released it as an open source tool.

Informatics - machine learning to predict side effects

A multi-institutional group of researchers led by Harvard Medical School (HMS), US, and the Novartis Institutes for BioMedical Research (NIBR) has created an open-source machine learning tool that identifies proteins associated with drug side effects.

The researchers say the work, published in the journal EBioMedicine, offers a new method for developing safer medicines by identifying potential adverse reactions before drug candidates reach human clinical trials or enter the market as approved medicines. Scientists can use, improve and build upon the model, which is posted for free online here.

The findings also offer insights into how the human body responds to drug compounds at the molecular level in both desired and unintended ways.

“Machine learning is not a silver bullet for drug discovery, but I do believe it can accelerate many different aspects in the difficult and long process of developing new medicines,” said the paper’s co-first author Robert Ietswaart, from HMS. “Although it cannot predict all possible adverse effects, we hope that our work will help researchers spot potential trouble early on and develop safer drugs in the future.”

Drug side effects can range from mild to fatal. They may occur either when taking a drug as prescribed or as a result of incorrect dosages, interaction of multiple medicines or off-label use. Adverse drug reactions are responsible for two million US hospitalisations each year, according to the Department of Health and Human Services and occur during 10 to 20 percent of hospitalisations, according to the Merck Manuals.

…many of the algorithm’s previously unproven predictions matched recent real-world reports”

Researchers and health care providers have applied many tactics over the decades to avoid or at least minimise adverse drug reactions. However, because a single drug often interacts with multiple proteins in the body – not always limited to the intended targets – it can be hard to predict what side effects a medicine may generate. Furthermore, if a drug does end up causing an adverse reaction, it can be hard to identify which of its protein targets could be responsible.

In the new study, the researchers took one existing database of reported adverse drug reactions and another database of 184 proteins that specific drugs are known to often interact with. Then they constructed a computer algorithm to connect the dots.

Learning from the data, the algorithm unearthed 221 associations between individual proteins and specific adverse drug reactions. Some were known and some were new. The associations indicated which proteins likely represent drug targets that contribute to particular side effects and which others may be innocent bystanders.

Based on what it has already learned, and strengthened by any new data that researchers feed it, the team say the programme may help doctors and scientists predict whether a new drug candidate is likely to cause a certain side effect on its own or when combined with particular medicines. The algorithm can help with these predictions before a drug is tested in humans, based on lab experiments that reveal which proteins the drug interacts with.

The hope is to raise the likelihood that a drug candidate will prove safe for patients before and after it reaches the market.

“This could reduce the risks that study participants face during the first in-human clinical trials and minimise risks for patients if a drug gains US Food and Drug Administration (FDA) approval and enters clinical use,” said Ietswaart.

Testing the model

Laszlo Urban, global head of pre-clinical secondary pharmacology at NIBR, and his team constructed their machine learning algorithm and applied it to two large data sets: one from Novartis with information about the proteins that each of 2,000 drugs interact with and one from the FDA with 600,000 physician reports of adverse drug reactions in patients.

The algorithm generated statistically robust information about how individual proteins contribute to documented adverse reactions, said Ietswaart. “It suggests the physiological response to perturbing a particular protein – or the gene that makes it – at the molecular level.”

Many of the results supported previous observations, such as that binding to the protein hERG can cause cardiac arrhythmias. Findings like this strengthened the researchers’ confidence that the algorithm was performing well.

Other results, they say, were unexpected. For instance, the algorithm suggested that the protein PDE3 is associated with over 40 adverse drug reactions. Doctors and researchers have known for years that PDE3 inhibitors – common anti-clotting treatments for acute heart failure, stroke prevention and a heart attack complication known as cardiogenic shock – can cause arrhythmias, low platelet counts and elevated levels of enzymes called transaminases, a possible indicator of liver damage. However, it was not known that targeting PDE3 might raise the risk of so many other side effects, including some related to the muscles, bones, connective tissue, kidneys, urinary tract and ear.

Into the future

The algorithm also offered predictions on the likelihood that a particular drug would cause a certain adverse reaction.

To find out how accurate the predictions are, the researchers fed their algorithm updated information. Until then, the programme had learned from adverse drug reactions reported through 2014. The team added reports gathered from 2014 through 2019, some of which revealed side effects that had not been observed before from particular drugs.

They found that many of the algorithm’s previously unproven predictions matched the recent real-world reports.

“What seemed like false-positive predictions proved not to be false at all when the new reports became available,” said Ietswaart.

To make extra certain that the algorithm is reliable, the team compared its results to drug labels, conducted text mining of the scientific literature and used other validation techniques.

However, the team emphasise that although the researchers strengthened the model as much as they could, it still assesses less than one percent of the 20,000 genes in the human genome.

“Our work is by no means a complete understanding of adverse drug events because many other genes and proteins might contribute for which no assay is available or no drugs have been tested,” said Ietswaart.

Related people
Laszlo Urban, Robert Ietswaart

Cookie	Type	Duration	Description
cookielawinfo-checkbox-advertising-targeting	persistent	1 year	The cookie is set by GDPR cookie consent to record the user consent for the cookies in the category "Advertising & Targeting".
cookielawinfo-checkbox-analytics	persistent	1 year	This cookie is set by GDPR Cookie Consent WordPress Plugin. The cookie is used to remember the user consent for the cookies under the category "Analytics".
cookielawinfo-checkbox-necessary	persistent	1 year	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Necessary".
cookielawinfo-checkbox-performance	persistent	1 year	This cookie is set by GDPR Cookie Consent WordPress Plugin. The cookie is used to remember the user consent for the cookies under the category "Performance".
PHPSESSID	session	1 year	This cookie is native to PHP applications. The cookie is used to store and identify a users' unique session ID for the purpose of managing user session on the website. The cookie is a session cookies and is deleted when all the browser windows are closed.
viewed_cookie_policy	persistent	1 year	The cookie is set by the GDPR Cookie Consent plugin and is used to store whether or not user has consented to the use of cookies. It does not store any personal data.
zmember_logged	session	1 year	This session cookie is served by our membership/subscription system and controls whether you are able to see content which is only available to logged in users.

Cookie	Type	Duration	Description
advanced_ads_browser_width	persistent	1 month	This cookie is set by Advanced Ads and measures the browser width.
advanced_ads_page_impressions	persistent	2 years	This cookie is set by Advanced Ads and measures the number of previous page impressions.
advanced_ads_pro_server_info	persistent	1 month	This cookie is set by Advanced Ads and sets geo-location, user role and user capabilities. It is used by cache busting in Advanced Ads Pro when the appropriate visitor conditions are used.
advanced_ads_pro_visitor_referrer	persistent	1 year	This cookie is set by Advanced Ads and sets the referrer URL.
bscookie	persistent	2 years	This cookie is a browser ID cookie set by LinkedIn share Buttons and ad tags.
IDE	persistent	2 years	This cookie is set by Google DoubleClick and stores information about how the user uses the website and any other advertisement before visiting the website. This is used to present users with ads that are relevant to them according to the user profile.
li_sugr	persistent	3 months	This cookie is set by LinkedIn and is used for tracking.
UserMatchHistory	persistent	1 month	This cookie is set by Linkedin and is used to track visitors on multiple websites, in order to present relevant advertisement based on the visitor's preferences.
VISITOR_INFO1_LIVE	persistent	5 months	This cookie is set by YouTube. Used to track the information of the embedded YouTube videos on a website.

Cookie	Type	Duration	Description
bcookie	persistent	2 years	This cookie is set by LinkedIn. The purpose of the cookie is to enable LinkedIn functionalities on the page.
GPS	persistent	30 minutes	This cookie is set by YouTube and registers a unique ID for tracking users based on their geographical location
lang	session	1 year	This cookie is set by LinkedIn and is used to store the language preferences of a user to serve up content in that stored language the next time user visit the website.
lidc	persistent	1 day	This cookie is set by LinkedIn and used for routing.
lissc	persistent	11 months	This cookie is set by LinkedIn share Buttons and ad tags.
vuid	persistent	2 years	We embed videos from our official Vimeo channel. When you press play, Vimeo will drop third party cookies to enable the video to play and to see how long a viewer has watched the video. This cookie does not track individuals.
wow.anonymousId	persistent	2 years	This cookie is set by Spotler and tracks an anonymous visitor ID.
wow.schedule	persistent	20 minutes	This cookie is set by Spotler and enables it to track the Load Balance Session Queue.
wow.session	persistent	20 minutes	This cookie is set by Spotler to track the Internet Information Services (IIS) session state.
wow.utmvalues	persistent	20 minutes	This cookie is set by Spotler and stores the UTM values for the session. UTM values are specific text strings that are appended to URLs that allow Communigator to track the URLs and the UTM values when they get clicked on.
_ga	persistent	2 years	This cookie is set by Google Analytics and is used to calculate visitor, session, campaign data and keep track of site usage for the site's analytics report. It stores information anonymously and assign a randomly generated number to identify unique visitors.
_gat	persistent	1 minute	This cookies is set by Google Universal Analytics to throttle the request rate to limit the collection of data on high traffic sites.
_gid	persistent	1 day	This cookie is set by Google Analytics and is used to store information of how visitors use a website and helps in creating an analytics report of how the website is doing. The data collected including the number visitors, the source where they have come from, and the pages visited in an anonymous form.

Cookie	Type	Duration	Description
cf_ob_info	persistent	1 minute	This cookie is set by Cloudflare content delivery network and, in conjunction with the cookie 'cf_use_ob', is used to determine whether it should continue serving “Always Online” until the cookie expires.
cf_use_ob	persistent	1 minute	This cookie is set by Cloudflare content delivery network and is used to determine whether it should continue serving “Always Online” until the cookie expires.
free_subscription_only	session	1 year	This session cookie is served by our membership/subscription system and controls which types of content you are able to access.
ls_smartpush	persistent	1 month	This cookie is set by Litespeed Server and allows the server to store settings to help improve performance of the site.
one_signal_sdk_db	persistent	Until cleared	This cookie is set by OneSignal push notifications and is used for storing user preferences in connection with their notification permission status.
YSC	session	1 year	This cookie is set by Youtube and is used to track the views of embedded videos.

Recommended

Drug side effects: an open source tool to predict adverse reactions

Testing the model

Into the future

Leave a Reply Cancel reply

Recommended

Drug side effects: an open source tool to predict adverse reactions

Testing the model

Into the future

The future of carcinogenicity risk assessment: AI-powered, ethical alternatives to traditional testing

The biotech mapping thousands of hidden therapeutic clues

Why PARP inhibitors fail: key role of the CST complex in BRCA1-deficient cancers

$250K grant fuels development of new type 1 diabetes therapy

Solving the disconnect between lab and data scientists: part 2

Leave a Reply Cancel reply