Artificial intelligence analyses how viruses evade the immune system

Share via

Posted: 15 January 2021 | Hannah Balfour (Drug Target Review) | No comments yet

The natural language processing model trained using viral protein sequence data was able to predict promising targets for vaccines against HIV, influenza and coronaviruses.

blur viral particle surrounded by antibodies

Researchers have developed a computer model that can predict which sections of viral surface proteins are more or less likely to mutate in a way that would disguise the virus from the immune system. So far, they have used the system to study and suggest potential vaccine targets for HIV, influenza and SARS-CoV-2 (which causes COVID-19).

One of the problems that has confounded the development of an effective HIV or universal flu vaccine is that these viruses mutate their surface protein very rapidly. As a result, the antibodies produced in response to a vaccine quickly become unable to bind to their intended target and so the immunity provided by the vaccine becomes useless. The process by which viruses adapt their surface proteins to avoid recognition by the immune system is known as viral escape.

To make predictions about which mutations would allow viral escape, the team from MIT, US, trained a natural language processing (NLP) model, which were originally developed to analyse patterns and make suggestions in language, to analyse patterns found in genetic sequences. According to the team, the NLP model was ideally suited to this purpose because some of the rules governing language are analogous to those governing protein structure and function.

When used for linguistic analysis, models are trained to analyse patterns in language, specifically, the frequency with which certain words occur together. The models then make predictions of which words could be used to complete a sentence. The chosen word must be both grammatically correct and have the right meaning.

In the new system, grammar is analogous to the rules that determine whether the protein encoded by a particular sequence is functional or not and semantic meaning is analogous to whether the protein can take on a new shape that helps it evade antibodies. Therefore, training an NLP with genetic sequences allows the model to predict new sequences, which still follow the rules biological rules of protein structure but have a different appearance.

The researchers said, some of the benefits of using NLP models for this application included that they can be trained using only genetic sequence information, which is much easier to obtain than protein structures, and that this training requires a relatively small amount of information – in their study, the researchers used 60,000 HIV sequences, 45,000 influenza sequences and 4,000 coronavirus sequences.

Predicting promising vaccine targets

Once the model was trained, the researchers used it to predict sequences of the coronavirus Spike (S) protein, HIV envelope protein and influenza hemagglutinin (HA) protein that would be more or less likely to generate escape mutations.

The model suggested:

The sequences least likely to mutate in influenza were in the stalk of the HA protein. Unfortunately, most people infected with the flu or vaccinated against it do not develop antibodies against the HA stalk.
For coronaviruses, a part of the S protein called the S2 subunit is least likely to generate escape mutations.
In their studies of HIV, the researchers found that the V1-V2 hypervariable region of the envelope protein has many possible escape mutations, as well as identifying some sequences that would have a lower probability of escape.

The researchers are now working with others to use their model to identify possible targets for cancer vaccines that stimulate the immune system to destroy tumours. They said it could also be used to design small-molecule drugs that might be less likely to provoke treatment resistance, for diseases such as tuberculosis.

“There are so many opportunities and the beautiful thing is all we need is sequence data, which is easy to produce,” concluded Bryan Bryson, one of the senior authors of the paper published in Science, an assistant professor of biological engineering at MIT and a member of the Ragon Institute of MGH, MIT and Harvard.

Epidemiological data on SARS-CoV-2 uncovers insights into mutations…

Related conditions
Coronavirus, Covid-19, HIV, Influenza

Related organisations
Massachusetts Institute of Technology (MIT)

Related people
Bryan Bryson

Cookie	Type	Duration	Description
cookielawinfo-checkbox-advertising-targeting	persistent	1 year	The cookie is set by GDPR cookie consent to record the user consent for the cookies in the category "Advertising & Targeting".
cookielawinfo-checkbox-analytics	persistent	1 year	This cookie is set by GDPR Cookie Consent WordPress Plugin. The cookie is used to remember the user consent for the cookies under the category "Analytics".
cookielawinfo-checkbox-necessary	persistent	1 year	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Necessary".
cookielawinfo-checkbox-performance	persistent	1 year	This cookie is set by GDPR Cookie Consent WordPress Plugin. The cookie is used to remember the user consent for the cookies under the category "Performance".
PHPSESSID	session	1 year	This cookie is native to PHP applications. The cookie is used to store and identify a users' unique session ID for the purpose of managing user session on the website. The cookie is a session cookies and is deleted when all the browser windows are closed.
viewed_cookie_policy	persistent	1 year	The cookie is set by the GDPR Cookie Consent plugin and is used to store whether or not user has consented to the use of cookies. It does not store any personal data.
zmember_logged	session	1 year	This session cookie is served by our membership/subscription system and controls whether you are able to see content which is only available to logged in users.

Cookie	Type	Duration	Description
advanced_ads_browser_width	persistent	1 month	This cookie is set by Advanced Ads and measures the browser width.
advanced_ads_page_impressions	persistent	2 years	This cookie is set by Advanced Ads and measures the number of previous page impressions.
advanced_ads_pro_server_info	persistent	1 month	This cookie is set by Advanced Ads and sets geo-location, user role and user capabilities. It is used by cache busting in Advanced Ads Pro when the appropriate visitor conditions are used.
advanced_ads_pro_visitor_referrer	persistent	1 year	This cookie is set by Advanced Ads and sets the referrer URL.
bscookie	persistent	2 years	This cookie is a browser ID cookie set by LinkedIn share Buttons and ad tags.
IDE	persistent	2 years	This cookie is set by Google DoubleClick and stores information about how the user uses the website and any other advertisement before visiting the website. This is used to present users with ads that are relevant to them according to the user profile.
li_sugr	persistent	3 months	This cookie is set by LinkedIn and is used for tracking.
UserMatchHistory	persistent	1 month	This cookie is set by Linkedin and is used to track visitors on multiple websites, in order to present relevant advertisement based on the visitor's preferences.
VISITOR_INFO1_LIVE	persistent	5 months	This cookie is set by YouTube. Used to track the information of the embedded YouTube videos on a website.

Cookie	Type	Duration	Description
bcookie	persistent	2 years	This cookie is set by LinkedIn. The purpose of the cookie is to enable LinkedIn functionalities on the page.
GPS	persistent	30 minutes	This cookie is set by YouTube and registers a unique ID for tracking users based on their geographical location
lang	session	1 year	This cookie is set by LinkedIn and is used to store the language preferences of a user to serve up content in that stored language the next time user visit the website.
lidc	persistent	1 day	This cookie is set by LinkedIn and used for routing.
lissc	persistent	11 months	This cookie is set by LinkedIn share Buttons and ad tags.
vuid	persistent	2 years	We embed videos from our official Vimeo channel. When you press play, Vimeo will drop third party cookies to enable the video to play and to see how long a viewer has watched the video. This cookie does not track individuals.
wow.anonymousId	persistent	2 years	This cookie is set by Spotler and tracks an anonymous visitor ID.
wow.schedule	persistent	20 minutes	This cookie is set by Spotler and enables it to track the Load Balance Session Queue.
wow.session	persistent	20 minutes	This cookie is set by Spotler to track the Internet Information Services (IIS) session state.
wow.utmvalues	persistent	20 minutes	This cookie is set by Spotler and stores the UTM values for the session. UTM values are specific text strings that are appended to URLs that allow Communigator to track the URLs and the UTM values when they get clicked on.
_ga	persistent	2 years	This cookie is set by Google Analytics and is used to calculate visitor, session, campaign data and keep track of site usage for the site's analytics report. It stores information anonymously and assign a randomly generated number to identify unique visitors.
_gat	persistent	1 minute	This cookies is set by Google Universal Analytics to throttle the request rate to limit the collection of data on high traffic sites.
_gid	persistent	1 day	This cookie is set by Google Analytics and is used to store information of how visitors use a website and helps in creating an analytics report of how the website is doing. The data collected including the number visitors, the source where they have come from, and the pages visited in an anonymous form.

Cookie	Type	Duration	Description
cf_ob_info	persistent	1 minute	This cookie is set by Cloudflare content delivery network and, in conjunction with the cookie 'cf_use_ob', is used to determine whether it should continue serving “Always Online” until the cookie expires.
cf_use_ob	persistent	1 minute	This cookie is set by Cloudflare content delivery network and is used to determine whether it should continue serving “Always Online” until the cookie expires.
free_subscription_only	session	1 year	This session cookie is served by our membership/subscription system and controls which types of content you are able to access.
ls_smartpush	persistent	1 month	This cookie is set by Litespeed Server and allows the server to store settings to help improve performance of the site.
one_signal_sdk_db	persistent	Until cleared	This cookie is set by OneSignal push notifications and is used for storing user preferences in connection with their notification permission status.
YSC	session	1 year	This cookie is set by Youtube and is used to track the views of embedded videos.

Recommended

Artificial intelligence analyses how viruses evade the immune system

Predicting promising vaccine targets

Leave a Reply Cancel reply

Recommended

Artificial intelligence analyses how viruses evade the immune system

Predicting promising vaccine targets

Developing next generation non-replicative HSV-1 vectors for sustainable and more precise gene therapies

Whitepaper: Targeting kinases in the innate immune response

CAR-NK cells: promising for cancer therapy

The IKAROS protein is crucial for B cell development

Drug Target Review Proteomics eBook 2023

Leave a Reply Cancel reply