Integrating genomic data from different ancestries reduces bias in predicting disease risk

Share via

Posted: 6 May 2022 | Ria Kakkad (Drug Target Review) | No comments yet

Researchers have developed a promising new tool that accurately uses genomic data to predict disease risk across diverse populations.

Big genomic data visualization. DNA test, genome map.

Polygenic risk scores (PRS) are promising tools for predicting disease risk, but current versions have built-in bias that can affect their accuracy in some populations and result in health disparities. Researchers from Massachusetts General Hospital (MGH), the Broad Institute of Massachusetts Institute of Technology (MIT) and Harvard University, all US, and Shanghai Jiao Tong University, China, have designed a new method for generating PRS that integrates genomic data from different ancestries, therefore, more accurately predict disease risk across populations. The new study was recently published in Nature Genetics.

Alterations in a gene’s DNA sequence can produce a genetic variant that increases the risk for disease. Some genetic variants are closely linked to certain diseases, such as the BRCA1 mutation and breast cancer. However, most common human diseases are influenced by hundreds or thousands of genetic variants across the genome. PRS aggregate the effects of genetic variants across the genome and have shown promise for one day being used to predict individual patients’ chances of developing diseases. This would allow clinicians to recommend preventive measures and monitor patients closely for early diagnosis and intervention.

A PRS must be trained to predict disease risk using data from studies in which genomic information is collected from large groups of individuals. While many disease-causing variants are shared there are important differences in the genetic basis of a disease between individuals of different ancestries.

“A major problem with existing methods for PRS calculation is that, to date, most of the genomic studies used data collected from individuals of European ancestry,” said Dr Tian Ge, a co-senior author of the study. This creates a Eurocentric bias in existing PRS, producing substantially less-accurate predictions and raising the possibility that they could over- or underestimate disease risk in non-European populations.

Recently, researchers have increased efforts to collect genomic data from underrepresented populations. Leveraging these resources, the team created a new tool called PRS-CSx that can integrate data from multiple populations and account for genetic similarities and differences between them. While there is s still significantly more genomic data on individuals of European ancestry, the investigators used computational methods that allowed them to maximise the value of non-European data and improve prediction accuracy in ancestrally diverse individuals.

In the study, the investigators used genomic data from individuals in several different populations to predict a wide range of physical measures (such as height, body mass index, and blood pressure), blood biomarkers (such as glucose and cholesterol), and the risk for schizophrenia. Then they compared the predicted trait or disease risk with actual measures or reported disease status to measure PRS-CSx’s prediction accuracy. The study’s results demonstrated that PRS-CSx is significantly more accurate than existing PRS tools in non-European populations.

ARTICLE: Closing the diversity gap in genomics
READ MORE

PRS-CSx could also have a role in basic research. It could be used, for example, to explore gene-environment interactions, such as how the effect of genetic risk would depend on the level of environmental risk factors in global populations.

Even with PRS-CSx, the gap in prediction accuracy between European and non-European populations remains considerable. Broadening the sample diversity across global populations is crucial to further improve the prediction accuracy of PRS in diverse populations.

“The expansion of non-European genomic resources, coupled with advanced analytic methods like PRS-CSx, will accelerate the equitable deployment of PRS in clinical settings,” concluded Dr Hailiang Huang, a co-senior author of the paper.

Related conditions
Breast cancer

Related people
Dr Hailiang Huang, Dr Tian Ge

Cookie	Type	Duration	Description
cookielawinfo-checkbox-advertising-targeting	persistent	1 year	The cookie is set by GDPR cookie consent to record the user consent for the cookies in the category "Advertising & Targeting".
cookielawinfo-checkbox-analytics	persistent	1 year	This cookie is set by GDPR Cookie Consent WordPress Plugin. The cookie is used to remember the user consent for the cookies under the category "Analytics".
cookielawinfo-checkbox-necessary	persistent	1 year	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Necessary".
cookielawinfo-checkbox-performance	persistent	1 year	This cookie is set by GDPR Cookie Consent WordPress Plugin. The cookie is used to remember the user consent for the cookies under the category "Performance".
PHPSESSID	session	1 year	This cookie is native to PHP applications. The cookie is used to store and identify a users' unique session ID for the purpose of managing user session on the website. The cookie is a session cookies and is deleted when all the browser windows are closed.
viewed_cookie_policy	persistent	1 year	The cookie is set by the GDPR Cookie Consent plugin and is used to store whether or not user has consented to the use of cookies. It does not store any personal data.
zmember_logged	session	1 year	This session cookie is served by our membership/subscription system and controls whether you are able to see content which is only available to logged in users.

Cookie	Type	Duration	Description
advanced_ads_browser_width	persistent	1 month	This cookie is set by Advanced Ads and measures the browser width.
advanced_ads_page_impressions	persistent	2 years	This cookie is set by Advanced Ads and measures the number of previous page impressions.
advanced_ads_pro_server_info	persistent	1 month	This cookie is set by Advanced Ads and sets geo-location, user role and user capabilities. It is used by cache busting in Advanced Ads Pro when the appropriate visitor conditions are used.
advanced_ads_pro_visitor_referrer	persistent	1 year	This cookie is set by Advanced Ads and sets the referrer URL.
bscookie	persistent	2 years	This cookie is a browser ID cookie set by LinkedIn share Buttons and ad tags.
IDE	persistent	2 years	This cookie is set by Google DoubleClick and stores information about how the user uses the website and any other advertisement before visiting the website. This is used to present users with ads that are relevant to them according to the user profile.
li_sugr	persistent	3 months	This cookie is set by LinkedIn and is used for tracking.
UserMatchHistory	persistent	1 month	This cookie is set by Linkedin and is used to track visitors on multiple websites, in order to present relevant advertisement based on the visitor's preferences.
VISITOR_INFO1_LIVE	persistent	5 months	This cookie is set by YouTube. Used to track the information of the embedded YouTube videos on a website.

Cookie	Type	Duration	Description
bcookie	persistent	2 years	This cookie is set by LinkedIn. The purpose of the cookie is to enable LinkedIn functionalities on the page.
GPS	persistent	30 minutes	This cookie is set by YouTube and registers a unique ID for tracking users based on their geographical location
lang	session	1 year	This cookie is set by LinkedIn and is used to store the language preferences of a user to serve up content in that stored language the next time user visit the website.
lidc	persistent	1 day	This cookie is set by LinkedIn and used for routing.
lissc	persistent	11 months	This cookie is set by LinkedIn share Buttons and ad tags.
vuid	persistent	2 years	We embed videos from our official Vimeo channel. When you press play, Vimeo will drop third party cookies to enable the video to play and to see how long a viewer has watched the video. This cookie does not track individuals.
wow.anonymousId	persistent	2 years	This cookie is set by Spotler and tracks an anonymous visitor ID.
wow.schedule	persistent	20 minutes	This cookie is set by Spotler and enables it to track the Load Balance Session Queue.
wow.session	persistent	20 minutes	This cookie is set by Spotler to track the Internet Information Services (IIS) session state.
wow.utmvalues	persistent	20 minutes	This cookie is set by Spotler and stores the UTM values for the session. UTM values are specific text strings that are appended to URLs that allow Communigator to track the URLs and the UTM values when they get clicked on.
_ga	persistent	2 years	This cookie is set by Google Analytics and is used to calculate visitor, session, campaign data and keep track of site usage for the site's analytics report. It stores information anonymously and assign a randomly generated number to identify unique visitors.
_gat	persistent	1 minute	This cookies is set by Google Universal Analytics to throttle the request rate to limit the collection of data on high traffic sites.
_gid	persistent	1 day	This cookie is set by Google Analytics and is used to store information of how visitors use a website and helps in creating an analytics report of how the website is doing. The data collected including the number visitors, the source where they have come from, and the pages visited in an anonymous form.

Cookie	Type	Duration	Description
cf_ob_info	persistent	1 minute	This cookie is set by Cloudflare content delivery network and, in conjunction with the cookie 'cf_use_ob', is used to determine whether it should continue serving “Always Online” until the cookie expires.
cf_use_ob	persistent	1 minute	This cookie is set by Cloudflare content delivery network and is used to determine whether it should continue serving “Always Online” until the cookie expires.
free_subscription_only	session	1 year	This session cookie is served by our membership/subscription system and controls which types of content you are able to access.
ls_smartpush	persistent	1 month	This cookie is set by Litespeed Server and allows the server to store settings to help improve performance of the site.
one_signal_sdk_db	persistent	Until cleared	This cookie is set by OneSignal push notifications and is used for storing user preferences in connection with their notification permission status.
YSC	session	1 year	This cookie is set by Youtube and is used to track the views of embedded videos.

Recommended

Integrating genomic data from different ancestries reduces bias in predicting disease risk

Leave a Reply Cancel reply

Recommended

Integrating genomic data from different ancestries reduces bias in predicting disease risk

Developing next generation non-replicative HSV-1 vectors for sustainable and more precise gene therapies

Bile acids and the microbiome: revolutionising disease approaches

Determining the cellular architecture of multiple sclerosis lesions

RBM5: potential drug target for acute myeloid leukaemia

The genetic mutation causing the development of psoriatic arthritis

Leave a Reply Cancel reply