1000 Genomes Project team create largest catalogue of genomic differences

Share via

Posted: 2 October 2015 | Victoria White

Understanding how genomic variants contribute to disease may help clinicians develop improved diagnostics and treatments, in addition to new methods of prevention…

An international team of scientists from the 1000 Genomes Project Consortium has created the world’s largest catalogue of genomic differences among humans, providing researchers with powerful clues to help them establish why some people are susceptible to various diseases.

Understanding how genomic variants contribute to disease may help clinicians develop improved diagnostics and treatments, in addition to new methods of prevention.

In two studies, investigators examined the genomes of 2,504 people from 26 populations across Africa, East and South Asia, Europe and the Americas.

In the main study, investigators identified about 88 million sites in the human genome that vary among people, establishing a database available to researchers as a standard reference for how the genomic make-up of people varies in populations and around the world. The catalogue more than doubles the number of known variant sites in the human genome, and can now be used in a wide range of studies of human biology and medicine.

“The 1000 Genomes Project was an ambitious, historically significant effort that has produced a valuable resource about human genomic variation,” said Eric Green, M.D., Ph.D., director of the US National Human Genome Research Institute (NHGRI). “The latest data and insights add to a growing understanding of the patterns of variation in individuals’ genomes, and provide a foundation for gaining greater insights into the genomics of human disease.”

“Some 88 million sites in the genome differ among people. About one-quarter of these variants are common and occur in many or all populations, while about three-quarters occur in only 1% of people or are even more rare,” said Lisa Brooks, Ph.D., programme director in the NHGRI Genomic Variation Programme. “The 1000 Genomes Project data are a resource for any study in which scientists are looking for genomic contributions to disease, including the study of both common and rare variants.”

Scientists can use the 1000 Genomes Project data to home in on regions affecting disease

One of the more immediate uses of 1000 Genomes Project data is for genome-wide association studies (GWAS), which compare the genomes of people with and without a disease to search for regions of the genome that contain genomic variants associated with that disease. Such studies generally find several genomic regions associated with a disease and many variants in each of those regions. Scientists can now combine GWAS data with the more detailed 1000 Genomes Project data to home in on regions affecting disease more precisely. Instead of sequencing the genomes of all the people in a study, which remains expensive, researchers can use the 1000 Genomes Project data to find most of the variants in those regions that are associated with the disease.

In the second study, scientists examined differences in the structure of the genome in the 2,504 samples. They found nearly 69,000 differences, known as structural variants. The researchers created a map of eight classes of structural variants that potentially contribute to disease.

“Structural variation is responsible for a large percentage of differences in the DNA among human genomes,” said Jan Korbel, Ph.D., group leader and European Research Council Investigator in the Genome Biology Unit of the European Molecular Biology Laboratory in Heidelberg, Germany. “No study has ever looked at genomic structural variation with this kind of broad representation of populations around the world.”

Dr Korbel and colleagues discovered that structural variants were often more complicated than they originally thought. For example, the majority of inversions, which involve DNA sequences changing their orientation in the genome, frequently occur along with other structural changes.

The 1000 Genomes Project team developed new methods for large-scale DNA sequencing

To Gonçalo Abecasis, Ph.D., chair of biostatistics at the University of Michigan in Ann Arbor and co-principal investigator for the main study, the value of the 1000 Genomes Project extends far beyond the data. Advances in DNA sequencing and bioinformatics were vital to completing the project.

“We’ve learned a great deal about how to do genomics on a large scale,” said Dr. Abecasis. “Over the course of the 1000 Genomes Project, we developed new, improved methods for large-scale DNA sequencing, analysis and interpretation of genomic information, in addition to how to store this much data. We learned how to do quality genomic studies in different contexts and parts of the world.”

Cookie	Type	Duration	Description
cookielawinfo-checkbox-advertising-targeting	persistent	1 year	The cookie is set by GDPR cookie consent to record the user consent for the cookies in the category "Advertising & Targeting".
cookielawinfo-checkbox-analytics	persistent	1 year	This cookie is set by GDPR Cookie Consent WordPress Plugin. The cookie is used to remember the user consent for the cookies under the category "Analytics".
cookielawinfo-checkbox-necessary	persistent	1 year	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Necessary".
cookielawinfo-checkbox-performance	persistent	1 year	This cookie is set by GDPR Cookie Consent WordPress Plugin. The cookie is used to remember the user consent for the cookies under the category "Performance".
PHPSESSID	session	1 year	This cookie is native to PHP applications. The cookie is used to store and identify a users' unique session ID for the purpose of managing user session on the website. The cookie is a session cookies and is deleted when all the browser windows are closed.
viewed_cookie_policy	persistent	1 year	The cookie is set by the GDPR Cookie Consent plugin and is used to store whether or not user has consented to the use of cookies. It does not store any personal data.
zmember_logged	session	1 year	This session cookie is served by our membership/subscription system and controls whether you are able to see content which is only available to logged in users.

Cookie	Type	Duration	Description
advanced_ads_browser_width	persistent	1 month	This cookie is set by Advanced Ads and measures the browser width.
advanced_ads_page_impressions	persistent	2 years	This cookie is set by Advanced Ads and measures the number of previous page impressions.
advanced_ads_pro_server_info	persistent	1 month	This cookie is set by Advanced Ads and sets geo-location, user role and user capabilities. It is used by cache busting in Advanced Ads Pro when the appropriate visitor conditions are used.
advanced_ads_pro_visitor_referrer	persistent	1 year	This cookie is set by Advanced Ads and sets the referrer URL.
bscookie	persistent	2 years	This cookie is a browser ID cookie set by LinkedIn share Buttons and ad tags.
IDE	persistent	2 years	This cookie is set by Google DoubleClick and stores information about how the user uses the website and any other advertisement before visiting the website. This is used to present users with ads that are relevant to them according to the user profile.
li_sugr	persistent	3 months	This cookie is set by LinkedIn and is used for tracking.
UserMatchHistory	persistent	1 month	This cookie is set by Linkedin and is used to track visitors on multiple websites, in order to present relevant advertisement based on the visitor's preferences.
VISITOR_INFO1_LIVE	persistent	5 months	This cookie is set by YouTube. Used to track the information of the embedded YouTube videos on a website.

Cookie	Type	Duration	Description
bcookie	persistent	2 years	This cookie is set by LinkedIn. The purpose of the cookie is to enable LinkedIn functionalities on the page.
GPS	persistent	30 minutes	This cookie is set by YouTube and registers a unique ID for tracking users based on their geographical location
lang	session	1 year	This cookie is set by LinkedIn and is used to store the language preferences of a user to serve up content in that stored language the next time user visit the website.
lidc	persistent	1 day	This cookie is set by LinkedIn and used for routing.
lissc	persistent	11 months	This cookie is set by LinkedIn share Buttons and ad tags.
vuid	persistent	2 years	We embed videos from our official Vimeo channel. When you press play, Vimeo will drop third party cookies to enable the video to play and to see how long a viewer has watched the video. This cookie does not track individuals.
wow.anonymousId	persistent	2 years	This cookie is set by Spotler and tracks an anonymous visitor ID.
wow.schedule	persistent	20 minutes	This cookie is set by Spotler and enables it to track the Load Balance Session Queue.
wow.session	persistent	20 minutes	This cookie is set by Spotler to track the Internet Information Services (IIS) session state.
wow.utmvalues	persistent	20 minutes	This cookie is set by Spotler and stores the UTM values for the session. UTM values are specific text strings that are appended to URLs that allow Communigator to track the URLs and the UTM values when they get clicked on.
_ga	persistent	2 years	This cookie is set by Google Analytics and is used to calculate visitor, session, campaign data and keep track of site usage for the site's analytics report. It stores information anonymously and assign a randomly generated number to identify unique visitors.
_gat	persistent	1 minute	This cookies is set by Google Universal Analytics to throttle the request rate to limit the collection of data on high traffic sites.
_gid	persistent	1 day	This cookie is set by Google Analytics and is used to store information of how visitors use a website and helps in creating an analytics report of how the website is doing. The data collected including the number visitors, the source where they have come from, and the pages visited in an anonymous form.

Cookie	Type	Duration	Description
cf_ob_info	persistent	1 minute	This cookie is set by Cloudflare content delivery network and, in conjunction with the cookie 'cf_use_ob', is used to determine whether it should continue serving “Always Online” until the cookie expires.
cf_use_ob	persistent	1 minute	This cookie is set by Cloudflare content delivery network and is used to determine whether it should continue serving “Always Online” until the cookie expires.
free_subscription_only	session	1 year	This session cookie is served by our membership/subscription system and controls which types of content you are able to access.
ls_smartpush	persistent	1 month	This cookie is set by Litespeed Server and allows the server to store settings to help improve performance of the site.
one_signal_sdk_db	persistent	Until cleared	This cookie is set by OneSignal push notifications and is used for storing user preferences in connection with their notification permission status.
YSC	session	1 year	This cookie is set by Youtube and is used to track the views of embedded videos.

Recommended

1000 Genomes Project team create largest catalogue of genomic differences

Scientists can use the 1000 Genomes Project data to home in on regions affecting disease

The 1000 Genomes Project team developed new methods for large-scale DNA sequencing

Multi-omic approach classifies biomarkers for paediatric sepsis

Proteogenomics: finding targets for never-smoker lung cancer

How MMR-deficient colorectal cancers regulate their growth

Five HERV expression signatures linked to psychiatric disorders