Suitability of Some Data Quality Controls Thresholds for Genetic Association Studies of Admixed Population
|The safety and scientific validity of this study is the responsibility of the study sponsor and investigators. Listing a study does not mean it has been evaluated by the U.S. Federal Government. Read our disclaimer for details.|
|ClinicalTrials.gov Identifier: NCT02770001|
Recruitment Status : Withdrawn
First Posted : May 12, 2016
Last Update Posted : September 18, 2018
Background: In genetic studies, the quality of DNA samples is tested first. Samples that are low-quality are not used. Some studies involve minority ethnic groups. And example is admixed African American. These studies often have small sample sizes. It is important to make sure samples are not discarded unnecessarily. This may happen by using quality control (QC) thresholds for homogenous groups. These may not be appropriate for an admixed group. Researchers want to study samples that failed certain QC tests. They want to see if this has to do with the ancestry of the outliers or the quality of the samples.
To study samples that fail heterozygosity and sample genotype call rate QC. To see if the failing rates have to do with the ancestry composition of the outliers or the quality of the samples.
No new participants. Researchers will review data that has already been collected.
Researchers will study DNA samples in a lab.
The samples will not include data that can identify the person the sample came from.
|Condition or disease|
|African American Genetics|
The purpose of this IRB proposal is to gain access to genetic data generated from participants of publically-funded genomic studies and deposited into dbGaP. It is our intention to use dbGaP data to conduct secondary analysis of the influence of admixture on the outcome of data quality control (QC) in genetic association studies to inform future studies of the optimal QC metric for the genetic association analysis of admixed population. The data we intend to use is deposited in dbGaP and is from the Michigan University Health and Retirement Study (HRS). This protocol is being sent to you because of a dbGaP requirement for this specific de-identified dataset to be reviewed by an IRB as that was not in the protocol.
This work will require the use of statistical analyses tools to estimate the genetic ancestry make-up of each sample, from the genotype data, and determine how that ancestry relates to QC outcomes (i.e. whether or not the sample might be excluded from an analysis due to its ancestral genetic composition rather that the sample genetic quality).
Objectives and specific aims: This work aims to investigate samples failing heterozygosity and sample genotype call rate quality control (QC) to determine whether or not the samples call rate and heterozygosity rate have to do with the ancestry composition of the outliers rather than the quality of the samples and inform future studies of potential loss if general QC is applied to genetic data of admixed sample sets.
Rationale and Background: In genetic association studies DNA sample quality can vary largely across study participants and such variation has an impact on genotype call rate and genotype accuracy; samples of low DNA quality tend to have lower genotype call rate and genotype accuracy. Heterozygosity rate (proportion of heterozygous loci per individual) and genotype failure rate (proportion of missing genotypes per individual) are jointly and routinely used to identify samples with low DNA quality at the data quality control (QC) stage of genetic association studies. Excessive heterozygosity rate may indicate sample contamination whilst a reduced heterozygosity rate could indicate inbreeding . Samples with 3-7% [2,3] genotype call-rate and heterozygosity > 2-3 standard deviations from the mean heterozygosity are usually excluded from genetic case-control studies.
Generally, the sample size of genetic association studies involving minority ethnic groups such as admixed African American tends to be small. It is hence important to ensure samples are not discarded unnecessarily, resulting into reduced statistical power, by using QC thresholds applied to homogenous groups which might not be appropriate for an admixed sample set. The aim of this analysis is to investigate samples failing heterozygosity and sample genotype call rate QC to determine whether or not the samples call rate and heterozygosity rate have to do with the ancestry composition of the outliers rather than the quality of the samples. The motivation is to inform future studies of potential loss if general
|Study Type :||Observational|
|Actual Enrollment :||0 participants|
|Official Title:||Investigating the Suitability of Some Data Quality Controls Thresholds for Genetic Association Studies of Admixed Population|
|Study Start Date :||May 5, 2016|
|Actual Primary Completion Date :||May 15, 2017|
|Actual Study Completion Date :||May 15, 2017|
- Continental genetic ancestry fraction [ Time Frame: Study Completion ]
- Heterozygosity rate [ Time Frame: Study Completion ]
To learn more about this study, you or your doctor may contact the study research staff using the contact information provided by the sponsor.
Please refer to this study by its ClinicalTrials.gov identifier (NCT number): NCT02770001
|Principal Investigator:||Sharon K Davis||National Human Genome Research Institute (NHGRI)|