We examined the potential gene gene relationships and gene smoking relationships

We examined the potential gene gene relationships and gene smoking relationships in rheumatoid arthritis (RA) using the candidate gene data units provided by Genetic Analysis Workshop 15 Problem 2. race (Native American), woman gender, obesity, old age, and smoking [1,2]. However, like most complex diseases, few studies of gene gene connection and gene environmental connection have been performed because a large sample size is required to identify such effects in traditional statistical paradigms. Logistic regression is commonly used in detecting interactive effects between genes or environmental factors in epidemiologic studies. However, the guidelines cannot be accurately estimated when there are many self-employed variables while the sample size is not large enough [3]. Recently, Ritchie et al. [4] launched a multifactor dimensionality reduction (MDR) method for identifying gene gene connection or gene environmental connection to conquer this limitation of traditional logistic regression [3-5]. This approach enumerates all possible mixtures of genotype or environmental factors associated with high risk and low risk of disease, and it may enable us to find relationships between genes in the absence of main effects [3-5]. To detect potential epistasis in RA, we evaluated 1) disease associations using solitary SNPs (single-nucleotide polymorphisms) from 15 candidate genes and haplotypes of the PTPN22 gene, 2) gene gene relationships among the candidate genes using the MDR method and logistic regression, and 3) gene environmental (smoking) relationships using a case-only study design. Methods Materials The data units for the candidate gene studies of RA were provided by Genetic Analysis Workshop 15 (GAW15) Problem 2. There were two case-control data units. The 1st one included 855 unrelated settings and 839 instances, as well as genotype data on 20 SNPs from 15 candidate genes, which were selected from previously published associations with RA or additional autoimmune disorders by Plenge et al. [6]. The second data arranged included 1519 unrelated settings and 1393 instances, and genotype data on 14 SNPs from your PTPN22 gene. Additional phenotype data, including smoking history, age of onset, sex, and body mass index, were available for instances only GSK1904529A in both data sets. There were 408 and 720 affected sibling pairs among instances in the two data units, respectively. Statistical analysis Solitary SNP and haplotype (PTPN22 only) associations with disease status were first evaluated. To account for the dependency among family members, the generalized estimating equations methods GSK1904529A (GEE1) [7] as implemented in the GENMOD process of SAS 9.0 was utilized in the association analysis by using family as the cluster element, i.e., users from your same family were assumed to be correlated and those from different family members were assumed to be self-employed. The haplotype block structure GSK1904529A of PTPN22 was evaluated by Haploview [8]. Individual haplotypes were reconstructed using the PHASE 2.0 by assigning each haplotype with maximum probability [9]. Seventy-four percent of haplotype GATA1 projects experienced probabilities of 100% and 93% experienced probabilities of 80% or better. Individuals whose haplotype task had probability below 80% were excluded from subsequent analysis. Association analysis was carried out for each common haplotype in turn. For each haplotype, a dominating model was assumed, i.e., service providers of the particular haplotype versus non-carriers were compared for his or her RA status. To test gene gene relationships, MDR was used to determine the genetic model that could most successfully predict the disease status or phenotype from several loci. SNP rs2240340 within the PADI4 gene was excluded from analysis due to its large amount of missing data. One thousand three hundred and thirty case-control samples with completed marker data on 19 GSK1904529A SNPs from 14 candidate genes were utilized in the MDR analysis. Cross-validation (CV) regularity and balanced accuracy estimates were determined for each combination of a pool of genetic polymorphisms. The model with the highest accuracy and maximal CV was considered to be the best [5]. We identified statistical significance by comparing the accuracy of the observed data with the distribution of accuracy under the null hypothesis of no associations derived empirically from 1000 replicates of permutations [10]. The null hypothesis was declined when the p-value derived from the permutation test was 0.05 or less. Like a follow-up, logistic regression analysis was carried out if there was suggestive connection. We also examined the connection between SNPs and smoking history in RA instances. The logistic function in the GENMOD process was used to quantify departure from multiplicativity..

Posted in My Blog

Tags: ,


Comments are closed.