Logo

Publikacije (174)

Nazad
E. Makalic, D. Schmidt

Bayesian penalized regression techniques, such as the Bayesian lasso and the Bayesian horseshoe estimator, have recently received a significant amount of attention in the statistics literature. However, software implementing state-of-the-art Bayesian penalized regression, outside of general purpose Markov chain Monte Carlo platforms such as STAN, is relatively rare. This paper introduces bayesreg, a new toolbox for fitting Bayesian penalized regression models with continuous shrinkage prior densities. The toolbox features Bayesian linear regression with Gaussian or heavy-tailed error models and Bayesian logistic regression with ridge, lasso, horseshoe and horseshoe$+$ estimators. The toolbox is free, open-source and available for use with the MATLAB and R numerical platforms.

Nicola Barban, R. Jansen, R. Vlaming, Ahmad Vaez, Ahmad Vaez, Jornt J. Mandemakers, Felix C. Tropf, Xia Shen et al.

G. Fehringer, P. Kraft, P. Pharoah, R. Eeles, N. Chatterjee, F. Schumacher, J. Schildkraut, S. Lindström et al.

Identifying genetic variants with pleiotropic associations can uncover common pathways influencing multiple cancers. We took a two-stage approach to conduct genome-wide association studies for lung, ovary, breast, prostate, and colorectal cancer from the GAME-ON/GECCO Network (61,851 cases, 61,820 controls) to identify pleiotropic loci. Findings were replicated in independent association studies (55,789 cases, 330,490 controls). We identified a novel pleiotropic association at 1q22 involving breast and lung squamous cell carcinoma, with eQTL analysis showing an association with ADAM15/THBS3 gene expression in lung. We also identified a known breast cancer locus CASP8/ALS2CR12 associated with prostate cancer, a known cancer locus at CDKN2B-AS1 with different variants associated with lung adenocarcinoma and prostate cancer, and confirmed the associations of a breast BRCA2 locus with lung and serous ovarian cancer. This is the largest study to date examining pleiotropy across multiple cancer-associated loci, identifying common mechanisms of cancer development and progression. Cancer Res; 76(17); 5103-14. ©2016 AACR.

R. MacInnis, D. Schmidt, E. Makalic, G. Severi, L. FitzGerald, M. Reumann, M. Kapuscinski, A. Kowalczyk et al.

Background: We have developed a genome-wide association study analysis method called DEPTH (DEPendency of association on the number of Top Hits) to identify genomic regions potentially associated with disease by considering overlapping groups of contiguous markers (e.g., SNPs) across the genome. DEPTH is a machine learning algorithm for feature ranking of ultra-high dimensional datasets, built from well-established statistical tools such as bootstrapping, penalized regression, and decision trees. Unlike marginal regression, which considers each SNP individually, the key idea behind DEPTH is to rank groups of SNPs in terms of their joint strength of association with the outcome. Our aim was to compare the performance of DEPTH with that of standard logistic regression analysis. Methods: We selected 1,854 prostate cancer cases and 1,894 controls from the UK for whom 541,129 SNPs were measured using the Illumina Infinium HumanHap550 array. Confirmation was sought using 4,152 cases and 2,874 controls, ascertained from the UK and Australia, for whom 211,155 SNPs were measured using the iCOGS Illumina Infinium array. Results: From the DEPTH analysis, we identified 14 regions associated with prostate cancer risk that had been reported previously, five of which would not have been identified by conventional logistic regression. We also identified 112 novel putative susceptibility regions. Conclusions: DEPTH can reveal new risk-associated regions that would not have been identified using a conventional logistic regression analysis of individual SNPs. Impact: This study demonstrates that the DEPTH algorithm could identify additional genetic susceptibility regions that merit further investigation. Cancer Epidemiol Biomarkers Prev; 25(12); 1619–24. ©2016 AACR.

F. Couch, K. Kuchenbaecker, K. Michailidou, Gustavo A. Mendoza-Fandino, Silje Nord, Janna Lilyquist, Curtis L. Olswold, Emily J. Hallberg et al.

J. Hopper, T. Nguyen, J. Stone, K. Aujard, M. Matheson, M. Abramson, J. Burgess, E. Walters et al.

M. Jenkins, E. Makalic, J. Dowty, D. Schmidt, G. Dite, R. MacInnis, D. Ait Ouakrim, M. Clendenning et al.

AIM To determine whether single nucleotide polymorphisms (SNPs) can be used to identify people who should be screened for colorectal cancer. METHODS We simulated one million people with and without colorectal cancer based on published SNP allele frequencies and strengths of colorectal cancer association. We estimated 5-year risks of colorectal cancer by number of risk alleles. RESULTS We identified 45 SNPs with an average 1.14-fold increase colorectal cancer risk per allele (range: 1.05-1.53). The colorectal cancer risk for people in the highest quintile of risk alleles was 1.81-times that for the average person. CONCLUSION We have quantified the extent to which known susceptibility SNPs can stratify the population into clinically useful colorectal cancer risk categories.

Louise B. Thingholm, L. Andersen, E. Makalic, M. Southey, M. Thomassen, Lise Lotte Hansen

The development and progression of cancer, a collection of diseases with complex genetic architectures, is facilitated by the interplay of multiple etiological factors. This complexity challenges the traditional single-platform study design and calls for an integrated approach to data analysis. However, integration of heterogeneous measurements of biological variation is a non-trivial exercise due to the diversity of the human genome and the variety of output data formats and genome coverage obtained from the commonly used molecular platforms. This review article will provide an introduction to integration strategies used for analyzing genetic risk factors for cancer. We critically examine the ability of these strategies to handle the complexity of the human genome and also accommodate information about the biological and functional interactions between the elements that have been measured—making the assessment of disease risk against a composite genomic factor possible. The focus of this review is to provide an overview and introduction to the main strategies and to discuss where there is a need for further development.

R. MacInnis, D. Schmidt, E. Makalic, G. Severi, M. Liesel, Fitzgerald, M. Reumann, M. Kapuscinski et al.

Background : We have developed a GWAS analysis method called DEPTH (DEPendency of association on the number of Top Hits) to identify genomic regions potentially associated with disease by considering overlapping groups of contiguous markers (e.g. single nucleotide polymorphisms, SNPs) across the genome. DEPTH is a machine learning algorithm for feature ranking of ultra-high dimensional datasets, built from well-established statistical tools such as bootstrapping, penalised regression and decision trees. Unlike marginal regression, which considers each SNP individually, the key idea behind DEPTH is to rank groups of SNPs in terms of their joint strength of association with the outcome. Our aim was to compare the performance of DEPTH with that of standard logistic regression analysis. Methods : We selected 1,854 prostate cancer cases and 1,894 controls from the UK for whom 541,129 SNPs were measured using the Illumina Infinium HumanHap550 array. Confirmation was sought using 4,152 cases and 2,874 controls, ascertained from the UK and Australia, for whom 211,155 SNPs were measured using the iCOGS Illumina Infinium array. Results : From the DEPTH analysis we identified 14 regions associated with prostate cancer risk that had been reported previously; five of which would not have been identified by conventional logistic regression. We also identified 112 novel putative susceptibility regions. Conclusions : DEPTH can reveal new risk-associated regions that would not have been identified using a conventional logistic regression analysis of individual SNPs. Impact : This study demonstrates that the DEPTH algorithm could identify additional genetic susceptibility regions that merit further investigation.

A. Cox, S. Cross, J. Cunningham, K. Czene, M. Daly, F. Damiola, Hatef Darabi, M. Hoya et al.

Common variants in 94 loci have been associated with breast cancer including 15 loci with genome-wide significant associations ( P o 5 (cid:2) 10 (cid:3) 8 ) with oestrogen receptor (ER)-negative breast cancer and BRCA1- associated breast cancer risk. In this study, to identify new ER-negative susceptibility loci, we performed a meta-analysis of 11 genome-wide association studies (GWAS) consisting of 4,939 ER-negative cases and 14,352 controls, combined with 7,333 ER-negative cases and 42,468 controls and 15,252 BRCA1 mutation carriers genotyped on the iCOGS array. We identify four previously unidentified loci including two loci at 13q22 near KLF5 , a 2p23.2 locus near WDR43 and a 2q33 locus near PPIL3 that display genome-wide significant associations with ER-negative breast cancer. In addition, 19 known breast cancer risk loci have genome-wide significant associations and 40 had moderate associations ( P o 0.05) with ER-negative disease. Using functional and eQTL studies we implicate TRMT61B and WDR43 at 2p23.2 and PPIL3 at 2q33 in ER-negative breast cancer aetiology. All ER-negative loci combined account for B 11% of familial relative risk for ER-negative disease and may contribute to improved ER-negative and BRCA1 breast cancer risk prediction.

Nema pronađenih rezultata, molimo da izmjenite uslove pretrage i pokušate ponovo!

Pretplatite se na novosti o BH Akademskom Imeniku

Ova stranica koristi kolačiće da bi vam pružila najbolje iskustvo

Saznaj više