AUC-RF: A New Strategy for Genomic Profiling with Random Forest Medizin - Open Access LMU - Teil 17/22

    • Educación

Objective: Genomic profiling, the use of genetic variants at multiple loci simultaneously for the prediction of disease risk, requires the selection of a set of genetic variants that best predicts disease status. The goal of this work was to provide a new selection algorithm for genomic profiling. Methods: We propose a new algorithm for genomic profiling based on optimizing the area under the receiver operating characteristic curve (AUC) of the random forest (RF). The proposed strategy implements a backward elimination process based on the initial ranking of variables. Results and Conclusions: We demonstrate the advantage of using the AUC instead of the classification error as a measure of predictive accuracy of RF. In particular, we show that the use of the classification error is especially inappropriate when dealing with unbalanced data sets. The new procedure for variable selection and prediction, namely AUC-RF, is illustrated with data from a bladder cancer study and also with simulated data. The algorithm is publicly available as an R package, named AUCRF, at http://cran.r-project.org/. Copyright (C) 2011 S. Karger AG, Basel

Objective: Genomic profiling, the use of genetic variants at multiple loci simultaneously for the prediction of disease risk, requires the selection of a set of genetic variants that best predicts disease status. The goal of this work was to provide a new selection algorithm for genomic profiling. Methods: We propose a new algorithm for genomic profiling based on optimizing the area under the receiver operating characteristic curve (AUC) of the random forest (RF). The proposed strategy implements a backward elimination process based on the initial ranking of variables. Results and Conclusions: We demonstrate the advantage of using the AUC instead of the classification error as a measure of predictive accuracy of RF. In particular, we show that the use of the classification error is especially inappropriate when dealing with unbalanced data sets. The new procedure for variable selection and prediction, namely AUC-RF, is illustrated with data from a bladder cancer study and also with simulated data. The algorithm is publicly available as an R package, named AUCRF, at http://cran.r-project.org/. Copyright (C) 2011 S. Karger AG, Basel

Top podcasts de Educación

Dr. Mario Alonso Puig
Mario Alonso Puig
kaizen con Jaime Rodríguez de Santiago
Jaime Rodríguez de Santiago
BBVA Aprendemos juntos 2030
BBVA Podcast
Black Mango Podcast
Black Mango
Inglés desde cero
Daniel
Learning English Conversations
BBC Radio

Más de Ludwig-Maximilians-Universität München

MCMP
MCMP Team
LMU Grundkurs Strafrecht I (L-Z) WS 2014/15
Prof. Dr. jur. Helmut Satzger
GK Strafrecht II (A-K) SoSe 2020 Satzger
Helmut Satzger
Podcast Jüdische Geschichte
Abteilung für Jüdische Geschichte und Kultur, LMU München
Forum Kunstgeschichte Italiens (LMU)
Prof. Dr. Ulrich Pfisterer, Dr. Matteo Burioni
LMU Fakultät für Philosophie, Wissenschaftstheorie und Religionswissenschaft - Vorlesungen und Vorträge
Professoren der Fakultät für Philosophie, Wissenschaftstheorie und Religionswissenschaft