Gewählte Publikation:
SHR
Neuro
Krebs
Kardio
Lipid
Stoffw
Microb
Pfeifer, B; Holzinger, A; Schimek, MG.
Robust Random Forest-Based All-Relevant Feature Ranks for Trustworthy AI.
Stud Health Technol Inform. 2022; 294: 137-138.
Doi: 10.3233/SHTI220418
PubMed
FullText
FullText_MUG
- Führende Autor*innen der Med Uni Graz
-
Pfeifer Bastian
- Co-Autor*innen der Med Uni Graz
-
Holzinger Andreas
-
Schimek Michael
- Altmetrics:
- Dimensions Citations:
- Plum Analytics:
- Scite (citation analytics):
- Abstract:
- Feature selection is a fundamental challenge in machine learning. For instance in bioinformatics, it is essential when one wishes to detect biomarkers. Tree-based methods are predominantly used for this purpose. In this paper, we study the stability of the feature selection methods BORUTA, VITA, and RRF (regularized random forest). In particular, we investigate the feature ranking instability of the associated stochastic algorithms. For stabilization of the feature ranks, we propose to compute consensus values from multiple feature selection runs, applying rank aggregation techniques. Our results show that these consolidated features are more accurate and robust, which helps to make practical machine learning applications more trustworthy.
- Find related publications in this database (using NLM MeSH Indexing)
-
Algorithms - administration & dosage
-
Biomarkers - administration & dosage
-
Computational Biology - methods
-
Machine Learning - administration & dosage