ChatGPT as a Tool for Biostatisticians: A Tutorial on Applications, Opportunities, and Limitations
Statistics in Medicine 44.23-24. Oct. 2025
#p_boulesteix
WHN+25a
Rethinking the Handling of Method Failure in Comparison Studies
Statistics in Medicine. Oct. 2025
#p_boulesteix
BCH+25
Bridging the Gap Between Methodological Research and Statistical Practice: Toward Translational Simulation Research
Preprint (Oct. 2025)
#p_boulesteix
GMR+25
Comparing Supervised Machine Learning Algorithms for the Prediction of Partial Arterial Pressure of Oxygen During Craniotomy
BMC Medical Informatics and Decision Making 25.326. Sep. 2025
#p_boulesteix
Man25
Addressing Researcher Degrees of Freedom in Applications, Methodological Research, and Teaching
Dissertation LMU München. Jul. 2025
#p_boulesteix
MBB+25
Outlier Detection in Mendelian Randomization
Statistics in Medicine 44.15-17. Jul. 2025
#p_boulesteix
SLT+25
Statistical Parametric Simulation Studies Based on Real Data
Preprint (Apr. 2025)
#p_boulesteix
HNS+25
Evaluating Machine Learning Models in Non-Standard Settings: An Overview and New Findings
Statistical Science. Mar. 2025. to Be Published. Preprint Available
#p_bischl#p_boulesteix
LWH+25
On 'Confirmatory' Methodological Research in Statistics and Related Fields
Preprint (Mar. 2025)
#p_boulesteix
MWW+25
The Impact of the Storytelling Fallacy on Real Data Examples in Methodological Research
Preprint (Mar. 2025)
#p_boulesteix
RED+25
Addressing Complex Structures of Measurement Error Arising in the Exposure Assessment in Occupational Epidemiology Using a Bayesian Hierarchical Approach
Preprint (Mar. 2025)
#p_boulesteix#p_rueckert
WSH+25
To Tweak or Not to Tweak. How Exploiting Flexibilities in Gene Set Analysis Leads to Over-Optimism
Biometrical Journal 67.1. Feb. 2025
#p_boulesteix
ABB+25
Data-Driven Simulations to Assess the Impact of Study Imperfections in Time-to-Event Analyses
American Journal of Epidemiology 194.1. Jan. 2025
#p_boulesteix
SFN+25
Constructing Confidence Intervals for 'The' Generalization Error – A Comprehensive Benchmark Study
Journal of Data-Centric Machine Learning Research 2.6. Jan. 2025. to Be Published. Preprint Available
#p_bischl#p_boulesteix#p_nagler
SBH+24a
Beyond Algorithm Hyperparameters: On Preprocessing Hyperparameters and Associated Pitfalls in Machine Learning Applications
Preprint (Dec. 2024)
#p_boulesteix
WPM+24a
Point-of-Care Breath Sample Analysis by Semiconductor-Based E-Nose Technology Discriminates Non-Infected Subjects From SARS-CoV-2 Pneumonia Patients: A Multi-Analyst Experiment
MedComm 5.11. Nov. 2024
#p_boulesteix
BDT+24
Understanding Overfitting in Random Forest for Probability Estimation: A Visualization and Simulation Study
Diagnostic and Prognostic Research 8.14. Sep. 2024
#p_boulesteix
LHM+24a
Does Combining Numerous Data Types in Multi-Omics Data Improve or Hinder Performance in Survival Prediction? Insights From a Large-Scale Benchmark Study
Earth System Science Data 24.244. Sep. 2024
#p_boulesteix
HH24
Multi Forests: Variable Importance for Multi-Class Outcomes
Preprint (Sep. 2024)
#p_boulesteix
Her24
Dimensionality and Distance: Curse or Blessing? Geometrical Aspects of Nearest Neighbor Computation in High-Dimensional Data
Statistical Computing 2024
#p_boulesteix
HLE+24
Position: Why We Must Rethink Empirical Research in Machine Learning
DCSI–An Improved Measure of Cluster Separability Based on Separation and Connectedness
Preprint (Oct. 2023)
#p_boulesteix#p_scheipl
HSB23
Reproduzierbare Und Replizierbare Forschung
Moderne Verfahren Der Angewandten Statistik. Sep. 2023
#p_boulesteix#p_scheipl
MBD+23
A White Paper on Good Research Practices in Benchmarking: The Case of Cluster Analysis
Wiley Interdisciplinary Reviews: Data Mining and Knowledge Discovery 13.6. Jul. 2023
#p_boulesteix
HPS23
A Geometric Framework for Outlier Detection in High-Dimensional Data
Wiley Interdisciplinary Reviews: Data Mining and Knowledge Discovery E1491. Apr. 2023
#p_bischl#p_boulesteix#p_scheipl
UBH+23
Over-Optimistic Evaluation and Reporting of Novel Cluster Algorithms: An Illustrative Study
Advances in Data Analysis and Classification 17. Mar. 2023
#p_boulesteix#p_seidl
BBL+23
Hyperparameter Optimization: Foundations, Algorithms, Best Practices, and Open Challenges
Wiley Interdisciplinary Reviews: Data Mining and Knowledge Discovery 13.2. Mar. 2023
#p_bischl#p_boulesteix
UPF+23
Over-Optimism in Unsupervised Microbiome Analysis: Insights From Network Learning and Clustering
PLOS Computational Biology 19.1. Jan. 2023
#p_boulesteix#p_mueller
Ull22
Evaluation of Clustering Results and Novel Cluster Algorithms: A Metascientific Perspective
Dissertation LMU München. Dec. 2022
#p_boulesteix
Her22
Towards More Reliable Machine Learning: Conceptual Insights and Practical Approaches for Unsupervised Manifold Learning and Supervised Benchmark Studies
Dissertation LMU München. Oct. 2022
#p_boulesteix
SHC+22
Critical Appraisal of Artificial Intelligence-Based Prediction Models for Cardiovascular Disease
European Heart Journal 43.31. Aug. 2022
#p_boulesteix
UHB22
Validation of Cluster Analysis Results on Validation Data: A Systematic Framework
Wiley Interdisciplinary Reviews: Data Mining and Knowledge Discovery 12.3. May. 2022
#p_boulesteix
NHW+22
Over-Optimism in Benchmark Studies and the Multiplicity of Design and Analysis Options When Interpreting Their Results
Wiley Interdisciplinary Reviews: Data Mining and Knowledge Discovery 12.2. Mar. 2022
#p_bischl#p_boulesteix
HS21
A Geometric Perspective on Functional Outlier Detection
Stats 4.4. Nov. 2021
#p_boulesteix#p_scheipl
SCB+21
Statisticians, Roll Up Your Sleeves! There's a Crisis to Be Solved
Significance 18.4. Aug. 2021
#p_boulesteix
SCD+21
A Computational Reproducibility Study of PLOS ONE Articles Featuring Longitudinal Data Analyses
PLOS One 16.6. Jun. 2021
#p_bischl#p_boulesteix#p_mueller
KHP+21
Examining the Robustness of Observational Associations to Model, Measurement and Sampling Uncertainty With the Vibration of Effects Framework
International Journal of Epidemiology 50.1. Feb. 2021
#p_boulesteix
HS20
Unsupervised Functional Data Analysis via Nonlinear Dimension Reduction
Preprint (Dec. 2020)
#p_boulesteix#p_scheipl
BHC+20
A Replication Crisis in Methodological Research?
Significance 17.5. Oct. 2020
#p_boulesteix
HPH+20
Large-Scale Benchmark Study of Survival Prediction Methods Using Multi-Omics Data
Briefings in Bioinformatics. Aug. 2020
#p_boulesteix
EBB+20
Improved Outcome Prediction Across Data Sources Through Robust Parameter Tuning
Journal of Classification 38. Jul. 2020
#p_bischl#p_boulesteix
SAS+20
Predicting Personality From Patterns of Behavior Collected With Smartphones
Proceedings of the National Academy of Sciences 117.30. Jul. 2020
#p_bischl#p_boulesteix
KMB+20a
Sampling Uncertainty Versus Method Uncertainty: A General Framework With Applications to Omics Biomarker Selection
Biometrical Journal 62.3. May. 2020
#p_boulesteix
Her20b
Fda-Ndr: Unsupervised Functional Data Analysis via Nonlinear Dimension Reduction. R Package
2020
#p_boulesteix
Her20a
Manifun: Collection of Functions to Work With Embeddings and Functional Data. R Package
2020
#p_boulesteix
WSC+19
Essential Guidelines for Computational Method Benchmarking
Genome Biology 20.125. Jun. 2019
#p_boulesteix
PBB19
Tunability: Importance of Hyperparameters of Machine Learning Algorithms
Journal of Machine Learning Research 20. Mar. 2019
#p_bischl#p_boulesteix
PWB19
Hyperparameters and Tuning Strategies for Random Forest
Wiley Interdisciplinary Reviews: Data Mining and Knowledge Discovery 9.3. Jan. 2019