Home | Tags | #p_boulesteix

#p_boulesteix

DC25

Efficient Computation of Image Persistence

Discrete and Computational Geometry. Oct. 2025

#p_boulesteix

Learn more

DBB+25

ChatGPT as a Tool for Biostatisticians: A Tutorial on Applications, Opportunities, and Limitations

Statistics in Medicine 44.23-24. Oct. 2025

#p_boulesteix

Learn more

WHN+25a

Rethinking the Handling of Method Failure in Comparison Studies

Statistics in Medicine. Oct. 2025

#p_boulesteix

Learn more

BCH+25

Bridging the Gap Between Methodological Research and Statistical Practice: Toward Translational Simulation Research

Preprint (Oct. 2025)

#p_boulesteix

Learn more

GMR+25

Comparing Supervised Machine Learning Algorithms for the Prediction of Partial Arterial Pressure of Oxygen During Craniotomy

BMC Medical Informatics and Decision Making 25.326. Sep. 2025

#p_boulesteix

Learn more

Man25

Addressing Researcher Degrees of Freedom in Applications, Methodological Research, and Teaching

Dissertation LMU München. Jul. 2025

#p_boulesteix

Learn more

MBB+25

Outlier Detection in Mendelian Randomization

Statistics in Medicine 44.15-17. Jul. 2025

#p_boulesteix

Learn more

SLT+25

Statistical Parametric Simulation Studies Based on Real Data

Preprint (Apr. 2025)

#p_boulesteix

Learn more

HNS+25

Evaluating Machine Learning Models in Non-Standard Settings: An Overview and New Findings

Statistical Science. Mar. 2025. to Be Published. Preprint Available

#p_bischl #p_boulesteix

Learn more

LWH+25

Preprint (Mar. 2025)

#p_boulesteix

Learn more

MWW+25

The Impact of the Storytelling Fallacy on Real Data Examples in Methodological Research

Preprint (Mar. 2025)

#p_boulesteix

Learn more

RED+25

Addressing Complex Structures of Measurement Error Arising in the Exposure Assessment in Occupational Epidemiology Using a Bayesian Hierarchical Approach

Preprint (Mar. 2025)

#p_boulesteix #p_rueckert

Learn more

WSH+25

To Tweak or Not to Tweak. How Exploiting Flexibilities in Gene Set Analysis Leads to Over-Optimism

Biometrical Journal 67.1. Feb. 2025

#p_boulesteix

Learn more

ABB+25

Data-Driven Simulations to Assess the Impact of Study Imperfections in Time-to-Event Analyses

American Journal of Epidemiology 194.1. Jan. 2025

#p_boulesteix

Learn more

SFN+25

Constructing Confidence Intervals for 'The' Generalization Error – A Comprehensive Benchmark Study

Journal of Data-Centric Machine Learning Research 2.6. Jan. 2025. to Be Published. Preprint Available

#p_bischl #p_boulesteix #p_nagler

Learn more

SBH+24a

Beyond Algorithm Hyperparameters: On Preprocessing Hyperparameters and Associated Pitfalls in Machine Learning Applications

Preprint (Dec. 2024)

#p_boulesteix

Learn more

WPM+24a

Point-of-Care Breath Sample Analysis by Semiconductor-Based E-Nose Technology Discriminates Non-Infected Subjects From SARS-CoV-2 Pneumonia Patients: A Multi-Analyst Experiment

MedComm 5.11. Nov. 2024

#p_boulesteix

Learn more

BDT+24

Understanding Overfitting in Random Forest for Probability Estimation: A Visualization and Simulation Study

Diagnostic and Prognostic Research 8.14. Sep. 2024

#p_boulesteix

Learn more

LHM+24a

Does Combining Numerous Data Types in Multi-Omics Data Improve or Hinder Performance in Survival Prediction? Insights From a Large-Scale Benchmark Study

Earth System Science Data 24.244. Sep. 2024

#p_boulesteix

Learn more

HH24

Multi Forests: Variable Importance for Multi-Class Outcomes

Preprint (Sep. 2024)

#p_boulesteix

Learn more

Her24

Dimensionality and Distance: Curse or Blessing? Geometrical Aspects of Nearest Neighbor Computation in High-Dimensional Data

Statistical Computing 2024

#p_boulesteix

Learn more

HLE+24

Position: Why We Must Rethink Empirical Research in Machine Learning

ICML 2024

#p_bischl #p_boulesteix #p_feurer #p_huellermeier #p_ruegamer

Learn more

MBH+24

Addressing Researcher Degrees of Freedom Through MinP Adjustment

BMC Medical Research Methodology 24.152. Jul. 2024

#p_boulesteix

Learn more

HKS+24a

Enhancing Cluster Analysis via Topological Manifold Learning

Data Mining and Knowledge Discovery 38. Apr. 2024

#p_boulesteix #p_scheipl

Learn more

CMD+24

TRIPOD+AI Statement: Updated Guidance for Reporting Clinical Prediction Models That Use Regression or Machine Learning Methods

The BMJ 385.e078378. Apr. 2024

#p_boulesteix

Learn more

WHN+24

On the Handling of Method Failure in Comparison Studies

Preprint (Apr. 2024)

#p_boulesteix

Learn more

MHB+24

Raising Awareness of Uncertain Choices in Empirical Data Analysis: A Teaching Concept Toward Replicable Research Practices

PLOS Computational Biology 20.3. Mar. 2024

#p_boulesteix

Learn more

NHU+24

Explaining the Optimistic Performance Evaluation of Newly Proposed Methods: A Cross-Design Validation Experiment

Biometrical Journal 66.1. Jan. 2024

#p_boulesteix

Learn more

SBM+24

Simulation Studies for Methodological Research in Psychology: A Standardized Template for Planning, Preregistration, and Reporting

Psychological Methods Advance Online Publication. Jan. 2024

#p_boulesteix

Learn more

DVT+24

A Comparison of Hyperparameter Tuning Procedures for Clinical Prediction Models: A Simulation Study

Statistics in Medicine. Jan. 2024

#p_boulesteix

Learn more

HLH+24

Prediction Approaches for Partly Missing Multi-Omics Covariate Data: A Literature Review and an Empirical Comparison Study

Wiley Interdisciplinary Reviews: Computational Statistics 16.1. Jan. 2024

#p_boulesteix

Learn more

WSC+24

From RNA Sequencing Measurements to the Final Results: A Practical Guide to Navigating the Choices and Uncertainties of Gene Set Analysis

Wiley Interdisciplinary Reviews: Computational Statistics 16.1. Jan. 2024

#p_boulesteix

Learn more

GSH24

DCSI–An Improved Measure of Cluster Separability Based on Separation and Connectedness

Preprint (Oct. 2023)

#p_boulesteix #p_scheipl

Learn more

HSB23

Reproduzierbare Und Replizierbare Forschung

Moderne Verfahren Der Angewandten Statistik. Sep. 2023

#p_boulesteix #p_scheipl

Learn more

MBD+23

A White Paper on Good Research Practices in Benchmarking: The Case of Cluster Analysis

Wiley Interdisciplinary Reviews: Data Mining and Knowledge Discovery 13.6. Jul. 2023

#p_boulesteix

Learn more

HPS23

A Geometric Framework for Outlier Detection in High-Dimensional Data

Wiley Interdisciplinary Reviews: Data Mining and Knowledge Discovery E1491. Apr. 2023

#p_bischl #p_boulesteix #p_scheipl

Learn more

UBH+23

Over-Optimistic Evaluation and Reporting of Novel Cluster Algorithms: An Illustrative Study

Advances in Data Analysis and Classification 17. Mar. 2023

#p_boulesteix #p_seidl

Learn more

BBL+23

Hyperparameter Optimization: Foundations, Algorithms, Best Practices, and Open Challenges

Wiley Interdisciplinary Reviews: Data Mining and Knowledge Discovery 13.2. Mar. 2023

#p_bischl #p_boulesteix

Learn more

UPF+23

Over-Optimism in Unsupervised Microbiome Analysis: Insights From Network Learning and Clustering

PLOS Computational Biology 19.1. Jan. 2023

#p_boulesteix #p_mueller

Learn more

Ull22

Evaluation of Clustering Results and Novel Cluster Algorithms: A Metascientific Perspective

Dissertation LMU München. Dec. 2022

#p_boulesteix

Learn more

Her22

Towards More Reliable Machine Learning: Conceptual Insights and Practical Approaches for Unsupervised Manifold Learning and Supervised Benchmark Studies

Dissertation LMU München. Oct. 2022

#p_boulesteix

Learn more

SHC+22

Critical Appraisal of Artificial Intelligence-Based Prediction Models for Cardiovascular Disease

European Heart Journal 43.31. Aug. 2022

#p_boulesteix

Learn more

UHB22

Validation of Cluster Analysis Results on Validation Data: A Systematic Framework

Wiley Interdisciplinary Reviews: Data Mining and Knowledge Discovery 12.3. May. 2022

#p_boulesteix

Learn more

NHW+22

Over-Optimism in Benchmark Studies and the Multiplicity of Design and Analysis Options When Interpreting Their Results

Wiley Interdisciplinary Reviews: Data Mining and Knowledge Discovery 12.2. Mar. 2022

#p_bischl #p_boulesteix

Learn more

HS21

A Geometric Perspective on Functional Outlier Detection

Stats 4.4. Nov. 2021

#p_boulesteix #p_scheipl

Learn more

SCB+21

Statisticians, Roll Up Your Sleeves! There's a Crisis to Be Solved

Significance 18.4. Aug. 2021

#p_boulesteix

Learn more

SCD+21

A Computational Reproducibility Study of PLOS ONE Articles Featuring Longitudinal Data Analyses

PLOS One 16.6. Jun. 2021

#p_bischl #p_boulesteix #p_mueller

Learn more

KHP+21

Examining the Robustness of Observational Associations to Model, Measurement and Sampling Uncertainty With the Vibration of Effects Framework

International Journal of Epidemiology 50.1. Feb. 2021

#p_boulesteix

Learn more

HS20

Unsupervised Functional Data Analysis via Nonlinear Dimension Reduction

Preprint (Dec. 2020)

#p_boulesteix #p_scheipl

Learn more

BHC+20

A Replication Crisis in Methodological Research?

Significance 17.5. Oct. 2020

#p_boulesteix

Learn more

HPH+20

Large-Scale Benchmark Study of Survival Prediction Methods Using Multi-Omics Data

Briefings in Bioinformatics. Aug. 2020

#p_boulesteix

Learn more

EBB+20

Improved Outcome Prediction Across Data Sources Through Robust Parameter Tuning

Journal of Classification 38. Jul. 2020

#p_bischl #p_boulesteix

Learn more

SAS+20

Predicting Personality From Patterns of Behavior Collected With Smartphones

Proceedings of the National Academy of Sciences 117.30. Jul. 2020

#p_bischl #p_boulesteix

Learn more

KMB+20a

Sampling Uncertainty Versus Method Uncertainty: A General Framework With Applications to Omics Biomarker Selection

Biometrical Journal 62.3. May. 2020

#p_boulesteix

Learn more

Her20b

Fda-Ndr: Unsupervised Functional Data Analysis via Nonlinear Dimension Reduction. R Package

2020

#p_boulesteix

Learn more

Her20a

Manifun: Collection of Functions to Work With Embeddings and Functional Data. R Package

2020

#p_boulesteix

Learn more

WSC+19

Essential Guidelines for Computational Method Benchmarking

Genome Biology 20.125. Jun. 2019

#p_boulesteix

Learn more

PBB19

Tunability: Importance of Hyperparameters of Machine Learning Algorithms

Journal of Machine Learning Research 20. Mar. 2019

#p_bischl #p_boulesteix

Learn more

PWB19

Hyperparameters and Tuning Strategies for Random Forest

Wiley Interdisciplinary Reviews: Data Mining and Knowledge Discovery 9.3. Jan. 2019

#p_boulesteix

Learn more

#p_boulesteix

Efficient Computation of Image Persistence

ChatGPT as a Tool for Biostatisticians: A Tutorial on Applications, Opportunities, and Limitations

Rethinking the Handling of Method Failure in Comparison Studies

Bridging the Gap Between Methodological Research and Statistical Practice: Toward Translational Simulation Research

Comparing Supervised Machine Learning Algorithms for the Prediction of Partial Arterial Pressure of Oxygen During Craniotomy

Addressing Researcher Degrees of Freedom in Applications, Methodological Research, and Teaching

Outlier Detection in Mendelian Randomization

Statistical Parametric Simulation Studies Based on Real Data

Evaluating Machine Learning Models in Non-Standard Settings: An Overview and New Findings

On 'Confirmatory' Methodological Research in Statistics and Related Fields

The Impact of the Storytelling Fallacy on Real Data Examples in Methodological Research

Addressing Complex Structures of Measurement Error Arising in the Exposure Assessment in Occupational Epidemiology Using a Bayesian Hierarchical Approach

To Tweak or Not to Tweak. How Exploiting Flexibilities in Gene Set Analysis Leads to Over-Optimism

Data-Driven Simulations to Assess the Impact of Study Imperfections in Time-to-Event Analyses

Constructing Confidence Intervals for 'The' Generalization Error – A Comprehensive Benchmark Study

Beyond Algorithm Hyperparameters: On Preprocessing Hyperparameters and Associated Pitfalls in Machine Learning Applications

Point-of-Care Breath Sample Analysis by Semiconductor-Based E-Nose Technology Discriminates Non-Infected Subjects From SARS-CoV-2 Pneumonia Patients: A Multi-Analyst Experiment

Understanding Overfitting in Random Forest for Probability Estimation: A Visualization and Simulation Study

Does Combining Numerous Data Types in Multi-Omics Data Improve or Hinder Performance in Survival Prediction? Insights From a Large-Scale Benchmark Study

Multi Forests: Variable Importance for Multi-Class Outcomes

Dimensionality and Distance: Curse or Blessing? Geometrical Aspects of Nearest Neighbor Computation in High-Dimensional Data

Position: Why We Must Rethink Empirical Research in Machine Learning

Addressing Researcher Degrees of Freedom Through MinP Adjustment

Enhancing Cluster Analysis via Topological Manifold Learning

TRIPOD+AI Statement: Updated Guidance for Reporting Clinical Prediction Models That Use Regression or Machine Learning Methods

On the Handling of Method Failure in Comparison Studies

Raising Awareness of Uncertain Choices in Empirical Data Analysis: A Teaching Concept Toward Replicable Research Practices

Explaining the Optimistic Performance Evaluation of Newly Proposed Methods: A Cross-Design Validation Experiment

Simulation Studies for Methodological Research in Psychology: A Standardized Template for Planning, Preregistration, and Reporting

A Comparison of Hyperparameter Tuning Procedures for Clinical Prediction Models: A Simulation Study

Prediction Approaches for Partly Missing Multi-Omics Covariate Data: A Literature Review and an Empirical Comparison Study

From RNA Sequencing Measurements to the Final Results: A Practical Guide to Navigating the Choices and Uncertainties of Gene Set Analysis

DCSI–An Improved Measure of Cluster Separability Based on Separation and Connectedness

Reproduzierbare Und Replizierbare Forschung

A White Paper on Good Research Practices in Benchmarking: The Case of Cluster Analysis

A Geometric Framework for Outlier Detection in High-Dimensional Data

Over-Optimistic Evaluation and Reporting of Novel Cluster Algorithms: An Illustrative Study

Hyperparameter Optimization: Foundations, Algorithms, Best Practices, and Open Challenges

Over-Optimism in Unsupervised Microbiome Analysis: Insights From Network Learning and Clustering

Evaluation of Clustering Results and Novel Cluster Algorithms: A Metascientific Perspective

Towards More Reliable Machine Learning: Conceptual Insights and Practical Approaches for Unsupervised Manifold Learning and Supervised Benchmark Studies

Critical Appraisal of Artificial Intelligence-Based Prediction Models for Cardiovascular Disease

Validation of Cluster Analysis Results on Validation Data: A Systematic Framework

Over-Optimism in Benchmark Studies and the Multiplicity of Design and Analysis Options When Interpreting Their Results

A Geometric Perspective on Functional Outlier Detection

Statisticians, Roll Up Your Sleeves! There's a Crisis to Be Solved

A Computational Reproducibility Study of PLOS ONE Articles Featuring Longitudinal Data Analyses

Examining the Robustness of Observational Associations to Model, Measurement and Sampling Uncertainty With the Vibration of Effects Framework

Unsupervised Functional Data Analysis via Nonlinear Dimension Reduction

A Replication Crisis in Methodological Research?

Large-Scale Benchmark Study of Survival Prediction Methods Using Multi-Omics Data

Improved Outcome Prediction Across Data Sources Through Robust Parameter Tuning

Predicting Personality From Patterns of Behavior Collected With Smartphones

Sampling Uncertainty Versus Method Uncertainty: A General Framework With Applications to Omics Biomarker Selection

Fda-Ndr: Unsupervised Functional Data Analysis via Nonlinear Dimension Reduction. R Package

Manifun: Collection of Functions to Work With Embeddings and Functional Data. R Package

Essential Guidelines for Computational Method Benchmarking

Tunability: Importance of Hyperparameters of Machine Learning Algorithms

Hyperparameters and Tuning Strategies for Random Forest