tidymodels
Biostat 203B
1 Overview
- A typical data science project:
tidymodels is an ecosystem for:
- Build and fit a model;
- Feature engineering: coding qualitative predictors, transformation of predictors (e.g., log), extracting key features from raw variables (e.g., getting the day of the week out of a date variable), interaction terms, …;
- Evaluate model using resampling (such as cross-validation).
- Tuning model parameters.
2 Heart data example
We illustrate a binary classification example using a dataset from the Cleveland Clinic Foundation for Heart Disease.