Skip to main content The Hungry Cat

Posts on Optuna

  1. XGBoost Dask API on yarn with hyperparameter optimization

    Purpose

    A practitioner’s guide to implementing a XGBoost model with hyperparameter optimization using the Dask API for distributed computing. This article will limit its scope to a working script running on a yarn cluster. A official guide on the Dask API for XGBoost is here. We will limit this to simply building the model and finding the best hyperparameters for a XGBoost classification model. We can extend this to a XGBoost regression model as well.