How to Setup a Simple ETL Pipeline with AWS Lambda for Data Science
How to setup a simple ETL pipeline with AWS Lambda that can be triggered via an API Endpoint or Schedule and write the results to an S3...
Posted by
Brian Roepke
on Tue 07 February 2023
A Real-World Approach with XGBoost on Marketing Attribution Data
Leverage Lassoo for ML-ready marketing data to predict which users will convert to paid. From feature engineering to model building, and...
Posted by
Brian Roepke
on Tue 31 January 2023
5-10x Faster Hyperparameter Tuning with HalvingGridSearch
How to optimize the hyperparameters of a machine learning model and how to speed up the process
Posted by
Brian Roepke
on Sat 05 March 2022
2 Beautiful Ways to Visualize PCA
Plus model tuning and evaluation to select the best number of components.
Posted by
Brian Roepke
on Sat 26 February 2022
How to Setup AWS EMR and Jupyter Notebooks Without Breaking the Bank
Deploy a Distributed Computing Environment in Minutes with AWS
Posted by
Brian Roepke
on Sat 05 February 2022
Don’t Get Caught in the Trap of Imbalanced Data When Building Your ML Model
Utilize These Techniques to Bring Balance and Improve Performance.
Posted by
Brian Roepke
on Thu 25 November 2021