Recently Published
How to face a majority class greater than a minority class in a classification predictive modeling: sampling methods with H2O
Overview of issues about imbalanced data set
DocumentHow to face a majority class greater than a minority class in a classification predictive modeling: Cost Sensitive models with Caret
Overview of issues about imbalanced data set
How to face a majority class greater than a minority class in a classification predictive modeling: sampling methods with Caret
Overview of issues about imbalanced data set
How to face a majority class greater than a minority class in a classification predictive modeling: baseline models overview
Overview of issues about imbalanced data set
Song Analysis
Applied NLP and LDA to song lyrics kaggle dataset.
Relevance of feature engineering to build a predictive model
Is showed the relevance of feature engineering step in the machine learning pipeline using a New York City Taxi Trip Duration dataset from Kaggle Competition.
Graphic Road Accidents Great Britain 2015 - Kaggle Datasets
Graphic analysis of road accidents registered in UK in 2015
Porto Seguro Modeling - Kaggle Competition
Final step applying XGBoost machine learning model to predict the probability that a driver will initiate an auto insurance claim in the next year.
Porto Seguro Statistical Analysis & Data Cleansing - Kaggle Competition
With this step a statistical analysis of the variables is carried out: relationship between variables and analysis of their distribution, management of outliers and management of missing values.
Then a cleansing of the dataset by removing unnecessary variables.
Porto Seguro Visualization - Kaggle Competition
This document shows a visualization analysis on the features