gravatar

claudio75

claudio g. giancaterino

Recently Published

Song Analysis
Applied NLP and LDA to song lyrics kaggle dataset.
Relevance of feature engineering to build a predictive model
Is showed the relevance of feature engineering step in the machine learning pipeline using a New York City Taxi Trip Duration dataset from Kaggle Competition.
Graphic Road Accidents Great Britain 2015 - Kaggle Datasets
Graphic analysis of road accidents registered in UK in 2015
Porto Seguro Modeling - Kaggle Competition
Final step applying XGBoost machine learning model to predict the probability that a driver will initiate an auto insurance claim in the next year.
Porto Seguro Statistical Analysis & Data Cleansing - Kaggle Competition
With this step a statistical analysis of the variables is carried out: relationship between variables and analysis of their distribution, management of outliers and management of missing values. Then a cleansing of the dataset by removing unnecessary variables.
Porto Seguro Visualization - Kaggle Competition
This document shows a visualization analysis on the features