gravatar

justmarkham

Kevin Markham

Recently Published

Going deeper with dplyr: New features in 0.3 and 0.4 (tutorial)
dplyr is an R package for data manipulation. Using a series of examples on a dataset you can download, this tutorial covers the new functionality in dplyr versions 0.3 and 0.4. You can also watch a 37-minute video tutorial that walks through this document: http://youtu.be/2mh1PqfsXVI
Data Science Capstone Project: Text Prediction Presentation
Presentation for the Data Science Specialization Capstone Project
Data Science Capstone Project: Text Prediction
Milestone report for the Data Science Specialization Capstone Project
Introduction to dplyr for Faster Data Manipulation in R
dplyr is a new R package for data manipulation. This document demonstrates the five basic dplyr "verbs" as well as a dozen other dplyr functions. You can also watch a 40-minute video tutorial that walks through this document: http://youtu.be/jWjqLW-u3hc
Health and Economic Impacts of Storm and Weather Events
An analysis of the impact of storm and weather events in the US (1950-2011) using the NOAA storm database.
Linear Regression and Regularization for Salary Prediction
Using linear regression and regularization techniques to predict salary for a class assignment based on a Kaggle competition.
Exploring the Abalone dataset with ggplot2
A couple simple plots of the Abalone dataset, using the ggplot2 package.