Recently Published
TypeAhead Word Predictor
TypeAhead is a software tool that predicts the probable word that will be typed by the user. The presentation explains the model used, accuracy and performance results as well as instructions on how to use the application.
Word Prediction based on Sequence of Words
Coursera Daya Science Capstone project Milestone Report
ToothGrowth Analysis Presentation
Analyze the effect of supplement and dosage on tooth growth
Weight Training the Right Way: Common Mistakes Predictor
One thing that people regularly do is quantify how much of a particular activity they do, but they rarely quantify how well they do it. In this report, our goal is to use data from accelerometers on the belt, forearm, arm, and dumbbell of 6 participants and build a model to quantify how well weight lifting exercises are done. Please see http://groupware.les.inf.puc-rio.br/har#weight_lifting_exercises for information on this research.
After data exploration and data cleaning, cross-validation technique is used to build models using four different techniques. The best model shows an accuracy of 99.51% and that model is used to predict the outcome for the supplied test data.
Alcohol Consumption by Country
This is based on 2010 data
MPG Analysis Motor Trend
Motor Trend collected data on various cars to understand the impact of several factors on Miles Per Gallon (MPG). Specifically, this report tries to answer these two questions: “Is an automatic or manual transmission better for MPG” and “Quantify the MPG difference between automatic and manual transmissions”.
The analysis shows manual transmissions are better for MPG. In addition, weight of the car and horse power of the engine are also influencing factors
Tooth Growth Analysis
In this report we will analyze the ToothGrowth data in the R datasets package. Analysis include the effect of supplement and dosage on tooth growth.
Expotential Disribution and Central Limit Theorem
In this report we will investigate the exponential distribution in R and compare it with the Central Limit Theorem. We will investigate with averages of 40 exponentials using 1000 simulations. We will compare sample mean and variance to the theoretical values.
Significant Weather Event Impact on People and Property
NOAA’s significant weather event data (also known as Storm data) for USA from years 1950 through 2011 were analyzed for fatalities, injuries, property damage and crop damages. Results of the analysis are presented here.