gravatar

rtheman

Rich Leung

Recently Published

Natural Language Processing - Introduction
A quick introduction to NLP. Using NLP techniques to explore data set via tokenization of sentence, word, person, location, organization. Moreover, create POS tags.
high_school_prospect
Using multinomial logistics regression to predict how entering high school students make program choices among general program, vocational program and academic program. Their choice might be modeled using their writing score and their social economic status.
Titanic Survival Prediction
Using binomial logistics regression to predict survival of the Titanic using Kaggle's dataset found at https://www.kaggle.com/c/titanic (login require).
Airfare Prediction for IAD
Using linear regression to predict airfare using U.S. Domestic Routes for 4th Quarter of 2002 from U.S. Department of Transportation.
Granular's Temperature Regression
Weather Data (Mountain View, CA)
A quick example of displaying weather data
EDA_Viz
Results of EDA using R. This results is compared to other languages/tools such as SAS and Python