Recently Published
Time Series - US Electricity Production
This analysis uses several methods to evaluate and model electricity data (monthly, between 1973-2013). Evaluation methods include testing for normalization, Auto-correlation, stationary. Several ARIMA methods used to model.
Text Analysis - US Presidential Inaugurations
Exploratory analysis of free-text from each of the 58 presidential inauguration speeches. Used several methods including: frequency counts, cosine similarity, n-grams, comparison (word) clouds, and lexical diversity
Spatial Analysis - NYC Airbnb
Using data from http://insideairbnb.com/get-the-data.html Applied several spatial analysis techniques including: Proximity Polygons, Trend Surface Analysis, Krigging, and spatial-based prediction using xgboost & h2o.
Factor Analysis - Socio-Economic Determinants (retail)
Used unsupervised learning to analyze social determinants of food availability.
Cluster Analysis - Employee Satisfaction Survey
Used hierarchical and k-means clustering to analyze data about the employees and feedback pertaining to their job satisfaction.