Recently Published
DATA689_Midterm_Draft
Here is a draft of my project progress
Data625=Project2-V5
This draft is a synthesis of everyone's work done by PK with the addition of Appendix 3 with updated MARS models giving modest performance gains
DATA624 Project 2-Collinearity Assessment, Feature Selection and MARS Models
This is my attempt and collinearity evaluation and feature selection. At the end I create some MARS models to capture some of the nonlinear features of the data. The data evaluation at the top belongs to Deepa Sharma a member of my group.
DATA624 HW9
This HW is using tree based methods for regression. The fact that the randomForest package doesn't have a function to plot a tree is insane.
DATA624 HW8
In this homework is looking at implementing non-linear regression models
DATA624 HW7
This homework assignment involves applying linear regression models (and cousins) to datasets with large numbers of predictors compared to the number of rows of data
DATA624 Project 1
In this project, I perform some ETS forecasting for some data sets provided. The data requires some fixing of outlier and missing values. The models selected by the model function may not be the models that best fit the data.
DATA624 HW6
Intro to ARIMA models
DATA624 HW5
This assignment is comparing different types of exponential smoothing
DATA624 HW4
This assignment involves resolving skewness and outliers and dealing with missing values
DATA624 HW3
This assignment is an introduction to simple modeling techniques and accuracy measures
DATA624 HW2
This homework in an introduction into time series decomposition
DATA624 HW1
This assignment is an introduction to plotting time series data
DATA622 HW4
In this assignment I'm using Random Forest and SVM to predict surgical complications from patient demographics.
DATA622 HW3
In this assignment, I continued the work done in HW2 where I evaluated the performance of Random Forest by applying SVM to the same dataset.
DATA622 HW2
This homework is an exploration of the performance of Decision Trees and Random Forest on a classification problem.
DATA622 HW1
This a homework assignment where I was asked to take two datasets of different size (small/large) and apply 2 ML algorithms to them and see what I find.
Data 607 HW7 William Aiken
Exploration of HTML, JSON and XML files
Data607 Project1 WilliamAiken
This is a data wrangling project where a text file is converted into a usable matrix format.