Recently Published
Public or Private? Logistic Regression on the College Dataset
ALY6015 Module 3 — Logistic Regression analysis using the ISLR College dataset to predict whether a university is private or public. Includes EDA, glm() model fitting, confusion matrices, ROC curve (AUC = 0.976), standardised coefficient plot, and predicted probability distribution. Built with R, ggplot2, and Plotly.
Predicting 30-Day Hospital Readmission in Diabetic Patients
Initial statistical analysis of 68,061 diabetic patient encounters from 130 US hospitals. Uses chi-square tests, ANOVA, and logistic regression to identify readmission predictors. Interactive Plotly visualizations show age effects, A1C testing impact, and a key finding: patients with 3+ prior admissions have 25-30% readmission rates. ALY6015 Module 4 Report.
Regularization-ridge-lasso-aly6015
An interactive R Markdown report for ALY6015 (Intermediate Analytics) at Northeastern University. This analysis applies Ridge regression, LASSO regression, and backward stepwise selection to the ISLR College dataset to predict graduation rates across 777 American colleges. The report features interactive plotly charts (hover for details, click to filter, drag to zoom), searchable coefficient tables, color-coded variable selection results, and a complete reproducible R code appendix organized by section. Key findings: LASSO achieves the lowest test RMSE (≈9.62) while automatically reducing the model from 17 to 8 predictors, outperforming both Ridge (9.77) and stepwise selection (9.76). All code is fully reproducible with fixed random seeds.