Recently Published
Data 621 Homework 1 - Moneyball Project
Using an MLB training dataset of each team from 1871 to 2006 to predict the number of wins on the test set.
Data 622 Homework 4
Building Machine Learning models to predict NBA player salaries using 2022-2023 NBA players dataset.
Data 624 Project 2
Group project using dataset to predict pH levels based on features in dataset.
Data 622 Homework 3
Comparing the results of Decision Tree and Random Forest models with Support Vector Machine models.
Data 624 Homework 10 - Market Basket Analysis
Using Data Mining and Association tools to create a cluster analysis using a Grocery list dataset.
Data 624 Homework 9
Exploring tree-based models such as Random Forest, Bagged Trees, and Gradient Boosting.
Data 624 Homework 8
Creating and comparing nonlinear regression models.
Data 622 Homework 2
Exploring and analyzing Decision Tree and Random Forest modeling techniques using a heart attack risk dataset.
Data 624 Homework 7
Answering questions from the Kuhn and Johnson textbook, creating models and comparing their performances on train and test sets.
Data 624 Project 1
Creating a Time-Series analysis for two business situations.
Data 624 Homework 6
Exploring ARIMA modeling using R.
Data 624 Homework 5
Answering questions pertaining to Exponential Smoothing.
Data 624 Homework 4
Exploring how to handle missing data in predictor variables.
Data 624 Homework 3
Applying Time Series methods to various datasets and analyzing its results.
Data 624 Homework 2
Using Box Cox Transformations, STL and X-11 decompositions on various datasets.
Data 624 Homework 1
Exploring data using time series functions.
Data 605 Final Exam/Project
Final Project using Crab Age dataset and answering questions pertaining to Probability, Calculus, Linear Algebra, and Multiple Linear Regression modeling. Revised to include Kaggle submission score
Data 605 Final Exam/Project
Final Project using Crab Age dataset and answering questions pertaining to Probability, Calculus, Linear Algebra, and Multiple Linear Regression modeling.
Data 605 HW15
Answering questions pertaining to what was covered during the semester.
Data 605 HW14
Answering math questions using Taylor Series expansion.
Data 605 HW13
Answering Calculus-related questions
Data 605 HW12
Answering questions related to dataset involving life expectancy, using simple and multiple linear regression, as well as variable transformation.
Data 605 Discussion 12
Using Sleep Efficiency dataset to create a multiple linear regression model.
Data 605 HW11
Creating a Simple Linear Regression Model using the cars dataset.
Data 605 HW10
Answering Gambler's Ruin question.
Data 605 HW9
Answering questions calculating the probability of independent random variables, and using moment generating functions to calculate the expected values and variances of binomial and exponential distributions, respectively.
Data 608 Story 4
Exploring annual salaries of Data Science jobs, examining the variation of salaries by role and by U.S. state.
Data 605 HW8
Answering questions pertaining to Sum of Independent Random Variables and Law of Large Numbers.
Data 605 HW7
Answering questions using Cumulative Distribution Function, Geometric, Exponential, Binomial, and Poisson Distributions.
Data 605 HW6
Answering probability and combinatoric questions.
Data 605 HW5
Answering probability questions related to Bayesian, Binomial, Poisson, Hypergeometric, and Geometric distribution methods.
Data 605 HW4
Building and visualizing eigenimagery that accounts for 80% of the variability using shoe images.
Data 605 HW3
Solving problems pertaining to finding the rank of a matrix, as well as finding the character polynomial, eigenvalues and eigenvectors of a matrix.
Data 605 HW2
Showing how a square matrix multiplied by its transpose is equal to the transpose of the square matrix multiplied by itself. Creating a function to find the LU of a square matrix.
Data 608 Assignment 1 - Infrastructure and Investment Jobs Act
An analysis of the allocation of funds based on state/territory population and 2020 Presidential Election results.
Data 605 HW1
Using a transformation matrix to create animations that shear, scale, rotate, and project the initials of my name.
Data 606 Final Project
Exploring NBA dataset obtained from Kaggle to determine factors that influence a player's WinShare.
Data 607 Final Project
Exploring NBA team and player clutch stats, extracting data using the hoopR library.
Data 606 Lab 9 - Multiple Linear Regression
Exploring Multiple Linear Regression using dataset of grading scores of professors.
Tidyverse Extend Assignment
Extending assignment focused on NCAA women's basketball dataset.
Analysis of Lobbying in the U.S.
Using datasets from OpenSecrets.org, I explored various areas of lobbying.
Data 606 Lab 8
Exploring Linear Regression using the Human Freedom Index dataset.
Data 607 Tidyverse Create Assignment
Exploring political donations made by Americans sports owners during the 2016, 2018, and 2020 election cycles.
Data 606 Final Project Proposal
Final project proposal using NBA Draft data of each player from 1989 to 2021.
Extra Credit Data 607 - Using JSON Files from Nobel Prize API
Exploring Nobel Prize data using JSON files extracted from Nobel Prize API.
Data 606 Lab 7
Inference for numerical data.
Data 607 Assignment 9
Using The New York Times' API to access articles that were most viewed on the NYTimes website in the last 30 days.
Data 606 Lab 6
Exploring inference for categorical data.
Data 607 Project 3 Team Document
A brief, detailed description of the scope of our project.
Data 607 Assignment 7
Creating data tables of selected books using html, xml, and json file types.
Data 606 Lab 5b - Foundations of Statistical Inference
A comprehensive look at confidence intervals and its impact on statistical analysis.
Data 606 Lab 5a - Foundations of Statistical Inference
An in-depth analysis of sampling distributions.
New York State Gas Price Breakdown
Examining weekly gas price data from 2007 to 2023.
Affects of Covid-19 in New York City
A breakdown of Covid-19 data in the five boroughs of New York City.
Data 607 Project 2 - MTA Daily Ridership
Using MTA ridership data, analyzing trends in subway ridership since March 2020.
Data 606 Lab 4
Breakdown of fast food dataset.
MTA Ridership Breakdown
A detailed look at MTA daily ridership, from March 1, 2020 until February 23, 2023.
Data 606 Lab 3
Breaking down the Hot Hand Basketball theory using the statistics of Kobe Bryant.
Data 607 Week 5 Assignment
Breaking down arrival on-time and delay performances of Alaska and Am West Airlines.
Data 607 Project 1
Creating a CSV file using chess data.
Data 606 Lab 2
Break down of NYC flights dataset.
DATA 607 Assignment 3
Using Regular Expressions to answer questions.
Data 607 Assignment 2
A breakdown of a movie ratings survey I conducted
Data 606 Lab 1
Comparison of birth rates among boys and girls, using a baptism dataset created by Arbuthnot from 1629 to 1710, and another dataset of births in the U.S. from 1940 to 2002.
American's Views on the Handling of the Coronavirus Response
Exploring data on how Americans view the job President Trump and President Biden did on responding to COVID-19.
DATA 607 - Assignment 1: Exploring American's Views on Response to Coronavirus Pandemic
This file examines a dataset pertaining to whether Americans approve or disapprove of how President Trump and President Biden handled the response to the Coronavirus pandemic.