gravatar

SimeonPaynter

Simeon Paynter

Recently Published

Flags of the World
This report analyzes national flags, using 193 countries flags to see if flag colors are associated with certain regions of the world. This report goes into detail about image processing, color identification, and categorical methods of testing the relationship between colors and global areas.
Discriminant Analysis and Logisitic Regression
Project 8, April 2021. Used Linear Discriminant Analysis and Logistic Regression to evaluate if bank customers were more likely to be loan acceptors or decliners.
Classification and Regression Trees
Project #6, March 2021. Used the rpart package to create trees and prune them, examining error rates and outliers, regarding the price of Toyota Corollas
Neural Networks on Toyota Corrolla and Airlines datasets
Project #7, March 2021. Used the neuralnet package to generate neural networks with various number of layers and nodes. Also used RMSE to compare models and reisudal plot to examine potential outliers.
Evaluating Predictive Performance
Project #5, March 2021. Used confusion matrices and lift charts to calculate model error rates.
Regularization Methods for Multiple Linear Regression
Project #4, March 2021. Used regularization methods: ridge, lasso, and elastic net to model housing in the Boston area.
Principal Component Analysis on Cereals dataset
Project #3 for Data Mining. February 2021. This project uses PCA to reduce the number of x variables in our model. Scree plots, biplots, and correlation matrices were used prior to fitting the linear regression model.
Text Mining President Trump's Speeches
Project #2 for Data Mining STAT 415. February 2021. This project helped me better understand some of the basics of text mining.
Kaggle 2020 Survey Exploration
Project #1 for Data Mining STAT 415. January 2021. This project explores and summarizes some of the responses Kaggle received in their 2020 survey.
Markov Chains
Using a probability transition matrix to generate 10,000 transitions of a markov chain to determine long-term probability for the # of users connected.
Generating Samples using the Rejection Method (Project-2)
This project uses the Acceptance-Rejection Method to generate random samples for X when the pdf is given, but the cdf is unknown and/or complicated.
Monte Carlo Simulation (Project-1)
Using Monte Carlo Simulation to simulate long-term probability
Visualizing Police Shootings
This is the documentation for the first version of the Shiny App I am creating. It uses Leaflet to geographically visualize the data in The Washington Post's 'fatal police shootings' dataset.