Recently Published
Group 2: Exploring Plotly
Following the instructions included in the assignment, our group constructed a plotly image to depict the daily closing prices of the major European stock indices. Additionally, we created a box plot to depict the overall difference in IMDB and RottenTomatoes scores for a few popular streaming services.
Decision Tree Models
An exploration of a dataset related to breast cancer. This paper contains two decision trees which attempt to predict the presence of progesterone receptors and a tumor's stage respectively. Each tree is grown, tested, and evaluated using a variety of different techniques
Evaluation of Two K-NN Models
This document provides a quick look at various metrics used to evaluate two separate k-nearest neighbors models. The first utilizes heart attack data and attempts to classify those at high risk and those at low risk. The second model classifies used motorcycles into low, moderate, and high values. As a quick summary, neither model is particularly effective at classification.
SQLDF Package Review
An analysis of the sqldf package built for R. This review covers the main components of the package including pros and cons as well as a brief analysis of SQL. All SQLDF code examples are performed on NBA 2020-21 season data.
Commercial or Not? K-Nearest Neighbors
A k-nearest neighbors model which determines whether or not a given video clip is a commercial or not given a set of variables describing said clip.
Sentiment Analysis of "Text Mining"
From the LexisNexus service we obtained articles from five separate newspapers whose content included the term "text mining." Using these articles, we attempted to determine the overall sentiment of the term. Despite some flaws in the data, we agree the general sentiment is positive.
Saving the Wizards
A k-means clustering method of selecting NBA players to trade for in order to improve a team's overall quality.
DS 3002 Lab 5: Corruption and human development
A reimagining of a corruption and human development graph first appearing in what I can only assume was The Economist. Certain features have been stripped away because of the ggplotly function, however the original RMD file has the correct representation for the assignment.
Data Science in Finance - Lab 4
Submission for DS 3001 week four lab.