Recently Published
NYC Asbestos Complaints vs. Income
This project examines the dataset from NYC OpenData regarding asbestos complaints received by the Department of Environmental Protection (DEP) and the Department of Health and Mental Hygiene (DOHMH) from 2010 to present. I also decided to follow parts of the "Manipulating and mapping US Census data in R using the acs, tigris and leaflet packages by ZevRoss as well as the "Census Mapping Tutorial tutorial by Laura Krull lkrull and Jeff Rosenblum.
Creating Visualizations for Trump Tweets
Analyzing Donald Trump's tweets from 2009 to 2018.
Creating Interactive Maps in R
This document follows the tutorial at ComptuerWorld. I will be using the tmap, tmaptools, sf, rio, and leaflet libraries.
Visualizing Top 100 Women's Tennis Scores
This data is about the top 100 Women’s Singles Tennis scores. I will be utilizing it to create a quick visualization.
Examining the World Happiness Report 2015-2017
This project will be analyzing the World Happiness Reports of 2015, 2016, and 2017. This data is extrapolated from Gallup poll results. These data sets create scores of happiness for countries and regions, based on 6 factors: GDP per capita, family or social support, health as defined by years of life expectancy, perceived freedom to make life decisions, trust in government, and generosity measured by recent donations.
Comparing GDP from 1990 to 2015
For this assignment, I created two visualizations from the “nations” dataset. The first chart shows GDP in trillions, for 4 countries from the time period 1990 to 2015. The second chart shows GDP in trillions by World Bank Region, from the year 1990 to 2015.
Taylor Swift Webscraping
Extra credit scraping the Taylor Swift Wiki and combining a data frame with Taylor Swift's pronoun usage.
Webscraping Feature Films of 2016 from IMDB
“Webscraping is a technique for converting the data present in unstructured format (HTML tags) over the web to the structured format which can easily be accessed and used.” For this assignment, we will be examining the most popular feature films of 2016 from the IMDB website utilizing the “rvest” package and the SelectorGadget Chrome extension.
Analyzing Suicide Rates from 1985 to 2016
For this project, I am using a library that compiles data about Suicide Rates from 1985 to 2016. It pulls data from the United Nations Human Developments Report, The World Bank DataBank, World Health Organization (WHO) Suicide Statistics, and the World Health Organization National suicide prevention strategies report.
Sampling and Analyzing the Million Song Dataset
For this project, I used a library adapted from The Million Song Dataset. The Million Song Dataset is a “freely-available collection of audio features and metadata for a million contemporary popular music tracks”, provided by The Echo Nest. This adaption was developed for another project by Ryan Whitcomb.