It's an analysis of whether work experience, gender, and the ability to develop full-stack are predictors of income for US developers in the tech industry.
This project will answer the research question: Is smoking independent of gender? Data from NYC’s Community Health Survey (2020) will be used to answer the question. Moreover, the conclusion will be checked against World Health Organization’s data on smoking. Ideally, the 2 datasets will agree on the findings.
Simple linear regression using the "cars" dataset in R.
First, some example code is run and explained so that the process of sentiment analysis becomes clear. Then, I perform my own sentiment analyses on Sir Arthur Conan Doyle's novel "The Hound of the Baskervilles" using various lexicons.
The New York Times' Books API is accessed to get the current bestsellers list for fiction (both print and ebooks).
This is my portion of the group project that attempts to find important skills for data science. I have cleaned a csv file from Kaggle's survey of its users.
This assignment involves reading in unstructured data and organizing it in a rectangular fashion.
This assignment is focused on the application of dplyr and tidyr packages (from Tidyverse).
Cleaning chess data and exporting to a csv file
String manipulation using regex for DATA 607: Data Acquisition and Management
A rudimentary analysis of the strength of the top 6 European soccer leagues using data from https://projects.fivethirtyeight.com/soccer-api/club/spi_global_rankings.csv .
Final project for CUNY SPS R workshop for MSDS program.
Submitted for CUNY SPS MSDS R bridge program.
This is the first assignment for CUNY SPS MSDS R workshop, submitted by Prinon Mahdi.