RPubs

by RStudio

stanleycheng

stanley

Recently Published

KNN_Iris_Data

In this project I will try to use a KNN (K-nearest-neighbors) method for Iris Species identification. The iris dataset is a built-in dataset in R. The goal of this project is to find out which k-value (No. of Nearest Neighbor) is the best in species identification.

about 4 years ago

Random_Forest_Heart_Disease

This is the heart disease data set from the UCI machine learning repository. Here I will use these variables to build a random forest model, to predict if a patient have heart disease according to the data provided

about 4 years ago

Logistic_Regression_Heart_Disease

This is the heart disease data set from the UCI machine learning repository. Here I will use these variables to build a logistics regression model, to predict if he/ she have heart disease.

about 4 years ago

Logistic Regression Model - Determine the Student’s Success Admission Rate

about 4 years ago

The Top Billionaires

According to Bloomberg/Forbes, the 10 richest people on the planet. Data for the last five years. Interesting dynamics of capitalization of companies. How do people make money in technology companies. Sources: https://www.kaggle.com/datasets/alexandrparkhomenko/the-top-billionaires

about 4 years ago

Shiny-IO Project

Trend in Demographics and Income Explore the difference between people who earn less than 50K and more than 50K. You can filter the data by country, then explore various demographic information.

about 4 years ago