gravatar

harrychangjr

Harry Chang

Recently Published

DSA2101 Group B
Cellphone Billing Project
Data Science Project - Biopics
Data Science Project on Biopics dataset from Kaggle Source: https://www.kaggle.com/datasets/fivethirtyeight/fivethirtyeight-biopics-dataset Methods explored on dataset: - Data preprocessing - Visualizations to understand more about the dataset - Regression methods to predict box office revenue (linear regression, random forest, SVM) - K-means clustering with PCA to identify similar types of movies - Content-based recommendation system using cosine similarity to recommend similar movies based on input title