gravatar

zlhTao2012

Linghuan Zeng

Recently Published

C.A.R.S Analysis
This project is to explore the transaction datasets of 2009 US Gorvenment C.A.R.S (also as known as “Cash for Clunkder”) program. During this program, consumers turned in gas guzzlers and bought nearly 700,000 more fuel-efficient vehicles in fewer tha 30 days. NHTSA website provides all the data collected from the program. In this analysis, we first try to create two metrics to identify 10 most successful states and 10 least successful states during the C.A.R.S program. Next, we would like to see whether the fuel-efficient car peference differs among different regions consumers by using the established metrics. Furthermore, by exploring the dataset, we find some behavioral patterns that can help us understand how consumers buy new cars. Lastly, we try to answer whether the data from the NHTSA sufficiently support that the program was “wildly successful” by the government. Although we only analyze Final Paid Transaction Data, other data and materials are also used to answer the last question. (C.A.R.S. data are available through http://www.nhtsa.gov/Laws+&+Regulations/CARS+Program+Transaction+Data+and+Reports)
Data Science Capstone Slides
Data Science Capstone Slides for Next Word Prediction Project
Capstone_MileStone_Report
Data Science Capstone MileStone Report
Capstone_MileStone_Report_new
The report is to explains my exploratory analysis on the all of three text files (blogs, news and twitter). Due to the large size of the all three file, this analyis will only randowly pick 3000 and 5000 lines from those three files. First, basic summaries analysis of the three files will be conducted. Next, some histograms will be plotted to present the frequency of Top 20 2-grams and 3-grams distributions. Then, two 2-gram word cloud charts for each file will be created as next part of this study. Finally, I will list some interesting findings from my sample trials.
Capstone_MileStone_Report
Capstone Milestone Report
BMI for Adult
A Project Presentation for Developing Data Product Course
Peer_Assessment_1
Cousera - Data Science Specilizations - Reproducible Research - Peer Assessment 1
Regression_Models_Project
Cousera - Data Science Specilizations - Regression Models - Regression Models Project
The Storm Types Assessment Based On The Heaviest Casualties And The Greatest Economic Consequences
Cousera - Data Science Specilizations - Reproducible Research - Peer Assignment 2