gravatar

hsarfraz76

Hussain Sarfraz

Recently Published

Analysis of nyc 'flights' and 'airports' dataset
In this assignment I am joining the nyc 'flights' and 'airports' dataset together. I use functions such as anti_join() and left_join() to derive insights from both datasets
Analysis of diamonds and flights dataset
I am analyzing the diamonds dataset in this assignment and have created box plots, heat maps, scatter plots with colors, and scatter plots with regression lines. The statistics concepts that I cover are: frequency distributions, measure of central tendency (mean, median, and mode), explanatory variables (a type of independent variable), and variability.
Summary of Statistics Concepts
This page is where I write about key statistics concepts and notes that I am picking up along my data analytics learning journey. This page also contains code on how to create different visualizations in R (boxplots, heatmaps, scatterplots).
Analysis of nyc 'flights' and Lahman 'Batting' dataset
Analyzing the flight delays and arrivals from each airline and destination from the 'flights' dataset from the nycflights13 library. Also looking at the batting averages of each baseball player from the 'Batting' dataset from the Lahman library. Graphs Produced: scatterplots, bar charts,
Analysis of mpg dataset
Examining the relationship between different car features (cylinders, highway mpg, car mpg, car displacement) Plots used: bar plots and scatter plots