Recently Published
Assignment 3 - Storytelling with Open Data
The goal of Assignment 3 is simple. Tell a compelling story using data visualisation.
MATH2270 - Assignment 2
Assignment 2 - Deconstruct, Reconstruct Web Report
Rashbir Singh Kohli
s3810585
MATH1324 Assignment 4
The purpose of this assignment is two solve two questions, i.e:
To verify the claim of South Western Sydney hospital that an average length of stay (ALOS) of a patient is of 4.5 days.
To determine whether tutorials are effective in improving student’s performance.
In this assignment two tests were identified, i.e:
One sample two-sided t-test.
Paired sample one-sided t-test.
Two data sets were used for this assignment, i.e:
The average length of stay in hospital data set downloaded from the AIHW website [1]
Assignment 4b-3.csv, data that have information about the impact of tutorials on the performance of the students.
To conduct an experiment we used a software called R-Studio and the on the two data samples.
Data exploration was done by calculating and visualized using summary statistics, data table, box plots, histogram, and Q-Q Plots.
To complete the test and verify its p-value was compared with the significance level and confidence interval was checked.
MATH1324 Assignment 3
The human body is very complex and various factors affect how the body will grow or reduce based on the growth of the other body parts.
So, it would be interesting to see how a person’s height and their chest size are related to each other.
How a person’s increase in height can lead to an increase or decrease in their chest size.
Or it could be that height does not affect one’s chest size at all.
The data was collected from 507 individuals and is sufficiently large.
Data exploration was done by calculating and visualized using summary statistics, data table, box plots, histogram, and Q-Q Plots.
Various tests performed on the data including the correlation test, test on the intercept and slope of the relation linear line and confidence interval, and p-value of t-statistics were also calculated.
For this experiment chest and height are identified as the variable of interest and all the tests were conducted on them only.
With height as the independent variable and chest as the dependent variable.
MATH1324 Assignment 2
Australia has 693 public hospitals and 657 private hospitals in 2017–18 report of the Australian Institute of Health and Welfare(AIHW).
So, we have taken a data called Average length of stay from the AIHW website.
The motive is to judge people’s nature towards the size of the hospital.
To conduct an experiment we used a software called R-Studio and the data have 14 variables and 30021 observations.
Data exploration was done by calculating and visualized using summary statistics, data table, box plots, histogram, and Q-Q Plots.
Based on the discoveries several tests were conducted to test the homogeneity of variance, calculate the difference in mean, and decide on a 95% confidence or 5% level of significance.
For this experiment Average length of stay was identified as the variable of interest and all the tests were conducted on it only.
MATH1324 Assignment 1
For this assignment, the student is required to choose only one measurement from the given data having 25 variables and 507 observations.
After selecting the variable of choice we have to determine that does it fits a normal distribution or not separately for male and female.
For this assignment, I have taken variable names ‘bic.gi’ (Bicep Girth) and created a different data frame for both the gender groups.
For the normal distribution fitting, I have used a histogram and shown the density distribution. along with normal distribution.
MATH2349 Semester 1, 2020
The following dataset(Novel Corona Virus 2019 Dataset) named COVID19_line_list_data.csv is about the latest epidemic diseases called COVID19 (What you need to know about coronavirus (COVID-19)) and is taken from Kaggle that is uploaded from Johns Hopkins Github repository (2019 Novel Coronavirus COVID-19 (2019-nCoV) Data Repository by Johns Hopkins CSSE) by one of the Kaggle Grandmaster (SRK ).
For this assignment data version 34 is used from Kaggle that is uploaded on 16/03/2020.
The following data source have 6 data CSV files.
COVID19_line_list_data.csv CSV file have 27 Variables and 1085 Observations.
Initially the data set have 27 variables named as:
MATH2349_2010_Assignment_2_Template
Assignment 2 for data wrangling by Rashbir Singh Kohli