Recently Published
MATH2349 Semester 1, 2019: Pre-processing of IMDB Data
This assignment demonstrates the key steps in data pro-processing as studied in this subject. Data processing techniques are applied to Movie data from the IMDB data set is used to demonstrate these steps. Firstly, the required packages are loaded into and then the data is read into R as data frames from CSV files based on the local computer. This data is then checked and with data pre processing methods and then selected features across a number tables are merged to apply further pre-processing and reading the data for further analysis. Outliers are identified and addressed using Tukeys univariate methods. Following this, a range of transformations are applied to the selected columns in the new data set to further improve and stabilise the data for further analysis.
MATH1324 Assignment 3: Supermarket Price Wars
A statistical analysis of supermarket prices.