gravatar

LucyMoto

Lucy Michaels

Recently Published

6-4B Logistic Regression Model
In this example, we are going to examine a dataset of part of the passenger list for the Titanic. Our aim is to use the data to fit a model which can predict each passenger’s survival/nonsurvival, given their age, sex and class of ticket.
6-2E Multiple Linear Regression Modelling
Multiple linear regression is a widely used statistical method for modelling the relationship between a dependent variable and one or more independent variables. In this analysis, we aim to develop a robust and interpretable linear regression model while addressing common challenges such as multicollinearity, overfitting, and variable selection. Key steps include evaluating the significance of predictors using the F-statistic and p-values, assessing multicollinearity through variance inflation factors, and exploring the impact of reducing the number of variables on model performance.
6-1C Simple Linear Regression Model - Automobili
A simple data set of 205 automobiles with attributes: horsepower and price is cleaned, examined, analysed and a linear regression model constructed and tested.
5-12 Comparison of Clustering Methods
Objective: Analyse the dataset 'macro' using tandem analysis (PCA, followed by k-means), k-means analysis and reduced k-means analysis (PCA and k-means simultaneously) and compare the results in order to find the best clustering solution.
5-11 - Simultaneous Use of Principal Components and Cluster Analysis
Examining the Economic Indicators Dataset ‘macro’ with PCA and Cluster Analysis
5-10 Comparison of Clustering Methods Lucy Michaels
Objective: To perform non-hierarchical (k-means) and hierarchical (Ward) clustering on the USArrests dataset, then compare the results.
5-08 Cluster Analysis - Hierarchy Methods
We are going to identify clusters in the ‘mtcars’ dataset and experiment with different types of bond method. We aim to develop an understanding of how the different bond types affect the solution(s).
5-09 Cluster Analysis - Non Hierarchy Methods - KMeans
Analysis of the iris dataset using kmeans clustering. Is 3 the magic number?