gravatar

Newellay

Ashley

Recently Published

lab 6
plots
Plot
About Me
Plot
Box Plot
The sample size is n=1599. There are no missing data points. There are outliers for residual sugar and chlorides. There were some bottles where sugar was high for red wine compared to other wine types, and therefore the data is right skewed. This is also true for chlorides, because there are a few high values, but most are in the same range. The chloride levels for wine vary by region, and high levels can make the wine taste too salty. The median is a better measure for these variables than the mean. Having taken into consideration, I do not have problems with the data quality. I’ve created histograms and boxplots to summarize the data for each variable. The histograms show the distributions. The alcohol variable is almost symmetrical. The quality scores are left skewed, but not much. More wines scored above the mean than below.