gravatar

ZarminaMuhammad

Zarmina Muhammad

Recently Published

Quantified Self
Document
Document"Exploratory Data Analysis (EDA) for Pima Indian Diabetes Dataset"
This is my first R vignette and this explains ‘Exploratory Data Analysis (EDA)’ for Pima Indian diabetes dataset. Data source is: https://www.kaggle.com/uciml/pima-indians-diabetes-database) EDA is generally used to analyse data sets to summarize main characteristics with visual methods and possibly formulate hypotheses that could lead to new data collection and experiments. I have used tidyr and ggplots to show bar graphs for Bivariate tests between predictor variables Vs Target Variable (Outcome), & have shown correlations before and after treating missing values. My blog can be found at: