gravatar

MTrungDang

Trung Dang

Recently Published

Scatter plots
Ridgeline Plots in R
Simple for loop
Cycle plot
Indexing
storytelling with data - SWD community
Undertake targeted practice by completing the latest exercise or browse the exercise bank and tackle one relevant to a current project or your data communication development goals.
Small multiples
Choosing an effective visual
One way ANOVA
TWO-SAMPLE T-TEST
Visualizing Times Series
Correlograms
Wilcoxon Rank Sum test
Document
Reshaping Data
Apache Hive and R
UIC heart desease data
Ensuring the quality of data
Data visualization
Crime in Toronto, Part 1: Data cleaning, initial analysis, and univariate analysis
In this post, the package tidyverse is used to wrangle the Toronto crime dataset (167525 records and 29 variables). * Finding 47 rows with missing values. Removing all of rows containing missing values as these rows are not useful for future analysis. * 11973 duplicates have been found and removed. * The number of columns reduced from 29 to 19. Columns were transformed to the appropriate data types. * Initial analysis and univariate analysis is performed.