Recently Published
Data Science Capstone
The *corpora* in this presentation consist of one or more 0.1% random samples of size n= 2,000, 150 and 5,000 lines extracted from blogs, news and twitter text files archived in the Helsinki Corpora English language database. [link](www.corpora.heliohost.org)
Data Science Capstone
An n-gram is a sequence of n words, *e.g.*, "new york" is a sequence of two words or bigram (n=2). An n-gram model is a probabilistic language model for predicting the next item in such a sequence. For example, "new york city" is a trigram (n=3) with the third word "city" predicted from a probabilistic language model.
Data Science Capstone
An n-gram is a sequence of n words, *e.g.*, "new york" is a sequence of two words or bigram (n=2). An n-gram model is a probabilistic language model for predicting the next item in such a sequence. For example, "new york city" is a trigram (n=3) with the third word "city" predicted from a probabilistic language model.
Data Science Capstone
An n-gram model is a probabilistic language model for predicting the next item in such a sequence, e,g., "new york city churches" is a quadrigram (n=4) with the fourth word "churches" predicted from a probabilistic language model.
Data Science Capstone
An n-gram model is a probabilistic language model for predicting the next item in such a sequence. For example, "new york city" is a trigram (n=3) with the third word "city" predicted from a probabilistic language model.
Data Science Capstone
This n-gram prediction model uses bootstrap re-sampling, *i.e.*, successive *corpora* or 0.1% random samples to predict the fourth word in a quadrigram (n=4) from the first three words, *e.g.*, "churches" is the fourth word in the quadrigram "new york city churches"
Data Science Capstone
This n-gram prediction model uses re-sampling, i.e., successive corpora or 0.1% random samples to predict the fourth word in a quadrigram (n=4) from the first three words, e.g., “fire” is the fourth word in the quadrigram “new york city churches.”
Milestone Report for the Data Capstone Project
This Milestone Report outlines exploratory analysis and a hierarchical clustering model of several hundred online reviews of award-winning wines aggregated by Wilson Daniels from _The Wine Spectator, The Wine Enthusiast _ and other ezines.
PA1_template
Peer Assessment 1
R Studio Pitch Presentation
Galton Family Data
index
Galton Family Data Slidify pitch presentation.
slides
Slidify.
Prediction Assignment
This is a compiled HTML file for the Practical Machine Learning Course Project.
PA2_template
Reproducible Research Peer Assessments
PA1_template
Re-publication.
PA1_template
Peer Assignment 1.