RPubs

by RStudio

ww44ss

Winston Saunders

Recently Published

Summer-Dry Interval Computations Seattle

almost 4 years ago

Seattle Rainfall 2022

Seattle has had a very dry Fall. To put it into context I got weather records for the last 100 years and analyzed them.

almost 4 years ago

Upward Trend of Temperatures in Aberdeen WA

An analysis of the upward trend of nighttime temperatures in Aberdeen WA

almost 4 years ago

Regional COVID Late November 2020

over 5 years ago

Regional_COVID_October_2020

Regional analysis of COVID stats,

almost 6 years ago

Politics of COVID 2

including presidential politics of 2016

almost 6 years ago

Politics_of_Covid

Looking at the Red vs. Blue State Covid infection rate breakdown. (In an era where speaking truth is a political act).

almost 6 years ago

Regional_COVID_September_2020

Analysis of New York Times data

almost 6 years ago

Regional COVID-19 August 2020

Based on NYTImes data. #rstats

almost 6 years ago

COVID Regional Trends

Analysis based on NY Times dataset.

almost 6 years ago

Regional COVID-19

Analysis of COVID-19 Pandemic for the Continental US

almost 6 years ago

COVID_HOT_10

This is an analysis of the Hottest 10 COVID states.

about 6 years ago

US_NATIONAL_MAP_COVID

This is a map of the COVID cases similar to the one the New York Times publishes.

about 6 years ago

Regional Covid Estimated Reproducton Rates from NYTimes data

Evolving analysis of COVID cases.

about 6 years ago

IOWA COVID-19 HEAT MAP

Data are from the NYTImes. Data analysis shows total cases and growth rate.

about 6 years ago

California COVID Heat Map

Heat map of COVID cases in California counties. Data from NY Times Github https://github.com/nytimes/covid-19-data

about 6 years ago

Oregon Heat Map

Heat Map of COVID-19 Growth. Points are color coded to reflect growth rate.

over 6 years ago

North Carolina COVID Heatmap

Heatmap of COVID-19 cases in North Carolina Counties. From data compiled by the NYTimes.

over 6 years ago

South Carolina COVID heatmap

Heatmap of South Carolina from data compiled by the NYTimes

over 6 years ago

Georgia COVID-19 Heatmap

Heatmap of Covid 19 cases in Georgia. From data compiled by the NYTimes.

over 6 years ago

COLORADO COVID HEAT MAP

Heat Map of COVID-19 cases. From NYTimes data.

over 6 years ago

CALIFORNIA COVID HEAT MAP

Heat Map of COVID-19 cases and doubling times. Based on NYTimes Data

over 6 years ago

WASHINGTON COVID HEATMAP

Heatmap of Washington State Covid Cases. From NYTImes data

over 6 years ago

HEAT MAP US COVID revA

Covid data from NYTimes mapped with doubling rates encoded as color.

over 6 years ago

US COVID HEATMAP

Heat Map of COVID cases in the US. Case doubling time is estimated using a spline fit of the case data. Extrapolated values are used to estimate regional severity.

over 6 years ago

Washington Regional COVID-19 Growth

#COVID19 Regional Growth

over 6 years ago

Washington Regional COVID-19 Growth 2020_04_08

COVID-19 Case analysis from the #NYTimes #COVID19 data

over 6 years ago

US COVID-19 HEAT MAP

An adaptation of the New York Times COVID-19 Map which includes an analysis of regional case growth rates.

over 6 years ago

Washington Regional COVID-19 Growth

Tracking regional trends in COVID-19 growth rates. Cases are spreading much faster outside the Puget Sound region, with doubling times on the order of four days, while withing the Puget Sound region, cases are doubling approximately every 9 days. #COVID #rCOVID

over 6 years ago

Washington COVID

over 6 years ago

California Regional COVID-19 Trends

Regional Trends for the Bay Area, Greater California, and Southern California

over 6 years ago

Washington COVID-19 Case Growth

over 6 years ago

Oregon Covid Regional Trends

Trends in COVID-19 cases in the Willamette Valley versus Grater Oregon.

over 6 years ago

Washington Regional COVID-19 Growth

Analysis of COVID-19 case growth in Washington State. While cases are higher in the Puget Sound region, their growth rate is now effectively half of that in the rest of the State.

over 6 years ago

COVID-19 Cases in Oregon 2020_03_30

Compares COVID-19 cases in Greater Oregon to the Willamette Valley region.

over 6 years ago

Puget Sound & Greater Washington COVID-19 Cases

Analysis of COVID-19 cases fr the Puget Sound Region versus Greater Washington. Growth in Puget Sound has slowed while the rest of the state is accelerating.

over 6 years ago

Washington State COVID 19 Regional Analysis March 29 2020

A regional analysis of COVID-19 Cases for Washington State. Reveals substantial differences between the (largely urban) Puget Sound region and the (more rural) rest of the State. Growth in Puget Sound has achieved doubling times ~ 6 days while the rest of the State is still near ~3.

over 6 years ago

Washington State Opioid Network

Mapping analysis of the network of Opioid transations involving WA 2016 to 2912 based on the Washington Post Opioid Data

over 6 years ago

Elephant Opioid Shipments

Part of ongoing casual analysis of the Washington Post Opioid dataset.

over 6 years ago

Top Million Opioid Shipments 2006 to 2012

This is an analysis of the Washington Post Opiod Dataset looking at the top million shipments (by volume) in 2006 to 2012. An interesting relationship between shipment size and rank, akin to Zipf's Law, is observed.

over 6 years ago

kaggle hack

over 9 years ago

Education Team

This is a draft document of learning from initial Education Survey

over 9 years ago

Sentiment Analysis of the Three 2016 Presidential Debates

I used some sentiment analysis tools just to look at trends

almost 10 years ago

Oct 9 2016 Presidential Debate Text Sentiment

almost 10 years ago

Oct 2016 Vice Presidential debate

This is an attempt to play around with animating sentiment analysis

almost 10 years ago

#AI tweet fluxes for Sept 2016

#AI tweet fluxes measured using twitteR package. Now normalized to metro areas.

almost 10 years ago

#TrumpWon

Collection of #TrumpWon tweets from the 2016 Presidential Debate

almost 10 years ago

#HillaryWon

Updated method to include metro populations and areas.

almost 10 years ago

Tweet densities of #rstats in the US

I use `twitteR` and `ggmap` to display data on the usage of #rstats in tweets.

almost 10 years ago

Understanding Digital Privacy based on OkCupid data

I use the recently published OkCupid data to understand from a practical standpoint what private information can be inferred from more public information.

almost 10 years ago

Publish Document

Exploratory analysis of OkCupid data set from library(okcupid)

almost 10 years ago

Machine Learning for High Performance Reverse-Geo-Coding

Several options for reverse geo-coding (i.e. determining a specific State and County from (latitude, longitude) coordinate pairs) are explored for both performance and accuracy. Direct reverse-geo-code API calls, which take about 200msec per point, are compared to computation via “point-in-polygon”, as well as machine-learning randomForest and nnet classification models. A Random Forest model with accuracy approaching 98% improves throughput by a factor of 104 over a web-based API call. A neural network model has faster prediction times, but its accuracy was lower and modeling times were prohibitively long.

almost 10 years ago

CanonicalExamplesWordVectors

This explores some canonical examples of word vectors based on the GloVe vectors from Pennington et al.

over 10 years ago

Heat Map of Presidential Debate Speech

Plots a heat map of frequent words. Provides ability to filter for specific topics

over 10 years ago

GloVe 100d Word Vectors in R

playing aournd with GloVe 100d word vectors

over 10 years ago

GloVe 200d Word Vectors in R

Playing around with GloVe Vectors in R

over 10 years ago

GloVe 300d Word Vectors in R

This is just some playing around with word vectors

over 10 years ago

Publish Document

over 10 years ago

Canonical Examples of Word VectoRs

Using the GloVe word vectors I explore some well known examples of word vector relations. The GloVe word vectors are from Jeffrey Pennington, Richard Socher, and Christopher D. Manning. 2014, http://nlp.stanford.edu/pubs/glove.pdf

over 10 years ago

2016 Presidential Debate Speech

An analysis of word frequencies used in the Debates by Presidential Candidates for the 2016 election cycle.

over 10 years ago

Santiam Pass Crash Analysis

Analysis of Santiam Pass Oregon Crashes and their correlation to snow and other factors.

over 10 years ago

Season 15-16

Visualization of ski days at Mt Bachelor Ski Area for the season of 2015-2016

over 10 years ago

Ski Trend HB

Draft of Ski Analytics Visualization

over 10 years ago

Debate Word Processing

Looks at different methods of analyzing speech from Presidential Debates 2015

over 10 years ago

Word Analysis of Presidential Debates 2015

This is a word-frequency analysis of the 2015 Presidential debate texts. The point of the analysis is to explore whether word analytics can reveal biases in the positions of candidates.

over 10 years ago

Tactical.Data Exascalar

almost 11 years ago

Exascalar June 2015

A data visualization exploration and analysis of the June2015 Top and Green500 Supercomputers.

almost 11 years ago

Exploratory Summary for PDX kaggle product classification

over 11 years ago

Word RippeR

Presentation on Coursera Capstone NLP word predictor

over 11 years ago

Ski Pass Exploratory

This is a "preliminary" analysis of data from 10 seasons of skiing.

over 11 years ago

Data_Text_Analysis_Rev_1_4

A short exploratory analysis of four different text corpora for the Coursera Capstone. In addition to looking at basic statistics like word number and frequency, adherence to Zipf's Law is examined.

over 11 years ago

Presentation on mtcars model

This is my presentation on the motor trend cars data. Very swank.

over 11 years ago

Predicting_Road_Safety_from_Twitter

This analysis uses data from the Oregon DOT real time Twitter feeds on road conditions to understand historical trends affecting road safety. It shows the data have reasonable predictive value.

over 11 years ago

Architectural Influencers

AN Exascalar analysis of the Top500/Green500 Supercomputers for architectural influencers. This is a first quick analysis. To be updated.

over 11 years ago

Exascalar as a lens correlating the Top500 and Green500

The Top500 and Green500 lists appear to be largely uncorrelated. However, but using Exascalar as an intermediate analysis point a linkage can be understood.

over 11 years ago

Exascalar November 2014 Delta

An analysis of the change of the Top500 and Green500 supercomputer populations between November and April 2014 using Exascalar

over 11 years ago

Power_Trend_of_Exascalar

Short blog on Exascalar trends and some extrapolation

over 11 years ago

Engle_Road

over 11 years ago

Twitter-based Santiam Pass Accident Analysis

This is an analysis of the locations, dates, and density of accidents on Santiam Pass, Oregon, based on ODOT twitter feed data which reports realtime accident data.

over 11 years ago

Looking for Evidence of Climate Change in Temp Records

This analysis uses threshold detection to look for increases in TMIN (daily minimum temperatures) in data from the NOAA National Climate Data Center http://www.ncdc.noaa.gov/cdo-web/datasets.

almost 12 years ago

Analysis of SPECCPU2006 dependence on MHz and Cores

This is a quick analysis of the dependency of CPU performance, as measured by SPECCPU2006 on core count and MHz.

almost 12 years ago

Number and Severity of Hacks and Data Breaches

Hardly as week goes by without a data breach being reported. I found some data online that I was able to clean and analyze to partially answer the question: are the number and severity of hacking attacks increasing relative to the overall population of data breaches. The answer is surprising.

almost 12 years ago

Storm_Analysis1

Analysis of storm event data for reproducible research class from JHU on Coursera.

almost 12 years ago

Sign In

ww44ss

Winston Saunders

Recently Published