RPubs

by RStudio

Recently Published

AESC40180 25/26 - Practical week 5

By V_Morera-Pujol

Code and explanation for practical #5 of Data Analysis for Biologists

4 months ago

Consumos Electricidad y Agua Facultad Ciencias de la Salud, Universidad de Granada

By watzilei

Evolución de consumos de electricidad y agua así como tarificación.

4 months ago

Map showing Queens Neckless & sports

By gharemp

Beautiful Queen's Neckless & Sports facilities around Marine Drive, Mumbai

4 months ago

Industry Best-Practice EDA for Resource Estimation

By ghoziankarami

Thalanga VMS Deposit | Pre-Estimation Workflow & Domain Modeling

4 months ago

text

By Nhi_Truong

first day using rpubs

4 months ago

Survey experiments lab session

By jordimunoz

4 months ago

Swiftkey_Nextword_Prediction_Pitch

By chidemannie

This project presents a next-word prediction application developed as part of the Johns Hopkins Data Science Capstone (Coursera). The objective was to build a lightweight and efficient predictive text model similar to those used in smart mobile keyboards. Using the HC Corpora English datasets (blogs, news, and Twitter), I performed exploratory data analysis, constructed n-gram frequency tables (2-gram, 3-gram, and 4-gram models), and implemented a backoff strategy to handle unseen word combinations. The final product is a deployed Shiny application that predicts the top three most likely next words given an input phrase. The model is optimized for performance and memory efficiency to ensure responsiveness in a web environment. Key Features: Frequency-based n-gram language model Backoff prediction logic (4-gram → 3-gram → 2-gram → fallback) Efficient storage using serialized RDS tables Deployed via shinyapps.io Live application: https://chidemannie.shinyapps.io/Swiftkey_Next_Word_Predictor/ Source code: https://github.com/chidemannie/swiftkey-capstone This slide deck summarizes the modeling approach, performance considerations, and demonstrates how the application works.

4 months ago

RPubs

Recently Published

AESC40180 25/26 - Practical week 5

Tarea 01

10. Informed prior

Consumos Electricidad y Agua Facultad Ciencias de la Salud, Universidad de Granada

Map showing Queens Neckless & sports

Industry Best-Practice EDA for Resource Estimation

text

HTML

Document

Plot

Survey experiments lab session

Swiftkey_Nextword_Prediction_Pitch

Sign In

Recently Published