Recently Published
AESC40180 25/26 - Practical week 5
Code and explanation for practical #5 of Data Analysis for Biologists
Consumos Electricidad y Agua Facultad Ciencias de la Salud, Universidad de Granada
Evolución de consumos de electricidad y agua así como tarificación.
Map showing Queens Neckless & sports
Beautiful Queen's Neckless & Sports facilities around Marine Drive, Mumbai
Industry Best-Practice EDA for Resource Estimation
Thalanga VMS Deposit | Pre-Estimation Workflow & Domain Modeling
text
first day using rpubs
Swiftkey_Nextword_Prediction_Pitch
This project presents a next-word prediction application developed as part of the Johns Hopkins Data Science Capstone (Coursera).
The objective was to build a lightweight and efficient predictive text model similar to those used in smart mobile keyboards. Using the HC Corpora English datasets (blogs, news, and Twitter), I performed exploratory data analysis, constructed n-gram frequency tables (2-gram, 3-gram, and 4-gram models), and implemented a backoff strategy to handle unseen word combinations.
The final product is a deployed Shiny application that predicts the top three most likely next words given an input phrase. The model is optimized for performance and memory efficiency to ensure responsiveness in a web environment.
Key Features:
Frequency-based n-gram language model
Backoff prediction logic (4-gram → 3-gram → 2-gram → fallback)
Efficient storage using serialized RDS tables
Deployed via shinyapps.io
Live application:
https://chidemannie.shinyapps.io/Swiftkey_Next_Word_Predictor/
Source code:
https://github.com/chidemannie/swiftkey-capstone
This slide deck summarizes the modeling approach, performance considerations, and demonstrates how the application works.