Recently Published

AESC40180 25/26 - Practical week 5
Code and explanation for practical #5 of Data Analysis for Biologists
Tarea 01
10. Informed prior
Consumos Electricidad y Agua Facultad Ciencias de la Salud, Universidad de Granada
Evolución de consumos de electricidad y agua así como tarificación.
Map showing Queens Neckless & sports
Beautiful Queen's Neckless & Sports facilities around Marine Drive, Mumbai
Industry Best-Practice EDA for Resource Estimation
Thalanga VMS Deposit | Pre-Estimation Workflow & Domain Modeling
text
first day using rpubs
HTML
Document
Plot
Swiftkey_Nextword_Prediction_Pitch
This project presents a next-word prediction application developed as part of the Johns Hopkins Data Science Capstone (Coursera). The objective was to build a lightweight and efficient predictive text model similar to those used in smart mobile keyboards. Using the HC Corpora English datasets (blogs, news, and Twitter), I performed exploratory data analysis, constructed n-gram frequency tables (2-gram, 3-gram, and 4-gram models), and implemented a backoff strategy to handle unseen word combinations. The final product is a deployed Shiny application that predicts the top three most likely next words given an input phrase. The model is optimized for performance and memory efficiency to ensure responsiveness in a web environment. Key Features: Frequency-based n-gram language model Backoff prediction logic (4-gram → 3-gram → 2-gram → fallback) Efficient storage using serialized RDS tables Deployed via shinyapps.io Live application: https://chidemannie.shinyapps.io/Swiftkey_Next_Word_Predictor/ Source code: https://github.com/chidemannie/swiftkey-capstone This slide deck summarizes the modeling approach, performance considerations, and demonstrates how the application works.