Recently Published
Principal Component Analysis for THE ranking 2021
In this home task, THE 2021 data is used to practice the principal component analysis (PCA). The results are plotted with special attention for Japanese, USA, and UK universities.
Web scraping with RSelenium
Some parsing tricks.
efa + regression
Here, I conduct the exploratory factor analysis (efa) on the Canadian data to construct the factors for predicting students' math achievement.
Poetic Topic Modeling using LDA: themes of Silver Age & Soviet times
The last paper for cmta course.
BA final project part 1
Just the beginning.
Cultural geography
Some graphs constructed for the project about cultural consumption in 2002-2010 St. Petersburg.
Simple maths, Log-likelihood, and so on: Tarkovsky's poetry and Silver Age poems
That is my first lab paper for the course "computational methods for text analysis" - the comparison of 2 corpora of poems. There are some mistakes, though some moments are good.
Text classifiers: logistic regression & random forest
I prepared this task as the second laboratory paper for the university course "computational methods for text analysis". It is interesting to look at, although there are some troubles with data.
Clusterization & churn prediction
I did this small research for my business analytics course and publish it mostly to send a link to a couple of friends. It is a bit ugly but might be helpful. Enjoy!
MPSP - final project
Here are some things we explored during our introductory course in Data Analysis in Sociology. Enjoy!