RPubs

by RStudio

aleksandra-szpakowska

Aleksandra Szpakowska

Recently Published

Spatial Analysis of Blood Test Points in Warsaw

5 months ago

Dimension reduction and clustering of high-variance gene expression data

This project explores various dimensiona reduction and clustering techniques applied to high-variance gene expression data. The primary objective is to identify meaningful patterns in gene expression by reducing dimensions while preserving key structures. The analysis begins with data preprocessing, scaling, and feature selection, followed by Principal Component Analysis (PCA) and Factor Analysis (FA) to determine the optimal number of components or factors. Clustering methods, including K-Means, Hierarchical Clustering, and t-SNE with K-Means, are then applied to uncover potential subgroups within the data.

12 months ago