Single cell RNA-seq data clustering using TF-IDF based methods


Single cell transcriptomics is critical for understanding cellular heterogeneity and identification of novel cell types. Leveraging the recent advances in single cell RNA sequencing (scRNA-Seq) technology requires novel unsupervised clustering algorithms that are robust to high levels of technical and biological noise and scale to datasets of millions of cells.

University of Connecticut researchers present novel computational approaches for clustering scRNA-seq data based on the Term Frequency – Inverse Document Frequency (TF-IDF) transformation that has been successfully used in the field of text analysis.

Compared scRNA-Seq clustering methods


Empirical experimental results show that TF-IDF methods consistently outperform commonly used scRNA-Seq clustering approaches.

Moussa M, Măndoiu II. (2018) Single cell RNA-seq data clustering using TF-IDF based methods. BMC Genomics 19(Suppl 6):569. [article]

Articles You May Like

Sodium in the microenvironment regulates immune responses and tissue homeostasis
Don’t look up: Intense video shows the sky ‘raining spiders’ in Brazil
sim1000G: a user-friendly genetic variant simulator in R for unrelated individuals and family-based designs
Orca Calf Offers Hope for a Fading Group in the Pacific Northwest
Smallest near-infrared fluorescent protein evolved from cyanobacteriochrome as versatile tag for spectral multiplexing

Leave a Reply

Your email address will not be published. Required fields are marked *