Single-cell Preprocess: Add TF-IDF #339

mstrazar · 2019-05-22T06:50:58Z

An alternative to log(CPM+1) transformation of count data is the TF-IDF transform, adopted from text analysis. Similar to finding characteristic words describing a topic in the document, TF-IDF can be used to find stand-out genes ("terms") for each cell ("document").

It should be relatively straightforward to include this approach into Single-cell preprocess.

See https://bmcgenomics.biomedcentral.com/articles/10.1186/s12864-018-4922-4

The text was updated successfully, but these errors were encountered:

gabriellayi · 2020-06-23T06:30:53Z

Hi,

I read your papers about this method, however I’m new to coding, do you mind sharing the code or telling me where I can find code or tutorial for this method which id like to apply for scRNA-seq Gene clustering?

Thanks!!
Yi

mstrazar added the enhancement label May 22, 2019

JakaKokosar added idea and removed enhancement labels Sep 1, 2020

JakaKokosar changed the title ~~[ENH] Single-cell Preprocess: Add TF-IDF~~ Single-cell Preprocess: Add TF-IDF Sep 1, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Single-cell Preprocess: Add TF-IDF #339

Single-cell Preprocess: Add TF-IDF #339

mstrazar commented May 22, 2019

gabriellayi commented Jun 23, 2020

Single-cell Preprocess: Add TF-IDF #339

Single-cell Preprocess: Add TF-IDF #339

Comments

mstrazar commented May 22, 2019

gabriellayi commented Jun 23, 2020