In [[Information Retrieval (IR)]], tf–idf, short for term frequency–inverse document frequency, is a measure of importance of a word to a document in a collection or corpus, adjusted for the fact that some words appear more frequently in general.
Variant: c-TF-IDF, a Class-based TF-IDF procedure using scikit-learns TfidfTransformer as a base. Ex, BERTopic