Term Frequency, Inverse Document Frequency (TFIDF) - AFS

AFS Installation and Administration Guide

Product
AFS
AFS_Version
7.10
Platform
RHEL
Category
Reference Guide

The Term Frequency, Inverse Document Frequency (TFIDF) weight is a weight used in information retrieval and text-mining.

This weight is a statistical measure used to evaluate how important a word is to a document in a collection or corpus. The importance increases proportionally to the number of times a word appears in the document but is offset by the frequency of the word in the corpus. Variations of the TFIDF weighting scheme are used by search engines as a central tool in scoring and ranking a document's relevance given a user query.