In: Computer Science
The following question will ask you about a corpus with the following documents.
Document 1: a a b c
Document 2: a c c c d e f
Document 3: a c d d d
Document 4: a d f
What is the tf-idf value for "d" in Document 3?
Round answers to two decimal places. Use the natural logarithm (log base e) when taking a logarithm.
0.00
0.57
0.69
0.86
2.07
3.46
6.00
Term frequency (tf) =
tf (d) = 3 / 5 = 0.60
Inverse document frequency (idf) =
Hence,
tf-idf
The answer is 0.00