Question

In: Statistics and Probability

Compare the TF-IDF pivoted normalization formula and Okapi formula analytically. Both formulas are given in the...

Compare the TF-IDF pivoted normalization formula and Okapi formula analytically.

Both formulas are given in the figure above. What are the common statistical information about documents and queries that they both use? How are the two formulas similar to each other, and how are they different?

Solutions

Expert Solution


Related Solutions

Compare the TF-IDF pivoted normalization formula and Okapi BM25 formula analytically. Both formulas are given in...
Compare the TF-IDF pivoted normalization formula and Okapi BM25 formula analytically. Both formulas are given in Table 1 of Singhal's review paper.(Note that there is an error in the Okapi formula.). What are the common statistical information about documents and queries that they both use? How are the two formulas similar to each other, and how are they different?
Implement the TF-IDF pivoted normalization method and Okapi retrieval formula above. For the Okapi formula, set...
Implement the TF-IDF pivoted normalization method and Okapi retrieval formula above. For the Okapi formula, set k3=1000 and k1=1.2, so that you have only one parameter b to vary. In this way, both algorithms have precisely one parameter to tune.
ADVERTISEMENT
ADVERTISEMENT
ADVERTISEMENT