In: Statistics and Probability
You want to improve your search engine by comparing to an competitor. In your database you have over 30million queries and your target is to select only 5 thousands for evaluation. How would you sample? Note that we are not asking for implementation of the sampling, but are asking for the design.
Given :- Database have 30 million Queries
Target:- To select only 5 thousand for evaluation for
checking;
The effectiveness of retreival of information of our search engine
which is;
with our competitior.
Search engines are one of the important software in the field of internet today.
Everyone is looking for fast and relevant results for their
search queries.Retreival
effectiveness is one of the most important feature of evaluation of
search engines.Most od the tests for this feature does not take
into account the actual user interaction during the search process
instead they are based on query response paradigm.Selection of
queries is the key part of this kind of evaluation.
Following categories of queries will make a good combination:
A good mix of such queries can give a better picture of the performance of out search engine.