Question

In: Computer Science

What is the difference between stemming and lemmatizationand explain how this canaffect precision and recall with...

What is the difference between stemming and lemmatizationand explain how this canaffect precision and recall with examples

Solutions

Expert Solution

Stemming and Lemmatization are much similar terms in the information retrieval system or in field of NLP(Natural Language Processing) for Text Normalization.

The Stemming is the process taking the relevant part of a word and removing the rest. This is actually removing the inflected parts of the word while retrieving with a query.

For example, Porter stemmer, the most used stemmer is used to remove inflections in natural language processing(NLP) applications.

So, if we use stemming to a query, we can increase recall by matching words against their other inflected parts.

Lemmatization is also a way to remove inflected portions from a word. Lemmatization relies on a lexical knowledge base like WordNet to obtain the correct base forms of words instead of simply removing the inflections.

For example WordNet lemmatizes the word "meanness" to "meaning".

Thus Lemmatization more precision than stemming, due to this special feature.

SKIP POINTERS:

Skip pointer is another mechanism used in information retrieval to skip the processing part when searching with a query. The two challenges are where to place the skip pointer and how to efficiently merging of result using skip pointers. But this skip pointer are not useful in the queries of the form X and Y because, we cant skip the processing here. Here we the search engine definitely have to visit every document ID in the list of posting of either x and y terms. Skip pointers skip several irrelevant steps to reduce the time of rerieval and to take only most important relevant data. Queries of the form X and Y means, there are two variables in the query connected with an "and". This means only data which include the information about both X and Y have to retrieve. To do so search engines have to check every dataset whether there is X and Y together.

For example if someone searched "Apple and Orange", the most compatible and relevant results should contain both apple and orange. If the skip pointer technique is used, the search engine may miss the connected data. Therefore skip pointer cannot be used here.


Related Solutions

how the stemming and Lemmatization affect a ration of precision and recall with examples ? {does...
how the stemming and Lemmatization affect a ration of precision and recall with examples ? {does increase or decrease]
1) a) What is the difference between accuracy and precision? b) What is the difference in...
1) a) What is the difference between accuracy and precision? b) What is the difference in resolution between the instruments: Dial Caliper and Micrometer caliper? In order words, what is the lowest division in mm that we can read in each one of these instruments? c) How do you relate calibration and accuracy? Explain. 2) In the Experiment Newton’s Second Law, explain with your own words, a) the relation that you found between acceleration and mass. What did you keep...
Recall the difference between the endpoint and the equivalence point in a titration. How does this...
Recall the difference between the endpoint and the equivalence point in a titration. How does this difference affect choosing an indicator for a titration?
Recall and Precision are often discussed together as their focus is on complementary information. If precision...
Recall and Precision are often discussed together as their focus is on complementary information. If precision is important, the we don’t not want to see any non-relevant documents. That is, whatever is retrieved, should be relevant. If recall is important, we want to see all the relevant documents, even if it requires sifting through some non-relevant ones. Provide and Justify two information-seeking tasks where precision may be considerably more important than recall. Similarly, Provide and Justify two information-seeking tasks where...
Find an example in your own experience of a communication-related challenge stemming from the difference between...
Find an example in your own experience of a communication-related challenge stemming from the difference between high and low context in culture. Identify any potential misunderstanding or difficulty that you may have observed in interpreting an individual’s intended meaning. For example, if someone from a different culture seems too direct (low context), the exchange can seem, incorrectly, to imply a harsh tone, while being too indirect (high context) can seem, incorrectly, to reflect fear or hesitation. Try to reason how...
What is the difference between a pdf and a cdf? How are they related? Explain this...
What is the difference between a pdf and a cdf? How are they related? Explain this as you would to someone who wasn't in class when I did. Assume they have taken Calculus 2. You can use text, pictures, a video, a dance, a song... whatever you want. Be creative! Submit your explanation as one file (a pdf for text or image, a single video or audio file etc.)
what is the difference between monopoly and oligopoly ? explain how the demand and the supply...
what is the difference between monopoly and oligopoly ? explain how the demand and the supply behave in the two cases?
What is the difference between ALE and MPAL? Explain how each is calculated. (This is for...
What is the difference between ALE and MPAL? Explain how each is calculated. (This is for a Security Risk Analysis class.)
Precision Machining Technology (2nd Edition) Chapter 8SU6, Problem 10RQ Briefly explain the difference between rigid and...
Precision Machining Technology (2nd Edition) Chapter 8SU6, Problem 10RQ Briefly explain the difference between rigid and nonrigid tapping. Thanks!
What is the difference between primary markets and secondary markets?Explain What is the difference between money...
What is the difference between primary markets and secondary markets?Explain What is the difference between money markets and capital markets?Explain What are three (3) of the seven (7) types of financial institutions? Include a description of the main services offered by each. Explain Why would the U.S. government, local governments, and corporations issue bonds? Explain Provide the definitions of a discount bond and a premium bond. Give examples. As owners, what rights and advantages do shareholders obtain? Explain Why might...
ADVERTISEMENT
ADVERTISEMENT
ADVERTISEMENT