Question

In: Computer Science

What should the range be for each of the following for the most significant association rules:...

What should the range be for each of the following for the most significant association rules:

Support, Confidence, Lift and Conviction?

Solutions

Expert Solution

Support

support(A→C)=support(A∪C),range: [0,1]

The support metric is defined for itemsets, not assocication rules. The table produced by the association rule mining algorithm contains three different support metrics: 'antecedent support', 'consequent support', and 'support'. Here, 'antecedent support' computes the proportion of transactions that contain the antecedent A, and 'consequent support' computes the support for the itemset of the consequent C. The 'support' metric then computes the support of the combined itemset A ∪ C -- note that 'support' depends on 'antecedent support' and 'consequent support' via min('antecedent support', 'consequent support').

Typically, support is used to measure the abundance or frequency (often interpreted as significance or importance) of an itemset in a database. We refer to an itemset as a "frequent itemset" if you support is larger than a specified minimum-support threshold. Note that in general, due to the downward closure property, all subsets of a frequent itemset are also frequent.

Confidence

The confidence of a rule A->C is the probability of seeing the consequent in a transaction given that it also contains the antecedent. Note that the metric is not symmetric or directed; for instance, the confidence for A->C is different than the confidence for C->A. The confidence is 1 (maximal) for a rule A->C if the consequent and antecedent always occur together.

Lift

The lift metric is commonly used to measure how much more often the antecedent and consequent of a rule A->C occur together than we would expect if they were statistically independent. If A and C are independent, the Lift score will be exactly 1.

Conviction

A high conviction value means that the consequent is highly depending on the antecedent. For instance, in the case of a perfect confidence score, the denominator becomes 0 (due to 1 - 1) for which the conviction score is defined as 'inf'. Similar to lift, if items are independent, the conviction is 1.

Note: Plzzz don' t give dislike.....Plzzz comment if u have any problem i will try to resolve it.......


Related Solutions

Answer the following: 1a. What are the rules for counting significant figures? 1b. What are the...
Answer the following: 1a. What are the rules for counting significant figures? 1b. What are the rules of significant figures when multiplying or dividing numbers? 1c. What are the rules of significant figures when adding or subtracting numbers? 1d. What is the formula for an average? 1e. What is the formula for a standard deviation (assuming small sample size), also called the square root of the variance? 1f. How many significant figures should a standard deviation have? 1g. If you...
What are the most significant rules to consider when converting personal-use property to business use? Which...
What are the most significant rules to consider when converting personal-use property to business use? Which common mistakes may be made during this conversion process?
Classification and Reg Tress + Association Rules Question 11 The number of association rules increases Additively...
Classification and Reg Tress + Association Rules Question 11 The number of association rules increases Additively with number of items Multiplicatively with number of items Exponentially with number of items Question 12 If there are k items, number of association rules formed is formulated as : 3^k - 2^(k-1) 3^k - 2^(k+1) + 1 3^k - 2^(k+1) - 1 3^k - 2^(k-1) + 1 Question 13 Consider the rules A -> B, B -> A, where A, B are two...
Tell if each measure of association is statistically significant and briefly explain why or why not....
Tell if each measure of association is statistically significant and briefly explain why or why not. - RR=1.69; p=0.52 - OR=0.95; (95% CI: 0.85-1.05) -OR=2.10; p=.003 -RR= 2.56; (95% CI: 1.90-3.24)
Use the following set of data to answer the question: is there a significant association between...
Use the following set of data to answer the question: is there a significant association between an immigrant’s length of time in the country and his/her level of acculturative stress, as measured by a well-being scale? The data are listed in the table below. Years Well-being In country score X Y 12 6 15 8 9 4 7 5 18 9 24 10 15 7 16 6 21 3 15 9 M = 15.20 M = 6.70 SSx = 235.60...
Identify (using the most appropriate of lists, rules, or notation) the elements of each of the...
Identify (using the most appropriate of lists, rules, or notation) the elements of each of the following sample spaces: 1) Weight of a randomly selected crawfish 2) A cubic die and a octahedron (8 sided) die being rolled 3) Human IQ 4) Days till next hurricane hits Baton Rouge 5) A die being rolled until an odd number is obtained thrice in a row
1. What answer should be reported, with the correct number of significant figures, for the following...
1. What answer should be reported, with the correct number of significant figures, for the following calculation? (433.621 - 333.9)
What answer should be reported, with the correct number of significant figures, for the following calculation...
What answer should be reported, with the correct number of significant figures, for the following calculation (249.36+ 41.3)/63.7?
Select whether a chi-square test for association is the most appropriate for the following scenarios: Most...
Select whether a chi-square test for association is the most appropriate for the following scenarios: Most appropriate or Not appropriate a) Emma wishes to know whether gender has any relation with whether one prefers wine, beer, or liqueur. She surveys a group of men and women and records the number of participants who prefer wine, the number who prefer beer and the number who prefer liqueur. (yes or no) b) Danielle surveys a group of people on how often they...
What are the most significant contributions to epidemiology of Pasteur and Koch?
What are the most significant contributions to epidemiology of Pasteur and Koch?
ADVERTISEMENT
ADVERTISEMENT
ADVERTISEMENT