Question

In: Statistics and Probability

{1,2,3} {2,3,4} {3,4,5} {4,5,6} {1,3,5} {2,4,6} {1,3,4} {2,4,5} {3,5,6} {1,2,4} {2,3,5} {3,4,6} Exercise 6.4.2: Apply Toivonen’s...

{1,2,3} {2,3,4} {3,4,5} {4,5,6} {1,3,5} {2,4,6} {1,3,4} {2,4,5} {3,5,6} {1,2,4} {2,3,5} {3,4,6}

Exercise 6.4.2: Apply Toivonen’s Algorithm to the data, with a support threshold of 4. Take as the sample the first row of baskets: {1,2,3}, {2,3,4}, {3,4,5}, and {4,5,6}, i.e., one-third of the file. Our scaleddown support theshold will be 1.
(a) What are the itemsets frequent in the sample?
(b) What is the negative border?
(c) What is the outcome of the pass through the full dataset? Are any of the itemsets in the negative border frequent in the whole?

Solutions

Expert Solution

Answer:

By using ,given data

Here, 12 baskets are given.

Below are the steps or passes for PCY algorithm as follows.

Pass 1:

(i) Determine total number of occurences of allitems called as count.
(ii) For every bucket,consist of items fil; : : :;,hash all pairs to a bucket of hash table,Increment bucket count by 1.
(iii)  Determine ,L1 and the items with the count of atleasts at the end of the pass.
(iv) Determine,buckets with count atleast s at the ende buckets with counts at least s.

Pass 2:

(i) All the frequent items, i.e. L1. holds in main memory.
(ii)  Main memory also holds the bitmap summarizing the results of the hashing from pass 1.
(iii) Main memory also holds a table with all the candidate pairs and their counts.

A pair (x; y) can be a candidate in C2 only if al l of the following are true:

(a). x is in L1.
(b). y is in L1.
(c). (x; y) hashes to a frequent bucket.

(iv) We consider each basket, and each pair of its items and making the test as above.

If all three conditions meets by pair, add to its count in memory, or make an entry for it if one
not yet exist.


Related Solutions

{1,2,3} {2,3,4} {3,4,5} {4,5,6} {1,3,5} {2,4,6} {1,3,4} {2,4,5} {3,5,6} {1,2,4} {2,3,5} {3,4,6} Exercise 6.4.2: Apply Toivonen’s...
{1,2,3} {2,3,4} {3,4,5} {4,5,6} {1,3,5} {2,4,6} {1,3,4} {2,4,5} {3,5,6} {1,2,4} {2,3,5} {3,4,6} Exercise 6.4.2: Apply Toivonen’s Algorithm to the data, with a support threshold of 4. Take as the sample the first row of baskets: {1,2,3}, {2,3,4}, {3,4,5}, and {4,5,6}, i.e., one-third of the file. Our scaleddown support theshold will be 1. (a) What are the itemsets frequent in the sample? (b) What is the negative border? (c) What is the outcome of the pass through the full dataset? Are...
Calculus #3: 1. a) Let A = (2,4,6),B = (1,2,3) and C = (5,5,5). Find point...
Calculus #3: 1. a) Let A = (2,4,6),B = (1,2,3) and C = (5,5,5). Find point D so that ABCD is a parallelogram. b). Two points X and Y are colinear if they lie on the same line. Are the points A = (3,6,−1), B = (2,0,3) and C = (−1, 3, −4) colinear? Justify your answer.
Does {1,2,3} ,{3,4,5}, {1,4}, {1,5}, {2,4}, {2,5} form an incidence geometry? If so do any of...
Does {1,2,3} ,{3,4,5}, {1,4}, {1,5}, {2,4}, {2,5} form an incidence geometry? If so do any of the parallel postulates hold (Elliptic, Euclidean, Hyperbolic parallel postulates)?
The Objective of this exercise is to apply a systematic analysis of a real ethical dilemma....
The Objective of this exercise is to apply a systematic analysis of a real ethical dilemma. Reporting on Robin Williams When actor Robin Williams took his life in August of 2014, major news organizations covered the story in great detail. Most major news outlets reported on Marin County Sheriff’s Lt. Keith Boyd’s press conference, which revealed graphic details from the coroner’s report about the methods Williams used. While there was great interest on the part of the public in finding...
The Objective of this exercise is to apply a systematic analysis of a real ethical dilemma....
The Objective of this exercise is to apply a systematic analysis of a real ethical dilemma. Reporting on Robin Williams When actor Robin Williams took his life in August of 2014, major news organizations covered the story in great detail. Most major news outlets reported on Marin County Sheriff’s Lt. Keith Boyd’s press conference, which revealed graphic details from the coroner’s report about the methods Williams used. While there was great interest on the part of the public in finding...
Personal Selling Assignment: The objective for this exercise is for you to better understand and apply...
Personal Selling Assignment: The objective for this exercise is for you to better understand and apply the steps used in personal selling. This is an individual assignment. Please explain how you can use the seven steps of personal selling in the activity of getting a new job. For each step, write 1-2 sentences on how the step relates to job search, interviewing, etc. The seven steps are: PROSPECTING: The salesperson must develop a list of customers. PREAPPROACH: The salesperson must...
In this exercise you will apply you new understanding of class design to develop an advanced...
In this exercise you will apply you new understanding of class design to develop an advanced version of the array helper class we created in the classroom. This variation of the design will maintain the data stored in the array in an ordered fashion (0 -100 for our int storage) . The class should always maintain its stored data in order meaning at any time if the programmer iterates the items, they will come back in order. This means each...
Exercise 20-04 The following facts apply to the pension plan of Vaughn Inc. for the year...
Exercise 20-04 The following facts apply to the pension plan of Vaughn Inc. for the year 2020. Plan assets, January 1, 2020 $525,500 Projected benefit obligation, January 1, 2020 525,500 Settlement rate 8 % Service cost 36,800 Contributions (funding) 23,800 Actual and expected return on plan assets 49,800 Benefits paid to retirees 30,300 Using the preceding data, compute pension expense for the year 2020. As part of your solution, prepare a pension worksheet that shows the journal entry for pension...
Exercise 20-04 The following facts apply to the pension plan of Vaughn Inc. for the year...
Exercise 20-04 The following facts apply to the pension plan of Vaughn Inc. for the year 2020. Plan assets, January 1, 2020 $525,500 Projected benefit obligation, January 1, 2020 525,500 Settlement rate 8 % Service cost 36,800 Contributions (funding) 23,800 Actual and expected return on plan assets 49,800 Benefits paid to retirees 30,300 Using the preceding data, compute pension expense for the year 2020. As part of your solution, prepare a pension worksheet that shows the journal entry for pension...
Exercise 20-04 The following facts apply to the pension plan of Sheridan Inc. for the year...
Exercise 20-04 The following facts apply to the pension plan of Sheridan Inc. for the year 2020. Plan assets, January 1, 2020 $528,000 Projected benefit obligation, January 1, 2020 528,000 Settlement rate 8 % Service cost 43,400 Contributions (funding) 26,600 Actual and expected return on plan assets 51,600 Benefits paid to retirees 35,600 Using the preceding data, compute pension expense for the year 2020. As part of your solution, prepare a pension worksheet that shows the journal entry for pension...
ADVERTISEMENT
ADVERTISEMENT
ADVERTISEMENT