Question

In: Statistics and Probability

{1,2,3} {2,3,4} {3,4,5} {4,5,6} {1,3,5} {2,4,6} {1,3,4} {2,4,5} {3,5,6} {1,2,4} {2,3,5} {3,4,6} Exercise 6.4.2: Apply Toivonen’s...

{1,2,3} {2,3,4} {3,4,5} {4,5,6} {1,3,5} {2,4,6} {1,3,4} {2,4,5} {3,5,6} {1,2,4} {2,3,5} {3,4,6}

Exercise 6.4.2: Apply Toivonen’s Algorithm to the data, with a support threshold of 4. Take as the sample the first row of baskets: {1,2,3}, {2,3,4}, {3,4,5}, and {4,5,6}, i.e., one-third of the file. Our scaleddown support theshold will be 1.
(a) What are the itemsets frequent in the sample?
(b) What is the negative border?
(c) What is the outcome of the pass through the full dataset? Are any of the itemsets in the negative border frequent in the whole?

Solutions

Expert Solution

Answer:

By using ,given data

Here, 12 baskets are given.

Below are the steps or passes for PCY algorithm as follows.

Pass 1:

(i) Determine total number of occurences of allitems called as count.
(ii) For every bucket,consist of items fil; : : :;,hash all pairs to a bucket of hash table,Increment bucket count by 1.
(iii)  Determine ,L1 and the items with the count of atleasts at the end of the pass.
(iv) Determine,buckets with count atleast s at the ende buckets with counts at least s.

Pass 2:

(i) All the frequent items, i.e. L1. holds in main memory.
(ii)  Main memory also holds the bitmap summarizing the results of the hashing from pass 1.
(iii) Main memory also holds a table with all the candidate pairs and their counts.

A pair (x; y) can be a candidate in C2 only if al l of the following are true:

(a). x is in L1.
(b). y is in L1.
(c). (x; y) hashes to a frequent bucket.

(iv) We consider each basket, and each pair of its items and making the test as above.

If all three conditions meets by pair, add to its count in memory, or make an entry for it if one
not yet exist.


Related Solutions

{1,2,3} {2,3,4} {3,4,5} {4,5,6} {1,3,5} {2,4,6} {1,3,4} {2,4,5} {3,5,6} {1,2,4} {2,3,5} {3,4,6} Exercise 6.4.2: Apply Toivonen’s...
{1,2,3} {2,3,4} {3,4,5} {4,5,6} {1,3,5} {2,4,6} {1,3,4} {2,4,5} {3,5,6} {1,2,4} {2,3,5} {3,4,6} Exercise 6.4.2: Apply Toivonen’s Algorithm to the data, with a support threshold of 4. Take as the sample the first row of baskets: {1,2,3}, {2,3,4}, {3,4,5}, and {4,5,6}, i.e., one-third of the file. Our scaleddown support theshold will be 1. (a) What are the itemsets frequent in the sample? (b) What is the negative border? (c) What is the outcome of the pass through the full dataset? Are...
Calculus #3: 1. a) Let A = (2,4,6),B = (1,2,3) and C = (5,5,5). Find point...
Calculus #3: 1. a) Let A = (2,4,6),B = (1,2,3) and C = (5,5,5). Find point D so that ABCD is a parallelogram. b). Two points X and Y are colinear if they lie on the same line. Are the points A = (3,6,−1), B = (2,0,3) and C = (−1, 3, −4) colinear? Justify your answer.
Does {1,2,3} ,{3,4,5}, {1,4}, {1,5}, {2,4}, {2,5} form an incidence geometry? If so do any of...
Does {1,2,3} ,{3,4,5}, {1,4}, {1,5}, {2,4}, {2,5} form an incidence geometry? If so do any of the parallel postulates hold (Elliptic, Euclidean, Hyperbolic parallel postulates)?
The Objective of this exercise is to apply a systematic analysis of a real ethical dilemma....
The Objective of this exercise is to apply a systematic analysis of a real ethical dilemma. Reporting on Robin Williams When actor Robin Williams took his life in August of 2014, major news organizations covered the story in great detail. Most major news outlets reported on Marin County Sheriff’s Lt. Keith Boyd’s press conference, which revealed graphic details from the coroner’s report about the methods Williams used. While there was great interest on the part of the public in finding...
In this exercise you will apply you new understanding of class design to develop an advanced...
In this exercise you will apply you new understanding of class design to develop an advanced version of the array helper class we created in the classroom. This variation of the design will maintain the data stored in the array in an ordered fashion (0 -100 for our int storage) . The class should always maintain its stored data in order meaning at any time if the programmer iterates the items, they will come back in order. This means each...
Exercise 20-04 The following facts apply to the pension plan of Vaughn Inc. for the year...
Exercise 20-04 The following facts apply to the pension plan of Vaughn Inc. for the year 2020. Plan assets, January 1, 2020 $525,500 Projected benefit obligation, January 1, 2020 525,500 Settlement rate 8 % Service cost 36,800 Contributions (funding) 23,800 Actual and expected return on plan assets 49,800 Benefits paid to retirees 30,300 Using the preceding data, compute pension expense for the year 2020. As part of your solution, prepare a pension worksheet that shows the journal entry for pension...
Exercise 20-04 The following facts apply to the pension plan of Vaughn Inc. for the year...
Exercise 20-04 The following facts apply to the pension plan of Vaughn Inc. for the year 2020. Plan assets, January 1, 2020 $525,500 Projected benefit obligation, January 1, 2020 525,500 Settlement rate 8 % Service cost 36,800 Contributions (funding) 23,800 Actual and expected return on plan assets 49,800 Benefits paid to retirees 30,300 Using the preceding data, compute pension expense for the year 2020. As part of your solution, prepare a pension worksheet that shows the journal entry for pension...
Exercise 20-04 The following facts apply to the pension plan of Sheridan Inc. for the year...
Exercise 20-04 The following facts apply to the pension plan of Sheridan Inc. for the year 2020. Plan assets, January 1, 2020 $528,000 Projected benefit obligation, January 1, 2020 528,000 Settlement rate 8 % Service cost 43,400 Contributions (funding) 26,600 Actual and expected return on plan assets 51,600 Benefits paid to retirees 35,600 Using the preceding data, compute pension expense for the year 2020. As part of your solution, prepare a pension worksheet that shows the journal entry for pension...
Exercise 20-7 The following defined pension data of Teal Corp. apply to the year 2017. Projected...
Exercise 20-7 The following defined pension data of Teal Corp. apply to the year 2017. Projected benefit obligation, 1/1/17 (before amendment) $538,000 Plan assets, 1/1/17 525,000 Pension liability 13,000 On January 1, 2017, Teal Corp., through plan amendment,    grants prior service benefits having a present value of 119,000 Settlement rate 9 % Service cost 62,100 Contributions (funding) 67,100 Actual (expected) return on plan assets 52,100 Benefits paid to retirees 42,000 Prior service cost amortization for 2017 18,600 For 2017, prepare...
Exercise 20-07 The following defined pension data of Cheyenne Corp. apply to the year 2020. Projected...
Exercise 20-07 The following defined pension data of Cheyenne Corp. apply to the year 2020. Projected benefit obligation, 1/1/20 (before amendment) $616,000 Plan assets, 1/1/20 601,600 Pension liability 14,400 On January 1, 2020, Cheyenne Corp., through plan amendment,    grants prior service benefits having a present value of 126,000 Settlement rate 9 % Service cost 61,500 Contributions (funding) 61,000 Actual (expected) return on plan assets 54,800 Benefits paid to retirees 42,700 Prior service cost amortization for 2020 15,600 For 2020, prepare...
ADVERTISEMENT
ADVERTISEMENT
ADVERTISEMENT