In: Statistics and Probability
The motion Picture Industry is a competitive business. More than 50 studios produce a total of 300 to 400 new motion pictures each year, and financial success of each motion picture varies considerably. The opening weekend gross sales ($millions), the total gross sales ($millions), the number of theaters the movie was shown in, and the number of weeks the motion picture was in the top 60 for gross sales are common variables used to measure the success of a motion picture. Data collected for a sample of 100 motion pictures produced in 2005 are contained in the file 'Motion Pictures' table below.
Use the tabular, graphical and numerical methods of descriptive statistics to learn how these variables contribute to the success of a motion picture. Include the following in you report.
a) Tabular and graphical summaries for each of the four variables along with what each summary tells us about the motion picture industry.
b) A scatter diagram to explore the relationship between Total Gross Sales and Opening Weekend Gross Sales. What do you see?
c) A scatter diagram to explore the relationship between Total Gross Sales and Number of theaters. What do you see?
d) A scatter diagram to explore the relationship between Total Gross Sales and Number of Weeks in Top 60. What do you see?
e) Descriptive statistics for each of the four variables along with discussion of what the descriptive statistics tell us about the industry.
f) What motion pictures, if any, should be considered high-performance outliers? Explain.
Motion Picture |
Opening Gross |
Total Gross |
Number of Theaters |
Weeks in Top 60 |
Coach Carter |
29.17 |
67.25 |
2,574 |
16 |
Ladies in Lavender |
0.15 |
6.65 |
119 |
22 |
Batman Begins |
48.75 |
205.28 |
3,858 |
18 |
Unleashed |
10.90 |
24.47 |
1,962 |
8 |
Pretty Persuasion |
0.06 |
0.23 |
24 |
4 |
Fever Pitch |
12.40 |
42.01 |
3,275 |
14 |
Harry Potter and the Goblet of Fire |
102.69 |
287.18 |
3,858 |
13 |
Monster-in-Law |
23.11 |
82.89 |
3,424 |
16 |
White Noise |
24.11 |
55.85 |
2,279 |
7 |
Mr. and Mrs. Smith |
50.34 |
186.22 |
3,451 |
21 |
Be Cool |
23.45 |
55.81 |
3,216 |
8 |
Modigliani |
0.03 |
0.13 |
9 |
4 |
Flightplan |
24.63 |
89.69 |
3,424 |
21 |
Steamboy |
0.14 |
0.36 |
46 |
3 |
Lost Embrace |
0.02 |
0.05 |
5 |
1 |
Kung Fu Hustle |
0.27 |
17.08 |
2,503 |
16 |
Howl's Moving Castle |
0.43 |
4.61 |
202 |
11 |
War of the Worlds |
77.06 |
234.21 |
3,910 |
19 |
Balzac and the Little Chinese Seamstress |
0.02 |
0.42 |
22 |
6 |
Lords of Dogtown |
5.62 |
11.01 |
1,865 |
4 |
The Baxter |
0.04 |
0.04 |
47 |
1 |
The Amityville Horror |
23.51 |
64.26 |
3,323 |
6 |
House of Wax |
12.08 |
32.05 |
3,111 |
12 |
Uncle Nino |
0.17 |
0.17 |
189 |
1 |
Separate Lies |
0.07 |
0.85 |
127 |
6 |
Thumbsucker |
0.09 |
1.23 |
330 |
6 |
Sons of Provo |
0.03 |
0.03 |
7 |
1 |
Kingdom of Heaven |
19.64 |
47.31 |
3,219 |
12 |
Mrs. Henderson Presents |
0.06 |
4.36 |
260 |
10 |
Casanova |
0.23 |
11.24 |
1,011 |
8 |
The World's Fastest Indian |
0.40 |
0.87 |
121 |
2 |
Alone in the Dark |
2.83 |
5.13 |
2,124 |
3 |
Get Rich or Die Tryin' |
12.02 |
30.97 |
1,666 |
11 |
Cheaper by the Dozen 2 |
15.34 |
80.83 |
3,211 |
8 |
Red Eye |
16.17 |
57.86 |
3,134 |
8 |
Mughal-e-Azam |
0.06 |
0.11 |
32 |
2 |
Head On |
0.02 |
0.11 |
5 |
3 |
The Thing About My Folks |
0.24 |
0.78 |
145 |
4 |
Lucky |
0.15 |
0.15 |
44 |
1 |
Broken Flowers |
0.78 |
13.65 |
433 |
12 |
Paradise Now |
0.05 |
1.26 |
65 |
16 |
Dil Jo Bhi Kahe |
0.09 |
0.13 |
33 |
2 |
Look at Me |
0.07 |
1.66 |
75 |
13 |
D.E.B.S. |
0.06 |
0.06 |
45 |
1 |
Ek Khiladi Ek Hasina |
0.08 |
0.08 |
15 |
1 |
Viruddh |
0.11 |
0.29 |
37 |
3 |
Sin City |
29.12 |
74.00 |
3,230 |
16 |
Bee Season |
0.12 |
1.14 |
277 |
7 |
A Lot Like Love |
7.58 |
21.84 |
2,502 |
10 |
First Descent |
0.44 |
0.74 |
243 |
3 |
George A. Romero's Land of the Dead |
10.22 |
20.43 |
2,253 |
6 |
Ong Bak: The Thai Warrior |
1.34 |
4.51 |
387 |
8 |
Me and You and Everyone We Know |
0.03 |
3.69 |
160 |
13 |
Caterina in the Big City |
0.01 |
0.16 |
7 |
5 |
Shaadi No. 1 |
0.14 |
0.34 |
45 |
3 |
The Wild Parrots of Telegraph Hill |
0.04 |
2.81 |
66 |
27 |
Mindhunters |
1.91 |
4.45 |
1,040 |
5 |
Sahara |
18.07 |
68.64 |
3,200 |
17 |
Racing Stripes |
18.86 |
49.19 |
3,185 |
17 |
Mad Hot Ballroom |
0.05 |
7.90 |
202 |
19 |
The Exorcism of Emily Rose |
30.05 |
75.07 |
3,045 |
9 |
Nina's Tragedies |
0.04 |
0.20 |
15 |
2 |
Home Delivery |
0.05 |
0.05 |
15 |
1 |
Into the Blue |
7.06 |
18.47 |
2,789 |
5 |
Ek Ajnabee |
0.12 |
0.21 |
38 |
3 |
The Edukators |
0.03 |
0.07 |
32 |
1 |
Magnificent Desolation |
0.50 |
7.31 |
82 |
19 |
Memoirs of a Geisha |
0.68 |
56.07 |
1,654 |
10 |
Cronicas |
0.04 |
0.14 |
13 |
3 |
Bride and Prejudice |
0.39 |
6.57 |
288 |
13 |
Happily Ever After |
0.03 |
0.12 |
8 |
1 |
State Property 2 |
0.76 |
1.68 |
202 |
3 |
Star Wars: Episode III |
108.44 |
380.18 |
3,663 |
19 |
Indigo |
1.19 |
1.19 |
603 |
1 |
Imaginary Heroes |
0.04 |
0.09 |
19 |
1 |
Cinderella Man |
18.32 |
61.58 |
2,820 |
15 |
The Upside of Anger |
0.21 |
18.74 |
1,166 |
15 |
The Skeleton Key |
16.06 |
47.81 |
2,784 |
11 |
The Cave |
6.15 |
14.89 |
2,195 |
6 |
The Family Stone |
12.52 |
59.70 |
2,469 |
9 |
Jiminy Glick in La La Wood |
0.03 |
0.03 |
24 |
1 |
High Tension |
1.90 |
3.65 |
1,323 |
3 |
Yours, Mine and Ours |
17.46 |
53.30 |
3,210 |
12 |
Wedding Crashers |
33.90 |
209.22 |
3,131 |
23 |
Wallace and Gromit: Were-Rabbit |
16.03 |
56.07 |
3,656 |
13 |
Three... Extremes |
0.04 |
0.04 |
19 |
1 |
Nobody Knows |
0.03 |
0.49 |
23 |
6 |
Capote |
0.33 |
20.13 |
1,239 |
20 |
A History of Violence |
0.52 |
31.46 |
1,348 |
19 |
Palindromes |
0.06 |
0.51 |
46 |
7 |
The Devil's Rejects |
7.07 |
16.90 |
1,757 |
5 |
The Greatest Game Ever Played |
3.66 |
15.33 |
1,810 |
9 |
Proof |
0.19 |
7.53 |
517 |
10 |
Walk the Line |
22.35 |
113.63 |
3,160 |
13 |
Where the Truth Lies |
0.14 |
0.82 |
92 |
4 |
Wolf Creek |
4.91 |
16.04 |
1,761 |
7 |
My Summer of Love |
0.09 |
0.97 |
63 |
7 |
The Producers |
0.16 |
19.28 |
978 |
9 |
Happy Endings |
0.24 |
1.25 |
74 |
6 |
Last Days |
0.09 |
0.42 |
31 |
5 |
(a)
From the above boxplots we see that all four data sets are positively skewed and No. of Theaters and weeks in Top 60 have outliers.
From histograms we see that the all four data sets are positively skewed.
Stem-and-Leaf Display: Opening Gros, Total Gross, Number of Th, Weeks in Top
Stem-and-leaf of Opening Gross N = 100
Leaf Unit = 1.0
(70) 0
00000000000000000000000000000000000000000000000000000000001111234+
30 1 002222566678889
15 2 23334499
7 3 03
5 4 8
4 5 0
3 6
3 7 7
2 8
2 9
2 10 28
Stem-and-leaf of Total Gross N = 100
Leaf Unit = 10
(77) 0
00000000000000000000000000000000000000000000000000000001111111111+
23 0 5555555666677888
7 1 1
6 1 8
5 2 003
2 2 8
1 3
1 3 8
Stem-and-leaf of Number of Theaters N = 100
Leaf Unit = 100
(51) 0 000000000000000000000000000000000001111112222222334
49 0 569
46 1 001233
40 1 6677889
33 2 11224
28 2 555778
22 3 01111122222223444
5 3 66889
Stem-and-leaf of Weeks in Top 60 N = 100
Leaf Unit = 1.0
14 0 11111111111111
28 0 22223333333333
38 0 4444455555
(13) 0 6666666677777
49 0 8888889999
39 1 0000111
32 1 2222333333
22 1 455
19 1 6666677
12 1 899999
6 2 011
3 2 23
1 2
1 2 7
(b)
From above scatter plot we see that Opening Gross and total Gross are positvely correlated (since as opening gross increases total gross also increases) and strength is good. Hence these two variab les are linearly related.
(c)
From above scatter plot we see that Number of Theaters and total Gross are positvely correlated but strength is very poor (because most of the points are parallel to horizontal line). However there may be non linear relationship between these two variables.
(d)
From above scatter plot we see that Weeks in Top 60 and total Gross are positvely correlated but strength is very poor (because most of the points are parallel to horizontal line). However there may be non linear relationship between these two variables.
(e)