Question

In: Statistics and Probability

Many movies are released each year and it would be interesting to be able to predict...

Many movies are released each year and it would be interesting to be able to predict the Total Gross Revenues (in $1,000,000) from the box office based on a few predictors. The following predictors have been identified for 70 movies:

  • BUDGET: Estimated budget in $1,000,000
  • LENGTH: The length of each movie in minutes
  • SCREENS: Number of Screens on Opening Weekend
  • AWARDS: Number of Award nominations of entire cast in their careers
  • GENRE: Type of movie: Action, Comedy or Drama recoded as Genre 1 and Genre 2
    • Genre1 = 1 if Action, 0 if not
    • Genre2 = 1 if Comedy, 0 if not

                       

            The following partial output has been obtained:

Coefficients

Standard Error

t Stat

Intercept

-4.039

30.735

Budget

0.803

0.154

Length

-0.433

0.242

Screens

0.013

0.005

Awards

1.390

1.049

Genre 1

4.777

2.032

Genre 2

2.732

13.646

ANOVA

df

SS

MS

F

Regression

39.5

Residual

170.2

Total

Based on the above partial printout, answer the following questions:

  1. What is the regression model?
  2. Interpret the following coefficients: -0.433 and 4.777.
  3. Is there sufficient evidence at the 5% level of significance to conclude that the model is useful at predicting Total Gross Revenues?
  4. Determine the adjusted coefficient of determination and explain its meaning in the context of the problem.
  5. What is the standard error of the estimate?
  6. Does the “genre” of movie have a significant impact (at 5%) on the Total Gross Revenues? Justify.
  7. Estimate the Total Gross Revenues for a Drama movie of 90 minutes produced with a $25,000,000 budget with a cast of actors who were nominated 6 times for awards. In addition, that movie was shown on 2,000 screens in the first weekend.

Solutions

Expert Solution

First we fill the missing values in the table as:

t-Stat=Coefficients/ standard error of coefficients
i.e. t-Stat=betahat/SE(betahat)
where, SE=Standard Error

ANOVA table:
GIVEN: MSregression=39.5, SSresidual=170.2
n=70 (total number of movies)
k=6 (total number of predictors)
p=k+1=7 (total number of parameters)
df(Total)=n-1=69
df(Regression)=k=6
df(Residual)=n-p=70-7=63
MSregression = SSRegression/df(Regression)
Therefore, SSregression=MSRegression*df(Regression)=39.5*6=237

MSResidual= SSResidual/df(Residual)=170.2/63=2.701587
Total SS= SSregression + SSresidual = 237 + 170.2 = 407.2
F=MSregression / MSresidual = 39.5 / 2.701587 =14.62103

Completed table is as follows:

beta_hat SE(betahat) bhat/SE(bhat)
Coefficients Standard Error ratio t Stat   calc t p-value Formula in Excel
Intercept -4.039 30.735 -4.039/30.735 -0.13141 2.296237 0.895867 TDIST(ABS(D3),63,2)
Budget 0.803 0.154 0.803/0.154 5.214286 2.296237 2.18E-06 TDIST(ABS(D4),63,2)
Length -0.433 0.242 -0.433/0.242 -1.78926 2.296237 0.07838 TDIST(ABS(D5),63,2)
Screens 0.013 0.005 0.013/0.005 2.6 2.296237 0.0116 TDIST(ABS(D6),63,2)
Awards 1.39 1.049 1.39/01.049 1.325071 2.296237 0.189933 TDIST(ABS(D7),63,2)
Genre 1 4.777 2.032 4.777/2.032 2.350886 2.296237 0.021872 TDIST(ABS(D8),63,2)
Genre 2 2.732 13.646 2.732/13.646 0.200205 2.296237 0.841965 TDIST(ABS(D9),63,2)
TINV(0.025,63)=2.296237 FINV(0.05,6,63)=2.246408
ANOVA
df SS MS F
Regression 6 237 39.5 14.62103
Residual 63 170.2 2.701587
Total 69 407.2


Based on the above partial printout, answer the following questions:

a) What is the regression model?
Regression equation is given as:
## y=TGR=Total Gross Revenue
y= Intercept+ beta1*Budget+ beta2*Length+ beta3*Screens+ beta4*Awards+ beta5*Genre 1 + beta6*Genre 2
y= -4.039+ 0.803*BUDGET -0.433*LENGTH+ 0.013*SCREENS+ 1.390*AWARDS+ 4.777*GENRE1+ 2.732*GENRE2

b) Interpret the following coefficients: -0.433 and 4.777.
X=Independent variable/ Regressor
Y=Dependent variable / Response

  • Positive sign means that if X increases, Y also increases and if X decreases Y also decreases. [Movement in Same direction]
  • Negative sign of the coefficients means that if X increases Y decreases and if X decreases, Y increases.[Movement in OPPOSITE direction]

Here, (-0.433) indicates that if Length decreases , the revenue (response) increases and vice-versa.
Here, 4.777 indicates that: If Genre 1=1, then intercept will be (-4.039+4.777=0.738) and if Genre 1=0, then intercept remains unchanged.

c) Is there sufficient evidence at the 5% level of significance to conclude that the model is useful at predicting Total Gross Revenues?
H0: All beta_i's equal to zero i.e. regressors do not significantly contribute to the model.
Reject H0 if F > F(alpha,k,n-p)
Here, F(alpha,k,n-p)=FINV(0.05,6,63)=2.246408. Calculated F=14.62103
Calculated F > tabulated F, Thus, we Reject H0.
Yes, there sufficient evidence at the 5% level of significance to conclude that the model is useful at predicting Total Gross Revenues.

d) Determine the adjusted coefficient of determination and explain its meaning in the context of the problem.
## adjusted coefficient of determination = R2Adj

R-squared measures the proportion of the variation in your dependent variable (Y) explained by independent variables (X) for a linear regression model.The adjusted R-squared adjusts the statistic for the number of independent variables in the model. Importantly, its value increases only when the new term improves the model fit more than expected by chance alone.
R2Adj= 1 - [( SSresidual/ (n-p) ) / ( TotalSS / (n-1) )]= 1 - [ (170.2/63) / (407.2/69) ]=0.542216

54.2216 % of variation in the model is explianed by the regressors.

e) What is the standard error of the estimate?
Sigmasquare hat=MSresidual=2.701587
standard error of the estimate=SE=sigmahat=squareroot(2.701587) = 1.643651

f) Does the “genre” of movie have a significant impact (at 5%) on the Total Gross Revenues? Justify.
For Genre 1, calculated t-Stat > Tabulated t, we Reject H0 and conclude that Genre 1 is significant. [Also, p-value=0.021872 which is less than alpha=0.05, thus we Reject H0.]
BUT, for Genre 2, calculated t-Stat < Tabulated t, we Accept H0 and conclude that Genre 2 is insignificant. [Also, p-value=0.841965 which is greater than alpha=0.05, thus we Accept H0.]

We can conclude that Genre is significant.


g) Estimate the Total Gross Revenues for a Drama movie of 90 minutes produced with a $25,000,000 budget with a cast of actors who were nominated 6 times for awards. In addition, that movie was shown on 2,000 screens in the first weekend.

Since, it is drama movie, Genre1=Genre2=0.

Put Budget=$ 25 (in $1,000,000), Length=90, Awards=6, Screens=2000, Genre 1=0, Genre 2=0 in REGRESSION EQUATION:
y= -4.039+ 0.803*(25) -0.433*(90)+ 0.013*(2000)+ 1.390*(6)+ 4.777*(0)+ 2.732*(0)
y=11.406

Estimated Total Gross Revenue is 11.406 (in $1,000,000) i.e. $11,406,000.


Related Solutions

Predict the elasticity of demand for movies described by each of the following statements. Justify your answers.
Predict the elasticity of demand for movies described by each of the following statements. Justify your answers.We go to the movies two Fridays out of every month, without exception.When the price of movies went up to $7, we quit going to regular movies. We now wait for $3.50 Tuesday before we see a movie. (instead of drawing the actual graph, briefly explain the graph instead. Explain all the curves, shifts, points and axis, intersections and labels etc).
the manager of an amusement park would like to be able to predict daily attendance in...
the manager of an amusement park would like to be able to predict daily attendance in order to develop more accurate plans about how much food to order and how many ride operators to hire. after some consideration, he decided the following three factors are critical: 1. Yesterday's attendance 2. Weekday or Weekend 3. Predicted Weather He then took a random sample of 40 days. For each day, he recorded the attendance, day of the week, and weather forecast. The...
In which of the following cases would you be able to predict with certainty the change...
In which of the following cases would you be able to predict with certainty the change in the equilibrium price? There is more than one answer to this question. You must mark all of the correct answers to receive full credit for this question. There is a decrease in demand. There is a decrease in demand and an increase in supply. There is an increase in demand and an increase in supply. There is an increase in supply. There is...
The manager of an amusement park would like to be able to predict daily attendance in...
The manager of an amusement park would like to be able to predict daily attendance in order to develop more accurate plans about how much food to order and how many ride operators to hire. After some consideration, he decided that the following three factors are critical: Yesterday’s attendance Weekday or weekend (1 if weekend, 0 if weekday) Predicted weather Rain forecast ( 1 if forecast for rain, 0 if not) Sun ( 1 if mostly sunny, 0 if not)...
The manager of an amusement park would like to be able to predict daily attendance in...
The manager of an amusement park would like to be able to predict daily attendance in order to develop more accurate plans about how much food to order and how many ride operators to hire. After some consideration, he decided that the following three factors are critical: Yesterday’s attendance Weekday or weekend Predicted weather He then took a random sample of 36 days. For each day, he recorded the attendance, the previous day’s attendance, day of the week, and weather...
Many people have different concepts or ideas about the study of economics. It would be interesting...
Many people have different concepts or ideas about the study of economics. It would be interesting to know what are your thoughts as it relates to the study of economics and its importance in society.
1. It is quite interesting that most of us are likely to be able to read...
1. It is quite interesting that most of us are likely to be able to read and comprehend words, even if the alphabets of these words are scrambled (two of them) given the fact that the first and last alphabets remain the same. For example, “I dn'ot gvie a dman for a man taht can olny sepll a wrod one way.” (Mrak Taiwn) “We aer all moratls, Hamrbee is an immoratl, and tehre is no question abuot it.” (Kevin Unknown)...
A random sample of 76 teenagers were asked how many movies they see in a year....
A random sample of 76 teenagers were asked how many movies they see in a year. It came out that the mean number of movies-per-year is 51.2 and the standard deviation is 15.4. Use a .05 significance level to test the claim that the mean number of movies-per-year is 52. Test claim using the critical value method .
Assume that Tom spends $190 on books and movies each month. Movies cost $10 each and...
Assume that Tom spends $190 on books and movies each month. Movies cost $10 each and books cost $ 25 each. Tom’s total utility for books and movies are presented in the following table. Movies Books No. /month TU MU MU/$ No. /month TU MU MU/$ 0 0 ------- ---------- 0 0 --------- ----------- 1 22 1 50 2 42 2 80 3 52 3 100 4 57 4 110 5 60 5 116 6 62 6 121 7 63...
The authors of a paper studied characters who were depicted smoking in movies released between a...
The authors of a paper studied characters who were depicted smoking in movies released between a certain range of years. The smoking characters were classified according to sex and whether the character type was positive, negative, or neutral. The resulting data are given in the table. Assume that this sample is representative of smoking movie characters. Do these data provide evidence of an association between sex and character type for movie characters who smoke? Use α = 0.05. Character Type...
ADVERTISEMENT
ADVERTISEMENT
ADVERTISEMENT