Question

In: Statistics and Probability

There are two variables: Favorite film genre and favorite literary genre. Calculate lambda using literary genre...

There are two variables: Favorite film genre and favorite literary genre. Calculate lambda

using literary genre to predict Film Genre.

Literary Genres

mystery sci fi classics

film action 22 42 24

genres comedy 58 52 18

drama 26 30 12

Solutions

Expert Solution

Lambda is defined as an asymmetrical measure of association that is suitable for use with nominal variables.Lambda provides us with an indication of the strength of the relationship between independent and dependent variables. As an asymmetrical measure of association, lambda’s value may vary depending on which variable is considered the dependent variable and which variables are considered the independent variable.

To calculate lambda, you need two numbers: E1 and E2.

  1. E1 is the error of prediction made when the independent variable is ignored. To find E1, you first need to find the mode of the dependent variable and subtract its frequency from N. E1 = N – Modal frequency.
  2. E2 is the errors made when the prediction is based on the independent variable. To find E2, you first need to find the modal frequency for each category of the independent variables, subtract it from the category total to find the number of errors, then add up all the errors.

The formula for calculating lambda is: Lambda = (E1 – E2) / E1.

Lambda may range in value from 0.0 to 1.0.

  • Zero indicates that there is nothing to be gained by using the independent variable to predict the dependent variable. In other words, the independent variable does not, in any way, predict the dependent variable.
  • A lambda of 1.0 indicates that the independent variable is a perfect predictor of the dependent variable. That is, by using the independent variable as a predictor, we can predict the dependent variable without any error.
Film Genre Frequency
Film action 88
Genres comedy 128
Drama 68
Total 284

Let's say you want to guess what the rating of another 284 people would be. Your best guess would be to pick to modal category, which is "Genres comedy" . If you consistently pick "Genres comedy," you will make the fewest number of wrong guesses. Original error=128 right and (284-128)=156 wrong (out of 284 total guesses).

Now, let's say that you are given one additional piece of information. You now know what the frequency of literary genre: mystery,sci fi and classics.

literary genre
Film Genre Mystery Sci fi Classics Total
Film action 22 42 24 88
Genres comedy 58 52 18 128
Drama 26 30 12 68
Total 106 124 54 284

Now, if you had to guess the lierary genre rating, you could qualify your best guess by knowing the literary genre. For each literary genre, you would guess the modal category.

literary genre N Modal category Right Guess Wrong guess
Mystery 106 Genres comedy 58 48
Sci fi 124 Genres comedy 52 72
Classics 54 Film action 24 30
Total 134 150

The total number of new errors (wrong guesses) is 150

To calculate Lambda, subtract the number of new errors from the number of original errors and divide by the number of original errors. In this case, [(156-150)/156]=0.038

By knowing literary genre, we can reduce the error in predicting Film Genre by 26.3%. This indicates a weak or negligible relationship between literary genre and Film Genre.

As the independent variable is measured on a nominal scale, there is no direction for the relationship (neither positive nor negative, just an association).


Related Solutions

Is it preferable to read a literary basis of a film before viewing the film itself,...
Is it preferable to read a literary basis of a film before viewing the film itself, or the other way around? If so why?
Define the Anti-Hero genre. Discuss the importance of the Anti-Hero film to American Cinema (1968-1977).
Define the Anti-Hero genre. Discuss the importance of the Anti-Hero film to American Cinema (1968-1977).
a. Calculate the covariance between variables X and Y. Is it a positive or negative relationship between the two variables?
Observation x y 1 -22 22 2 -33 49 3 2 8 4 29 -16 5 -13 10 6 21 -28 7 -13 27 8 -23 35 9 14 -5 10 3 -3 11 -37 48 12 34 -29 13 9 -18 14 -33 31 15 20 -16 16 -3 14 17 -15 18 18 12 17 19 -20 -11 20 -7 -22 Answer the following questions a. Calculate the covariance between variables X and Y. Is it a positive...
Suppose Y1,Y2, .. ,Y8 are independent and identically distributed as Poisson random variables with mean lambda....
Suppose Y1,Y2, .. ,Y8 are independent and identically distributed as Poisson random variables with mean lambda. a) Derive the most powerful test for testing Ho: lambda = 2, Ha: lambda = 3. Carefully show all work involved in the derivation. i) Give the form of the test. (In other words, for what general values of Y1,Y2, .. ,Y8 will Ho be rejected?) ii) Describe the rejection region as carefully as possible if alpha <= .05 (and is as close as...
Assume that it is appropriate and meaningful to calculate the correlation coefficient of two variables X...
Assume that it is appropriate and meaningful to calculate the correlation coefficient of two variables X and Y. If the correlation coefficient, r, has a value -0.95 Select one: a. it is impossible to tell if there is a linearly relationship between the two variables b. there is a strong linear relationship between the two variables c. there is no relationship between the two variables d. the slope of the regression line will be -0.95 2. The father's height (in...
Using the following variables, calculate an organization's cost of debt on a $100,000 bond. Rf: 2%...
Using the following variables, calculate an organization's cost of debt on a $100,000 bond. Rf: 2% Credit-risk rate: 6% t: 20% a.)$7,840 b.)$1,600 c.)$6,400 d.)$8,000 Calculate a company's total leverage given the following information: Net income = $35,000 Revenue = $70,000 Variable costs = $15,000 a.)Cannot calculate without ROE data b.)1.57 c.)2.33 d.)Cannot calculate without knowing the degree of operating leverage Eadie owns 300 shares of stock in Company A that were valued at $2/share. After Company A announces a...
What are the differences between results that demonstrate a correlation between two variables and results where a regression is run using two variables?
What are the differences between results that demonstrate a correlation between two variables and results where a regression is run using two variables? Think about your future clinical role and provide a clinical example of variables that you may want a correlation analysis run and explain. Think about your future clinical role and provide a clinical example of variables that you may want a regression analysis run and explain.
Why is it impossible to calculate a chi-square on two interval/ratio level variables?
Why is it impossible to calculate a chi-square on two interval/ratio level variables?
Using the analysis containing all four independent variables, calculate the 99% confidence internal for the coefficient...
Using the analysis containing all four independent variables, calculate the 99% confidence internal for the coefficient of V2. V1 V2 V3 V4 Y 10 2 5 2 24.4 11 2 4 2 22 12 2 3 2 22.4 13 4 2 3 24.5 14 3 1 2 21 15 5 5 2 37.4 16 5 4 2 37.8 17 4 2 1 25.5 18 5 1 2 24 19 5 5 1 25 20 2 3 1 22.1 21 4...
Let x be an exponential random variable with lambda equals 0.2. Calculate the probabilities described below....
Let x be an exponential random variable with lambda equals 0.2. Calculate the probabilities described below. a. ​P(x<7​) b. ​P(x>8​) c. ​P(7< = x < = 8​) d. ​P(x> = 6​) e. the probability that x is at most 8
ADVERTISEMENT
ADVERTISEMENT
ADVERTISEMENT