Question

In: Statistics and Probability

Which variable should be the dependent variable and which should be the independent variable? Why? Plot...

  1. Which variable should be the dependent variable and which should be the independent variable? Why?
  2. Plot the points on the scatterplot graph. NOTE: In a scatterplot the x axis will always reflect 1, 2, 3… not the Quarters specifically.

Label both axes with words.

In order to analyze the data we will use the line of best fit.

  1. Answer with 1-3 complete sentences. Does the line seem to fit the data? Why?
  1. Calculate the following use excel trendline:
  1. y-intercept to two decimal places a=
  2. slope to 3 decimal places     b=
  1. correlation coefficient (to 4 decimal places) =
  2. n =
  3. correlation of determination (to 3 decimal places) =
  4. line of best fit equation
  5. What does the correlation imply about the relationship between the time and the number of users?
  1. Is the correlation significant? Why or why not? (Answer in 1-2 complete sentences.) (Use the Pearson calculator).
  1. Interpret the coefficient of determination in this real world context:

12) Can the regression line be used for prediction?

  1. Predict the following:
  1. For the First quarter of 2008, predict the total monthly users.
  2. For the First quarter of 2020, predict the total monthly users.

Quarter

Number in millions

Q4 '08

100

Q1 '09

197

Q2 '09

242

Q3 '09

305

Q4 '09

360

Q1 '10

431

Q2 '10

482

Q3 '10

550

Q4 '10

608

Q1 '11

680

Q2 '11

739

Q3 '11

800

Q4 '11

845

Q1 '12

901

Q2 '12

955

Q3 '12

1,007

Q4 '12

1,056

Q1 '13

1,110

Q2 '13

1,155

Q3 '13

1,189

Q4 '13

1,228

Q1 '14

1,276

Q2 '14

1,317

Q3 '14

1,350

Q4 '14

1,393

Q1 '15

1,441

Q2 '15

1,490

Q3 '15

1,545

Q4 '15

1,591

Q1 '16

1,654

Q2 '16

1,712

Q3 '16

1,788

Q4 '16

1,860

Q1 '17

1,936

Solutions

Expert Solution

Answer:

Dependent variable: Number in millions

Independent variable: Period (Quarterly)

Explanation:

Since we want to predict the number of users (in million) based on the quarterly time, the number depends on the period. Hence the number of users is a dependent variable and the quarterly time is the independent variable.

Scatterplot

Answer:

Explanation:

The scatterplot is obtained in excel by following these steps,

Step 1: Write the data values in excel. The screenshot is shown below,

Step 2: Select the time and Number of users columns then go to the graph area then INSERT > Recommended Chart > XY Scatter > OK.

Line of best fit.

Answer:

From the scatterplot, we can observe that there is a positive linear trend between and the number of users and the quarterly time. There is a strong association between the two variables which means the data values are aligned in a straight line that is why the linear trendline fits the data values very well.

y-intercept, a= 127.604

Slope to 3 decimal places, b= 52.024

The correlation coefficient, r = 0.9976

n = 34

The correlation of determination, r^2 = 0.995

line of the best-fit equation

There is a very strong positive linear correlation between the time and the number of users.

The t-test from the parameter estimation table shows that the slope coefficient is statistically significant at a 5% significance level (i.e. p-value = 1.31E-18 < 0.05) which indicates that there is a significant positive association between the time and the number of users.

The coefficient of determination (R-square value) tells, how well the regression model fits the data values. The R-square value of the model is 0.9952 which means, the model explains approximately 99.52% of the variance of the data value. Based on this evidence we can conclude the model is a good fit.

Hence we can predict the number of users based on the time very accurately.

Explanation:

The best fit line and the regression analysis is done in excel by following these steps,

Step 1: Write the data values in excel. The screenshot is shown below,

Step 2: DATA > Data Analysis > Regression > OK. The screenshot is shown below,

Step 3: Select Input Y Range: 'Number of users' column, Input X Range: 'Time' column, and select the Line Fit Plots then OK. The screenshot is shown below,

The result is obtained. The screenshot is shown below,

Best-fit line

the regression equation is defined as,

where a = intercept and b = slope of the best-fit equation

From the regression output summary,

Correlation coefficient

From the regression output summary,

Multiple R 0.997573

Correlation of determination

From the regression output summary,

R Square 0.995151

Prediction

Answer:

For first-quarter of 2008 => total monthly users = 23.556 millions

For first-quarter of 2020 => total monthly users = 2520.722 millions

Explanation:

The regression equation is,

For first-quarter of 2008 => Time = -2

For first-quarter of 2020 => Time = 46


Related Solutions

For the following, identify the independent variable and the dependent variable and explain why we should...
For the following, identify the independent variable and the dependent variable and explain why we should use a parametric or nonparametric procedure. (a) When ranking the intelligence of a group of people given a smart pill. (b) When compar- ing the median income for a group of college professors to that of the national population of all incomes. (c) When comparing the mean reading speed for a sam- ple of hearing-impaired children to the average reading speed in the population...
a. Develop a scatter plot with income as the dependent variable and age as the independent...
a. Develop a scatter plot with income as the dependent variable and age as the independent variable. Include the estimated regression equation and the coefficient of determination on your scatter plot. Briefly comment on the relationship between the two variables, and fully interpret the coefficient of determination. b. Using the Excel’s Regression Tool, develop the estimated regression equation to show how income (y annual income in $1000s) is related to the independent variables education (?_1level of education attained in number...
a. If they are going to run a linear regression, identify which variable should be the independent variable and which should be the dependent variable in a regression equation.
In seeking to determine how influential advertising is, the management of a recently established retail chain collected data on sales revenue and advertising expenditure from its' stores over the last ten (10) weeks. The table below shows the data collected: Advertising Expenditure ($ 000) Sales ($ 000) 3 5 76 50 250 700 450 3.5 75 4 150 4.5 7 200 750 7.5 800 8.5 1,100 a. If they are going to run a linear regression, identify which variable should...
For the following variables, identify which is the Independent Variable and the Dependent Variable. Then create...
For the following variables, identify which is the Independent Variable and the Dependent Variable. Then create a hypothesis (be sure to also include the direction of the relationship) and then create a null hypothesis. Temperature and crime rates Hours studied and test scores Number of police officers and crime rate
1.) Use Excel to plot the dependent vs the independent variable. Show the regression equation from...
1.) Use Excel to plot the dependent vs the independent variable. Show the regression equation from the computer output. Lannie Karner- GPA 3.6 Income 75k Courtney Sheperd Gpa 3.3 Income 74K Zenobia Roussel- GPA 2.9 Income 66K Elaine Doody- GPA 3.8 Income 80k Maudie Hocker-GPA 3.1 Income 65k Rick Hoover-GPA 3.2 Income 53k Franinca Ortez-GPA 2.7 Income 65k Li Kinder-GPA 3.3 Income 71k Brad Clem-GPA 3.8 Income 80k Soon Nettleton-GPA 4.0 Income 95k Vertie Yousesef-GPA 3.9 Income 110k Love Au-GPA...
Having data of two independent variables and a dependent variable, how do I plot a linear...
Having data of two independent variables and a dependent variable, how do I plot a linear graph (trendline fit)? (Linear equation) Possible to use excel.
Identify the independent and dependent variable.
A researcher is studying relationship between speed of cycling and the presence of people. Formulate a relevant hypothesis and identify the independent and dependent variable.
Why and where do we use dependent and independent variable?
Why and where do we use dependent and independent variable?
15. In an experiment, an independent variable is _______ and a dependent variable is _______. Group...
15. In an experiment, an independent variable is _______ and a dependent variable is _______. Group of answer choices Manipulated, measured Measured, manipulated Discrete, summation Continuous, manipulated 16. Outliers are Group of answer choices The lowest and highest scores in a data set Extreme or unusual values All options present The lowest value in a data set 17. Assume that we have the following set of data:     Score                 11, 12, 17, 18, 19, 20, 21, 22, 23, 24,...
Discuss if Uber drivers should be independent contractors or dependent employees. Why yes, why no.
Discuss if Uber drivers should be independent contractors or dependent employees. Why yes, why no.
ADVERTISEMENT
ADVERTISEMENT
ADVERTISEMENT