Question

In: Statistics and Probability

Consider the data in the Excel file Olympic Track and Field Results. The Olympic records in...

Consider the data in the Excel file Olympic Track and Field Results. The Olympic records in Discus Throw, High Jump and Long Jump all show a clear increasing trend. Can one be predicted from the others(s)? Run a regression analysis using Discus Throw as the dependent variable and High Jump as the independent variable. Interpret the key regression results.

Dataset - https://easyupload.io/wc1oz2

Solutions

Expert Solution

The regression analysis for discuss throw as dependent variable and High jump as independent variable is given below in the attached images

Analysis from the Summary Output

1. Multiple R value is 0.964358 which indicates

Multiple R value is the correlation coefficient which measures the strength of the linear relationship between independent and dependent variable. The values range between -1 and 1 including them

The relationship is stronger for higher values of R

Our Multiple R value is 0.96 which indicates a very good relationship between our variables Discus throw and High Jump

2. R2  Value is 0.929987

R2 value suggests how much % of dependent values are explained by independent variables

Here 92.99% of Discus throw values are explained by High jump values which is a good fit

3. Standard error is 128.1227 which represents the distance the points are away from the regression line which is low when we compare with the dependent variable values which are each more than 1000

ANOVA analysis

1. SS value is the sum of squares. The residual SS component value should be low for the model to be a good fit. Our residual SS is 393970.2 which is very low when compared to the total 5627121 indicating this is a good fit

2. Significance F value is 2.31-15 . If Significance F value is less than 0.05 (our significance level), it shows that results are statistically significant. Our Significant F value is way less than 0.05 hence the results are statistically significant

Coefficient analysis

The regression equation is

Discuss Throw = (High Jump * 58.71) - 2712.54

The P-values for Intercept and High jump are 7.08704-10 and 2.3118-15 which are way less than 0.05 (our significance level) indicating that the predictor variable or independent variable High jump is statistically significant

Residuals

The actual values and the predicted values (from the regression) will not be equal and their difference is residuals. Residuals tells how far the actual values are from the predicted values using regression equation. The difference arises because independent variables will not perfectly predict the dependent variable


Related Solutions

develop frecasting models for each of the events. Olympic Track and Field Results Year High Jump...
develop frecasting models for each of the events. Olympic Track and Field Results Year High Jump (in.) Discus (in.) Long Jump (in.) 1896 71.250 1147.500 249.750 1900 74.800 1418.900 282.875 1904 71.000 1546.500 289.000 1908 75.000 1610.000 294.500 1912 76.000 1780.000 299.250 1920 76.250 1759.250 281.500 1924 78.000 1817.125 293.125 1928 76.375 1863.000 304.750 1932 77.625 1946.875 300.750 1936 79.938 1987.375 317.313 1948 78.000 2078.000 308.000 1952 80.320 2166.850 298.000 1956 83.250 2218.500 308.250 1960 85.000 2330.000 319.750 1964 85.375...
The attached file contains trials data of a school's Track & Field meet's long jump competition....
The attached file contains trials data of a school's Track & Field meet's long jump competition. Find a 95% confidence interval for the difference in the means of populations Grade7 and Grade8. Step 1: Perform an ANOVA and find MSE . Step 2: State the statistic. Step 3: Find t* from the t-distribution. Step 4: Find a 95% confidence interval for the difference in the means of populations Grade7 and Grade8. Step 5: Using Bootstrap, find a 95% confidence interval...
During the 2016 Summer Olympic the Russian track and field team was banned from participating because...
During the 2016 Summer Olympic the Russian track and field team was banned from participating because of a drug doping scandal. Over 1000 Russian athletes were banned from competition because of this suspension. What role do ethics play in sports? Do you think it is important to hold individuals accountable when they are found in to be in violation of the rules? Did the punishment fit the crime for the Russians?
The Excel file Myatt Steak House provides five years of data on key business results for...
The Excel file Myatt Steak House provides five years of data on key business results for a restaurant. Identify the leading and lagging measures, find the correlation matrix, and propose a cause-and-effect model using the strongest correlations. Myatt Steak House 2010 2011 2012 2013 2014 Order Accuracy X1 86.0% 86.0% 89.0% 90.0% 95.0% Timeliness of Delivery X2 84.0% 82.0% 86.0% 93.0% 95.0% Table Cleanliness X3 4.8 4.8 5.1 5.6 5.8 Customer Satisfication X4 93.4% 93.2% 94.2% 95.3% 96.7% Total #...
Following is the normalized distance matrix for the first four records of the Excel file Credit...
Following is the normalized distance matrix for the first four records of the Excel file Credit Approval Decisions. Apply single linkage clustering to these records until only one option remains. What conclusions can you make from this analysis? Applicant 1 2 3 4 1 0 2.874 2.326 1.769 2 0 1.530 1.798 3 0 1.317 4 0
how to export source data to excel file in python?
how to export source data to excel file in python?
Consider the data set below. Excel File: data12-33.xls The estimated regression equation is ŷ = 30.33...
Consider the data set below. Excel File: data12-33.xls The estimated regression equation is ŷ = 30.33 - 1.88x. Estimate the standard deviation of ŷ p when x = 3 (to 3 decimals). Develop a 95% confidence interval for the expected value of y when x = 3 (to 2 decimals). (  ,  ) Estimate the standard deviation of an individual value of y when x = 3 (to 2 decimals). Develop a 95% prediction interval for y wh
In the Excel data file, the tab labeled Question 1 contains data on the number of...
In the Excel data file, the tab labeled Question 1 contains data on the number of times boys and girls raise their hands in class. Conduct the t-test: Two-Sample Assuming Equal Variances. Males 9,8,4,9,3,8,10,8,9,10,7,6,12 Females 3,5,1,2,6,4,3,6,7,9,7,3,7,6,8,8 a. What is the null hypothesis? b. What is the research hypothesis? c. Why run a Two-Sample Assuming Equal Variances t-test? d. Interpret the findings. What are the results of the hypothesis test? Can you reject the null hypothesis?
The data set for this question set (Tab Q1 in the Excel data file) comes from...
The data set for this question set (Tab Q1 in the Excel data file) comes from a research project that tracks the elderly residents in a community to monitor their cognitive function and general health. Based on the literature, education is considered a protective factor against dementia, and memory decline is usually the first sign of dementia. So the researchers would like to know whether education level (measured in number of years of formal schooling) is correlated with memory function...
Enter the Shoe Size and Height data from the In class data Excel file (use the...
Enter the Shoe Size and Height data from the In class data Excel file (use the excel in class data lab file loaded into the files section of Canvas) into the week 7 Regression Excel sheet. Create a scatter plot for the data, store and graph the regression equation, and note the r2 and r value. Looking at the graph and the r, and r^2 value, do you feel that shoe size is a good predictor for height? Explain your...
ADVERTISEMENT
ADVERTISEMENT
ADVERTISEMENT