Question

In: Statistics and Probability

4. A researcher intends to determine which variables are more likely to predict public assaults against...

4. A researcher intends to determine which variables are more likely to predict public assaults against the police in Kentucky. The variable represents the 2012-2016 average assault rates per 100,000 people.  The following correlation matrix shows the relationship between assaults against the police in the state and the 2006-2010 average homicide rates per 100,000, and other socio-demographic variables at the county level in KY, based on 2010 US Census data. Relationships among the independent variables are included as well. Table 1 includes the Pearson correlation coefficients corresponding to bivariate relationships among five variables. The unit of analysis is the county (N = 120). Note: The analysis is based on real data.

Table 1. Assaults against the police in Kentucky (N = 120): Intercorrelation matrix     

Measures

Police

Assault

Poverty

Homicide Rate

AA%

FHH%

Assault against POLICE rate

1

Poverty level

-.297

1

Homicide rate

-.176

.452

1

% African Americans (AA%)

.316

-.292

.040

1

% Female Head of Households (FHH)

.325

.291

.199

.408

1

  1. Identify the dependent (DV) and the independent (IV) variables
  2. Formulate the null and alternative hypotheses [Note: a null hypothesis should be formulated for each relationship the DV has with the selected IVs]. You should have 4 null hypotheses [see column 1 of the correlation matrix].
  3. Knowing that N = 120 (the number of counties in KY), test the null hypotheses you’ve formulated [see the critical values of Pearson’s r, at the corresponding df and p< .05, 2-tail test]. You should list the value of the critical Pearson’s r.
  4. Reach a statistical conclusion for each test [it should be 4 tests] that examines the bivariate relationship between the dependent variable and the selected independent variables and interpret your findings. What characteristics appear to have those counties that registered a higher incidence of public assaults against the police in KY? What variables are significantly related to assault against the police and what is the direction of the relationship?
  5. What percentage of the variation in assaults against the police is explained by the percentage of female-headed households at the county level? [Hint: calculate the coefficient of determination to answer the question].  


  1. The table also shows the relationship between percent African Americans at the county level and the homicide rate in the county. Based on the results included in the matrix, with an increase in the percentage of blacks would homicide rate decrease or increase? Is the relationship statistically significant? Justify your answer.
  2. What characteristics have KY counties that have a higher percentage of female-headed households?

Solutions

Expert Solution

Answer:

Given that,

A researcher intends to determine which variables are more likely to predict public assaults against the police in Kentucky. The variable represents the 2012-2016 average assault rates per 100,000 people.

The following correlation matrix shows the relationship between assaults against the police in the state and the 2006-2010 average homicide rates per 100,000, and other socio-demographic variables at the county level in KY, based on 2010 US Census data.

Relationships among the independent variables are included as well. Table 1 includes the Pearson correlation coefficients corresponding to bivariate relationships among five variables.

The unit of analysis is the county (N = 120).

Note: The analysis is based on real data.

In the given problem, we are asked to determine which variables are more likely to predict public assaults against the police in Kentucky, the possible predictors being Poverty level, Homicide rate, % African Americans and % Female Head of Households:

(a).

The variable dependent on all the given factors (probable predictors) is Public assaults against the police in Kentucky. And the variables that probably causes the same are the predictors.

Hence, the Dependent variable is Public assaults against the police in Kentucky. The independent variables are Poverty level, Homicide rate, % African Americans, and % Female Head of Households.

(b).

In order to attain our objective, we must first determine whether a strong relationship exists between the dependent variable and the predictors. Since all the given variables are continuous in nature, the appropriate measure of correlation, here, would be Pearson's Correlation - which measures the strength and direction of the relationship.

Let Pp, Phi Pas Pl denote the Pearson's correlation coefficient between Public assaults against the police in Kentucky and each of the 4 predictors - Poverty level, Homicide rate, % African Americans, and % Female Head of Households respectively.

We may confirm a significant relationship between the two variables if the correlation coefficient between them is significantly different from zero. By definition of r, r lies between -1 and 1, with negative and positive values representing negative and positive relationships respectively. And values close to 1 representing strong relation and those close to zero representing little or no correlation.

Hence, we may test:

(c).

The significance of correlation coefficient r is tested using the following t-statistic:

Where the critical value of t can be obtained from t-table for n-2 degrees of freedom.

For Poverty level, substituting the given values,

=0.179

Similarly, for correlation between Public assaults against the police in Kentucky and the predictors - Poverty level, Homicide rate, % African Americans and % Female Head is 0.179.

(d).

Comparing the r values obtained with the critical value, we may reject the null hypotheses if |r| > r_{crit}:

Based on the given data, we find that the coefficients for the Poverty level, % African Americans and % Female Head - 0.297, 0.316, 0.325, respectively, all lie in the rejection / critical region.

Based on the given data, we may conclude that Public assaults against the police in Kentucky exhibit a significant correlation with predictors Poverty level, % African Americans, and % Female Head, except with Homicide rate.

Also, based on the given data, it is to be noted that % African Americans and % Female Head are positively significantly correlated with Public assaults against the police in Kentucky. And there exists a significant negative correlation between Public assaults and poverty level.

(e).

The percentage of variation explained can be computed using the measure - Coefficient of determination which is nothing but the square of the correlation coefficient:

For a percentage of the variation in assaults against the police that is explained by the percentage of female-headed households at the county level,

r_{f}^{2}=(0.325)^{2}=0.106

Based on the given data, we find that percentage of female-headed households at the county level explains about 10.6% of the variation in assaults against the police.

(f).

Based on the given data, the Pearson's correlation coefficient for African Americans at the county level, and the homicide rate is obtained as 0.04. As the coefficient is positive, we may say, that the Homicide rate increases with % African Americans at the county level and vice versa. But since r = 0.040 < 0.179 , we may conclude that this correlation is not significant at 5% level. Hence, a relationship as stated above cannot be established.

(g).

Based on the given data, we find that KY counties that have a higher percentage of female-headed households have higher No. of Police assault cases (r = 0.325 > 0.179), higher % of African -Americans ( r = 0.408 >0.179), higher % of Homicide rate (r = 0.199 > 0.179) and higher level of poverty (r = 0.291 > 0.179) based on the given data.


Related Solutions

A researcher intends to determine which variables are more likely to predict public assaults against the...
A researcher intends to determine which variables are more likely to predict public assaults against the police in Kentucky. The variable represents the 2012-2016 average assault rates per 100,000 people. The following correlation matrix shows the relationship between assaults against the police in the state and the 2006-2010 average homicide rates per 100,000, and other socio-demographic variables at the county level in KY, based on 2010 US Census data. Relationships among the independent variables are included as well. Table 1...
A researcher performed a regression analysis to predict achievement in physics using the following predictor variables:...
A researcher performed a regression analysis to predict achievement in physics using the following predictor variables: student gender (1=male, 0=female), emotional intelligence (ei), intelligence quotient (iq), verbal SAT (vsat), and math SAT (msat). The results are shown below. Call: lm(formula = physics ~ Gender + iq + ei + vsat + msat, data = physachv) Residuals:      Min       1Q   Median       3Q      Max -18.8370 -5.3765 -0.6288   5.5483 21.7685 Coefficients: Estimate Std. Error t value Pr(>|t|)    (Intercept) -60.106031   9.784708 -6.143 4.69e-09 ***...
In the year in which it intends to go public, a firm has revenues of $20...
In the year in which it intends to go public, a firm has revenues of $20 million and net income after taxes of $2 million. The firm has no debt, and revenue is expected to grow at 20% annually for the next five years and 5% annually thereafter. Net profit margins are expected remain constant throughout. Capital expenditures are expected to grow in line with depreciation and working capital requirements are minimal. The average beta of a publicly traded company...
7-5) A researcher intends to determine if exposure to a documentary about death penalty would impact...
7-5) A researcher intends to determine if exposure to a documentary about death penalty would impact attitudes toward capital punishment. From a large class of undergraduate students she selects a random sample of 40 students who will watch the movie [experimental group] and a random sample of 40 students [control group] who will not watch the documentary. She then measures one’s opposition toward penalty using a single-item question asking students to rank their opinions on a scale from zero [strong...
A marketing researcher predicts that college students will be more likely to purchase tickets for... A...
A marketing researcher predicts that college students will be more likely to purchase tickets for... A marketing researcher predicts that college students will be more likely to purchase tickets for the next football home game if their team won (vs. lost) the last game. The researcher asked 6 WSU students their willingness to purchase tickets (1= not likely at all, 7=very likely) and 6 EWU students their willingness to purhcase tickets (1=not likely, 7=very likely) (WSU won the game in...
1. A researcher would like to predict the dependent variable YY from the two independent variables...
1. A researcher would like to predict the dependent variable YY from the two independent variables X1X1 and X2X2 for a sample of N=15N=15 subjects. Use multiple linear regression to calculate the coefficient of multiple determination and test the significance of the overall regression model. Use a significance level α=0.05α=0.05. X1X1 X2X2 YY 66.4 76.4 58 34.6 39 65.5 32.7 23.1 65.8 44.4 71.2 73.3 57.3 50.8 57.9 32.7 48 74.6 53.3 51.4 64.4 48.3 51.1 59.2 66.9 81.4 59.4...
A researcher would like to predict the dependent variable Y from the two independent variables X1...
A researcher would like to predict the dependent variable Y from the two independent variables X1 and X2 for a sample of N=13 subjects. Use multiple linear regression to calculate the coefficient of multiple determination and test statistics to assess the significance of the regression model and partial slopes. Use a significance level α=0.02. X1X1 X2X2 YY 58 64.7 35.9 45 35.4 78.8 69.4 45.5 64.1 63.2 71.9 9 58 42.8 84.3 24.5 55.5 63.5 27.5 49.3 77.3 41.8 50.5...
A researcher would like to predict the dependent variable YY from the two independent variables X1X1...
A researcher would like to predict the dependent variable YY from the two independent variables X1X1 and X2X2 for a sample of N=11N=11 subjects. Use multiple linear regression to calculate the coefficient of multiple determination and test statistics to assess the significance of the regression model and partial slopes. Use a significance level α=0.05α=0.05. X1X1 X2X2 YY 55.3 51.1 56.2 72.1 51.6 76.6 35.2 41.7 51.8 70.4 58 47.9 51 71.6 39.8 66.6 60.4 61.9 61.9 48.9 63.4 46.8 54.3...
A researcher would like to predict the dependent variable YY from the two independent variables X1X1...
A researcher would like to predict the dependent variable YY from the two independent variables X1X1 and X2X2 for a sample of N=18N=18 subjects. Use multiple linear regression to calculate the coefficient of multiple determination and test the significance of the overall regression model. Use a significance level α=0.05α=0.05. X1X1 X2X2 YY 48.6 52.9 39.2 40.8 58.8 45.5 43.5 64.3 50.1 45.3 32.7 40.8 50.4 47.4 42.9 46.9 44.1 38.4 90.6 46.6 49.3 50.2 33.6 37.3 54.2 28.2 38.8 24.9...
A researcher would like to predict the dependent variable Y from the two independent variables X1...
A researcher would like to predict the dependent variable Y from the two independent variables X1 and X2 for a sample of N=16 subjects. Use multiple linear regression to calculate the coefficient of multiple determination and test the significance of the overall regression model. Use a significance level α=0.05. X1             X2             Y 48 42.3 47.1 36.3 58.7 65.4 43.4 40.2 63.6 49.5 37.9 45.6 45.5 37.2 50.8 40.6 64.7 42.4 42.5 46.7 63.1 42.7 40 35.8 55.8 10.6 52.1 40.9...
ADVERTISEMENT
ADVERTISEMENT
ADVERTISEMENT