Questions
PSY-520 Graduate Statistics Topic 5 – Benchmark – Correlation and Regression Project Directions: Use the following...

PSY-520 Graduate Statistics

Topic 5 – Benchmark – Correlation and Regression Project

Directions: Use the following information to complete the questions below. While APA format is not required for the body of this assignment, solid academic writing is expected, and documentation of sources should be presented using APA formatting guidelines, which can be found in the APA Style Guide, located in the Student Success Center.

  1. Select at least three variables that you believe have a linear relationship.
    1. Specify how you will measure each of these variables (i.e., what instrument will you use and provide an APA reference for the instrument)
  2. Collect the data for these variables and describe the data collection technique and why it was appropriate as well as why the sample size was best.
    1. Submit the data collected by submitting the SPSS data file with your submission.
  3. Find the Correlation coefficient for each of the possible pairings of variables and describe the relationship in terms of strength and direction.
  4. Find a linear model of the relationship between the three (or more) variables of interest. Identify the predictor variables and the criterion variable.
  5. Provide the output of the SPSS results and interpret the results using correct APA style.

This benchmark assignment assesses the following programmatic competencies: 3.1: Interpret psychological phenomena using scientific reasoning

In: Math

1. A researcher wants to determine the average given below. Describe how the researcher should apply...

1. A researcher wants to determine the average given below. Describe how the researcher should apply the five basic steps in a statistical study. (Assume that all the people in the poll answered truthfully.) The average number of light bulbs per household that function as designed. Determine how to apply the first basic step in a statistical study in this situation. Choose the correct answer below. a. The population is all households. The researcher wants to estimate the average b. The population is all light bulbs that function as designed. The researcher wants to estimate the average number of light bulbs per household that function as designed. c. The population is all light bulbs that function as designed. The researcher wants to estimate the average number of light bulbs per household that do not function as designed. d. The population is all households. The researcher wants to estimate the average number of light bulbs per household that function as designed. 2. Determine how to apply the second basic step in a statistical study in this situation. Choose the correct answer below. a. The researcher should gather data about functionality from the largest sample of light bulbs about whom the researcher can gather data. b. The researcher should only gather raw data from light bulbs that function as designed. c. The researcher should gather raw data from all light bulbs. d. The researcher should only gather raw data from light bulbs that do not function as designed. 3. Determine how to apply the third basic step in a statistical study in this situation. Choose the correct answer below. a. The sample statistic of interest is the average number of light bulbs per household that function as designed. b. The sample statistic of interests is the average number of light bulbs per household that do not function as designed. c. The sample statistic of interest is the percentage of light bulbs that do not function as deigned. d. The sample statistic of interest is the percentage of light bulbs that function as designed. 4. Determine how to apply the fourth basic step in a statistical study in this situation. Choose the correct answer below. a. The researcher should use the sample statistic as an estimate for the population value of the average number of light bulbs that function as designed and then use the methods of statistics to determine how good that estimate is. b. If the percentage of light bulbs that function as designed in the sample is greater than 50%, then the researcher can be confident that all light bulbs function as designed. c. The sample statistic provides no useful information to the researcher in this situation. d. If the researcher followed correct procedures, he or she can be confident that the sample statistics is equal to the average number of light bulbs that function as designed. 5. Determine how to apply the fifth basic step in a statistical study in this situation. Choose the correct answer below. a. The researcher should use the methods of statistics to determine the quality of the estimate of the population parameter and draw conclusions based on this estimate accordingly. b. The researcher cannot draw any conclusions based on the value of the sample statistic. c. There is no way to determine how well the sample statistic estimates the population parameter. d. The researcher knows that the sample statistic is equal to the population parameter, so he or she may draw conclusions with complete confidence.

In: Math

1.5 Suppose you believe that, in general, graduates who have majored in your subject are offered...

1.5 Suppose you believe that, in general, graduates who have majored in your subject are offered higher salaries upon graduating than are graduates of other programs. Describe a statistical experiment that could help test your belief.

1.6 You are shown a coin that its owner says is fair in the sense that it will produce the same number of
heads and tails when flipped a very large number of times.

a.Describe an experiment to test this claim.

b. What is the population in your experiment?

c. What is the sample?

d. What is the parameter?
e. What is the statistic?

f. Describe briefly how statistical inference can be used to test the claim.

1.7 Suppose that in Exercise

1.6 you decide to flip the coin 100 times.

a.What conclusion would you be likely to draw if you observed 95 heads?
b. What conclusion would you be likely to draw if you observed 55 heads?
c.Do you believe that, if you flip a perfectly fair coin 100 times, you will always observe exactly 50 heads? If you answered “no,” then what numbers do you think are possible? If you answered “yes,” how many heads would you observe if you flipped the coin twice? Try flipping a coin twice and repeating this experiment 10 times and report the results.

1.8 Xr01-08 The owner of a large fleet of taxis is trying to estimate his costs for next year’s operations. One major cost is fuel purchase. To estimate fuel purchase, the owner needs to know the total distance his taxis will travel next year, the cost of a gallon of fuel, and the fuel mileage of his taxis. The owner has been provided with the first two figures (distance estimate and cost of a gallon of fuel). However, because of the high cost of gasoline, the owner has recently converted his taxis to operate on propane. He has measured and recorded the propane mileage (in miles per gallon) for 50 taxis.

a.What is the population of interest?
b. What is the parameter the owner needs?

c. What is the sample?
d. What is the statistic?

e. Describe briefly how the statistic will produce the kind of information the owner wants.

In: Math

1.1 In your own words, define and give an example of each of the following statistical...


1.1 In your own words, define and give an example of each of the following statistical terms. a.
population
b. sample statistic
e.
c. parameter d.
statistical inference

1.2 Briefly describe the difference between descriptive statistics and inferential statistics.

1.3 A politician who is running for the office of mayor of a city with 25,000 registered voters commissions a survey. In the survey, 48% of the 200 registered voters interviewed say they plan to vote for her. a.
What is the population of interest? b. What is the sample?
c.Is the value 48% a parameter or a statistic? Explain.

1.4 A manufacturer of computer chips claims that less than 10% of its products are defective. When 1,000 chips were drawn from a large production, 7.5% were found to be defective.

a.What is the population of interest?

b. What is the sample?

c. What is the parameter?
d. What is the statistic?

e.Does the value 10% refer to the parameter or to the statistic?

f. Is the value 7.5% a parameter or a statistic?

g. Explain briefly how the statistic can be used to make inferences about the parameter to test the claim.


1.5 Suppose you believe that, in general, graduates who have majored in your subject are offered higher salaries upon graduating than are graduates of other programs. Describe a statistical experiment that could help test your belief.

1.6 You are shown a coin that its owner says is fair in the sense that it will produce the same number of
heads and tails when flipped a very large number of times.

a.Describe an experiment to test this claim.

b. What is the population in your experiment?

c. What is the sample?

d. What is the parameter?
e. What is the statistic?

f. Describe briefly how statistical inference can be used to test the claim.

1.7 Suppose that in Exercise

1.6 you decide to flip the coin 100 times.

a.What conclusion would you be likely to draw if you observed 95 heads?
b. What conclusion would you be likely to draw if you observed 55 heads?
c.Do you believe that, if you flip a perfectly fair coin 100 times, you will always observe exactly 50 heads? If you answered “no,” then what numbers do you think are possible? If you answered “yes,” how many heads would you observe if you flipped the coin twice? Try flipping a coin twice and repeating this experiment 10 times and report the results.

1.8 Xr01-08 The owner of a large fleet of taxis is trying to estimate his costs for next year’s operations. One major cost is fuel purchase. To estimate fuel purchase, the owner needs to know the total distance his taxis will travel next year, the cost of a gallon of fuel, and the fuel mileage of his taxis. The owner has been provided with the first two figures (distance estimate and cost of a gallon of fuel). However, because of the high cost of gasoline, the owner has recently converted his taxis to operate on propane. He has measured and recorded the propane mileage (in miles per gallon) for 50 taxis.

a.What is the population of interest?
b. What is the parameter the owner needs?

c. What is the sample?
d. What is the statistic?

e. Describe briefly how the statistic will produce the kind of information the owner wants.

In: Math

3. The heights of female students attending a sixth form college have a mean of 168...

3. The heights of female students attending a sixth form college have a mean of 168 cm and standard deviation of 4.5 cm. The heights can be modeled by a normal distribution: a. Find the probability that the height of a randomly selected female student attending this college is less than 172.5 cm? b. Find the probability that the mean height of a random sample of 11 female students from this college exceeds 172 cm?

In: Math

Employees from Company A and Company B both receive annual bonuses. What information would you need...

Employees from Company A and Company B both receive annual bonuses. What information would you need to test the claim that the difference in annual bonuses is greater than $100 at the 0.05 level of significance? Write out the hypothesis and explain the testing procedure.

In: Math

Part 2 – SPSS APPLICATION from this week’s assigned readings and presentations (28 pts) Problem Set...

Part 2 – SPSS APPLICATION from this week’s assigned readings and presentations (28 pts)

Problem Set 1: The independent-samples t test (7 pts)

Research Scenario: A clinical psychologist is studying whether there is a difference between veterans diagnosed with PTSD and veterans without PTSD in reduction of anxiety after aerobic exercise. All veterans in this study agree to complete 30 minutes of aerobic exercise 5 days a week for 4 weeks and began with similar levels of reported anxiety. Scores shown are calculated difference scores in anxiety, with higher scores indicating more of a reduction in anxiety (scale measurement).

Using this table, enter the data into a new SPSS data file and run an independent samples t test to test whether aerobic exercise differentially affects anxiety in veterans with and without PTSD. Remember to name and define your variables under the “Variable View,” then return to the “Data View” to enter and analyze the data. Remember, data will be entered differently than in a paired samples t-test. Specifically, you will have one variable (“Group”), with 0 = no PTSD and 1 = PTSD. The other variable is “Change”. Thus, data will not be entered exactly as shown below.

NoPTSD

PTSD

24

25

23

23

22

27

30

19

31

22

30

13

38

18

25

28

33

21

38

31

  1. Paste relevant SPSS output. (2 pts)
  1.     Create an appropriate graph to display this data. (2 pts)
  1. Write an APA-style Results section based on your analysis. All homework “Results sections” should follow the examples provided in the presentations and textbooks. They should include the statistical statement within a complete sentence that mentions the type of test conducted, whether the test was significant, and if relevant, the effect size. Don’t forget to include a decision about the null hypothesis. (3 pts)

Problem Set 2: Pearson’s correlation (7 pts)

Research Scenario: Is there a positive relationship between grit and dieting success? A researcher examined this issue by having people complete a grit inventory using a Likert-based scale (range 1 – 7), where higher numbers indicate more “grit”. Dieting success was measured using a likert-based inventory as well, with higher numbers indicating more success (range 1-7). Enter the data shown here into SPSS to run an analysis to test whether increased grit is associated with higher dieting success.

Grit

Success

3.5

2.3

2.0

1.1

1.7

0.3

1.8

0.1

2.7

0

3.5

0.7

3.9

3.7

4.1

5.5

3.5

3.7

4.7

6.7

5.1

5.9

3.2

5.3

5.6

4.4

4.1

2.8

  1.     Paste relevant SPSS output. (2 pts)
  1.     Create an appropriate graph to display this data. (2 pts)
  1.     Write an APA-style Results section based on your analysis. All homework “Results sections” should follow the examples provided in the presentations and textbooks. They should include the statistical statement within a complete sentence that mentions the type of test conducted, whether the test was significant, and if relevant, coefficient of determination and its interpretation. Don’t forget to include a decision about the null hypothesis. (3 pts)

Note: Please solve it by text not in hand written form.

In: Math

An amateur astronomer is researching statistical properties of known stars using a variety of databases. They...

An amateur astronomer is researching statistical properties of known stars using a variety of databases. They collect the color index, or B−V index, and distance (in light years) from Earth for 30 stars. The color index of a star is the difference in the light absorption measured from the star using two different light filters (a B and a V filter). This then allows the scientist to know the star's temperature and a negative value means a hot blue star. A light year is the distance light can travel in 1 year, which is approximately 5.9 trillion miles. The data is provided below. Use Excel to calculate the correlation coefficient r between the two data sets, rounding to two decimal places.

B-V index   Distance (ly)
1.1   1380
0.4   556
1.0   771
0.5   304
1.4   532
1.0   751
0.5   267
0.8   229
0.5   552
0.2   896
1.5   1819
0.5   381
0.5   257
1.1   541
0.7   133
0.5   300
0.0   985
0.4   525
1.0   408
1.1   1367
1.07   2848
1.1   128.9
1.12   1766.2
0.64   186.5
0.87   8269.2
0.19   828.9
1.03   153
0.55   223.6
1.39   963.9
0.89   91.7

R=

In: Math

1. T F The number of defects on a product produced by a process is distributed...

1. T F The number of defects on a product produced by a process is distributed as a Poisson distribution.

2. T F For large binomially distributed populations and small samples you can use sampling without replacement

3. T F Given a triangular distribution with minimum 2, maximum 8 and mode 7, the probability of being smaller than 7 is greater than 60%

4. T F The hypergeometric distribution describes sampling without replacement

5. T F The relative frequency of a value is the proportion of times the value occurs.

6. T F The permutation of n numbers is always smaller than the combination of the same n numbers, taken the same number of times

7. T F The intersection of two events is greater than zero when they are mutually exclusive

8. T F The probability of rolling a 6 with a fair six sided die is 1/6.

9. T F The product of the probability of two independent events is called conditional probability.

10. T F When we use both a beta and a triangular distribution to represent a variable, the probability of a low value will always be grater with the triangular distribution

In: Math

Alper, Beatta, and Grandma each pick five cards from a shuffled standard deck. Alper replaces the...

Alper, Beatta, and Grandma each pick five cards from a shuffled standard deck. Alper replaces the card and reshuffles each time he picks. Beatta picks from the deck without replacement. Grandma repeatedly picks the top card from the deck and puts it back on the top of the deck. Count an ace as 14, a king as 13, and so on. Let X,Y,Z be the sum of the numbers Alper, Beatta, and Grandma get, respectively. Which of X,Y,Z has or have the largest expected value?
Which of X,Y,Z has or have the largest variance?

Would someone please help me out with correct and detailed answer please?

In: Math

1. A survey administered to a random sample from a local grocery store finds that political...

1. A survey administered to a random sample from a local grocery store finds that political party is related to whether or not they favor an increase in sales tax, leading to a news story: Republicans support proposed tax increase. This is an example of what type of statistic?

2. What are measures of central tendency? When is each appropriate? What influences each measure of central tendency? (ex. What is affected by every score in the distribution? If you change only the highest score, what measure does that change?). What is the preferred measure of central tendency for a skewed variable? What is the purpose of measures of central tendency?

In: Math

Georgia Cabinets manufactures kitchen cabinets that are sold to local dealers throughout the Southeast. Because of...

Georgia Cabinets manufactures kitchen cabinets that are sold to local dealers throughout the Southeast. Because of a large backlog of orders for oak and cherry cabinets, the company decided to contract with three smaller cabinetmakers to do the final finishing operation. For the three cabinetmakers, the number of hours required to complete all the oak cabinets, the number of hours required to complete all the cherry cabinets, the number of hours available for the final finishing operation, and the cost per hour to perform the work are shown here:

Cabinetmaker 1 Cabinetmaker 2 Cabinetmaker 3
Hours required to complete all the oak cabinets 47 40 27
Hours required to complete all the cherry cabinets 64 52 36
Hours available 40 30 35
Cost per hour $34 $41 $52

a. Formulate a linear programming model that can be used to determine the proportion of the oak cabinets and the proportion of the cherry cabinets that should be given to each of the three cabinetmakers in order to minimize the total cost of completing both projects.

Let O1 = proportion of Oak cabinets assigned to cabinetmaker 1
O2 = proportion of Oak cabinets assigned to cabinetmaker 2
O3 = proportion of Oak cabinets assigned to cabinetmaker 3
C1 = proportion of Cherry cabinets assigned to cabinetmaker 1
C2 = proportion of Cherry cabinets assigned to cabinetmaker 2
C3 = proportion of Cherry cabinets assigned to cabinetmaker 3
Min __________O1 + __________O2 + __________O3 + __________C1 + __________C2 + __________C3
s.t.
__________O1 __________C1 __________ Hours avail. 1
__________O2 + __________C2 __________ Hours avail. 2
__________O3 + __________C3 __________ Hours avail. 3
__________O1 + __________O2 + __________O3 = __________ Oak
__________C1 + __________C2 + __________C3 = __________ Cherry
O1, O2, O3, C1, C2, C3 ≥ 0

b. Solve the model formulated in part (a). What proportion of the oak cabinets and what proportion of the cherry cabinets should be assigned to each cabinetmaker? What is the total cost of completing both projects? If required, round your answers for the proportions to three decimal places, and for the total cost to two decimal places.

Cabinetmaker 1 Cabinetmaker 2 Cabinetmaker 3
Oak O1 = _______ O2 = _______ O3 = _______
Cherry C1 = _______ C2 = _______ C3 = _______

Total cost = $ __________

c. If Cabinetmaker 1 has additional hours available, would the optimal solution change? YES OR NO
Explain.

d. If Cabinetmaker 2 has additional hours available, would the optimal solution change? YES OR NO
Explain.

e. Suppose Cabinetmaker 2 reduced its cost to $38 per hour. What effect would this change have on the optimal solution? If required, round your answers for the proportions to three decimal places, and for the total cost to two decimal places.

Cabinetmaker 1 Cabinetmaker 2 Cabinetmaker 3
Oak O1 = _______ O2 = _______ O3 = _______
Cherry C1 = _______ C2 = _______ C3 = _______

Total cost = $ __________

Explain.

In: Math

Poly(3-hydroxybutyrate) (PHB), a semicrystalline polymer that is fully biodegradable and biocompatible, is obtained from renewable resources....

Poly(3-hydroxybutyrate) (PHB), a semicrystalline polymer that is fully biodegradable and biocompatible, is obtained from renewable resources. From a sustainability perspective, PHB offers many attractive properties though it is more expensive to produce than standard plastics. The accompanying data on melting point (°C) for each of 12 specimens of the polymer using a differential scanning calorimeter appeared in an article.

180.6 181.7 180.8 181.5 182.6 181.6 181.2 182.1 182.2 180.4 181.8 180.4

Compute the following. (Round your answers to two decimal places.)

(a) the sample range

(b) the sample variance s^2 from the definition [Hint: First subtract 180 from each observation.]

(c) the sample standard deviation

(d) s^2 using the shortcut method

In: Math

La Jolla Beverage Products is considering producing a wine cooler that would be a blend of...

La Jolla Beverage Products is considering producing a wine cooler that would be a blend of a white wine, a rose wine, and fruit juice. To meet taste specifications, the wine cooler must consist of at least 50% white wine, at least 20% and no more than 30% rose, and exactly 20% fruit juice. La Jolla purchases the wine from local wineries and the fruit juice from a processing plant in San Francisco. For the current production period, 10000 gallons of white wine and 8000 gallons of rose wine can be purchased; an unlimited amount of fruit juice can be ordered. The costs for the wine are $1 per gallon for the white and $1.5 per gallon for the rose; the fruit juice can be purchased for $0.5 per gallon. La Jolla Beverage Products can sell all of the wine cooler it can produce for $2.5 per gallon.

a. Is the cost of the wine and fruit juice a sunk cost or a relevant cost in this situation? Explain.

b. Formulate a linear program to determine the blend of the three ingredients that will maximize the total profit contribution. Solve the linear program to determine the number of gallons of each ingredient La Jolla should purchase and the total profit contribution it will realize from this blend. If required, round your answers to one decimal place. For subtractive or negative numbers use a minus sign even if there is a + sign before the blank.

Let W = gallons of white wine
Let R = gallons of rose wine
Let F = gallons of fruit wine
Max __________W + _________R + __________F
s.t.
__________W + __________R + __________F __________ % white
__________W + __________R + __________F __________ % rose minimum
__________W + __________R + __________F __________ % rose maximum
__________W + __________R + __________F = __________ % fruit juice
__________W __________ Available white
__________R __________ Available rose
W, R, F ≥ 0
Optimal Solution:
W __________
R __________
F __________

Profit Contribution = __________

c. If La Jolla could obtain additional amounts of the white wine, should it do so? YES OR NO

If so, how much should it be willing to pay for each additional gallon, and how many additional gallons would it want to purchase?

d. If La Jolla Beverage Products could obtain additional amounts of the rosé wine, should it do so? YES OR NO.

If so, how much should it be willing to pay for each additional gallon, and how many additional gallons would it want to purchase?

e. Interpret the shadow price for the constraint corresponding to the requirement that the wine cooler must contain at least 50% white wine. What is your advice to management given this shadow price?

f. Interpret the shadow price for the constraint corresponding to the requirement that the wine cooler must contain exactly 20% fruit juice. What is your advice to management given this shadow price?



In: Math

1. An impartial judge of a local garden competition collects scores from two groups of judges...

1. An impartial judge of a local garden competition collects scores from two groups of judges on what he suspects to be the number one garden. He finds the first group (n = 5) has a mean score of 76. The second group (n = 3) has a mean score of 91. So that he may declare the final score, what is the weighted mean of these two groups?

2. A social scientist measures the number of minutes (per day) that a small hypothetical population of college students spends online. Student Score Student Score A 94 F 96 B 88 G 25 C 74 H 61 D 88 I 82 E 98 J 98

(a) What is the range of data in this population? min

(b) What is the IQR of data in this population? min

(c) What is the SIQR of data in this population? min

(d) What is the population variance?

(e) What is the population standard deviation? (Round your answer to two decimal places.) min

3. A sociologist records the annual household income (in thousands of dollars) among a sample of families living in a high-crime neighborhood. Locate the lower, median, and upper quartiles for the times listed below. Hint: First arrange the data in numerical order.

a) lower quartile thousand dollars

b) median thousand dollars

c) upper quartile thousand dollars

42 22 46 33 37 32 37 47 51 25

4. A theme park owner records the number of times the same kids from two separate age groups ride the newest attraction. Age 13–16 Time Age 17–21 Time 1 10 1 4 2 9 2 3 3 3 3 7 4 1 4 4 5 10 5 8 6 3 6 1 7 8 7 2 8 9 8 5 9 6 9 4 10 5 10 2 Using the computational formula, for the age group of 13–16? (Round your answers for variance and standard deviation to two decimal places.) what is the

a) SS

b) sample variance

c) standard deviation

In: Math