Questions
7.20 Body measurements, Part III. Exercise 7.13 introduces data on shoulder girth and height of a...

7.20 Body measurements, Part III. Exercise 7.13 introduces data on shoulder girth and height of a group of individuals. The mean shoulder girth is 107.30 cm with a standard deviation of 10.34 cm. The mean height is 171.14 cm with a standard deviation of 9.41 cm. The correlation between height and shoulder girth is 0.69. 1. (a) Write the equation of the regression line for predicting height. 2. (b) Interpret the slope and the intercept in this context. 3. (c) Calculate R2 of the regression line for predicting height from shoulder girth, and interpret it in the context of the application. 4. (d) A randomly selected student from your class has a shoulder girth of 100 cm. Predict the height of this student using the model. 5. (e) The student from part (d) is 160 cm tall. Calculate the residual, and explain what this residual means. 6. (f) A one year old has a shoulder girth of 56 cm. Would it be appropriate to use this linear model to predict the height of this child? *answers to 3 decimal points*!!!

In: Math

Roll two ordinary dice and let X be their sum. Draw the pmf and cmf for...

Roll two ordinary dice and let X be their sum. Draw the pmf and cmf for X. Compute the mean and standard deviation of X. Solve using R studio coding.

In: Math

Suppose there is a basket containing one apple and two oranges. A student randomly pick one...

Suppose there is a basket containing one apple and two oranges. A student randomly pick one fruit from the basket until the first time the apple is picked. (Sampling with replacement)

(a) What is the sample space for this experiment? What is the probability that the student pick the apple after i tosses?

(b) What is the expected number of times the students need to pick the apple?

(c) Let E be the event that the first time an apple is picked up is after an even number of picks. What set of outcomes belong to this event? What is the probability that E occurs?

In: Math

Your friend texted you a question. You did some research and found that a Minifig is...

Your friend texted you a question. You did some research and found that a Minifig is a package containing one figurine. You also found out that, for every 60 packs produced, 4 of those are Chip and 4 are Dale.

A) What is the probability of opening one box and getting Chip?

B) What is the probability of opening one box and getting Chip OR Dale?

C) What is the probability of opening two boxes and specifically getting Chip in the first box and Dale in the second box?

D) What is the probability of opening two boxes and getting one Chip and one Dale (in either order)?

ANSWER ALL PARTS A-D

In: Math

In a certain population, 25% of the person smoke and 7% have a certain type of...

In a certain population, 25% of the person smoke and 7% have a certain type of heart disease. Moreover, 10% of the persons who smoke have the disease.

What percentage of the population smoke and have the disease?

What percentage of the population with the disease also smoke?

In: Math

1. Assume that a sample is used to estimate a population proportion p. Find the 90%...

1. Assume that a sample is used to estimate a population proportion p. Find the 90% confidence interval for a sample of size 112 with 28% successes. Enter your answer as a tri-linear inequality using decimals (not percents) accurate to three decimal places.

2.  You want to obtain a sample to estimate a population proportion. Based on previous evidence, you believe the population proportion is approximately 88%. You would like to be 98% confident that your estimate is within 3% of the true population proportion. How large of a sample size is required? n= ________

3.  You want to obtain a sample to estimate a population mean. Based on previous evidence, you believe the population standard deviation is approximately σ=43.7σ=43.7. You would like to be 90% confident that your estimate is within 3 of the true population mean. How large of a sample size is required? n= _____

In: Math

An investor holding a certain portfolio consisting of two stocks invests 20% in Stock A and...

An investor holding a certain portfolio consisting of two stocks invests 20% in Stock A and 80% in Stock B. The expected return from Stock A is 4% and that from Stock B is 12%. The standard deviations are 8% and 10% for Stocks A and B respectively. Compute the expected return of the portfolio. b) Compute the standard deviation of the portfolio assuming the correlation between the two stocks is 0.75. c) Compute the standard deviation of the portfolio assuming the correlation between the two stocks is 1. d) Compare your answers in (b) and (c). What do you observe and why?

In: Math

From public records, individuals were identified as having been charged with drunken driving not less than...

From public records, individuals were identified as having been charged with drunken driving not less than 6 months or more than 12 months from the starting date of the study. Two random samples from this group were studied. In the first sample of 29 individuals, the respondents were asked in a face-to-face interview if they had been charged with drunken driving in the last 12 months. Of these 29 people interviewed face to face, 11 answered the question accurately. The second random sample consisted of 43 people who had been charged with drunken driving. During a telephone interview, 28 of these responded accurately to the question asking if they had been charged with drunken driving during the past 12 months. Assume the samples are representative of all people recently charged with drunken driving. Please show all steps in getting the answer.

(a) Categorize the problem below according to parameter being estimated, proportion p, mean μ, difference of means μ1 – μ2, or difference of proportions p1 – p2. Then solve the problem.

μ

μ1 – μ2

p

p1 – p2

(b) Let p1 represent the population proportion of all people with recent charges of drunken driving who respond accurately to a face-to-face interview asking if they have been charged with drunken driving during the past 12 months. Let p2 represent the population proportion of all people who respond accurately to the question when it is asked in a telephone interview. Find a 95% confidence interval for p1 – p2. (Use 3 decimal places.)

lower limit

upper limit

(c) Does the interval found in part (a) contain numbers that are all positive? all negative? mixed? Comment on the meaning of the confidence interval in the context of this problem. At the 95% level, do you detect any differences in the proportion of accurate responses to the question from face-to- face interviews as compared with the proportion of accurate responses from telephone interviews?

Because the interval contains only positive numbers, we can say that there is a higher proportion of accurate responses in face-to-face interviews.

Because the interval contains both positive and negative numbers, we can not say that there is a higher proportion of accurate responses in face-to-face interviews.

We can not make any conclusions using this confidence interval.

Because the interval contains only negative numbers, we can say that there is a higher proportion of accurate responses in telephone interviews.

In: Math

(1) A researcher wants to test H0: x1 = x2 versus the two-sided alternative Ha: x1...

(1) A researcher wants to test H0: x1 = x2 versus the two-sided alternative Ha: x1 ≠ x2.

(a) This alternative hypothesis indicates a one-sided hypothesis instead of a two-sided hypothesis.

(b) The alternative hypothesis Ha should indicate that x1 ≥ x2 .

(c)The null hypothesis (but not the alternative hypothesis) should involve μ1 and μ2 (population means) rather than x1 and x2 (sample means).

(d) Hypotheses should involve μ1 and μ2 (population means) rather than x1 and x2 (sample means).

(e) The null hypothesis H0 should indicate that the two means are not equal.

(2) A study recorded the IQ scores of 50 college freshmen. The scores of the 24 males in the study were compared with the scores of all 50 freshmen using the two-sample methods of this section.

(a)The samples are too large to be used for hypothesis testing.

(b)The samples are not independent; we would need to compare the 24 males to the 26 females.    

(c) The samples are too small to be used for hypothesis testing.

(d)The sample sizes are too different to be used for hypothesis testing; we would need to have more males in the sample.

(e)A two-sample method is not appropriate in this situation.

(3)A two-sample t statistic gave a P-value of 0.93. From this we can reject the null hypothesis with 90% confidence.

(a)We can reject the null hypothesis, but with less than 90% confidence.

(b) We need the P-value to be small to reject H0.    

(c) We can reject the null hypothesis, but with more than 90% confidence.

(d) We need the P-value to be negative to reject H0.

(e)A P-value of this size is impossible.

(4)A researcher is interested in testing the one-sided alternative Ha: μ1 < μ2. The significance test gave t = 2.25. Since the P-value for the two-sided alternative is 0.04, he concluded that his P-value was 0.02.

(a) The alternative hypothesis should state that Ha: μ1μ2.

(b) A one-sided alternative should never be used.    

(c) The alternative hypothesis should state that Ha: μ1μ2.

(d) Assuming the researcher computed the t statistic using x1x2,a positive value of t does not support Ha.

(e)A t statistic of this size should have a much larger P-value associated with it.

In: Math

Solve this word problem using step by step procedure: The math department at a local university...

Solve this word problem using step by step procedure: The math department at a local university has customarily advised students to purchase Calculator A. The manufacturer has recently released a new model, Calculator B, which is reputed to be more user-friendly. The faculty decided to determine if there is a difference in the time required to perform a certain common statistical calculation. Twelve students chosen at random are given drills with both calculators so that they are familiar with the operation of each type. Then the time they take to complete the test calculation on each device is measured in seconds (which calculator they are to use first is determined by some random procedure to control for any additional learning during the first calculations). To be clear, each student worked through a particular type of statistics problem with one calculator and then did a similar problem on the other calculator. For each student, the amount of time in seconds that they spent on the problem with each calculator was recorded. Is Calculator B likely to be a more effective device than Calculator A?

Student CalculatorA Calculator B
1 23 19
2 18 18
3 29 24
4 22 23
5 33 31
6 20 22
7 17 16
8 25 23
9 27 24
10 30 26
11 25 24
12 27 28

b) Please use step by step guidelines so I can follow in this excel problem: In the sheet entitled “Part 2 Question 7”, you will find the death rate (per 1,000 resident population) for random samples of counties in Alaska and Texas. Is the average death rate among Alaska counties likely to be lower than that among Texas counties?

Alaska Texas
1.4 7.2
4.2 5.8
7.3 10.5
4.8 6.6
3.2 6.9
3.4 9.5
5.1 8.6
5.4 5.9
6.7 9.1
3.3 5.4
1.9 8.8
8.3 6.1
3.1 9.5
6 9.6
4.5 7.8
2.5 10.2
5.6
8.6

In: Math

The head start program provides a wide range of services to low-income children up to the...

The head start program provides a wide range of services to low-income children up to the age of 5 years and their families. Its goals are to provide services to improve social and learning skills and to improve health and nutrition status so that the participants can begin school on an equal footing with their more advantaged peers. The distribution of ages for participating children is as follows 12% five-year-olds, 47% four-year-olds, 33% three-year-olds, and 8% under three years. When the program was assessed in a particular region, it was found that of the 215 randomly selected participants, 17 were 5 years old, 101 were 4 years old, 60 were 3 years old, 37 were under 3 years old. Perform a test to see if there sufficient evidence at αα = 0.10 that the region's proportions differ from the national proportions.

The correct hypotheses are:

  • H0:H0: The regional distribution is a good fit to the national distribution. HA:HA: The regional distribution is not a good fit to the national distribution.(claim)
  • H0:H0: The regional distribution is not a good fit to the national distribution. HA:HA: The regional distribution is a good fit to the national distribution.(claim)
  • H0:H0: The regional distribution and the national distribution are independent. HA:HA: The regional distribution and the national distribution are dependent.(claim)
  • H0:H0: The regional distribution and the national distribution are dependent. HA:HA: The regional distribution and the national distribution are independent.(claim)

The test value is (round to 3 decimal places)

The p-value is (round to 3 decimal places)

According to the Bureau of Transportation Statistics, on-time performance by airlines is described as follows:

Action % of time
On Time 71
National Aviation System Delay 8
Aircraft arriving late 8
Other (weather and other conditions) 13

When a study was conducted it was found that of the 250 randomly selected flights, 180 were on time, 18 were a National Aviation System Delay, 20 arriving late , 32 were due to weather. Perform a test to see if there is sufficient evidence at αα = 0.05 to see if these differ from the governments statistics.

The correct hypotheses are:

  • H0:H0: The sample is a good fit to the Bureau of Transportation Statistics. HA:HA: The sample is not a good fit to the Bureau of Transportation Statistics.(claim)
  • H0:H0: The sample is not a good fit to the Bureau of Transportation Statistics. HA:HA: The sample is a good fit to the Bureau of Transportation Statistics.(claim)
  • H0:H0: The sample and the Bureau of Transportation Statistics are independent. HA:HA: The sample and the Bureau of Transportation Statistics are dependent.(claim)
  • H0:H0: The sample and the Bureau of Transportation Statistics are dependent. HA:HA: The sample and the Bureau of Transportation Statistics are independent.(claim)

The test value is  (round to 3 decimal places)

The p-value is (round to 3 decimal places)

The head start program provides a wide range of services to low-income children up to the age of 5 years and their families. Its goals are to provide services to improve social and learning skills and to improve health and nutrition status so that the participants can begin school on an equal footing with their more advantaged peers. The distribution of ages for participating children is as follows 10% five-year-olds, 50% four-year-olds, 28% three-year-olds, and 12% under three years. When the program was assessed in a particular region, it was found that of the 202 randomly selected participants, 20 were 5 years old, 99 were 4 years old, 46 were 3 years old, 37 were under 3 years old. Perform a test to see if there sufficient evidence at αα = 0.01 that the region's proportions differ from the national proportions.

The correct hypotheses are:

  • H0:H0: The regional distribution is a good fit to the national distribution. HA:HA: The regional distribution is not a good fit to the national distribution.(claim)
  • H0:H0: The regional distribution is not a good fit to the national distribution. HA:HA: The regional distribution is a good fit to the national distribution.(claim)
  • H0:H0: The regional distribution and the national distribution are independent. HA:HA: The regional distribution and the national distribution are dependent.(claim)
  • H0:H0: The regional distribution and the national distribution are dependent. HA:HA: The regional distribution and the national distribution are independent.(claim)

The test value is (round to 3 decimal places)

The p-value is (round to 3 decimal places)

According to the Bureau of Transportation Statistics, on-time performance by airlines is described as follows:

Action % of time
On Time 68
National Aviation System Delay 7
Aircraft arriving late 8
Other (weather and other conditions) 17

When a study was conducted it was found that of the 230 randomly selected flights, 159 were on time, 21 were a National Aviation System Delay, 21 arriving late , 29 were due to weather. Perform a test to see if there is sufficient evidence at αα = 0.01 to see if these differ from the governments statistics.

The correct hypotheses are:

  • H0:H0: The sample is a good fit to the Bureau of Transportation Statistics. HA:HA: The sample is not a good fit to the Bureau of Transportation Statistics.(claim)
  • H0:H0: The sample is not a good fit to the Bureau of Transportation Statistics. HA:HA: The sample is a good fit to the Bureau of Transportation Statistics.(claim)
  • H0:H0: The sample and the Bureau of Transportation Statistics are independent. HA:HA: The sample and the Bureau of Transportation Statistics are dependent.(claim)
  • H0:H0: The sample and the Bureau of Transportation Statistics are dependent. HA:HA: The sample and the Bureau of Transportation Statistics are independent.(claim)

The test value is (round to 3 decimal places)

The p-value is (round to 3 decimal places)

In: Math

A graduate picks 25 exams at random from a total of 194 exams. Those 25 exams...

A graduate picks 25 exams at random from a total of 194 exams. Those 25 exams have a mean of 78.2% ± 17.1% (mean ± 1 standard deviation). You may assume that these 25 exam scores are approximately normally distributed.

a) Estimate the population mean exam score and its confidence interval for the 95% confidence level.

b) Estimate the population exam score standard deviation and its confidence interval for the 95% confidence level.

c) Estimate how many students failed the exam if the passing grade is 60%.

d) Estimate how many students scored above 90% on the exam.

e) The graduate draws another set of exams of 25 exams, and these exam scores are not normally distributed: they are skewed towards a high score. What would you tell the graduate so that s/he can estimate the population mean using methods we learned in class?

In: Math

For the given reports, are the given values of the correlation coefficient reasonable? 1. A) A...

For the given reports, are the given values of the correlation coefficient reasonable?

1. A) A correlation of r = +0.7 between gender and height.

i) Reasonable ii) Unreasonable

1. B) A correlation of r = +1.0 between outdoor temperature and sales of ice cream.

i) Reasonable ii) Unreasonable

1. C) A correlation of r = 0 between shoe size and IQ scores in a study of adults.

i) Reasonable ii) Unreasonable

1. D) A correlation of r = +0.7 between distance from the equator for North American cities and average January temperature.

i) Reasonable ii) Unreasonable

1. E) A correlation of r = -0.8 between outside temperature and sales of hot chocolate.

i) Reasonable ii) Unreasonable

In: Math

Describe what qualitative forecasting models are. What are the advantages and disadvantages of this modelling? (5...

  1. Describe what qualitative forecasting models are. What are the advantages and disadvantages of this modelling?
  2. “High correlation between two variables means that one is the cause and the other is the effect”. Explain this statement

In: Math

A type of new car is offered for sale with 4 option packages. A customer can...

A type of new car is offered for sale with 4 option packages. A customer can buy any number of​ these, from none to all 4. A manager proposes the null hypothesis that customers pick packages at​ random, implying the number of packages bought by a customer should be binomial with n equals=4.
This table shows the number of packages chosen by 400 customers.

Packages Customers
0 30
1 104
2 147
3 107
4 12

​(a) Find the binomial parameter p needed to calculate the expected counts.

​(b) Find the estimated probability that a customer picks one option.

​(c) Find the degrees of freedom for chiχsquared2.

(d) Find the value of chiχsquared2 for testing Upper H0

​(e) Find the​ p-value for testing Upper H0


.

In: Math