Questions
The following relative frequency distribution was constructed from a population of 650. Calculate the population mean,...

The following relative frequency distribution was constructed from a population of 650. Calculate the population mean, the population variance, and the population standard deviation.

Class Relative Frequency
−20 up to −10 0.30
−10 up to 0 0.20
0 up to 10 0.40
10 up to 20 0.10

Population Mean -

Population Variance -

Population Standard Deviation -

In: Math

Define each and provide an example. -1 Convenience Sample -2 Cross-sectional research design -3 Research Ethics...

Define each and provide an example.

-1 Convenience Sample

-2 Cross-sectional research design

-3 Research Ethics

-4 Randomization “ Data Collection”

In: Math

You have colleagues who reside in different zip codes. Which measure(s) of central tendency or other...

You have colleagues who reside in different zip codes. Which measure(s) of central tendency or other descriptive statistics would you use to describe this information? (please answer in full paragraphs - thank you)

In: Math

A recent debate about where in the United States skiers believe the skiing is best prompted...

A recent debate about where in the United States skiers believe the skiing is best prompted the following survey. Using α = 0.10, test to see if the best ski area is independent of the level of the skier. Write the hypotheses, calculate the expected counts, check the condition, calculate the test statistic, and use either the critical value approach or the p-value approach to make a conclusion.

Level of the Skier
U.S. Ski Area Beginner Intermediate Advanced
Tahoe 20 30 40
Utah 10 30 60
Colorado 10 40 50

In: Math

A researcher was interested in the effects of alcohol on attractiveness of mate selected (referred to...

A researcher was interested in the effects of alcohol on attractiveness of mate selected (referred to as the beer goggle effect). She compared the rating (by independent raters) on level of attractiveness of the partners that young people chose at a party. She compared a group before using alcohol then after using alcohol. Assessment was whether their chosen partners differed in levels of attractiveness. Higher scores indicated more attractive partners. The attractiveness of partner for each participant (measured by independent raters) in 2 scenarios for the same group (before using alcohol and after using alcohol) was as follows:

Before using alcohol

After using alcohol

5

10

12

14

7

15

5

9

8

11

6

10

10

13

8

11

7

9

  1. Test the hypothesis (Use alpha level of 0.05) using both manual and MS Excel calculations that there is no statistically significant difference between alcohol using and non-alcohol using group with regard to the level of attractiveness of the partners they chose by the end of the party. In your answer determine and interpret the strength of this relationship.

In: Math

Think of a problem that you may be interested in that deals with a comparison of...

Think of a problem that you may be interested in that deals with a comparison of two population means. Propose either a confidence interval or a hypothesis test question that compares these two means. Gather appropriate data and post your problem (without a solution) in the discussion topic. Later, respond to your own post with the solution for others to check their work.

For example, you may want to know if the average weight of a rippled potato chip is the same as the average weight of a non-rippled potato chip. You may weigh rippled regular potato chips from a large bag and find weights of 1.7, 1.9, 2.4, 1.3, 1.7, and 2.0 grams. You may weigh non-rippled potato chips from another large bag and find weights of 1.8, 1.6, 1.9, 1.9, and 1.4 grams. Assume a random sample was drawn.

In: Math

The evidence supporting obesity as a risk factor for colon cancer remains inconclusive, especially among women....

The evidence supporting obesity as a risk factor for colon cancer remains inconclusive, especially among women. A study reported the association between obesity (measured at baseline) and colon cancer morbidity as determined from review of medical records and death certificates in a nationally representative cohort of men and women age 25-74 years who participated in the First National Health and Nutrition Examination Survey from 1971 to 1975 and were subsequently followed up through 1992. The following table is from this study for men and women combined.

Baseline body mass index

(kg/height)2

Number of incident cases of colon cancer

Person-years of follow up

Crude incidence rate/100,000 PY

<22

29

54,475

22-<24

42

39,919

24-<26

37

37,610

26-<28

41

33,635

28-<30

36

22,122

30+

43

35,904

a. Which of the following best describes the research design used in this study? Choose the ONE best answer. (1 point)

  1. Cross-sectional survey
  2. Ecological study
  3. Population based case control study
  4. Cohort study
  5. None of the above

b. Complete the table above by calculating the crude body mass index-specific incidence rates. (Show your work in the table above.) (3 points – ½ point for each correct answer)

c. Calculate the relative risk (rate ratio) of colon cancer associated with a BMI of 30+. Use the lowest BMI category as the reference group. In one sentence interpret your answer. (2 points)

d. Calculate the attributable fraction among those in the 30+ BMI category. In one sentence interpret your answer. (The attributable fraction formulas provided in class can be used even though the data provided here is for rates.) (2 points)

In: Math

Outline the best clustering method for the following tasks (and briefly describe the reason you would...

Outline the best clustering method for the following tasks (and briefly describe the reason you would make such a design) :

(a) Finding oil spills along a coast line.

(b) Clustering employees in a company based on their salaries and years of working experience.

In: Math

There is a strong linkage between statistical data analysis and data mining. Some people think of...

There is a strong linkage between statistical data analysis and data mining. Some people think of data mining as automated and scalable methods for statistical data analysis. Do you agree or disagree with this perception? Present one statistical analysis method that can be automated and/or scaled up nicely by integration with current data mining methodology.

In: Math

You are a news vendor selling the FTU Daily Times every morning. Before you get to...

You are a news vendor selling the FTU Daily Times every morning. Before you get to work, you go to the printer and buy the day’s paper for $0.35 a copy. You sell a copy of the Sun Times for $1.20. Daily demand is distributed normally with mean = 300 and standard deviation = 75. At the end of each morning, any leftover copies are worthless and they go to a recycle bin. How many copies of the FTU Times should you buy each morning? Based on part (a), what is the probability that you will run out of stock?

In: Math

What are the major differences among the three methods for the evaluation of the accuracy of...

What are the major differences among the three methods for the evaluation of the accuracy of a classifier:

(a) hold-out method,

(b) cross-validation, and

(c) bootstrap?

In: Math

The publisher of a sports magazine plans to offer new subscribers one of three gifts: a...

The publisher of a sports magazine plans to offer new subscribers one of three gifts: a sweatshirt with the logo of their favorite team, a coffee cup with the logo of their favorite team, or a pair of earrings with the logo of their favorite team. In a sample of 500 new subscribers, the number selecting each gift is reported below. At the .05 significance level, is there a preference for the gifts or should we conclude that the gifts are equally well liked?

Gift

Frequency

Sweatshirt

183

Coffee cup

175

Earrings

142

In: Math

A bakery opens every day from Monday to Saturday, but only in the morning on Wednesdays....

A bakery opens every day from Monday to Saturday, but only in the morning
on Wednesdays. It is known that the number of bread rolls sold daily follows
a Gaussian distribution with mean 130 and standard deviation 20 with the
exception of Wednesdays for which the distribution of the number of bread
rolls sold is still Gaussian but with mean 100 and standard deviation 30.
(a) What is the probability that on a Wednesday the bakery will sell more
than 140 bread rolls?
(b) What is the probability that on a random opening day the bakery will
sell more than 140 bread rolls?
(c) What is the probability that in a week the bakery will sell more than
800 bread rolls?

In: Math

Define data processing and explain the steps to be followed for data processing

Define data processing and explain the steps to be followed for data processing

In: Math

You’re a researcher looking at whether or not applicants are accepted to a prestigious law internship....

You’re a researcher looking at whether or not applicants are accepted to a prestigious law internship. Only 10% of applicants receive a call to a first interview. You’re interested in two samples: all students from a rural community college who applied (24) and all students from Arizona who applied over the last five years (5804).

What are the mean and standard deviation for the large sample?

Explain in the context of this scenario what the mean represents, with appropriate rounding and units.

For the large sample, what is the probability that at least 600 students in the past five years received a first call?

In: Math