(just how to solve those problems..) 3. You are asked to study the relationship between maternal...

(just how to solve those problems..) 3. You are asked to study the relationship between maternal smoking and low birthweight. You have a Stata dataset of babies’ birth weights and whether the mother smoked during pregnancy. Let Yi be a binary variable that equals 1 if a baby is born with low birthweight. Unless otherwise indicated, assume that {Y1,Y2,...,Yn} are independent and identically distributed. Use the dataset bwght2.dta for this question. (a) Use Stata to compute the mean of Yi for mothers who didn’t smoke. Using only the mean and number of observations, show how you can compute the sample standard deviation. (b) Your estimate for the proportion of babies with low birthweight is Y ̄ = .014. Provide an estimate for the variance of Y ̄ . (c) Suppose you want to test the null hypothesis that the proportion of babies born with low birth- weight in this population equal to .02. Conduct a two-sided test at 5% confidence level manually by constructing the following: i. The test statistic ii. The distribution of the test statistic under the null. Explain why you do not need to know the distribution of Y ̄ in order to know this distribution of the test statistic. What feature(s) of the setup make it possible to know this distribution? iii. The rejection rule iv. The outcome of the test (d) What is the p-value of the test? (e) Compute the 95 % confidence interval for the proportion of babies with low birthweight. (f) Confirm the above test results using the built-in Stata command (Hint: to perform t-test, use the ttest command). (g) Now use Stata to compute the mean of Yi for mothers who smoked. Test whether mothers who smoke have a different incidence of low birthweight than mothers who don’t smoke. Note: you may need to create a variable that indicates whether a mother smoked (Hint: the first part is gen cigs_10=1 if cigs>0 & cigs<. The extra part of the if command ensures that your new variable is set to missing when cigs is missing. You can type tabulate cigs cigs_10, missing when you are done to confirm that your new variable is reasonable.) Conduct a two-sided test at the 5% confidence level manually by constructing the following: i. The null hypothesis ii. The test statistic

Expert Solution

[As you ask for how to solve the mentioned questions I only give you hints to solve. If you need any further assistance or any doubts feel free to ask. ]

(a) Here Yi's are binary random variables taking value 0 or 1 furthermore Yi's are independent and identically(iid) distributed. If we think each Yi as a Bernoulli trial with some success probability then we can construct a Binomial model. Here the success probability (P, say) is the population proportion of low weighted babies.

Now observe that the mean of Yi's are nothing but the sample proportion of low weighted babies (p, say)

let, n= number of observations

then for binomial model the estimated variance is= v = np(1-p) ( both are available to us)

from the variance, we can get standard deviation just by taking a square root.

(b) use the steps used in (a)

(c) let the null hypothesis be

Ho: P=0.2 against the alternative H1: P !=(not equal to) 0.2

let the joint distribution of (Y1,Y2,......Yn) is = f(P)

now compute the likelihood ratio(LR)= f(P=0.2)/f(P=p) ,[where p=sample proportion]

Now we can construct our test statistics, P-value etc.

g) use the above steps for two groups of Yi's one is for mothers who smoked where the other is for mothers who didn't smoke.

orchestra answered 2 years ago

3) You have been asked to study whether there is a statistical relationship between the region...

3) You have been asked to study whether there is a statistical relationship between the region of the country and the categorical number of stores that have experienced at least a 20% return rate of the item you are studying. Sample data concerning these two variables is given in appendix three. At both the 5% and 2% levels of significance, is there evidence of a relationship between the region of the country and the categorical number of stores that have...

A study was conducted to investigate the relationship between maternal smoking during pregnancy and the presence...

A study was conducted to investigate the relationship between maternal smoking during pregnancy and the presence of congenital malformations in the child. Among children who suffer from an abnormality other than Down’s syndrome or an oral cleft, 32.8% have mothers who smoked during pregnancy. You wish to determine if this proportion is the same for those children born with an oral cleft. In a random sample of 27 infants with an oral cleft, 15 had mothers who smoked during pregnancy....

You have been asked to study whether there is a statistical relationship between the region of...

You have been asked to study whether there is a statistical relationship between the region of the country and the categorical number of stores that have experienced at least a 20% return rate of the item you are studying. Sample data concerning these two variables is given in appendix three. At both the 5% and 2% levels of significance, is there evidence of a relationship between the region of the country and the categorical number of stores that have experienced...

Knowing that Mercedes-Benz suffers from Information Silos’ problems, how would you solve those problems? Mercedes-Benz is...

Knowing that Mercedes-Benz suffers from Information Silos’ problems, how would you solve those problems? Mercedes-Benz is a German automotive marque and a subsidiary of Daimler AG. Mercedes-Benz is known for producing luxury vehicles and commercial vehicles. The headquarters is in Stuttgart, Baden-Württemberg. The name first appeared in 1926 under Daimler-Benz. In 2018, Mercedes-Benz was the largest seller of premium vehicles in the world, having sold 2.31 million passenger cars.

Read the case study. Identify three (3) problems and recommendations to solve the problems. Each problem...

Read the case study. Identify three (3) problems and recommendations to solve the problems. Each problem will require a justified recommended solution at least a page each. Zappos CEO Asks Employees to Commit to Teal, or Leave Zappos had modest beginnings. In 1999, shoesite.com was started by Nick Swinmurn to capture online shoe sales. Swinmurn reached out to Tony Hsieh (pronounced “shay”) and Alfred Lin, who were running Venture Frogs, a kind of venture capital group, for advice and funding....

The following data have to do with the relationship between maternal smoking (# of cigarettes smoked...

The following data have to do with the relationship between maternal smoking (# of cigarettes smoked per day, which is variable X) and infant birth weight (which is variable Y). (∑X, ∑X2, ∑Y, ∑Y2, and ∑XY have already been calculated for you and are shown below in red font.) Cigarettes Per Day (X) X2 Infant Birth Weight (Y) Y2 XY 2 4 7.5 56.25 15.0 6 36 7.2 51.84 43.2 10 100 6.9 47.61 69.0 12 144 6.2 38.44 74.4 14 196 5.8 33.64 81.2 ∑X = 44 ∑X2 = 480 ∑Y = 33.6 ∑Y2 =...

. In a study to determine whether an association exists between maternal rubella and congenital cataracts,...

. In a study to determine whether an association exists between maternal rubella and congenital cataracts, samples of 20 children with congenital cataracts and 25 children without congenital cataracts were selected. The mother of each child was asked whether she had rubella while carrying the child. The data are given below. Assume that all z-based methods are valid. RUBELLA CATARACTS Frequency Row Pct 1_YES 2_NO Total 1_YES 14 58.33 10 41.67 24 2_NO 6 28.57 15 71.43 21 Total 20...

You have been asked to ascertain whether there is a relationship between income level and the...

You have been asked to ascertain whether there is a relationship between income level and the amount of money donated to the campaign. Sample data concerning these two categorical variables is given in appendix three below. The numbers in the table represent the numbers of supporters in various income categories who have donated certain amounts of money. At both the 2% and 5% levels of significance, is there evidence of a statistical relationship between income level and amount of money...

To study the relationship between height and the weight of people you know, you need to...

To study the relationship between height and the weight of people you know, you need to collect a sample of nine (9) people using a systematic sampling method. Where and how are you going to collect your sample? Collect the sample and record the data. Construct a confidence interval to estimate the mean height and the mean weight. (CLO 1) Find the sample mean and the sample standard deviation of the height. Find the sample mean and the sample standard...

show how to solve the next 3 problems(show work) 1. You are starting your four-year college...

show how to solve the next 3 problems(show work) 1. You are starting your four-year college education today, and you are worried if you have enough money in your account for tuition and fees. You are going to pay $15,000 a year at the beginning of each school year starting today. If the interest rate on your account is 7%, compounded annually, how much should you have in your account today? Round to the nearest cent. 2. How much would...

Question