Question

In: Statistics and Probability

You are a data science consultant! In each of the following cases, decide whether you would...

You are a data science consultant! In each of the following cases, decide whether you would suggest a flexible regression model or an inflexible one. Provide your reasons as clearly as possible.

(a) In the study of breast cancer, a scientist is trying to find the genes associated with breast cancer. The total number of genes in the study is 50,000 and the number of patients is 120.

(b) The Ministry of Education in a certain country wants to identify students who need extra help. They wish to design a system which estimates student performance in the final 8th grade math exam based on their math, science and history grades in the 7th grade. To do this, they want to run a regression on the data from all the students who have graduated from the 8th grade in the last 10 years.

(c) Kelly is a very hardworking chemistry student and she has run an experiment to find a mathematical expression that relates the speed of corrosion of iron to the humidity and temperature of the environment, and the percentage of different elements in the alloy. Unfortunately, the lab that she is working in was established in 1967 and the equipment has not been changed since then. This has caused measurements to vary significantly between different experimental runs, even when the parameters were the same. She is skeptical about the quality of her measurements of the speed of corrosion.

(d) Kelly’s advisor won the Nobel prize in chemistry and used the prize money to outfit the lab with the most modern equipment. Kelly ran her experiments again with the new equipment and now she can trust her numbers. However, her advisor believes that she should not expect that the real relationship be linear.

Solutions

Expert Solution

(a) A flexible regression model will be more convenient to use because it is possible that the standard conditions under which the inflexible regression model is defined such as linearity, normal errors or homoskedasticity might not be satisfied. A flexible regression model will help us relax those assumptions.

(b) An inflexible regression model will be appropriate to model the given situation because certain assumptions made under the inflexible model are plausible such has normality of errors, linearity of the relationship between the independent and the dependent variable and homoskedasticity in the given situation.

(c) A flexible regression model will be used here because it is clearly stated in the given condition that because the equipment is old there was significant variation in the measurements between different experimental runs even when the parameters were same and this means that we cannot assume that the errors in the observations were normally distributed. Hence, we will have to allow for this in the regression model and thus it will be appropriate to choose a flexible regression model.

(d) A flexible regression model is still plausible because even though now the errors can be considered to be normally distributed but the assumption of linearity is not satisfied for the relationship between the dependent and the independent variable. An inflexible regression model assumes linearity of the relationship between the dependent and the independent variable hence it might be appropriate to use in the given situation.


Related Solutions

write a respose to the following question. in each of the following cases, decide whether a...
write a respose to the following question. in each of the following cases, decide whether a biomial distributio is an appropriate modddule and give your reason. 1. a firm uses a computer based training module to prepare 20 machinists to use new numerically controlled lathes. the module contains a test at the end of the course, x is the number who perform satisfactory on the test. 2. the list of potential product testers for a new product contains 100 person...
In each of the following cases, decide whether a binomial distribution is an appropriate model, and...
In each of the following cases, decide whether a binomial distribution is an appropriate model, and give your reasons. 1. A firm uses a computer-based training module to prepare 20 machinists to use new numerically controlled lathes. The module contains a test at the end of the course; X is the number who perform satisfactorily on the test. 2. The list of potential product testers for a new product contains 100 persons chosen at random from the adult residents of...
In which of the following cases did the court decide that in trying to ascertain whether...
In which of the following cases did the court decide that in trying to ascertain whether a statement is a term or not the relative knowledge of the parties is a relevant consideration? a). Hopkins v Tanqueray (1854) 15CB 130. b). Pym v Campbell (1856) 6 & B 370. c). Oscar Chess v Williams [1957] 1 ALL ER 325. d). All of the answers provided. Thanks for answering!
Assume you are a trial judge. Decide each of the following cases according to your idea...
Assume you are a trial judge. Decide each of the following cases according to your idea of a just result and explain the reasons for your decision. Do not invent additional facts. Hiram Price was arrested and charged with armed robbery shortly after three men stole $20,000 from Crabtree National Bank. Two of the men escaped. Price objected to his prosecution on the ground that he should not be tried unless the other two were tried with him. Does Price...
For each of the following scenarios, decide whether you agree or disagree, and then explain your...
For each of the following scenarios, decide whether you agree or disagree, and then explain your answer. (a) Paddy farmers suffer declines in their total revenues when they become more productive as a group. [4 marks] (b) Suppose the elasticity of demand for cocaine is -0.1 and the government succeeds in reducing supply substantially. As a result, buyers will now spend less on cocaine. [3 marks] (c) Every Chinese New Year, vendors bring thousands of Chinese red lanterns to Malaysia....
For each of the following scenarios, decide whether you agree or disagree, and then explain your...
For each of the following scenarios, decide whether you agree or disagree, and then explain your answer. (a) Paddy farmers suffer declines in their total revenues when they become more productive as a group. [4 marks] (b) Suppose the elasticity of demand for cocaine is -0.1 and the government succeeds in reducing supply substantially. As a result, buyers will now spend less on cocaine. [3 marks] (c) Every Chinese New Year, vendors bring thousands of Chinese red lanterns to Myanmar....
In each of the following cases, state whether you believe the industry fits as perfect competition,...
In each of the following cases, state whether you believe the industry fits as perfect competition, pure monopoly, monopolistic competition, or oligopoly. In the space below, state your reason:             a. Orange Growers in San Diego County b. Colleges in San Diego County             c. Bookstores for Students of Palomar College
Decide for each one of the following decays whether it is possible or not. If not,...
Decide for each one of the following decays whether it is possible or not. If not, point out which law of conservation is violated. Ξ- -> Λo + π- + ν Ξ- -> Ω- + πο Ξ- -> π- + γ A Ξ- particle break down through the process Ξ- -> Λo + π- What can be the type of interaction responsible for the decay? Please reason your answer. Write down an equation with one unknown variable for the momentum...
For each of the following vector fields F, decide whether it is conservative or not by...
For each of the following vector fields F, decide whether it is conservative or not by computing curl F. Type in a potential function f (that is, ∇f=F). Assume the potential function has a value of zero at the origin. If the vector field is not conservative, type N. A. F(x,y)=(−14x−6y)i+(−6x+6y)j f(x,y)= C. F(x,y,z)=−7xi−6yj+k f(x,y,z)= D. F(x,y)=(−7siny)i+(−12y−7xcosy)j f(x,y)= E. F(x,y,z)=−7x^2i−6y^2j+3z^2k f(x,y,z)=
For each of the following vector fields F , decide whether it is conservative or not...
For each of the following vector fields F , decide whether it is conservative or not by computing curl F . Type in a potential function f (that is, ∇f=F). If it is not conservative, type N. A. F(x,y)=(10x+7y)i+(7x+10y)jF(x,y)=(10x+7y)i+(7x+10y)j f(x,y)=f(x,y)= B. F(x,y)=5yi+6xjF(x,y)=5yi+6xj f(x,y)=f(x,y)= C. F(x,y,z)=5xi+6yj+kF(x,y,z)=5xi+6yj+k f(x,y,z)=f(x,y,z)= D. F(x,y)=(5siny)i+(14y+5xcosy)jF(x,y)=(5sin⁡y)i+(14y+5xcos⁡y)j f(x,y)=f(x,y)= E. F(x,y,z)=5x2i+7y2j+5z2k
ADVERTISEMENT
ADVERTISEMENT
ADVERTISEMENT