Question

In: Statistics and Probability

Consider the dataset regarding the drop in light percent output (output) and two input variables –...

Consider the dataset regarding the drop in light percent output (output) and two input variables – bulb surface (qualitative) and length of the operation hours (quantitative). The data is available in a sheet named ‘Problem 3’. Answer the following.

(a) Write down the overall model form if one wishes to build a second order model for each value of the qualitative variable

(c) Build a regression model showing the 90% confidence ranges of the regression parameters. Write down the mean estimates of the regression parameters for the model in (a)

(d) Write down the 90% bounds of the estimate of the y-intercept (constant term)
(e) Compute the model prediction for a bulb with a dirty surface and with 1500 operation hours.

(f) Perform one-way hypothesis testing for the existence of ?2 term, where ?2 is the qualitative input. Use ? = 0.05

DATA

Drop in light output percent (Output) Bulb surface (Qualitative input) Length of operation hours (Quantitative input)
0 Clean 0
16 Clean 400
22 Clean 800
27 Clean 1200
32 Clean 1600
36 Clean 2000
38 Clean 2400
0 Dirty 0
4 Dirty 400
6 Dirty 800
8 Dirty 1200
9 Dirty 1600
11 Dirty 2000
12 Dirty 2400

Solutions

Expert Solution

SOLUTION

(a) y=b0+b1x+b2x2

y=Drop in light output percent (Output) and x=Length of operation hours (Quantitative input)

(c)y=1.2738+0.0183*x-3.6E-06*x2

(d)

Lower 90% Upper 90
-10.8200 13.3677

(e) for x=1500, y=1.2738+0.0183*1500-3.6E-06*1500*1500=20.6238

(f) here we use t-test and t=coefficient/SE(coefficient)=-3.6E-06/5.26E-06=-0.6859

the one-tailed critical t(0.05,11)=1.7959 is greater than absolute value of calculated t=0.6859, so x2 term is not significant and it may be removed from the model ( i.e. no existence)

following regression analysis information has been generated using ms-excel

SUMMARY OUTPUT
Regression Statistics
Multiple R 0.637572
R Square 0.406499
Adjusted R Square 0.298589
Standard Error 10.9107
Observations 14
ANOVA
df SS MS F Significance F
Regression 2 896.881 448.4405 3.767037 0.056730925
Residual 11 1309.476 119.0433
Total 13 2206.357
Coefficients Standard Error t Stat P-value Lower 90% Upper 90%
Intercept 1.27381 6.734228 0.189155 0.853417 -10.82008801 13.36770706
X 0.018348 0.013142 1.396123 0.190214 -0.005253775 0.041950204
X2 -3.6E-06 5.26E-06 -0.68591 0.506971 -1.3057E-05 5.83973E-06

Related Solutions

Consider a relational dataset and specify your input and output variables, then: Train the model using...
Consider a relational dataset and specify your input and output variables, then: Train the model using 80% of this dataset and suggest an appropriate GLM model output to input variables. Specify the significant variables on the output variable at the level of ?=0.05 and explore the related hypotheses test. Estimate the parameters of your model. Predict the output of the test dataset using the trained model. Provide the functional form of the optimal predictive model. Provide the confusion matrix and...
Consider the dataset between a quantitative input variable, ? and a quantitative response (output) variable, ?...
Consider the dataset between a quantitative input variable, ? and a quantitative response (output) variable, ? . Which of the following provides an optimal fit between them - a linear model, a complete quadratic model or a complete third order model? (Hint: You can use adjusted multiple coefficient of determination, ?2 to determine the optimal? model. Your answers below must be accompanied by appropriate computation in Excel)?2 value for the linear model = ________________ ?2 value for the quadratic model...
6.25 LAB: Swapping variables Write a program whose input is two integers and whose output is...
6.25 LAB: Swapping variables Write a program whose input is two integers and whose output is the two integers swapped. Ex: If the input is: 3 8 The output is: 8 3 Your program must define and call the following function. swap_values() returns the two values in swapped order. def swap_values(user_val1, user_val2) **in Python, please
The below provides information about the dataset. Input variables (based on physicochemical tests) Fixed acidity Numeric...
The below provides information about the dataset. Input variables (based on physicochemical tests) Fixed acidity Numeric Volatile acidity Numeric Citric acid Numeric Residual sugar Numeric Chlorides Numeric Free sulfur dioxide Numeric Total sulfur dioxide Numeric Density Numeric pH Numeric Sulphates Numeric Alcohol Numeric (%) Wine Type red or white Output variable (based on sensory data) Quality Score between 0 and 10 in ordinal Data set fixed acidity volatile acidity citric acid residual sugar chlorides free sulfur dioxide total sulfur dioxide...
For the following system provide the input and output variables. Justify. *One Gadget in your home...
For the following system provide the input and output variables. Justify. *One Gadget in your home *An example of a socio-economic system *One system that you are using as a part of your work
5. Suppose that a data scientist have 200 observations, 300 input variables, and a categorical output...
5. Suppose that a data scientist have 200 observations, 300 input variables, and a categorical output variable. Here is his/her analysis procedure: Step 1. Find a small subset of good predictors that show fairly strong (univariate) association with the output variable. Step 2. Using the input variables selected in Step 1, build LDA (linear discriminant analysis) and QDA (quadratic discriminant analysis) models. Step 3. Perform 5-fold cross-validation for both LDA and QDA models with input variables selected in Step 1....
5. Suppose that a data scientist have 200 observations, 300 input variables, and a categorical output...
5. Suppose that a data scientist have 200 observations, 300 input variables, and a categorical output variable. Here is his/her analysis procedure: Step 1. Find a small subset of good predictors that show fairly strong (univariate) association with the output variable. Step 2. Using the input variables selected in Step 1, build LDA (linear discriminant analysis) and QDA (quadratic discriminant analysis) models. Step 3. Perform 5-fold cross-validation for both LDA and QDA models with input variables selected in Step 1....
Given the production function Y = AK0.3L0.7 if output grows by five percent, the capital input...
Given the production function Y = AK0.3L0.7 if output grows by five percent, the capital input grows by five percent, and the labor input grows by two percent, calculate the Solow residual growth. Calculate the growth rates of output per worker and capital per worker.
Consider 2-bit input functions that has either constant output (0 or 1) or balanced output (the...
Consider 2-bit input functions that has either constant output (0 or 1) or balanced output (the number of {0,1} outputs are equal to each other). First a class is selected with1/2 probability and then one of the functions that belong to the chosen class is selected uniform randomly. We would like to guess the class of the function with minimum function evaluations. • What is the average number of guesses before we identify the chosen class using classical bits? •...
Consider the following data for two variables, and
Consider the following data for two variables,  and . 7 30 21 18 25   10 27 23 16 21   a. Develop an estimated regression equation for the data of the form . Comment on the adequacy of this equation for predicting . Enter negative value as negative number. The regression equation is     s= (to 3 decimals)   R2= % (to 1 decimal)   R Adjusted= % (to 1 decimal)   Analysis of Variance SOURCE DF SS(to 2...
ADVERTISEMENT
ADVERTISEMENT
ADVERTISEMENT