Question

In: Statistics and Probability

I have a statistics question: An orange juice processing plant has three production lines. The production...

I have a statistics question:

An orange juice processing plant has three production lines. The production lines fill juice into 400 ml packages. The production manager of the plant would like to know if the production lines are all filling the packages the same amount.

A sample of 25 packages from each production line are taken and the data is saved in juice.csv. The production manager would like to know:

“are the three production lines filling packages with the same amount of juice, on average?”.

The production manager already attempted to answer their question by applying a one-way ANOVA to the data. This resulted in a test where the assumption of normality failed. This was unable to be corrected by transforming the response variable. Use another method to try and answer the production managers question. The production manager is only interested in whether a difference exists, not where the differences are.

Test at the 5% significance level.

An orange juice processing plant has three production lines. The production lines fill juice into 400 ml packages. The production manager of the plant would like to know if the production lines are all filling the packages the same amount.

A sample of 25 packages from each production line are taken and the data is saved in juice.csv. The production manager would like to know:

“are the three production lines filling packages with the same amount of juice, on average?”.

The production manager already attempted to answer their question by applying a one-way ANOVA to the data. This resulted in a test where the assumption of normality failed. This was unable to be corrected by transforming the response variable. Use another method to try and answer the production managers question. The production manager is only interested in whether a difference exists, not where the differences are.

Test at the 5% significance level.

What test should I use I am not sure, I am using Rstudio what would the process be? What code would I run?

Expert Solution

Solution :

Answer : Use Kruskal-wallis Rank Sum Test to test whether the three production lines fill packages with the same amount of packages, on average.

Explanation : Process for conducting test using R is as follows:

Here the manager wants to know if there is a significant difference between the means of three production lines.

Here in fruit.csv we have been given 25 samples of each production line . Here the assumption of normality has been violated.

Therefore the assumption of normality required for one way anova has failed. Thus the distribution of data in fruit.csv is non-parametric.

The non-parametric alternative to One Way ANOVA is the Kruskal-Wallis Rank sum Test.

kruskal.test function in r is used to test equality between means or averages of different groups in same as in one way anova but the difference here is kruskal test is used when the data is non parametric.

Kruskal Wallis Test in R is computed as follows :

Here there are three production lines hence we have 3 samples of size 25 each.

Step 1 : Read the data file in r and store it in fruit_data

Code :

fruit_data=read.table("FileLocation/fruit.csv",sep=",",header=T);fruit_data

attach(fruit_data)

Once the data is loaded in R software we can use the Kruskal wallis test.

Step 2: compute the Kruskal-Wallis Test

Code:

kruskal.test(response~group),data=fruit_data)

where,

response is the values of the juice filled into 400 ml packages from different production lines

group are the three production lines, thus having 3 groups

data is the data from the fruit.csv file provided.

Step 3: Interpretating the result on basis of p-value obtained .

After running the code provided in Step 2 we get the chi-squared and p-value.

chi-squared is the value of the test statistics and the

p-value is the value calculated at 5% level of significance

Now if the p-value is less than 0.05 we reject the null hypothesis and conclude that the means of three different production lines differ significantly.

Or if the p-value is greater than 0.05 then we accept the null hypothesis and conclude that the three production lines fill the packages with same amount of juice on average.

When p-value is greater than 0.05 we can say that there is no statistically significant difference between the means or averages of the three probuction lines.

[ Here native stats package fron r library is used]

____________________________________________________________________

orchestra answered 1 year ago

An orange juice processing plant has three production lines. The production lines fill juice into 400...

An orange juice processing plant has three production lines. The production lines fill juice into 400 ml packages. The production manager of the plant would like to know if the production lines are all filling the packages the same amount. A sample of 25 packages from each production line are taken and the data is saved in juice.csv. The production manager would like to know: “are the three production lines filling packages with the same amount of juice, on average?”....

Three return steam lines in a chemical processing plant enter a collection tank operating at steady...

Three return steam lines in a chemical processing plant enter a collection tank operating at steady state at 5 bar. Steam enters inlet 1 with flow rate of 2 kg/s and quality of 0.9. Steam enters inlet 2 with flow rate of 2 kg/s at 200°C. Steam enters inlet 3 with flow rate of 1.2 kg/s at 95°C. Steam exits the tank at 5 bar. The rate of heat transfer from the collection tank is 40 kW. Neglecting kinetic and...

Question 1 Jane operates a company called “Fruity Shop”. It sells apple juice, orange juice and...

Question 1 Jane operates a company called “Fruity Shop”. It sells apple juice, orange juice and melon juice in boxes. Jane rents a retail space to sell these items and the cost of goods sold (COGS), selling price and daily demand for each item is given in the table below: Description COGS($) Unit Selling Price($) Demand Per Day Apple Juice $ 10.00 $ 14.00 400 Orange Juice $ 12.00 $ 15.00 300 Melon Juice $ 13.00 $ 16.00 250 The...

An orange juice producer buys oranges from a large orange grove that has one variety of...

An orange juice producer buys oranges from a large orange grove that has one variety of orange. The amount of juice squeezed from these oranges is approximately normally distributed, with a mean of 4.40 ounces and a standard deviation of 0.32 ounce. Suppose that you select a sample of 16 oranges. a. What is the probability that the sample mean amount of juice will be at least 4.27 ounces? b. The probability is 72% that the sample mean amount of...

5. A sample of orange juice is found to have a pH of 3.45. What is...

5. A sample of orange juice is found to have a pH of 3.45. What is the hydroxide concentration in the orange juice? a. 1.03x10-5 b. 10.55x10-11 c. 3.16x10-4 d. 2.818x10-11 6. The Kc for the following reaction is 3.2. What is the Kp at 230 K? PCl5 (g) ↔ PCl3 (g) + Cl2 (g) a. 60.4 b. 1139 c. 30.2 d. 22.8 7. A + B + C à D + E Kc = 1.39 D + E à...

Nestlé’s water bottling plant in rural Mecosta County, Michigan has ten production lines that each have...

Nestlé’s water bottling plant in rural Mecosta County, Michigan has ten production lines that each have a theoretical rated capacity of 1200 bottles per minute. A natural disaster has occurred in Flint Michigan and the plant is now being ran at full capacity 24/7. There are five processes that are involved in the bottling of water: Per line Process Bottles per minute Processing time minutes Molding 1200 0.19 Capping 2600 0.1506 Inspecting 6000 0.019 Labeling 3000 0.025 Printing 2500 0.032...

Operational Statistics Georgia plant: This plant has newer equipment, has higher productivity, employs 100 nonunion production...

Operational Statistics Georgia plant: This plant has newer equipment, has higher productivity, employs 100 nonunion production workers, and ships $12 million in carpets a year; hourly base wage is $16 California plant: California plant employs 80 union production workers and ships $8 million in carpets a year: hourly base wage is $20 Financial implications Savings by closing California plant: (1) increased productivity by 17%; (2) reduce labor cost by 20% (total labor savings would be $1 million per year; see...

The market for orange juice in North Dakota has a demand function of Q = 120...

The market for orange juice in North Dakota has a demand function of Q = 120 – 0.4P and a supply function of Q = 0.4P – 20. Imagine the government sets the price to $125. What is the dead weight loss?

One manufacturer has developed a quantitative index of the "sweetness" of orange juice. (The higher the...

One manufacturer has developed a quantitative index of the "sweetness" of orange juice. (The higher the index, the sweeter the juice). Is there a relationship between the sweetness index and a chemical measure such as the amount ofwater-soluble pectin (parts per million) in the orange juice? Data collected on these two variables for 24 production runs at a juice manufacturing plant are shown in the accompanying table. Suppose a manufacturer wants to use simple linear regression to predict the sweetness...

Henderson Company has three product lines: baked goods, milk and fruit juice, and frozen foods. Company...

Henderson Company has three product lines: baked goods, milk and fruit juice, and frozen foods. Company has experienced net operating losses in its Milk & Fruit Juice line during the last few periods. Company management thinks that the store will improve its profitability if it discontinued the Milk & Fruit Juice line. For product line profitability analysis purposes, currently company is allocating operating expenses as a percentage of sales dollars which approximately is 30% of sales dollar. Sales Revenues $89,250 ...