Question

In: Statistics and Probability

Describe a fictional example and generate a small data set in which data are heteroscedastic and...

Describe a fictional example and generate a small data set in which data are heteroscedastic and you need to apply a GLS.

Solutions

Expert Solution

One example is when the independent variable is the monthly income of a person in $ and the dependent variable is the expenses on food. We think that in general as the income of a person increases he tends to use more money on food expenses with significant variability. The following is illustrated via the plot shown below.

The R code

And here is the plot


Related Solutions

Find an article or generate a data set from a real life example understanding the relationship...
Find an article or generate a data set from a real life example understanding the relationship between two or more variables. Determine the significant and insignificant factors and state why they are significant or not significant. Were there any surprises?
Let's analyze the results of a fictional set of data for variation 1 of the Unemployment...
Let's analyze the results of a fictional set of data for variation 1 of the Unemployment Compensation experiment. The table below shows the worker costs and buyer values assigned to the students in variation 1 of the experiment. A worker's cost of working is what that worker has to give up in order to work for one hour. The cost of working includes work-related expenses plus the unemployment compensation the worker passes up by working. An employer's buyer value is...
For the data in Data Set #1, generate a Frequency Distribution with an interval size of...
For the data in Data Set #1, generate a Frequency Distribution with an interval size of 10, a lower apparent limit value as a multiple of 10, the largest interval size place on the top of the distribution, and use this distribution to answer questions 11-18. 54 67 88 109 26 33 92 97 32 55 75 81 83 45 21 86 94 100 78 62 What is the midpoint of the lowest interval? What is the relative frequency of...
Generate a simulated data set with 100 observations based on the following model. Each data point...
Generate a simulated data set with 100 observations based on the following model. Each data point is a vector Z= (X, Y) where X describes the age of a machine New, FiveYearsOld, and TenYearsOld and Y describes whether the quality of output from the machine Normal or Abnormal. The probabilities of a machine being in the three states are P(X = New) = 1/4 P(X = FiveYearsOld) = 1/3 P(X = TenYearsOld) = 5/12 The probabilities of Normal output conditioned...
Make a small example that shows that a particular set of tasks can be scheduled by...
Make a small example that shows that a particular set of tasks can be scheduled by the ad hoc method and NOT by DM. short answer please
5. What is the skewness and kurtosis of each data set? 6. Generate a histogram plot...
5. What is the skewness and kurtosis of each data set? 6. Generate a histogram plot of each of the data sets. 7. Based on the variability of the data, what do you think the next step would be to analyze the data? Age Income 29 9315 25 6590 28 9668 27 8412 25 1654 24 2431 25 6977 19 8966 27 9327 18 3871 25 9934 19 2236 19 3035 29 2518 19 3616 19 9219 28 1090 18...
Database Systems Example of data set which illustrates the difference between the application-oriented and subject-oriented organization...
Database Systems Example of data set which illustrates the difference between the application-oriented and subject-oriented organization of data?
1) Generate a data set with three variables (X, Y and Z). X and Y have...
1) Generate a data set with three variables (X, Y and Z). X and Y have 10 observations for each (N=10), and Z has 13 observations (N=13). Each observation should have two digits (such as “83” or “8.3”). 2) Draw a stem-and-leaf display for variable Z only and draw a box plot display for variable Z after specifying the 5 numbers (UEX, LEX, FU, FL, MD). 3) Calculate the mean and standard deviation for variable X 4) Calculate the mean...
Find data of a process or generate your own data which consists of at least 25...
Find data of a process or generate your own data which consists of at least 25 observations – sample size should be between 3-6. Apply Statistical Process Control method to check whether the process is in control or not. What are the values for each individual observation and mean and R value for each sample? Draw X chart and R chart. Is the process in control? If not, what can be reasons? Hint: The process should contain continuous variables something...
5. Give an example of a scatterplot for which |r| is very small, but for which...
5. Give an example of a scatterplot for which |r| is very small, but for which there is a very strong relationship between the explanatory and response variables. 6. What are three reasons |r|~ 0?
ADVERTISEMENT
ADVERTISEMENT
ADVERTISEMENT