Question

In: Statistics and Probability

Q5. [20] We would like to determine whether any association exists between the survival time and...

Q5. [20] We would like to determine whether any association exists between the survival time and level of water toxicity, region and age of the patients.

Survival time is coded as 1 if < 1 month, 2 = 1-3 months, and 3 = more than 3 months.

Survival

Region

Toxic Level

Age

1

1

62.00

67

1

2

46.00

72

2

1

48.50

56

3

2

32.00

35

2

1

63.50

60

1

1

41.25

65

2

2

40.00

45

3

1

34.25

40

2

1

34.75

54

1

2

46.25

63

2

1

43.50

60

2

2

46.00

55

3

1

72.50

29

1

2

53.00

89

1

2

43.50

75

1

1

56.00

59

2

1

40.00

51

3

2

48.00

51

2

1

46.50

61

2

2

72.00

57

3

2

31.00

42

1

1

48.00

61

2

2

36.50

57

2

2

43.75

55

2

1

34.25

61

2

1

41.25

47

3

1

38.00

52

2

2

59.00

55

2

1

52.50

81

3

1

57.50

35

  1. [14] Fit an appropriate model to this data and test the goodness of fit.
  2. [6] Do toxic level, region and age have any significant impact on the survival time?

Solutions

Expert Solution

Hello,

Here, according to the given dataset, we observe that our dependent variable, survival is ordinal in nature, Hence the most appropriate model to apply to this dataset for the required analysis is the Ordinal Logistic Regression Model. We will be using the R programming language for our analysis of the given dataset:

Note: Copy the dataset into an excel sheet, and save it in a comma-delimited format(.csv) first

R codes:

install.packages("MASS")

library(MASS)

data1=read.csv(file.choose(), header=TRUE) # Acess the dataset from the destined location

attach(data1)

surv=as.ordered(survival)

model= polr(surv~region+toxic_level+age, data=data1, Hess=TRUE)

ctable=coef(summary(model))

p=pnorm(abs(table[,"t value"]), lower.tail=FALSE)*2

ctable=cbind(table, "p value"=p)

Results and Interpretation:

From the above test, we obtained a value of Residual Deviance of 34.91346 and the corresponding values of p for the independent variables of the region, toxic_level, and age are 0.93228, 0.67567, 0.00065 and the corresponding values of the coefficients are: -0.07708, -0.01901, 0.23033.

a] We have used an Ordinal Logistic Regression Model and as per the value of the DEVIANCE is concerned, it is a small value, indicating that our model is a Good Fit to the given dataset.

b] As per the values of coefficients and p values are concerned, The variable AGE has the most significant impact upon the survival time when compared to the region and toxic_level factors, given in our dataset

Thank You ...


Related Solutions

The manager of a telemarketing call centre wishes to determine whether an association exists between the...
The manager of a telemarketing call centre wishes to determine whether an association exists between the communicating time of its employees and the level of stress-related problems observed on the job. A study of 100 employees reveals the following: stress stress stress Commuting time (High) (Moderate) (low) 20 mins and under 15 12 33 over 20 mins 25 8 7 At the 5% level of significance, is there evidence of a significant relationship between commuting time and stress?
. In a study to determine whether an association exists between maternal rubella and congenital cataracts,...
. In a study to determine whether an association exists between maternal rubella and congenital cataracts, samples of 20 children with congenital cataracts and 25 children without congenital cataracts were selected. The mother of each child was asked whether she had rubella while carrying the child. The data are given below. Assume that all z-based methods are valid. RUBELLA CATARACTS Frequency Row Pct 1_YES 2_NO Total 1_YES 14 58.33 10 41.67 24 2_NO 6 28.57 15 71.43 21 Total 20...
In a study to determine whether an association exists between maternal rubella and congenital cataracts, samples...
In a study to determine whether an association exists between maternal rubella and congenital cataracts, samples of 20 children with congenital cataracts and 25 children without congenital cataracts were selected. The mother of each child was asked whether she had rubella while carrying the child. The data are given below. Assume that all z-based methods are valid. RUBELLA CATARACTS Frequency Row Pct 1_YES 2_NO Total 1_YES 14 58.33 10 41.67 24 2_NO 6 28.57 15 71.43 21 Total 20 25...
A psychologist would like to determine whether there is any consistent relationship between intelligence and creativity.  The...
A psychologist would like to determine whether there is any consistent relationship between intelligence and creativity.  The psychologist obtains a random sample of n = 18 people and administers a standardized IQ test and a creativity test to each individual.  Using these data, the psychologist obtains a Pearson correlation of r = +.20 between IQ and creativity. a.  Do the sample data provide sufficient evidence to conclude that a real (non-zero)      correlation exists in the population?  Test at the .05 level of significance. b.  If...
In an effort to determine whether any correlation exists between the price of stocks of airlines,...
In an effort to determine whether any correlation exists between the price of stocks of airlines, an analyst sampled six days of activity of the stock market. Using the following prices of Delta stock and Southwest stock, compute the coefficient of correlation. Stock prices have been rounded off to the nearest tenth for ease of computation. Delta Southwest 47.6 15.1 46.4 15.4 50.6 16.1 52.6 15.6 52.4 16.4 53.2 18.1
A research conducts a study to determine whether there is an association between time students spent...
A research conducts a study to determine whether there is an association between time students spent on revising the exam and level of level of exam anxiety among undergraduate. The researcher recruited 10 nursing students. To help the researcher, you need to do the hand calculation and then you use the SPSS Time spent revision Exam anxiety .40 10 1.30 9 2.15 8.5 3.40 6 4.30 1.5 4.00 3 3.45 5 2.30 7 2.00 8.5 1.15 9
In order to determine whether there was a difference in the survival rate between females and...
In order to determine whether there was a difference in the survival rate between females and males, a two-sample proportion test was applied. The following is the output for the test with some entries missing: Two sample proportion hypothesis test: p1 : Proportion of successes (Success = Survived) for Survival where Gender=Female p2 : Proportion of successes (Success = Survived) for Survival where Gender=Male p1 - p2 : Difference in proportions H0 : p1 - p2 = 0 HA :...
A psychologist would like to determine whether there is a relation between depression and aging. It...
A psychologist would like to determine whether there is a relation between depression and aging. It is known that the general population averages μ = 40 on a standardized depression test. The psychologist obtains a sample of n = 9 individuals who are all more than 70 years old. The depression scores for this sample are as follows. 50, 47, 41, 49, 44, 42, 43, 47, 48 On the basis of this sample, can the psychologist conclude that depression for...
Dr. Rueckert would like to determine whether the time and day a class is held at...
Dr. Rueckert would like to determine whether the time and day a class is held at NEIU makes a difference in how much students learn.  To study this, she decides to compare three different SRM II classes:  one held at 9:25 am on Tuesdays and Thursdays, one held at 5:40 pm on Tuesdays and Thursdays, and one held at 8 am on Saturday.  At the end of the semester, she gives a test of statistical concepts to all 3 classes. Another professor points...
You are conducting a case control study to determine if an association exists between melanoma and...
You are conducting a case control study to determine if an association exists between melanoma and indoor tanning. From a statewide cancer registry, you identify 1,107 people who were diagnosed with melanoma during the last three years. You select 1,500 controls for your study. Through a follow-up survey, you find that 696 of those with melanoma had a history of indoor tanning. 48.2% of the control group reported no exposure to indoor tanning. Part I: Calculate the odds ratio. Part...
ADVERTISEMENT
ADVERTISEMENT
ADVERTISEMENT