Question

In: Statistics and Probability

f) Discuss the following concepts as used in categorical data modeling Multinomial sampling vs binomil sampling...

f) Discuss the following concepts as used in categorical data modeling

  • Multinomial sampling vs binomil sampling
  • Poisson sampling process vs probality proportional to size sampling
  • Goodness of fit test vs Test of association
  • Relative risk vs odds ratio

Solutions

Expert Solution

a) - multinomial sampling may be considered as a generalization of binomial sampling. Data are collected on a pre-determined number of individuals that is units and classified according to levels of of a categorical variable of interest. - when the data are collected on a pre-determined number of units and are then classified according to two levels of a categorical variable, a binomial sampling emerges.

b) -poission sampling is a sampling process where each element of population is subjected to an independent Bernoulli trial which determines whether the element becomes part of the sample each element of the population may have a different probability of being included in the sample. If all first order inclusion probability are equal -poission sampling become equivalent to Bernoulli sampling which can therefore be considered to be a special case of poission sampling.

- probability proportional to size sampling is a method of sampling from a finite population in which a size measures is available for each population unit before sampling and where the probability of selecting a unit is proportional to size. It's use arises in two particular context multistage sampling and single stage sampling.

c) - the goodness of fit test is a statistical hypothesis test to see how well sample data fit a distribution from population with normal distribution. Goodness of fit establishes the discrepancy between the observed values and those that would be expected of the model in a normal distribution case. - test of association in statistic is usually means the chi-square test. It's used in informal sense to mean many things. Any time you are trying to find out if two variables are linked in some way, you are testing for association. The test of association involves cochran- mantel-haenszel test, fisher's exact test, gamma coffecient test.

d) odds ratio and relative risk are often confused dispute being unique concepts the basic difference is that the odds ratio is a ratio of two odds where as the relative risks is a ratio of two probabilities .

- the relative risk of an event is the likelyhood of its occurrence after exposure to a risk variable as comapre with the likelyhood of its occurrence in a control or reference group. The relative risk is estimated as the absolute risk with the risk variable divided by the absolute risk in the control group.   

- the odd ratio is a comparison of the odds of an event after exposure to a risk factor with the odds of that event. In a control or reference situation. The odd ratio is estimated as the odds of an event other the exposure group divided by the odds of that event in the control or reference book. The result is expressed as a ratio to denominator 1.


Related Solutions

Multinomial logistic regression can be used on: a)Categorical predictor variables only. b)Both categorical and continuous predictor...
Multinomial logistic regression can be used on: a)Categorical predictor variables only. b)Both categorical and continuous predictor variables. c)Continuous predictor variables only. d)Ordinal predictor variables only.
What are the statistics used to summarize categorical data?
What are the statistics used to summarize categorical data?
discuss how sampling distributions are used in inference.
discuss how sampling distributions are used in inference.
Discuss how sampling distributions are used for inference
Discuss how sampling distributions are used for inference
Data Modeling and Database Design (Database Concepts, Eighth Edition) The relational model is the most important...
Data Modeling and Database Design (Database Concepts, Eighth Edition) The relational model is the most important standard in database processing today. Why do you feel that this model has continued to be successful in the world of IT? What would happen if large corporations decided to reject this theory, and store their data using a non-relational model? Are there any success stories where this has happened?
Discuss the following concepts: Debt vs. Equity financing. The Weighted Average Cost of capital. Why might...
Discuss the following concepts: Debt vs. Equity financing. The Weighted Average Cost of capital. Why might preferred stock cost have a higher cost for a company than common stock?
Question 2 Dummy variables can be used to represent categorical data ___ a) only when the...
Question 2 Dummy variables can be used to represent categorical data ___ a) only when the categorical data is used as a response variable b) only when the categorical data is used as an explanatory variable c) when the categorical is used as either the response or explanatory variable d) Dummy variables can never be used to represent categorical data Question 3 Consider the following OLS regression equation: predicted y = b0 + b1X1 + b2d. The "X1" refers to...
In non-probability sampling, the sample means can be used to estimate population means. T or F...
In non-probability sampling, the sample means can be used to estimate population means. T or F when developing a questionnaire, researchers should avoid: Spending too many resources Targeting busy areas Simple questions Making assumptions If Don t-Know response is legitimate, it should be kept as a separate reply category. T or F
1.   Define each of the following fundamental concepts Interpreted vs. Compiled Languages Interpretive Overhead Scripting (Procedural) vs....
1.   Define each of the following fundamental concepts Interpreted vs. Compiled Languages Interpretive Overhead Scripting (Procedural) vs. Object Orientation vs. Logic Programing vs. Event Driven Programming Paradigms Programming Libraries/Modules Command Line (Prompt) ASCII Functions Control Flow Hashes/Dictionaries List Comprehension True or False 2.   A dictionary is a random-access data structure. 3.   An array is a sequential access data structure. 4.   Using functions to isolate code is a good programming practice. 5.   Lexicographical ordering starts by comparing the first letters of two strings to determine the...
From the following data, construct 3 graphs: [A] vs t, ln[A] vs t, and 1/[A] vs...
From the following data, construct 3 graphs: [A] vs t, ln[A] vs t, and 1/[A] vs t a) determine the order of the reaction from the graphs. b) from the one graph that is linear, determine the rate constant, k. You do not need to calculate the slopes of the nonlinear graphs. time in seconds [A] in (M) 0 1 25 0.914 50 0.829 75 0.744 100 0.659 125 0.573 150 0.488 175 0.403 200 0.318
ADVERTISEMENT
ADVERTISEMENT
ADVERTISEMENT