Question

In: Statistics and Probability

The following scenarios are drawn from real research articles. Imagine you were the one conducting these...

The following scenarios are drawn from real research articles. Imagine you were the one conducting these studies – tell me which type of statistical test would you in the following situations and why?

  1. In a study from 2009, researchers set out to examine whether call light use rate and the average call light response time contribute to patients’ fall and the injurious fall rates in acute care settings. As part of this study they compared the average call light response time between patients in medical, surgical, combined medical-surgical, and other settings.  

  1. What test would the authors have run to determine whether there was a difference in call light response times between these groups? Why?

  1. While effective communication is critical during the handover of patients between hospital shifts, to date (at least in 2013 when this study was conducted) there is no standard handover protocol. In one study of 56 ICU nurses in a large-scale Iranian teaching hospital, nurses were trained to use a standard protocol tool. Their adherence to/deviation from standards and protocols deemed vital to patient outcomes were assessed with a 20-point scale called the Nurses’ Safe Practice Evaluation Checklist (NSPEC) before and after the nurses were trained to utilize the standard protocol tool for handing over patients between shifts.  

  1. What test would the authors run to determine whether nurses’ mean score on the NSPEC increased significantly after training with the standard protocol tool? Why?

Solutions

Expert Solution

Today statistics provides the basis for inference in most medical research. Yet, for want of exposure to statistical theory and practice, it continues to be regarded as the Achilles heel by all concerned in the loop of research and publication – the researchers (authors), reviewers, editors and readers.

Most of us are familiar to some degree with descriptive statistical measures such as those of central tendency and those of dispersion. However, we falter at inferential statistics. This need not be the case, particularly with the widespread availability of powerful and at the same time user-friendly statistical software. As we have outlined below, a few fundamental considerations will lead one to select the appropriate statistical test for hypothesis testing. However, it is important that the appropriate statistical analysis is decided before starting the study, at the stage of planning itself, and the sample size chosen is optimum. These cannot be decided arbitrarily after the study is over and data have already been collected.

The great majority of studies can be tackled through a basket of some 30 tests from over a 100 that are in use. The test to be used depends upon the type of the research question being asked. The other determining factors are the type of data being analyzed and the number of groups or data sets involved in the study. The following schemes, based on five generic research questions, should help.[1]

Question 1: Is there a difference between groups that are unpaired? Groups or data sets are regarded as unpaired if there is no possibility of the values in one data set being related to or being influenced by the values in the other data sets. Different tests are required for quantitative or numerical data and qualitative or categorical data as shown in Fig. 1. For numerical data, it is important to decide if they follow the parameters of the normal distribution curve (Gaussian curve), in which case parametric tests are applied. If distribution of the data is not normal or if one is not sure about the distribution, it is safer to use non-parametric tests. When comparing more than two sets of numerical data, a multiple group comparison test such as one-way analysis of variance (ANOVA) or Kruskal-Wallis test should be used first. If they return a statistically significant p value (usually meaning p < 0.05) then only they should be followed by a post hoc test to determine between exactly which two data sets the difference lies. Repeatedly applying the t test or its non-parametric counterpart, the Mann-Whitney U test, to a multiple group situation increases the possibility of incorrectly rejecting the null hypothesis.

Figure 1

Tests to address the question: Is there a difference between groups – unpaired (parallel and independent groups) situation?

Question 2: Is there a difference between groups which are paired? Pairing signifies that data sets are derived by repeated measurements (e.g. before-after measurements or multiple measurements across time) on the same set of subjects. Pairing will also occur if subject groups are different but values in one group are in some way linked or related to values in the other group (e.g. twin studies, sibling studies, parent-offspring studies). A crossover study design also calls for the application of paired group tests for comparing the effects of different interventions on the same subjects. Sometimes subjects are deliberately paired to match baseline characteristics such as age, sex, severity or duration of disease. A scheme similar to Fig. 1is followed in paired data set testing, as outlined in Fig. 2. Once again, multiple data set comparison should be done through appropriate multiple group tests followed by post hoc tests.

Figure 2

Tests to address the question: Is there a difference between groups – paired situation?

Question 3: Is there any association between variables? The various tests applicable are outlined in Fig. 3. It should be noted that the tests meant for numerical data are for testing the association between two variables. These are correlation tests and they express the strength of the association as a correlation coefficient. An inverse correlation between two variables is depicted by a minus sign. All correlation coefficients vary in magnitude from 0 (no correlation at all) to 1 (perfect correlation). A perfect correlation may indicate but does not necessarily mean causality. When two numerical variables are linearly related to each other, a linear regression analysis can generate a mathematical equation, which can predict the dependent variable based on a given value of the independent variable.[2] Odds ratios and relative risks are the staple of epidemiologic studies and express the association between categorical data that can be summarized as a 2 × 2 contingency table. Logistic regression is actually a multivariate analysis method that expresses the strength of the association between a binary dependent variable and two or more independent variables as adjusted odds ratios.

Figure 3

Tests to address the question: Is there an association between variables?

Question 4: Is there agreement between data sets? This can be a comparison between a new screening technique against the standard test, new diagnostic test against the available gold standard or agreement between the ratings or scores given by different observers. As seen from Fig. 4, agreement between numerical variables may be expressed quantitatively by the intraclass correlation coefficient or graphically by constructing a Bland-Altman plot in which the difference between two variables x and y is plotted against the mean of x and y. In case of categorical data, the Cohen’s Kappa statistic is frequently used, with kappa (which varies from 0 for no agreement at all to 1 for perfect agreement) indicating strong agreement when it is > 0.7. It is inappropriate to infer agreement by showing that there is no statistically significant difference between means or by calculating a correlation coefficient.

Figure 4

Tests to address the question: Is there an agreement between assessment (screening / rating / diagnostic) techniques?

Question 5: Is there a difference between time-to-event trends or survival plots? This question is specific to survival analysis[3](the endpoint for such analysis could be death or any event that can occur after a period of time) which is characterized by censoring of data, meaning that a sizeable proportion of the original study subjects may not reach the endpoint in question by the time the study ends. Data sets for survival trends are always considered to be non-parametric. If there are two groups then the applicable tests are Cox-Mantel test, Gehan’s (generalized Wilcoxon) test or log-rank test. In case of more than two groups Peto and Peto’s test or log-rank test can be applied to look for significant difference between time-to-event trends.

It can be appreciated from the above outline that distinguishing between parametric and non-parametric data is important. Tests of normality (e.g. Kolmogorov-Smirnov test or Shapiro-Wilk goodness of fit test) may be applied rather than making assumptions. Some of the other prerequisites of parametric tests are that samples have the same variance i.e. drawn from the same population, observations within a group are independent and that the samples have been drawn randomly from the population.

A one-tailed test calculates the possibility of deviation from the null hypothesis in a specific direction, whereas a two-tailed test calculates the possibility of deviation from the null hypothesis in either direction. When Intervention A is compared with Intervention B in a clinical trail, the null hypothesis assumes there is no difference between the two interventions. Deviation from this hypothesis can occur in favor of either intervention in a two-tailed test but in a one-tailed test it is presumed that only one intervention can show superiority over the other. Although for a given data set, a one-tailed test will return a smaller p value than a two-tailed test, the latter is usually preferred unless there is a watertight case for one-tailed testing.

It is obvious that we cannot refer to all statistical tests in one editorial. However, the schemes outlined will cover the hypothesis testing demands of the majority of observational as well as interventional studies. Finally one must remember that, there is no substitute to actually working hands-on with dummy or real data sets, and to seek the advice of a statistician, in order to learn the nuances of statistical hypothesis testing.


Related Solutions

Imagine you were conducting research on the relationship between academic performance (e.g., better grades) and different...
Imagine you were conducting research on the relationship between academic performance (e.g., better grades) and different levels of loudness of music (interval scale) while studying. How would you design the study using a correlational design? How would you design the study using a quasi-experimental design? How would you design the study using an experimental design?
Imagine that you are educational psychologist, which method will you select in conducting a research at...
Imagine that you are educational psychologist, which method will you select in conducting a research at your work place. Select any research topic and describe the method(interview for example) , its characteristics, steps ,advantages disadvantages.
The following data were drawn from the Latin American Migration Project, a collaborative research effort based...
The following data were drawn from the Latin American Migration Project, a collaborative research effort based at Princeton University and the University of Guadalajara, supported by the National Institute of Child Health and Human Development (NICHD) (http://lamp.opr.princeton.edu). A random sample of respondents was drawn from three Latin American countries: Nicaragua, Guatemala, and Costa Rica. The variable if interest is the duration (in months) of stay in the United States during respondents’ first migration to the United States. Nicaragua: 4, 6,...
3.2 In conducting research for the following reports, name at least one form of data you...
3.2 In conducting research for the following reports, name at least one form of data you will need and questions you should ask to determine whether that set of data is appropriate.                                           (10) A report about the feasibility of an employer-provided preschool day-care program A report by a state boating and waterways commission providing information on the state marinas as they were operated during the most recent fiscal year.
Imagine that you are conducting an experiment in which each participant is randomly assigned to one...
Imagine that you are conducting an experiment in which each participant is randomly assigned to one and only one level of the independent variable throughout the course of the experiment. Which term below would best describe this experimental design? A correlational study A between-groups research design A within-groups research design A confounding variable design
In which of the following scenarios will conducting a tt-test for the difference in the means...
In which of the following scenarios will conducting a tt-test for the difference in the means of two independent samples be appropriate? CHECK ALL THAT APPLY. A. To test if there is a difference between the proportion of low-income families in Oklahoma and a known national proportion. B. To test if there is a difference between the mean annual income of Oklahomans and a known national mean. C. To test if there is a difference between the mean annual income...
Two random samples were drawn from members of the U.S. Congress. One sample was taken from...
Two random samples were drawn from members of the U.S. Congress. One sample was taken from members who are Democrats and the other from members who are Republicans. For each sample, the number of dollars spent on federal projects in each congressperson's home district was recorded. Dollars Spent on Federal Projects in Home Districts Party Less than 5 Billion 5 to 10 Billion More than 10 billion Row Total Democratic 6 16 23 45 Republican 11 17 19 47 Column...
Two random samples were drawn from members of the U.S. Congress. One sample was taken from...
Two random samples were drawn from members of the U.S. Congress. One sample was taken from members who are Democrats and the other from members who are Republicans. For each sample, the number of dollars spent on federal projects in each congressperson's home district was recorded. Dollars Spent on Federal Projects in Home Districts Party Less than 5 Billion 5 to 10 Billion More than 10 billion Row Total Democratic 9 11 25 45 Republican 11 19 17 47 Column...
Imagine you are a museum tour and you are responsible for conducting an international tour of...
Imagine you are a museum tour and you are responsible for conducting an international tour of foreign visitors and students through a museum exhibit exploring Abstract Expressionism. In your own words, describe how you would define the movement and explain the style and meaning of Abstract Expressionism art. Your essay should be no less than two paragraphs.
1. The following 10 numbers were drawn from a population. Is it likely that these numbers...
1. The following 10 numbers were drawn from a population. Is it likely that these numbers came from a population with a mean of 13? Evaluate with a two-tailed test at p < .05. 5, 7, 7, 10, 10, 10, 11, 12, 12, 13 a. State both in words, and then symbolically what your H1 and H0 would be. b. What is your df? What is your critical t? c. Calculate t d. Based on the above information, would you...
ADVERTISEMENT
ADVERTISEMENT
ADVERTISEMENT