How to read variables in SPSS and R for unstandardized and Standardized data
In: Statistics and Probability
Rosetta is an occupational therapist working in a Head Start program. The program serves 4-year-old children at risk of not passing the kindergarten screening test at age 5. One of the tasks on the kindergarten screening test is fine motor control at a level that makes cutting, coloring, and writing (name, alphabet) possible. To assist the children in developing fine motor control, Rosetta assists each child in fine motor exercises for a period of 1/2 hr a day over a period of 6 months. She wonders if the children’s fine motor skills significantly improved as a result of her instruction. Here are each child’s scores on a 10-point fine motor scale before and after instruction. Higher scores indicate more fine motor control:
Subject
Fine Motor Score Before Instruction Fine Motor Score After
Instruction
Quincy 3 5
Rolf 2 6
Stu 4 6
Tevya 5 6
Ulysses 3 3
Victor 4 7
Willy 3 4
Xerxes 2 4
Yekio 3 7
Zack 5 5
(a) Calculate t and compare it to a one-tailed critical t at the .05 level. Did Rosetta’s instruction result in significantly better fine motor control?
(b) The design of this study is not ideal because it is possible that the children’s motor ability would have improved over the 6-month period even without instruction. How could the design of this study be changed in order to eliminate this problem?
In: Statistics and Probability
Suppose you flip a biased coin (that lands heads with probability p) until 2 heads appear. Let X be the number of flips needed for this two happen. Let Y be the number of flips needed for the first head to appear. Find a general expression for the condition probability mass function pY |X(i|n) when n ≥ 2. Interpret your answer, i.e., if the number of flips required for 2 heads to appear is n, what can you say about the arrival of the first head?
In: Statistics and Probability
An environmentalist wants to find out the fraction of oil tankers that have spills each month. Step 2 of 2 : Suppose a sample of 355 tankers is drawn. Of these ships, 299 did not have spills. Using the data, construct the 85% confidence interval for the population proportion of oil tankers that have spills each month. Round your answers to three decimal places.
In: Statistics and Probability
In the following problem, check that it is appropriate to use
the normal approximation to the binomial. Then use the normal
distribution to estimate the requested probabilities.
What are the chances that a person who is murdered actually knew
the murderer? The answer to this question explains why a lot of
police detective work begins with relatives and friends of the
victim! About 66% of people who are murdered actually knew the
person who committed the murder.† Suppose that a detective file in
New Orleans has 62 current unsolved murders. Find the following
probabilities. (Round your answers to four decimal places.)
(a) at least 35 of the victims knew their murderers
(b) at most 48 of the victims knew their murderers
(c) fewer than 30 victims did not know their murderers
(d) more than 20 victims did not know their murderers
In: Statistics and Probability
The ski season started in Mount Sunapee! The ski and snowboard rental shop is located in the ski lodge area and offers a large selection of current snowboards, shaped skis, and helmets for children, women, and men. The winter rental shop is open every day that Mount Sunapee is open for skiing. The rental store maintains a current inventory of quality rental equipment with all rental equipment inventory replaced on a normal rotation basis every three to four years.
The owner of the store wants to understand better the factors that affect the demand for his ski and snowboard rental shop, so he can plan his capacity better. In particular, he wants to know how different factors, as the day of month, weekends, weather or school breaks affect their demand.
You will find data on number of rentals per day for December 2015 in this spreadsheet:
Date | Day | Weekend | Weather (good snow) | School Break | Shaped Ski Rentals | Snowboard Rentals | Helmet Rentals |
01/12/2015 | 1 | 0 | 0 | 0 | 25 | 8 | 15 |
02/12/2015 | 2 | 0 | 0 | 0 | 22 | 5 | 10 |
03/12/2015 | 3 | 0 | 0 | 0 | 26 | 7 | 15 |
04/12/2015 | 4 | 0 | 0 | 0 | 35 | 10 | 20 |
05/12/2015 | 5 | 1 | 0 | 0 | 90 | 25 | 65 |
06/12/2015 | 6 | 1 | 0 | 0 | 85 | 20 | 62 |
07/12/2015 | 7 | 0 | 0 | 0 | 28 | 6 | 15 |
08/12/2015 | 8 | 0 | 0 | 0 | 22 | 5 | 12 |
09/12/2015 | 9 | 0 | 0 | 0 | 25 | 5 | 15 |
10/12/2015 | 10 | 0 | 0 | 0 | 28 | 7 | 17 |
11/12/2015 | 11 | 0 | 0 | 0 | 35 | 8 | 20 |
12/12/2015 | 12 | 1 | 1 | 0 | 120 | 35 | 85 |
13/12/2015 | 13 | 1 | 1 | 0 | 105 | 28 | 95 |
14/12/2015 | 14 | 0 | 1 | 0 | 48 | 15 | 32 |
15/12/2015 | 15 | 0 | 0 | 0 | 25 | 5 | 15 |
16/12/2015 | 16 | 0 | 0 | 0 | 35 | 8 | 21 |
17/12/2015 | 17 | 0 | 1 | 0 | 45 | 12 | 28 |
18/12/2015 | 18 | 0 | 1 | 0 | 60 | 15 | 37 |
19/12/2015 | 19 | 1 | 1 | 0 | 130 | 42 | 90 |
20/12/2015 | 20 | 1 | 1 | 0 | 110 | 32 | 80 |
21/12/2015 | 21 | 0 | 1 | 0 | 70 | 15 | 39 |
22/12/2015 | 22 | 0 | 1 | 0 | 53 | 7 | 30 |
23/12/2015 | 23 | 0 | 1 | 1 | 85 | 18 | 49 |
24/12/2015 | 24 | 0 | 1 | 1 | 90 | 22 | 55 |
25/12/2015 | 25 | 0 | 1 | 1 | 110 | 35 | 70 |
26/12/2015 | 26 | 1 | 1 | 1 | 170 | 45 | 105 |
27/12/2015 | 27 | 1 | 1 | 1 | 190 | 50 | 120 |
28/12/2015 | 28 | 0 | 1 | 1 | 110 | 42 | 75 |
29/12/2015 | 29 | 0 | 1 | 1 | 90 | 35 | 65 |
30/12/2015 | 30 | 0 | 1 | 1 | 75 | 28 | 50 |
Part 1
Let’s start creating a linear regression model of daily shaped ski rentals as a function of the day of month only. Are the regression coefficients statistically significant at p=0.01?
Select the best answer
1.Neither β0 nor β1 are statistically significant at p=0.01.
2.Both β0 and β1 are statistically significant at p=0.01.
3.Only β0 is statistically significant at p=0.01.
4.Only β1 is statistically significant at p=0.01.
Part 2
The owner of the winter rental shop wants you to analyze how weekends affect his daily demand for shaped ski rentals, since he has noticed that more people rent shaped skis on weekend days (e.g. Saturday and Sunday) than on weekdays (e.g. Monday through Friday). Create a regression model of shaped skis daily rentals as a function of weekends. Use as only independent variable a dummy variable with a value of 1 for weekend days and 0 for the weekdays.
What is the ?2 value for this model?
Part 3
Based on your previous experience working on another ski rental shop located in Vermont (US), you believe that school breaks affect the daily rental demand for shaped skis. You also think that a better model could be obtained by using a multiple regression approach. Your proposal is to analyze three different models to predict shaped ski rentals using two independent variables in each model: (A) one model using the day of month and weekend as the independent variables, (B) another using the day of month and school break as independent variables, and (C) a last one using weekend and school break as independent variables. Which of these models is the best considering predictive power and statistical significance of coefficients?
Select the best answer
1.The model using the day of month and weekend is the best
2.The model using the day of month and school break is the best
3.The model using weekend and school break is the best
4.The model using only weekend (from Part 2) is better than any of these
Part 4
Create now a multivariate regression model using three independent variables: weekend, school break, and weather as predictors of the shaped ski rentals. For weather use a dummy variable with a value of 1 for those days that the quality of snow were good.
What is the adjusted ?2 value for this model?
Part 5
According to the previous (Part 4) multivariate regression model that you have created using three independent variables: weekend, school break, and weather as predictors of the shaped ski rentals, which is the value of the intercept?
In: Statistics and Probability
Here is the monthly stock price data for Ford Corp. and GM corp:
Prices for Ford and GM stock
Date Ford GM
8-Nov-99 24.44 66.08
1-Dec-99 25.79 65.09
3-Jan-00 24.32 72.14
1-Feb-00 20.35 68.54
1-Mar-00 22.45 74.63
3-Apr-00 27.00 84.37
1-May-00 23.95 64.02
1-Jun-00 22.08 52.63
3-Jul-00 24.17 51.61
1-Aug-00 21.95 63.97
1-Sep-00 23.14 59.40
2-Oct-00 23.98 56.77
1-Nov-00 20.89 45.64
1-Dec-00 21.52 46.96
2-Jan-01 26.16 49.51
1-Feb-01 25.30 51.77
1) What are the monthly returns for Ford and GM in April 00?
2) What is the covariance between returns of Ford and GM?
3) What is the slope of the regression?
4) What is the value of the intercept?
5) What is the r-squared of the regression?
In: Statistics and Probability
Biotech Co tracks their daily profits and has found that the
distribution of profits is approximately normal with a mean of
$22,400.00 and a standard deviation of about $650.00. Using this
information, answer the following questions.
For full marks your answer should be accurate to at least three
decimal places
Compute the probability that tomorrow's profit will be:
a.) less than $22,315.50 or greater than $22,348.00
b.)between $23,589.50 and $23,693.50
c.) greater than $21,217.00
d.) between $20,502.00 and $21,932.00
e.) less than $22,666.50 or greater than $23,895.00
In: Statistics and Probability
DATA:
Nursing | Psychology | |
Nervous | 16 | 3 |
Excited | 4 | 17 |
Required output for Chi square test is given as below and I need help inputting the data above into SPSS to achieve these results:
Observed Frequencies |
|||
Column variable |
|||
Row variable |
Nursing |
Psychology |
Total |
Nervous |
16 |
3 |
19 |
Excited |
4 |
17 |
21 |
Total |
20 |
20 |
40 |
Expected Frequencies |
|||
Column variable |
|||
Row variable |
Nursing |
Psychology |
Total |
Nervous |
9.5 |
9.5 |
19 |
Excited |
10.5 |
10.5 |
21 |
Total |
20 |
20 |
40 |
Data |
|
Level of Significance |
0.05 |
Number of Rows |
2 |
Number of Columns |
2 |
Degrees of Freedom |
1 |
Results |
|
Critical Value |
3.841459149 |
Chi-Square Test Statistic |
16.94235589 |
p-Value |
0.0000385 |
Reject the null hypothesis |
In: Statistics and Probability
Fogle Enterprises, Inc uses statistics in its quality assurance program. In the table below are the important measurements used at some of Fogle Enterprises, Inc international warehouses. For each warehouse they have collected the following information: warehouse location, maximum inventory level in cases, amount of space in square footage, and the percent of humidity expressed as Extreme = e; High = h; moderate = m; or low = l.
City |
Max Level |
Square Foot |
Humidity |
Acapulco |
99,000 |
7700.50 |
e |
Bangkok |
92,000 |
7800.00 |
m |
Mexico City |
77,000 |
5700.25 |
l |
Montreal |
72,000 |
5600.75 |
m |
Paris |
77,000 |
5800.00 |
h |
Rome |
88,000 |
6800.00 |
e |
Toronto |
78,000 |
6100.00 |
l |
a. How many elements and how many variables are in this data set?
b. Identify the variables and indicate whether they are qualitative or quantitative.
c. What “scale of measurement” represents each of the variables?
d. Which variables are discrete or continuous, and why?
e. In the actual “SiP” what is the variable being measured and what is the scale of measurement?
In: Statistics and Probability
When testing a random sample of 180 patients with a disease, the procedure yielded 16 “false negatives”.
a. Estimate the true proportion of all cases the diagnostic will yield a false negative, using 95% confidence. (18)
b. Verify that the sample size used in part A is large enough for the procedure to be considered valid. (3)
c. How many cases would need to be sampled to estimate the desired proportion to within a margin of error of 2%, using 95% confidence? (8)
In: Statistics and Probability
A random sample of 22 students’ weights is drawn from student population. Investigate whether the average weight of student population is different from 140 lb. 135 119 106 135 180 108 128 160 143 175 170 205 195 185 182 150 175 190 180 195 220 235 State the null and alternative hypothesis (Ho and Ha). What are the n,X ̅, s? Compute the t-statistic. What is the degree of freedom (df)? Find P-value from the table-D. Test the hypothesis at the significance level α=0.05. Reject Ho or Ha? Why? What conclusion can you make about the mean weight of students? Construct 95% Confidence Interval for the students’ mean weight. What value for t* should you use? Find the t* value from table-D. Do the calculation for the 95% Confidence Interval. Based on the 95% Confidence Interval from i), what conclusion can you make about the hypothesis in a)? Why?
(Answer all the questions and in a word document plz)
In: Statistics and Probability
Allegiant Airlines charges a mean base fare of $88. In addition, the airline charges for making a reservation on its website, checking bags, and inflight beverages. These additional charges average $38 per passenger. Suppose a random sample of 80 passengers is taken to determine the total cost of their flight on Allegiant Airlines. The population standard deviation of total flight cost is known to be $39. Use z-table. a. What is the population mean cost per flight? $ b. What is the probability the sample mean will be within $10 of the population mean cost per flight (to 4 decimals)? c. What is the probability the sample mean will be within $5 of the population mean cost per flight (to 4 decimals)?
In: Statistics and Probability
To see if police are more likely to pull over certain color cars, researchers gathered 28 Volkswagen Jettas and had them painted one of four colors: red, grey, black, and white. Participants drove the cars around for a month and recorded the number of traffic stops that occurred for each color. The results are below.
∑X2 = 2030
Red Cars: M = 12, T = 84, SS = 70, n = 7
Grey Cars: M = 4, T = 28, SS = 35, n = 7
Black Cars: M = 8, T = 56, SS = 63, n = 7
White Cars: M = 6, T = 42, SS = 42, n = 7
1.Specify the null and alternative hypotheses that car color affected the number of traffic stops.
2. Report the SS-Total, the SS-Within, and the SS-Between of an ANOVA
3. Report the MS-Within, the MS-Between, and the F-ratio of an ANOVA.
4. Using alpha = .01, report your decision about the impact of car color on traffic stops.
In: Statistics and Probability
1. in a recent study, the centers for disease and control centers reported that diastolic blood pressures of adult women in the united states are approximately normally distributed with a mean of 80.1 and a standard deviation of 9.5.
a) what proportion of women have blood pressures lower then 62?
b) what proportion of women have blood pressures between 69 and 87?
c) a dialostic blood pressure greater than 90 is classified as hypertension (high blood pressure), what porportion of women have hypertension?
d) is it unusual for a women to have blood pressure lower then 63?
round your answer to four decimal places.
2. the mean serum cholesterol levels for united states adults was 203, with the standard deviation of 42. a simple random sample of 109 adults were chosen. use the ti 84 calculator. round the answers to four decimal places.
a) what is the probability that the sample mean cholesterol levels is greater than 213.
b) what is the probability that the sample mean cholesterol levels is between 191 and 201 is:
c) would it be unusual for the sample mean to be less then 191?
it would be unusual or usual since the probability is:
3. ghgj
4. jj
In: Statistics and Probability