Question

In: Math

Open the files for the Course Project and the data set. For each of the five...

Open the files for the Course Project and the data set.

For each of the five variables, process, organize, present, and summarize the data. Analyze each variable by itself using graphical and numerical techniques of summarization. Use Excel as much as possible, explaining what the results reveal. Some of the following graphs may be helpful: stem-leaf diagram, frequency/relative frequency table, histogram, boxplot, dotplot, pie chart, and bar graph. Caution: not all of these are appropriate for each of these variables, nor are they all necessary. More is not necessarily better. In addition, be sure to find the appropriate measures of central tendency, the measures of dispersion, and the shapes of the distributions (for the quantitative variables) for the above data. Where appropriate, use the five number summary (the Min, Q1, Median, Q3, Max). Once again, use Excel as appropriate, and explain what the results mean. Analyze the connections or relationships between the variables. There are 10 possible pairings of two variables. Use graphical as well as numerical summary measures. Explain the results of the analysis. Be sure to consider all 10 pairings. Some variables show clear relationships, whereas others do not. Report Requirements From the variable analysis above, provide the analysis and interpretation for three individual variables. This would include no more than one graph for each, one or two measures of central tendency and variability (as appropriate), the shapes of the distributions for quantitative variables, and two or three sentences of interpretation. For the 10 pairings, identify and report only on three of the pairings, again using graphical and numerical summary (as appropriate), with interpretations. Please note that at least one pairing must include a qualitative variable, and at least one pairing must not include a qualitative variable. Prepare the report in Microsoft Word, integrating graphs and tables with text explanations and interpretations. Be sure to include graphical and numerical back up for the explanations and interpretations. Be selective in what is included in the report to meet the requirements of the report without extraneous information. All DeVry University policies are in effect, including the plagiarism policy. Project Part A report is due by the end of Week 2. Project Part A is worth 100 total points. See the grading rubric below. Submission: The report, including all relevant graphs and numerical analysis along with interpretations Format for report: Brief Introduction Discuss the first individual variable, using graphical, numerical summary and interpretation. Discuss the second individual variable, using graphical, numerical summary and interpretation. Discuss the third individual variable, using graphical, numerical summary and interpretation. Discuss the first pairing of variables, using graphical, numerical summary and interpretation. Discuss the second pairing of variables, using graphical, numerical summary and interpretation. Discuss the third pairing of variables, using graphical, numerical summary and interpretation. Conclusion

Sales (Y) Calls (X1) Time (X2) Years (X3) Type
48 168 12.3 5 ONLINE
36 131 16.4 4 NONE
46 162 15.7 3 NONE
47 183 13.0 3 ONLINE
44 177 15.3 3 ONLINE
49 181 12.4 2 ONLINE
35 123 19.0 3 NONE
46 169 14.8 3 GROUP
44 158 13.9 1 GROUP
39 146 15.4 3 GROUP
48 178 12.6 4 ONLINE
42 142 17.0 0 ONLINE
45 137 13.0 2 ONLINE
54 195 15.2 2 ONLINE
43 146 16.4 0 ONLINE
44 165 17.4 3 ONLINE
34 121 13.2 2 NONE
44 146 16.5 1 NONE
40 132 18.2 1 NONE
51 182 17.9 2 ONLINE
41 151 18.0 1 NONE
45 146 15.6 3 ONLINE
52 190 13.2 3 ONLINE
39 150 19.4 0 GROUP
41 149 13.2 3 GROUP
45 167 14.5 4 GROUP
46 189 20.0 1 GROUP
47 162 16.4 3 ONLINE
42 147 13.2 3 GROUP
45 171 19.4 2 ONLINE
44 165 15.0 0 ONLINE
50 175 15.1 3 ONLINE
46 161 13.2 3 GROUP
53 188 11.0 2 ONLINE
39 136 17.3 0 NONE
39 135 17.7 1 ONLINE
48 168 15.9 5 ONLINE
46 167 10.1 0 ONLINE
43 150 17.4 3 GROUP
44 151 15.2 2 GROUP
42 141 12.2 3 NONE
39 131 19.4 2 NONE
49 174 18.3 0 ONLINE
41 154 14.5 4 NONE
42 131 20.2 3 GROUP
39 128 15.3 1 GROUP
37 126 13.4 4 NONE
46 180 15.1 4 NONE
45 166 19.5 5 NONE
44 152 16.0 2 ONLINE
50 179 12.8 3 ONLINE
39 140 18.2 1 NONE
43 154 15.3 1 ONLINE
45 164 17.2 3 ONLINE
42 139 18.6 2 NONE
44 165 19.2 2 NONE
45 172 12.6 3 GROUP
41 147 18.5 3 GROUP
43 152 17.2 1 GROUP
48 160 15.8 2 ONLINE
42 159 13.6 4 GROUP
46 186 14.1 3 GROUP
46 150 20.7 2 GROUP
43 155 11.2 3 ONLINE
45 157 16.3 4 ONLINE
48 170 12.1 1 ONLINE
45 175 18.3 2 GROUP
49 186 17.5 1 GROUP
51 181 11.4 4 GROUP
47 171 17.3 2 ONLINE
50 185 16.4 0 ONLINE
39 146 15.8 1 GROUP
42 156 18.6 2 GROUP
46 157 19.3 2 ONLINE
43 163 11.7 1 GROUP
54 175 14.2 1 ONLINE
51 175 12.0 2 ONLINE
50 173 13.3 1 ONLINE
41 140 14.9 3 NONE
43 156 20.5 2 ONLINE
40 146 18.2 2 NONE
42 148 10.5 2 GROUP
50 183 11.7 1 GROUP
49 191 13.1 2 GROUP
40 149 14.2 4 ONLINE
40 143 18.3 2 NONE
47 185 15.2 2 ONLINE
41 136 17.4 3 GROUP
51 198 13.0 1 ONLINE
43 153 13.2 3 GROUP
38 129 15.2 3 NONE
44 158 11.8 3 ONLINE
43 149 12.7 1 GROUP
47 175 13.9 2 GROUP
40 154 16.4 3 GROUP
43 151 14.3 1 GROUP
46 153 22.0 0 ONLINE
46 167 14.8 1 ONLINE
46 167 15.8 0 ONLINE
39 143 17.7 3 NONE

Solutions

Expert Solution

Solution:

1) 5 number summary of the data.

2) Box plots

3) Stem and leaf plot

4) Histograms

5) Dot plots

6) Type category

7) Relationship between Two variables.

8) Regression equation and output

Y = 14.7567 + 0.1977*X1 - 0.0938*X2 - 0.1945*X3


Related Solutions

The link to the data is below, just click the link & open up the files...
The link to the data is below, just click the link & open up the files please. Listed under MOISTURE http://www.mediafire.com/download/thnnoaaqqefdwcf/excel_files.zip An important quality characteristic used by the manufacturer of Boston and Vermont asphalt shingles is the amount of moisture the shingles contain when they are packaged. Customers may feel that they have purchased a product lacking in quality if they find moisture and wet shingles inside the packaging. In some cases, excessive moisture can cause the granules attached to...
OK I have two data sets with 30 million rows each each data set is five...
OK I have two data sets with 30 million rows each each data set is five columns with four attributes and an amount. I want to confirm that the two data sets are exactly the same no two rows of data in the 30 million rolls are duplicates For my proof I will confirm each data set has the same number of rows. And I will also do the following: I will create four smaller data sets from each of...
OK I have two data sets with 30 million rows each each data set is five...
OK I have two data sets with 30 million rows each each data set is five columns with four attributes and an amount. I want to confirm that the two data sets are exactly the same no two rows of data in the 30 million rolls are duplicates For my proof I will confirm each data set has the same number of rows. And I will also do the following: I will create four smaller data sets from each of...
Use the data set named Store_Visits located in the folder Data Files for HW Assignment (outside...
Use the data set named Store_Visits located in the folder Data Files for HW Assignment (outside of Minitab folder) in the K-drive. The response variable y is the number of visits of a customer to a particular food store in a large suburban area within the period of a month, and the independent variable x is the distance (in miles) of the customer’s home to the store. Fit a simple linear regression model to the data, and answer the following...
Use the data set named Store_Visits located in the folder Data Files for HW Assignment (outside...
Use the data set named Store_Visits located in the folder Data Files for HW Assignment (outside of Minitab folder) in the K-drive. The response variable y is the number of visits of a customer to a particular food store in a large suburban area within the period of a month, and the independent variable x is the distance (in miles) of the customer’s home to the store. Fit a simple linear regression model to the data, and answer the following...
Python - files: find a solution for each following: -Open the file hostdata.txt for reading. -Store...
Python - files: find a solution for each following: -Open the file hostdata.txt for reading. -Store four file objects corresponding to the files winter2003.txt , spring2003.txt, summer2003.txt, and fall2003.txt in the variables winter, spring, summer, and fall (respectively), and open them all for reading. -Write a statement to open the file yearsummary.txt in a way that erases any existing data in the file. -Use the file object output to write the string "3.14159" to a file called pi. -A file...
Construct a scattergram for each data set. Then calculate r and r2 for each data set....
Construct a scattergram for each data set. Then calculate r and r2 for each data set. Interpret their values. Complete parts a through d. a. x −1 0 1 2 3 y −3 0 1 4 5 Calculate r. r=. 9853.​(Round to four decimal places as​ needed.) Calculate r2. r2=0.9709​(Round to four decimal places as​ needed.) Interpret r. Choose the correct answer below. A.There is not enough information to answer this question. B.There is a very strong negative linear relationship...
*Work the problems in EXCEL. 1. Problem 1: =======>>>>>>Open data files: JeepSales.xlsx and JeepTable.xlsx ( copy...
*Work the problems in EXCEL. 1. Problem 1: =======>>>>>>Open data files: JeepSales.xlsx and JeepTable.xlsx ( copy this) (a) Learn the PivotTable to derive JeepTables.xlsx from JeepSales.xlsx. [Hint: Click Insert,   PivotTable, highlight the data] (b) Construct a Frequency Bar chart, Pie chart.   Reduce graph size. Copy/paste into a MS Word file. Jeep Model W L G L G C L G C G L W L G W G L W W G L L L G L G L G...
The following data set represents the final grades assigned in a statistics course. The grades are...
The following data set represents the final grades assigned in a statistics course. The grades are as followed, 60, 50, 85, 85, 85, 90, 100, 70, 83, 92, 68, 70, 88, 88, 85, 90, 20, 100, 90, 80, 77 1. Professor Williamson believes that the average grade she would assign would be an 85. Is she correct? 2. Determine an appropriate alpha level for the given data set and justify your reason 3. Create your null and alternate hypothesis 4....
Open the State of the States data set and codebook, and answer the following questions. 1....
Open the State of the States data set and codebook, and answer the following questions. 1. Calculate the following univariate statistics for the variable, childabuserate (i.e., maximum, minimum, mean, standard deviation (STDEV), and skewness (SKEW). Be sure to determine N (the sample size) for this variable. 2. Diagram, by hand, the shape, mean, minimum & maximum values of child abuse rate. In other words, draw, by hand, a figure for a 5 number summary. 3.   Write a full paragraph giving...
ADVERTISEMENT
ADVERTISEMENT
ADVERTISEMENT