Question

In: Statistics and Probability

Each value in the data set is called a ? .    Variables whose values are...

  1. Each value in the data set is called a ? .

  

  1. Variables whose values are determined by chance are called ? .

  1. A Blank 1 consists of all subjects (human or otherwise) that are being studied.

  

  1. A Blank 1 is a circle that is divided into sections or wedges according to the percentage of frequencies in each category of the distribution.

  

  1. Tell whether Descriptive or Inferential Statistics has been used. In the upcoming election, it is predicted that candidate Murphy will get 52% of the votes and candidate Patterson will receive 46% of the votes.
    1.

    Descriptive

    2.

    Inferential

  

  1. Tell whether Descriptive or Inferential Statistics has been used. Results of the election last week showed that candidate Patterson received 53% of the votes and candidate Murphy received 44% of the votes.
    1.

    Descriptive

    2.

    Inferential

  1. Classify as nominal-level, ordinal-level, interval-level, or ratio-level data. Salaries of employees of the United Nations.
    1.

    Nominal

    2.

    Ordinal

    3.

    Interval

    4.

    Ratio

  1. Classify as nominal-level, ordinal-level, interval-level, or ratio-level data. Temperatures of automobile engines.
    1.

    Nominal

    2.

    Ordinal

    3.

    Interval

    4.

    Ratio

  1. Classify each variable as discrete or continuous. Capacity of water in automobile engines.
    1.

    Discrete

    2.

    Continuous

  1. Classify each variable as discrete or continuous. Temperatures of airplane interiors at a given airport.
    1.

    Discrete

    2.

    Continuous

  

  1. Classify each variable as qualitative or quantitative. Classification of a speaker as Excellent, Good, Fair, Poor, Very Poor.
    1.

    Qualitative

    2.

    Quantitative

  1. Classify each variable as qualitative or quantitative. Number of cups of coffee sold by McDonald's in a day.
    1.

    Qualitative

    2.

    Quantitative

  

  1. Identify which sampling technique was used to select the given sample. On a large University campus, students attending classes in 3 buildings on a Tuesday during the hours of 10am and 5PM were selected to complete a survey.
    1.

    Random

    2.

    Systematic

    3.

    Stratified

    4.

    Cluster

  1. Identify which sampling technique was used to select the given sample. On a large college campus, faculty were selected using random numbers to determine annual salaries.
    1.

    Random

    2.

    Systematic

    3.

    Stratified

    4.

    Cluster

Solutions

Expert Solution

  • Each value in the data set is called a sample unit.
  • Variables whose values are determined by chance are called Random variables.
  • A population consists of all subjects(human or otherwise) that are being studied.
  • A pie chart is a circle that is divided into sections or wedges according to the percentage of frequencies in each category of the distribution.
  • In prediction purpose, we generalise some result about population based on sample data. So, in this case inferential statistics has been used.
  • Result of election is declared. Clearly this is done based on all population values. So, in this case descriptive statistics has been used.
  • In the case of salaries of employees of the United Nations, the data have the sense of true zero (like zero salary) as well as order and the exact value between units. So, this is an example of ratio level data.
  • In the case of temperature of automobile engines, the data have both the order and the exact differences between the values, but no sense of true zero (as true zero temperature does not occur, also zero Celsius does not correspond zero Fahrenheit). So, this is an example of interval level data.
  • Capacity of water in automobile engines is non-negative real number. So, this variable is continuous.
  • Temperatures of airplane interiors at agiven airport is non-negative real number. So, this variable is continuous.
  • Classification of speaker as Excellent, Good, Fair, Poor, Very Poor reflects quality of speaker. So, it is qualitative variable.
  • Number of cups of coffee sold by McDonald's in a day is non negative integer. So, it is quantitative variable.
  • Here classes of different times in different buildings of different days can be considered as heterogeneous (more or less) clusters. Few clusters are selected (not samples from cluster groups) based on choosing certain day (Tuesday), certain time of classes (10 AM to 5 PM) and certain selected (3) buildings. So, this is an example of using Cluster sampling technique.
  • Neither grouping into strata, cluster is done nor sample is selected using some pre defined systematic procedure. Sample is selected in purely randomised way by using random numbers. So, this is an example of Random sampling technique.

Related Solutions

Using the following lines of data, create a temporary SAS data set called ThreeDates. Each line...
Using the following lines of data, create a temporary SAS data set called ThreeDates. Each line of data contains three dates, the first two in the form mm/dd/yyyy descenders and the last in the form ddmmmyyyy. Name the three date variables Date1, Date2, and Date3. Format all three using the MMDDYY10. format. Include in your data set the number of years from Date1 to Date2 (Year12) and the number of years from Date2 to Date3 (Year23). Round these values to...
Using values from the Appendix of Thermodynamic data, calculate the value of H° for each of...
Using values from the Appendix of Thermodynamic data, calculate the value of H° for each of the following reactions. (a) 3 Fe(s) + 4 CO2(g) 4 CO(g) + Fe3O4(s) H° = 13.6 Correct: Your answer is correct. kJ (b) CH4(g) + 4 Cl2(g) CCl4(l) + 4 HCl(g) H° = 308.6 Incorrect: Your answer is incorrect. kJ (c) Fe2O3(s) + 3 CO(g) 2 Fe(s) + 3 CO2(g) H° = 24.8 Incorrect: Your answer is incorrect. kJ (d) 4 NH3(g) + O2(g)...
2. Take data sets A and B and delete duplicated values such that each value is...
2. Take data sets A and B and delete duplicated values such that each value is unique even when pooling the two data sets. Just like with the previous problem, treat data sets A and B as hypothetical data on the weights of children whose parents smoke cigarettes, and those whose parents do not respectively. a) Calculate the expected value of the wilcoxon Rank-Sum test statistic E(Wx) assuming the null hypothesis of equal medians being true. b) Conduct a Wilcoxon-Rank-Sum...
calculate r and r2 for each data set. Interpret their values. Complete parts a through d....
calculate r and r2 for each data set. Interpret their values. Complete parts a through d. a. x   y -2   -4 -1   -1 0   0 1   3 2   6 b. x   y -2   6 -1   3 0   1 1   0 2   -2 c. x   y 2   3 3   2 3   4 4   2 4   3 4   4 5   3 d. x   y -3   0 -2   1 0   2 2   1 3   0
Construct a scattergram for each data set. Then calculate r and r2 for each data set....
Construct a scattergram for each data set. Then calculate r and r2 for each data set. Interpret their values. Complete parts a through d. a. x −1 0 1 2 3 y −3 0 1 4 5 Calculate r. r=. 9853.​(Round to four decimal places as​ needed.) Calculate r2. r2=0.9709​(Round to four decimal places as​ needed.) Interpret r. Choose the correct answer below. A.There is not enough information to answer this question. B.There is a very strong negative linear relationship...
Data set : Data Set G: Assume the population values are normally distributed. Random variable: x...
Data set : Data Set G: Assume the population values are normally distributed. Random variable: x = weight of border collie in pounds sample size = 25 34.1 40.8 36.0 34.9 35.6 43.4 35.4 29.3 33.3 37.8 35.8 37.4 39.0 38.6 33.9 36.5 37.2 37.6 37.3 37.7 34.9 33.2 36.2 33.5 36.9 1. Choose another confidence level similar to one found in the homework (do not use the confidence level from the posted example). Based on the second confidence level,...
. Choose a data set with 10-30 data values. (This could include but Is not limited...
. Choose a data set with 10-30 data values. (This could include but Is not limited to the closing price of a stock for 10-20 days, 10-20 of your exam grades, etc) You must all choose a different data set!! Compute the mean, median, mode, quartiles, IQR, SIQR, variance and standard deviation. Then report the relative position (from the median in terms of amount of SIQRs and also from the mean in terms of amount of standard dev's) of both...
Variables in Wooldridge's data set (description): Cross-sectional data set from Wooldridge 1. return % change stock...
Variables in Wooldridge's data set (description): Cross-sectional data set from Wooldridge 1. return % change stock price, 90-94 2. dkr debt/capital, 1990 3. eps earnings per share, 1990 4. netinc net income, 1990 (millions $) 5. salary CEO salary, 1990 (thousands $) Dataset: return dkr eps netinc salary -20.84211 4 48.1 1144 1090 -9.138381 27.3 -85.3 35 1923 86.21795 36.8 -44.1 127 1012 131.8367 46.4 192.4 367 579 -8.189655 36.2 -60.4 214 600 -26.00733 18.7 -79.8 118 735 52.27273 34.4...
Identify the possible values of each of the 3 variables in this dataset and describe what...
Identify the possible values of each of the 3 variables in this dataset and describe what information each of the 3 variables tells us about the data Heart rate before and after exercise M=0 F=1 Resting After Exercise 0 85.9 87.5 0 67.7 79.4 0 80.3 93.4 0 85.2 97.7 0 86.3 99.7 0 76.6 83.7
25. In Data Mining, ___ is a set of input variables used to predict an observation's...
25. In Data Mining, ___ is a set of input variables used to predict an observation's outcome class or continuous outcome value. 26. During each iteration of cluster analysis, the distances between new clusters are determined until any two clusters are sufficiently close to be linked using an algorithm called ___. 27. In the CRISP-DM process for data mining, which phase is the cleaning of the data so it is ready for modeling tools?
ADVERTISEMENT
ADVERTISEMENT
ADVERTISEMENT