Question

In: Statistics and Probability

A major bank collected data on 100,000 of its customers (income, sex, location, number of cards,...

A major bank collected data on 100,000 of its customers (income, sex, location, number of cards, etc.) and then computed how much profit is made from the account of these customers during the past calendar year.

a) Identify whether the data are cross-sectional or time-series

(b) Give a name to each variable and indicate if the variable categorical, ordinal, or numerical

(c) List any concerns you might have for the accuracy of the data.

Solutions

Expert Solution

Data is collected over 100000 customers under variables Income, Sex, Location, No. of Cards

A)

data which is being collected is an Cross sectional data

because

we know that if data is collected over several variables on a particular time period then it is a  cross sectional data

and if data of one variable is collected on a variable over time period then it is a Time series data .

B)

Now we will define whether which variable is categorical, ordinal and numerical

Income : Since income variable is continous then it is a numerical variable .

Sex : Since a person can only be male, female, or other then it will be categorical variables .

Location : If location is defined by Zip codes then it is a categorical variable since one person can belong to only one pin code i.e., to a one category it belongs .

Number of cards : Since no of cards can be ordered

SO it is an ordinal data

since it can be only take values 0, 1, 2, 3, ....

C)

It can be checked if we have dataset at our hand .


Related Solutions

Cross selling is a major activity at financial institutions.  One regional bank classifies its 100,000 individual customers...
Cross selling is a major activity at financial institutions.  One regional bank classifies its 100,000 individual customers into 4 groups: a) Basic services (checking, savings accounts), b) Lending (mortgages, loans), c) Investment (mutual funds, bonds), and d) Financial planning (retirement, trusts, comprehensive financial planning). Customers in each of the four categories generate net (of servicing costs) revenue of $100, $200, $300, and $1,000 per year respectively.  Currently the mix of customers is 70%, 10%, 15%, and 5% in the four types.  Retention rates...
Data were collected on the amount spent by 64 customers for lunch at a major Houston...
Data were collected on the amount spent by 64 customers for lunch at a major Houston restaurant. Based upon past studies the population standard deviation is known with $6. Round your answers to 2 decimal places. Use the critical value with 3 decimal places. At 99% confidence, what is the margin of error? Develop a 99% confidence interval estimate of the mean amount spent for lunch.   Amount 20.50 14.63 23.77 29.96 29.49 32.70 9.20 20.89 28.87 15.78 18.16 12.16 11.22...
Data were collected on the amount spent for lunch by 64 customers at a major Houston...
Data were collected on the amount spent for lunch by 64 customers at a major Houston restaurant. The sample provided a sample mean of $21.5. Based upon past studies the population standard deviation is known with σ = $6. Develop a 99% confidence interval estimate of the mean amount spent for lunch.
Suppose data were collected on the number of customers that frequented a grocery stores on randomly...
Suppose data were collected on the number of customers that frequented a grocery stores on randomly selected days before and after the governor of the state declared a lock down due to COVID 19. A sample of 6 days before the lockdown were chosen as well as 6 days randomly chosen after the lock down was in place. The number of shoppers each day were as follows: Before lock down After lock down 100 60 110 50 115 70 120...
Suppose data were collected on the number of customers that frequented a grocery stores on randomly...
Suppose data were collected on the number of customers that frequented a grocery stores on randomly selected days before and after the governor of the state declared a lock down due to COVID 19. A sample of 6 days before the lockdown were chosen as well as 6 days randomly chosen after the lock down was in place. The number of shoppers each day were as follows: Before lock down After lock down 100 60 110 50 115 70 120...
The following table shows data on the average number of customers processed by several bank service...
The following table shows data on the average number of customers processed by several bank service units each day. The hourly wage rate is $15, the overhead rate is 1.2 times labor cost, and material cost is $4 per customer. Unit Employees Customers Processed / Day A 5 38 B 6 46 C 7 61 D 3 33 a. Compute the labor productivity and the multifactor productivity for each unit. Use an eight-hour day for multifactor productivity.(Round your "Labor Productivity"...
The data below collected for the number of at bats and the number of hits in...
The data below collected for the number of at bats and the number of hits in the world series: Subject At Bats Hits     A 51 19 B 67 25 C 77 30 D 44 20 E 55 23 F 39 16 G 45 18 1. Draw a scatterplot (10 points) 2. Find the correlation coefficient r (45 points) 3. Find the equation of the regression line and graph the equation on the scatterplot (35 points) 4. How many hits...
Chase Bank account division has collected data on the age of credit accounts. The data collected...
Chase Bank account division has collected data on the age of credit accounts. The data collected indicate that the age of the accounts follows a normal distribution with mean 18 years and standard deviation 6 years. a. What proportion of the accounts are between 20 and 32 years old? b. What is the number of years in which 75% of all accounts are above?
A bank that offers charge cards to customers studies the yearly purchase amount (in thousands of...
A bank that offers charge cards to customers studies the yearly purchase amount (in thousands of dollars) on the card as related to the age, income (in thousands of dollars), and the and years of education of the cardholder. Using the Excel printout answer the following questions. SUMMARY OUTPUT Regression Statistics Multiple R 0.9629 R Square _ _ _ _ Adjusted R Square 0.9144 Standard Error 0.0871 Observations 21.0000 ANOVA df SS MS F Regression - - - - -...
The last time that information on employees working at this bank was collected, the number of...
The last time that information on employees working at this bank was collected, the number of months of previous work experience was distributed such that 15% of employees had a less than 2 years of experience, 25% had between 2 and 5 years of experience, 20% had more than 5 years and up to 10 years of previous work experience, and 40% had more than 10 years of experience. Verify at a 0.05 level of significance if the current data...
ADVERTISEMENT
ADVERTISEMENT
ADVERTISEMENT