Question

In: Statistics and Probability

Know Thy Customer (KTC) is a financial consulting company that provides personalized financial advice to its...

Know Thy Customer (KTC) is a financial consulting company that provides personalized financial advice to its clients. As a basis for developing this tailored advising, KTC would like to segment its customers into several representative groups based on key characteristics. Peyton Avery, the director of KTC’s fledging analytics division, plans to establish the set of representative customer profiles based on 600 customer records in the file KnowThyCustomer. Each customer record contains data on age, gender, annual income, marital status, number of children, whether the customer has a car loan, and whether the customer has a home mortgage. KTC’s market research staff has determined that these seven characteristics should form the basis of the customer clustering. Peyton has invited a summer intern, Danny Riles, into her office so they can discuss how to proceed. As they review the data on the computer screen, Peyton’s brow furrows as she realizes that sis task may not be trivial. The data contains both categorical variables (Female, Married, Car, Mortgage), and interval variables (Age, Income, and Children). Managerial Report Playing the role of Peyton, you must write a report documenting the construction of the representative customer profiles. Because Peyton would like to use this report as a training reference for interns such as Danny, your report should experiment with several approaches and explain the strengths and weaknesses of each. In particular, your report should include the following analyses: 1. Using k-means clustering on all seven variables, experiment with different values of k. Recommend a value of k and describe these k clusters according to their “average” characteristics. Why might k-means clustering not be a good method to use for these seven variables? 2. Using hierarchical clustering all seven variables, experiment with using complete linkage and group average linkage as the clustering method. Recommend a set of customer profiles (clusters). Describe these clusters according to their “average” characteristics. Why might hierarchical clustering not be a good method to use for these seven variables?

Solutions

Expert Solution

1.1 In K means clustering the convention is to use the Euclidean distance between feature vectors. But here as the feature vector are consists of both the qualitative and quantitative variable so using Euclidean distance will not work here. Rather this is meaningless. Instead of that one can use Bray Curtis dissimilarity measure to calculate the distance between the feature vectors as they take into account both the qualitative such as gender and the quantitative variables such as Annual income in the consideration.

1.2 The next thing is to choose the number of clusters. At first, any no of clusters can be chosen, the default can be chosen as if there are n data points available, then we can start with n/30 no of class centroids. There is no thumb rule or method to choose exact no of class centroids, but what we can do we can split the data 70-30 ratio. train the K means clustering on the 70% of the data. Then use the rest of 30% to assign them to the different cluster, after assigning this 30% of the point if the original class centroids change drastically then we need to change the no of clusters, there may be a possibility that more number of clusters are required.

1.3. The main disadvantage is that we do not know in prior how many clusters could be formed optimally. and also that I have indicated earlier the distance measure should have been changed after seeing the nature of the data.

2. For hierarchical clustering group average linkage is a very crude method to use. It is not very helpful. On the other hand, if we use complete linkage then it reveals the maximum distance between two clusters .

2.1 It may be useful to use the hierarchical clustering because at all the stages we can see the full picture and according to our necessity we can stop at any of the stages with the number of cluster.

There may be noisy data and some of the features may unnecessarily add noise to the data. So it might not be good to use hierarchical clustering .


Related Solutions

Know Thy Customer (KTC) is a financial consulting company that provides personalized financial advice to its...
Know Thy Customer (KTC) is a financial consulting company that provides personalized financial advice to its clients. As a basis for developing this tailored advising, KTC would like to segment its customers into several representative groups based on key characteristics. Peyton Avery, the director of KTC’s fledging analytics division, plans to establish the set of representative customer profiles based on 600 customer records in the file KnowThyCustomer. Each customer record contains data on age, gender, annual income, marital status, number...
Know Thy Customer (KTC) is a financial consulting company that provides personalized financial advice to its...
Know Thy Customer (KTC) is a financial consulting company that provides personalized financial advice to its clients. As a basis for developing this tailored advising, KTC would like to segment its customers into several representative groups based on key characteristics. Peyton Avery, the director of KTC’s fledging analytics division, plans to establish the set of representative customer profiles based on 600 customer records in the file KnowThyCustomer. Each customer record contains data on age, gender, annual income, marital status, number...
Financial Statements and Closing Entries Last Chance Company offers legal consulting advice to prison inmates. Last...
Financial Statements and Closing Entries Last Chance Company offers legal consulting advice to prison inmates. Last Chance prepared the end-of-period spreadsheet that follows at June 30, 20Y3, the end of the fiscal year. Last Chance Company End-of-Period Spreadsheet For the Year Ended June 30, 20Y3 Unadjusted Adjusted Trial Balance Adjustments Trial Balance Account Title    Dr.    Cr.    Dr.    Cr.    Dr.    Cr. Cash 5,100 5,100 Accounts Receivable 22,750 3,750 26,500 Prepaid Insurance 3,600 1,300 2,300 Supplies 2,025 1,500 525 Land 80,000 80,000...
39) Sunland Company provides financial consulting and has collected the following data for the next year’s...
39) Sunland Company provides financial consulting and has collected the following data for the next year’s budgeted activity for a lead consultant. Consultants’ wages $90000       Fringe benefits $22500       Related overhead $17500 Supply clerk’s wages $19000       Fringe benefits $4000       Related overhead $22000 Profit margin per hour $20 Profit margin on materials 15% Total estimated consulting hours 5000 Total estimated material costs $180000 A consulting job takes 20 hours of consulting time and $180 of materials. The client’s...
Sheridan Company provides financial consulting and has collected the following data for the next year’s budgeted...
Sheridan Company provides financial consulting and has collected the following data for the next year’s budgeted activity for a lead consultant. Consultants’ wages $80000       Fringe benefits $32500       Related overhead $27500 Supply clerk’s wages $19000       Fringe benefits $5000       Related overhead $22000 Profit margin per hour $20 Profit margin on materials 15% Total estimated consulting hours 5000 Total estimated material costs $166000 The labor rate per hour is $46.50. $48.00. $28.00. $47.50.
Powerjob Inc. provides employment consulting services. The company adjusts its accounts monthly but performs closing entries...
Powerjob Inc. provides employment consulting services. The company adjusts its accounts monthly but performs closing entries annually on December 31. The firms’s unadjusted trial balance dated December 31, 2019 is shown below. Other data: 1.       Accrued but unrecorded and uncollected consulting fees earned at December 31 total : $25000. 2.       The company determined that $15000 of previously unearned consulting fees had been earned at December 31. 3.       Office supplies on hand at December 31 total $300 4.       The company purchased all of its equipment...
Tripoli Consulting provides financial and estate planning services on a retainer basis for the executive officers...
Tripoli Consulting provides financial and estate planning services on a retainer basis for the executive officers of its corporate clients. It incurred the following labor costs on services for three corporate clients during March 2018: Direct Labor Contact 1 $9,000 Contract 2 $3,600 Contract 3 $14,400 TOTAL $27,000 Tripoli allocated March overhead costs of $13,500 to the contracts based on the amount of direct labor costs incurred on each contract. a. Assuming the revenue from Contract 3 was $40,000, what...
question Last Chance Company offers legal consulting advice to prison inmates. Last Chance Company prepared the...
question Last Chance Company offers legal consulting advice to prison inmates. Last Chance Company prepared the end-of-period spreadsheet that follows at June 30, 2019, the end of the The annual accounting period adopted by a business.fiscal year: Last Chance Company End-of-Period Spreadsheet For the Year Ended June 30, 2019 Unadjusted Adjusted Trial Balance Adjustments Trial Balance Account Title    Dr.    Cr.    Dr.    Cr.    Dr.    Cr. Cash 5,100 5,100 Accounts Receivable 22,750 (a) 3,750 26,500 Prepaid Insurance 3,600 (b) 1,300 2,300 Supplies...
We are a consulting company. Our customer is the BEST COMPUTER STORE which is targeted to...
We are a consulting company. Our customer is the BEST COMPUTER STORE which is targeted to begin operation next year. The store will need an information system to help store customer/sales & inventory information. Our system will be able to add/edit customers/sales & inventory and creates necessary output transactions such as packing slip & invoice. We did Statement of Work, Team & Clear Objectives & Scope, Timeline (PERT and Gantt charts), Assumption and Budget following requirements. Project Name: Project Manager:...
Concord Corporation provides financial consulting and has collected the following data for the next year’s budgeted...
Concord Corporation provides financial consulting and has collected the following data for the next year’s budgeted activity for a lead consultant. Consultants’ wages $85000       Fringe benefits $22500       Related overhead $17500 Supply clerk’s wages $17000       Fringe benefits $3000       Related overhead $21000 Profit margin per hour $20 Profit margin on materials 15% Total estimated consulting hours 5000 Total estimated material costs $164000 A consulting job takes 20 hours of consulting time and $180 of materials. The client’s bill...
ADVERTISEMENT
ADVERTISEMENT
ADVERTISEMENT