Question

In: Statistics and Probability

Salmons Stores operates a national chain of women's apparel stores. Five thousand copies of an expensive...

Salmons Stores operates a national chain of women's apparel stores. Five thousand copies of an expensive four-color sales catalog have been printed, and each catalog includes a coupon that provides a $50 discount on purchases of $200 or more. Salmons would like to send the catalogs only to customers who have the highest probability of using the coupon. The DATAile Salmons contains data from an earlier promotional campaign. For each of 1,000 Salmons customers, three variables are tracked: last year's total spending at Salmons, whether they have a Salmons store credit card, and whether they used the promotional coupon they were sent.

Create a standard partition of the data with all the tracked variables and 50% of observations in the training set, 30% in the validation set, and 20% in the test set. Use logistic regression to classify observations as a promotion-responder or not by using Spending and Card as input variables and Coupon as the output variable. Perform Variable Selection with the best subsets procedure with the number of best subsets equal to two.

Click on the datafile logo to reference the data.

(a) Evaluate the logistic regression models based on their classification error. Recommend a final model and express the model as a mathematical equation relating the output variable to the input variables.
If required, round your answers to three decimal places. For subtractive or negative numbers use a minus sign even if there is a + sign before the blank. (Example: -300)
Value =  +  *Spending +  *Card
(c) What is the area under the ROC curve on the test set?
If required, round your answer to three decimal places.
To achieve a sensitivity of at least 0.80, how much Class 0 error rate must be tolerated?

If required, round your answer to three decimal places.

This is part of the data and i need the solution but not the answer, it is better to provide screen shoot on excel, thanks a lot

Customer Spending Card Coupon
1 2291 1 0
2 3215 1 0
3 2135 1 0
4 3924 0 0
5 2528 1 0
6 2473 0 1
7 2384 0 0
8 7076 0 0
9 1182 1 1
10 3345 0 0
11 2140 1 0
12 3255 0 1
13 1512 0 0
14 2148 0 1
15 6737 0 0
16 6486 0 0
17 1307 0 0
18 3470 1 0
19 2936 0 0
20 6404 0 1
21 2229 0 0
22 2933 0 0
23 2118 0 0
24 2050 0 0
25 4998 0 1
26 1394 0 0
27 3993 1 1
28 2059 0 1
29 1677 0

0  

Solutions

Expert Solution

The data set is very small to divide data in training testing and validation so have only divided this data in the training and testing.

Also provided R code for your reference,

The total observation in data is 29

> dim(data)
[1] 29  
> set.seed(100)
> sample_split = createDataPartition(data$Coupon, p=0.7, list=FALSE)
> Train = data[sample_split, ]
> dim(Train )
[1] 21  
> Test = data[-sample_split,]
> dim(Test )
[1] 8

70% data in train and 30% in the test.

The fitted model on train data is,

trainCV = trainControl(method='repeatedcv',
number=3,
repeats=2,
verbose=TRUE)


modLog = train(x = Train[,-'Coupon'],
y = Train$Coupon,
method='glm',
trControl = trainCV)

Coefficients:
Estimate Std. Error t value Pr(>|t|)
(Intercept) 2.098e-02 2.439e-01 0.086 0.932
Spending 4.717e-05 5.661e-05 0.833 0.416
Card 1.825e-01 2.238e-01 0.815 0.426

The fitted model is,

Value of Coupon = 0.02098 + 0.00004717 * Spending + 0.1825 * Card

The p-values are greater than 0.05 so variables are not significant but would be those both affect on response together.

Results for train model,

Accuracy : 0.7619   

95% CI : (0.5283, 0.9178)

Sensitivity : 1.0000   

Specificity: 0.0000

Results for the test model:

Accuracy : 0.625
95% CI : (0.2449, 0.9148)

Sensitivity : 1.000
Specificity: 0.000

The training model performs well then the testing model because accuracy is high than the test model.

ROC curve:

ROC curve is the probability that how much area comes under the curve it is the plot between True positive rate and false positive rate.

for train model and test model specificity is 0 so not possible to draw curve.

>>>>>>>>>>>>>>> Best Luck >>>>>>>>>>>>>>


Related Solutions

Pelican Stores Pelican Stores, a division of National Clothing, is a chain of women’s apparel stores...
Pelican Stores Pelican Stores, a division of National Clothing, is a chain of women’s apparel stores operating throughout the country. The chain recently ran a promotion in which discount coupons were sent to customers of other National Clothing stores. data collected for a sample of 100 in-store credit card transactions at Pelican Stores during one day while the promotion was running are contained in the file named PelicanStores. Table 2.19 shows a portion of the data set. The Proprietary Card...
Pelican Stores, a division of National Clothing, is a chain of women’s apparel stores operating throughout...
Pelican Stores, a division of National Clothing, is a chain of women’s apparel stores operating throughout the country. The chain recently ran a promotion in which discount coupons were sent to customers of other National Clothing stores. Data collected for a sample of 100 in-store credit card transactions at Pelican Stores during one day while the promotion was running are contained in the file named PelicanStores. The Proprietary Card method of payment refers to charges made using a National Clothing...
Eagle Outfitters is a chain of stores specializing in outdoor apparel and camping gear. They are...
Eagle Outfitters is a chain of stores specializing in outdoor apparel and camping gear. They are considering a promotion that involves mailing discount coupons to all their credit card customers. This promotion will be considered a success if more than 10% of those receiving the coupons use them. Before going national with the promotion, coupons were sent to a sample of 100 credit card customers. Click on the datafile logo to reference the data. Excel or Minitab users: The data...
The Apparel Company makes expensive polo-style men's and women's short-sleeve knit shirts at its plant in...
The Apparel Company makes expensive polo-style men's and women's short-sleeve knit shirts at its plant in Jamaica. The production process requires that material be cut into large patterned squares by operators, which are then sewn together at another stage of the process. If the squares are not of a correct length, the final shirt will be either too large or too small. In order to monitor the cutting process, management takes a sample of 4 squares of cloth every other...
The Cotton Mill is an upscale chain of women's clothing stores, located in the southwestern United...
The Cotton Mill is an upscale chain of women's clothing stores, located in the southwestern United States. Do to recent success, The Cotton Mill's top management is planning to expand by locating new stores in other regions of the country. The director of planning has been asked to study the relationship between yearly sales and the store size. As part of the study, the director selects a sample of 25 stores and determines the size of the store in square...
A national chain of clothing goods stores recently sent shipments to the following stores. The number...
A national chain of clothing goods stores recently sent shipments to the following stores. The number of items shipped to each store and their total wholesale cost is shown in the table below. Find the wholesale price of one of each item. (A) cost of a pair of jeans is? (B) cost of a jacket is? (C) cost of a sweater is? (D) cost of a shirt is? JEANS JACKETS SWEATERS SHIRTS TOTAL COST Store A 4000 3000 2100 4300...
A national chain of clothing goods stores recently sent shipments to the following stores. The number...
A national chain of clothing goods stores recently sent shipments to the following stores. The number of items shipped to each store and their total wholesale cost is shown in the table below. Find the wholesale price of one of each item. JEANS JACKETS SWEATERS SHIRTS TOTAL COST 4000 4000 2300 4100 592,525 3700 3500 2200 4300 547,825 5000 2000 1500 6500 499,625 6000 1700 700 7000 485,550
The Walk Rite Shoe Company operates a chain of shoe stores. The stores sell ten different...
The Walk Rite Shoe Company operates a chain of shoe stores. The stores sell ten different styles of inexpensive​ men's shoes with identical unit costs and selling prices. A unit is defined as a pair of shoes. Each store has a store manager who is paid a fixed salary. Individual salespeople receive a fixed salary and a sales commission. Walk Rite is trying to determine the desirability of opening another​ store, which is expected to have the following revenue and...
Buckeye Department Stores, Inc. operates a chain of department stores in Ohio. The company’s organization chart...
Buckeye Department Stores, Inc. operates a chain of department stores in Ohio. The company’s organization chart appears below. Operating data for 20x1 follow. BUCKEYE DEPARTMENT STORES, INC. Operating Data for 20x1 (in thousands) Columbus Division Olentangy Store Scioto Store Downtown Store Cleveland Division (total for all stores) Sales revenue $ 5,000 $ 2,500 $ 12,000 $ 22,000 Variable expenses: Cost of merchandise sold 3,000 2,100 6,000 13,000 Sales personnel—salaries 500 310 760 1,700 Sales commissions 60 50 90 220 Utilities...
Big Box Chain Inc., (BBC) together with its subsidiaries,operates a chain of retail stores. The...
Big Box Chain Inc., (BBC) together with its subsidiaries, operates a chain of retail stores. The company sells a range of domestics’ merchandise, including bed linens and related items, bath items, and kitchen textiles; and home furnishings, such as kitchen and tabletop items, fine tabletop, basic housewares, general home furnishings, consumables, and various juvenile products. BBC just paid a year-end dividend of $2.00. Investors expect the dividends to grow at 20% for the next three years and after which the...
ADVERTISEMENT
ADVERTISEMENT
ADVERTISEMENT