Question

In: Statistics and Probability

Salmons Stores operates a national chain of women's apparel stores. Five thousand copies of an expensive...

Salmons Stores operates a national chain of women's apparel stores. Five thousand copies of an expensive four-color sales catalog have been printed, and each catalog includes a coupon that provides a $50 discount on purchases of $200 or more. Salmons would like to send the catalogs only to customers who have the highest probability of using the coupon. The DATAile Salmons contains data from an earlier promotional campaign. For each of 1,000 Salmons customers, three variables are tracked: last year's total spending at Salmons, whether they have a Salmons store credit card, and whether they used the promotional coupon they were sent.

Create a standard partition of the data with all the tracked variables and 50% of observations in the training set, 30% in the validation set, and 20% in the test set. Use logistic regression to classify observations as a promotion-responder or not by using Spending and Card as input variables and Coupon as the output variable. Perform Variable Selection with the best subsets procedure with the number of best subsets equal to two.

Click on the datafile logo to reference the data.

(a) Evaluate the logistic regression models based on their classification error. Recommend a final model and express the model as a mathematical equation relating the output variable to the input variables.
If required, round your answers to three decimal places. For subtractive or negative numbers use a minus sign even if there is a + sign before the blank. (Example: -300)
Value =  +  *Spending +  *Card
(c) What is the area under the ROC curve on the test set?
If required, round your answer to three decimal places.
To achieve a sensitivity of at least 0.80, how much Class 0 error rate must be tolerated?

If required, round your answer to three decimal places.

This is part of the data and i need the solution but not the answer, it is better to provide screen shoot on excel, thanks a lot

Customer Spending Card Coupon
1 2291 1 0
2 3215 1 0
3 2135 1 0
4 3924 0 0
5 2528 1 0
6 2473 0 1
7 2384 0 0
8 7076 0 0
9 1182 1 1
10 3345 0 0
11 2140 1 0
12 3255 0 1
13 1512 0 0
14 2148 0 1
15 6737 0 0
16 6486 0 0
17 1307 0 0
18 3470 1 0
19 2936 0 0
20 6404 0 1
21 2229 0 0
22 2933 0 0
23 2118 0 0
24 2050 0 0
25 4998 0 1
26 1394 0 0
27 3993 1 1
28 2059 0 1
29 1677 0

0  

Solutions

Expert Solution

The data set is very small to divide data in training testing and validation so have only divided this data in the training and testing.

Also provided R code for your reference,

The total observation in data is 29

> dim(data)
[1] 29  
> set.seed(100)
> sample_split = createDataPartition(data$Coupon, p=0.7, list=FALSE)
> Train = data[sample_split, ]
> dim(Train )
[1] 21  
> Test = data[-sample_split,]
> dim(Test )
[1] 8

70% data in train and 30% in the test.

The fitted model on train data is,

trainCV = trainControl(method='repeatedcv',
number=3,
repeats=2,
verbose=TRUE)


modLog = train(x = Train[,-'Coupon'],
y = Train$Coupon,
method='glm',
trControl = trainCV)

Coefficients:
Estimate Std. Error t value Pr(>|t|)
(Intercept) 2.098e-02 2.439e-01 0.086 0.932
Spending 4.717e-05 5.661e-05 0.833 0.416
Card 1.825e-01 2.238e-01 0.815 0.426

The fitted model is,

Value of Coupon = 0.02098 + 0.00004717 * Spending + 0.1825 * Card

The p-values are greater than 0.05 so variables are not significant but would be those both affect on response together.

Results for train model,

Accuracy : 0.7619   

95% CI : (0.5283, 0.9178)

Sensitivity : 1.0000   

Specificity: 0.0000

Results for the test model:

Accuracy : 0.625
95% CI : (0.2449, 0.9148)

Sensitivity : 1.000
Specificity: 0.000

The training model performs well then the testing model because accuracy is high than the test model.

ROC curve:

ROC curve is the probability that how much area comes under the curve it is the plot between True positive rate and false positive rate.

for train model and test model specificity is 0 so not possible to draw curve.

>>>>>>>>>>>>>>> Best Luck >>>>>>>>>>>>>>


Related Solutions

Pelican Stores Pelican Stores, a division of National Clothing, is a chain of women’s apparel stores...
Pelican Stores Pelican Stores, a division of National Clothing, is a chain of women’s apparel stores operating throughout the country. The chain recently ran a promotion in which discount coupons were sent to customers of other National Clothing stores. data collected for a sample of 100 in-store credit card transactions at Pelican Stores during one day while the promotion was running are contained in the file named PelicanStores. Table 2.19 shows a portion of the data set. The Proprietary Card...
Pelican Stores, a division of National Clothing, is a chain of women’s apparel stores operating throughout...
Pelican Stores, a division of National Clothing, is a chain of women’s apparel stores operating throughout the country. The chain recently ran a promotion in which discount coupons were sent to customers of other National Clothing stores. Data collected for a sample of 100 in-store credit card transactions at Pelican Stores during one day while the promotion was running are contained in the file named PelicanStores. The Proprietary Card method of payment refers to charges made using a National Clothing...
Eagle Outfitters is a chain of stores specializing in outdoor apparel and camping gear. They are...
Eagle Outfitters is a chain of stores specializing in outdoor apparel and camping gear. They are considering a promotion that involves mailing discount coupons to all their credit card customers. This promotion will be considered a success if more than 10% of those receiving the coupons use them. Before going national with the promotion, coupons were sent to a sample of 100 credit card customers. Click on the datafile logo to reference the data. Excel or Minitab users: The data...
The Apparel Company makes expensive polo-style men's and women's short-sleeve knit shirts at its plant in...
The Apparel Company makes expensive polo-style men's and women's short-sleeve knit shirts at its plant in Jamaica. The production process requires that material be cut into large patterned squares by operators, which are then sewn together at another stage of the process. If the squares are not of a correct length, the final shirt will be either too large or too small. In order to monitor the cutting process, management takes a sample of 4 squares of cloth every other...
1. NuDayWear Inc. is a nationwide retail chain specializing in women's apparel. The company's most popular...
1. NuDayWear Inc. is a nationwide retail chain specializing in women's apparel. The company's most popular lines are West and SeaWear. West offers executive wear for women in the middle to high-end markets, and SeaWear features sportswear and stylish clothes, also targeted at women in the middle to high-end markets. The company has 95 million shares outstanding with a current market price of $10.55 per share. The company is expected to show net income of $41 million in the next...
The Cotton Mill is an upscale chain of women's clothing stores, located in the southwestern United...
The Cotton Mill is an upscale chain of women's clothing stores, located in the southwestern United States. Do to recent success, The Cotton Mill's top management is planning to expand by locating new stores in other regions of the country. The director of planning has been asked to study the relationship between yearly sales and the store size. As part of the study, the director selects a sample of 25 stores and determines the size of the store in square...
4. The Apparel Company makes expensive polo-style men's and women's short-sleeve knit shirts at its plant...
4. The Apparel Company makes expensive polo-style men's and women's short-sleeve knit shirts at its plant in Jamaica. The production process requires that material be cut into large patterned squares by operators, which are then sewn together at another stage of the process. If the squares are not of a correct length, the final shirt will be either too large or too small. In order to monitor the cutting process, management takes a sample of 4 squares of cloth every...
A national chain of clothing goods stores recently sent shipments to the following stores. The number...
A national chain of clothing goods stores recently sent shipments to the following stores. The number of items shipped to each store and their total wholesale cost is shown in the table below. Find the wholesale price of one of each item. (A) cost of a pair of jeans is? (B) cost of a jacket is? (C) cost of a sweater is? (D) cost of a shirt is? JEANS JACKETS SWEATERS SHIRTS TOTAL COST Store A 4000 3000 2100 4300...
A national chain of clothing goods stores recently sent shipments to the following stores. The number...
A national chain of clothing goods stores recently sent shipments to the following stores. The number of items shipped to each store and their total wholesale cost is shown in the table below. Find the wholesale price of one of each item. JEANS JACKETS SWEATERS SHIRTS TOTAL COST 4000 4000 2300 4100 592,525 3700 3500 2200 4300 547,825 5000 2000 1500 6500 499,625 6000 1700 700 7000 485,550
The Walk Rite Shoe Company operates a chain of shoe stores. The stores sell ten different...
The Walk Rite Shoe Company operates a chain of shoe stores. The stores sell ten different styles of inexpensive​ men's shoes with identical unit costs and selling prices. A unit is defined as a pair of shoes. Each store has a store manager who is paid a fixed salary. Individual salespeople receive a fixed salary and a sales commission. Walk Rite is trying to determine the desirability of opening another​ store, which is expected to have the following revenue and...
ADVERTISEMENT
ADVERTISEMENT
ADVERTISEMENT