Question

In: Statistics and Probability

I only need the best and effective advice on choosing the right data mining method for...

I only need the best and effective advice on choosing the right data mining method for this problem. I am not sure which data model is best.  

Choose a data mining method to create the automatization. Use your model to make predictions on the testing dataset. Evaluate the precision of your model using Root Mean Square Error (rmse) and Correlation (cor) as metrics.
You are hired as a data analyst for Diamonds Inc., a company who appraises diamonds. The company would like to automate appraisals for future shipments and you are responsible for developing the method of automatization. For reference, you are given a dataset containing 48547 diamonds with the following characteristics (variables)

price

price in US dollars (\$326--\$18,823)

carat

weight of the diamond (0.2--5.01)

cut

quality of the cut (Fair, Good, Very Good, Premium, Ideal)

color

diamond color, from J (worst) to D (best)

clarity

a measurement of how clear the diamond is (I1 (worst), SI2, SI1, VS2, VS1, VVS2, VVS1, IF (best))

x

length in mm (0--10.74)

y

width in mm (0--58.9)

z

depth in mm (0--31.8)

depth

total depth percentage = z / mean(x, y) = 2 * z / (x + y) (43--79)

table

width of top of diamond relative to widest point (43--95)

Solutions

Expert Solution


Related Solutions

Reflect on the data mining concepts, strategies, and best practices explored so far. Consider data mining...
Reflect on the data mining concepts, strategies, and best practices explored so far. Consider data mining from both a global perspective in the management of big data and the impact of data mining on individual organizations.
Hello, I need some advice on the case below. I need to know some arguments for...
Hello, I need some advice on the case below. I need to know some arguments for how Kant's ethics applies to this case, and why other people who believe in the Kantean theory of ethics would also support the arguments. I need to use Kants theory to determine if this is morally correct or incorrect. Casino Gambling on Wall Street Case 4.5 Casino Gambling on Wall Street CDO stands for “ collateralized debt obligation,” and before the financial meltdown of...
Need Advise, As a Student in Business Analyst and Data mining what steps should i take...
Need Advise, As a Student in Business Analyst and Data mining what steps should i take as a student to make myself more competitive for a current job market in the U.S. Thanks.
Effective writing is more than just choosing the right words. And it’s more than correct sentences...
Effective writing is more than just choosing the right words. And it’s more than correct sentences organized in logical paragraphs. To be effective, your document must also look like it will be easy to read and easy to understand, and it must be accessible to all your readers regardless of their physical abilities” Respond to the above excerpt by explaining why document design is so critical in professional and technical communication. You can discuss this in the context of your...
Please, I need to get this right. No one is yet to get it right, it...
Please, I need to get this right. No one is yet to get it right, it has been answered on Chegg but is wrong In a psychrometric process, Volume of a classroom = 300 m^3. and temperature in the room is T= 20 oC, P=100 kPa relative humidity = 50% if all the water vapor of air in the room is converted to liquid at the same temperature, what is the volume of the liquid water (mL). I believe the...
A- Some General Managers believe they can best further their careers by choosing to manage only...
A- Some General Managers believe they can best further their careers by choosing to manage only hotels affiliated with a specific brand. Other GMs believe they are most marketable if they have experience managing several different brands. Assume you are a hotel owner. Which type of GM do you think would be most valuable to your hotel and why? B- General Managers sometimes face difficult decisions when they are employed by a management company and operate a branded hotel. In...
What method of data collection is best for: (i) asking how companies use their websites (ii)...
What method of data collection is best for: (i) asking how companies use their websites (ii) asking colleagues for their views on a proposed change to working conditions (iii) testing the effect of exercise on the incidence of heart disease (iv) testing the accuracy of invoices. What method of data collection is best for: (i) asking how companies use their websites (ii) asking colleagues for their views on a proposed change to working conditions (iii) testing the effect of exercise...
Match each phrase on the left with the best term on the right. You may only...
Match each phrase on the left with the best term on the right. You may only use each number only once. 1. Observation 2. Hypothesis 3. Theory 4. Control Group 5. Experimental Group 6. Independent Variable 7. Dependent Variable 8. Conclusion 9. Controlled Variable 10. Results - bone density increased with 800 IU vitamin D - increased doses of vitamin D will increase bone density in premature infants - increased vitamin D reduces risk of low bone density - increased...
ID Documents 1 I love data mining 2 The seven dwarves love mining 3 Data science...
ID Documents 1 I love data mining 2 The seven dwarves love mining 3 Data science is a hot new career 4 I don't love my major or career Use the corpus of documents shown in the above table to answer the quiz questions below. What is the inverse document frequency (IDF) of the term "love"? (Round your answer to 2 decimal places). What is the TF-IDF value (importance) of the term "data" to document 1? (Round your answer to...
Data mining i have a data column where the required mark is 20 but the data...
Data mining i have a data column where the required mark is 20 but the data fed in to the column is more than 40 , how do you advise me to clean that data, and also please state what data is it for instance noisy or what and also name the method to solve it and please tell how to clean it thanks
ADVERTISEMENT
ADVERTISEMENT
ADVERTISEMENT