Question

In: Computer Science

Compare and contrast SEMMA and CRISP-DM. Discuss when application of SEMMA or CRISP-DM might be most...

Compare and contrast SEMMA and CRISP-DM. Discuss when application of SEMMA or CRISP-DM might be most appropriate by providing a specific example related to each of the two processes.

Solutions

Expert Solution

Answer:-----------

SEMMA:----------------
SEMMA is the methodology for data mining processes proposed by the SAS Institute--one of the most important companies that develop statistical software applications--with the software package Enterprise Miner .
In SEMMA, SAS offers a data mining process that consists of five steps: sample, explore, modify, model, and assess. This methodology begins by analyzing a small portion of a large data set.
The next step is to explore the data and the information by looking for trends and anomalies in the data with the purpose of gaining some information about the data.
In the third phase, data is modified to create, select, and transform the variables for the study.
A valid model is then created using the software tools, which search automatically for combinations of rules and patterns that reliably predict the observed results.
Finally, the last step of the SEMMA methodology consists of evaluating the usefulness and reliability of the findings.

CRISP-DM:------------------
Another data mining methodology is CRISP-DM (cross-industry standard process for data mining).
CRISP-DM was originally conceived in late 1996, but it was not completed until 1999; it is intended to be industry-, tool-, and application-neutral.It was developed by a consortium of data mining vendors and companies through an effort funded by the European Commission.
The four partners of this project were NCR, Daimler Chrysler, OHRA, and Integral Solutions Limited (ISL), which became part of SPSS in 1998. The CRISP-DM 1.0 methodology comprises a hierarchical breakdown in which the data mining process is divided into four levels of 28 abstraction: phases, generic tasks, specialized tasks, and process instances.

CRIPS-DM 1.0 also recognizes four different dimensions of data mining context that drive the generic and specialized levels of the CRISP-DM.
The four dimensions are :
1) application domain
2) problem type
3) technical aspect,
4) tools and techniques.


Related Solutions

Discuss the theory behind the assertion that: "Selection is unimportant for most polymorphisms". Compare and contrast...
Discuss the theory behind the assertion that: "Selection is unimportant for most polymorphisms". Compare and contrast the selectionist and neutralist view of molecular evolution. Which view do you agree with and why?
Compare and contrast Web applications and native applications. When should a native application be selected over...
Compare and contrast Web applications and native applications. When should a native application be selected over a Web application and vice versa? What advantage does one have over the other? • What are the disadvantages or limitations of each? Provide URL of your sources.
Compare and contrast strategic planning and strategic management. Discuss what you think are the three most...
Compare and contrast strategic planning and strategic management. Discuss what you think are the three most significant reasons why organizations do not actively engage in strategic planning.
Discuss apoptosis and necrosis. Compare and contrast the two.
Discuss apoptosis and necrosis. Compare and contrast the two.
How might you compare and contrast the organization of the healthcare systems of the United States,...
How might you compare and contrast the organization of the healthcare systems of the United States, Germany, and the United Kingdom?
Compare and Contrast Mobile Device Management (MDM) and Mobile Application Management (MAM) tools.
Compare and Contrast Mobile Device Management (MDM) and Mobile Application Management (MAM) tools.
Compare and contrast the costs and benefits to a nation when it becomes part of an...
Compare and contrast the costs and benefits to a nation when it becomes part of an optimum currency area
Discuss ideas for when you might use calculations in tables, and when you might be better...
Discuss ideas for when you might use calculations in tables, and when you might be better off creating a document in Excel and pasting it into their Word document.
Compare and contrast the Classical Macroeconomic Model with the Keynesian Macroeconomic Model. a. When was the...
Compare and contrast the Classical Macroeconomic Model with the Keynesian Macroeconomic Model. a. When was the Classical Macroeconomic Model Developed? b. Why was the Classical Macroeconomic Model Developed? c. Can the Classical Model explain economic fluctuations why or why not? d. Can fiscal policy increase real economic output in the Classical Model why or why not? e. Can monetary policy increase real economic output in the Classical Model why or why not? f. What assumptions does the Classical Model make...
Compare and contrast emic and etic models of multiculturalism. Are there times when it is better...
Compare and contrast emic and etic models of multiculturalism. Are there times when it is better to have one perspective over the other? Defend your position.
ADVERTISEMENT
ADVERTISEMENT
ADVERTISEMENT