Question

In: Operations Management

IN 200 WORDS OR MORE Analyze the steps of knowledge discovery of data processing. Give an...

IN 200 WORDS OR MORE

Analyze the steps of knowledge discovery of data processing. Give an example

Solutions

Expert Solution

Knowledge discovery in databases (KDD) is the process of discovering useful knowledge from a collection of data. This widely used data mining technique is a process that includes data preparation and selection, data cleansing, incorporating prior knowledge on data sets and interpreting accurate solutions from the observed results. Major KDD application areas include marketing, fraud detection, telecommunication and manufacturing.

Traditionally, data mining and knowledge discovery was performed manually. As time passed, the amount of data in many systems grew to larger than terabyte size, and could no longer be maintained manually. Moreover, for the successful existence of any business, discovering underlying patterns in data is considered essential. As a result, several software tools were developed to discover hidden data and make assumptions, which formed a part of artificial intelligence.

The KDD process has reached its peak in the last 10 years. It now houses many different approaches to discovery, which includes inductive learning, Bayesian statistics, semantic query optimization, knowledge acquisition for expert systems and information theory. The ultimate goal is to extract high-level knowledge from low-level data.

KDD includes multidisciplinary activities. This encompasses data storage and access, scaling algorithms to massive data sets and interpreting results. The data cleansing and data access process included in data warehousing facilitate the KDD process. Artificial intelligence also supports KDD by discovering empirical laws from experimentation and observations. The patterns recognized in the data must be valid on new data, and possess some degree of certainty. These patterns are considered new knowledge. Steps involved in the entire KDD process are-

1) Identify the goal of the KDD process from the customer’s perspective.

2) Understand application domains involved and the knowledge that's required

3) Select a target data set or subset of data samples on which discovery is be performed.

4) Cleanse and preprocess data by deciding strategies to handle missing fields and alter the data as per the requirements.

5) Simplify the data sets by removing unwanted variables. Then, analyze useful features that can be used to represent the data, depending on the goal or task.

6) Match KDD goals with data mining methods to suggest hidden patterns.

7) Choose data mining algorithms to discover hidden patterns. This process includes deciding which models and parameters might be appropriate for the overall KDD process.

8) Search for patterns of interest in a particular representational form, which include classification rules or trees, regression and clustering.

9) Interpret essential knowledge from the mined patterns.

10) Use the knowledge and incorporate it into another system for further action.

11) Document it and make reports for interested parties.


Related Solutions

IN 200 WORDS OR MORE Identify a process for measurement of metrics data. Give an example
IN 200 WORDS OR MORE Identify a process for measurement of metrics data. Give an example
IN 200 WORDS OR MORE Differentiate what raw data and information means. Give an example
IN 200 WORDS OR MORE Differentiate what raw data and information means. Give an example
Define data processing and explain the steps to be followed for data processing
Define data processing and explain the steps to be followed for data processing
In 200 words or more, How does one analyze cash flow from operations?
In 200 words or more, How does one analyze cash flow from operations?
In 200 words or more, recommend steps that organizational leadership in the fast food business should...
In 200 words or more, recommend steps that organizational leadership in the fast food business should take in order to avoid child labor law violations from happening in the future.
in 200 words or more Explain the major elements of a business plan. Give an example
in 200 words or more Explain the major elements of a business plan. Give an example
in 200 words or more, What are the key aspects of maximizing value. Give example
in 200 words or more, What are the key aspects of maximizing value. Give example
IN 200 WORDS OR MORE Explain why a sample of data may not be consistent with...
IN 200 WORDS OR MORE Explain why a sample of data may not be consistent with the main data type and format
State the difference between qualitative and quantitative data. not more than 200 words
State the difference between qualitative and quantitative data. not more than 200 words
IN 200 WORDS OR MORE Which step in the data mining process is the most important...
IN 200 WORDS OR MORE Which step in the data mining process is the most important and why
ADVERTISEMENT
ADVERTISEMENT
ADVERTISEMENT