Question

In: Operations Management

The Data Mining Process and manual extraction of patterns from data has occurred for centuries. Early...

The Data Mining Process and manual extraction of patterns from data has occurred for centuries. Early methods of identifying patterns and trends in data include Bayes' theorem (circa 1700s) and regression analysis (circa 1800s). The proliferation, ubiquity and increasing power of computer technology has dramatically increased data collection, storage, and manipulation capabilities.

As data sets have grown and increased in complexity forming “Big Data” farms and structured Data Warehouses, "hands-on" data analysis has increasingly been enhanced with automated data processing and aided by other discoveries in computer science, such as neural networks, cluster analysis, genetic algorithms (circa 1950s), decision trees and decision rules (circa 1960s), and support vector machines (circa 1990s).

Data Mining is the process of applying these methods with the intention of uncovering hidden patterns and trends within large data warehouses. This helps to bridge the gap from applied statistics to artificial intelligence (AI), by exploiting the way data is stored and indexed in databases, thus producing the actual learning and execution of discovery algorithms, and allowing such methods to be applied to even larger data sets.

Discussion Topic #1:

Data Mining

Research the latest Privacy Issues with Data Mining and determine whether they are substantiated.

Also, research the most common mistakes and myths evolving around data mining.

Solutions

Expert Solution

Privacy Issues with the Data Mining :

1. Data mining can sometimes voilet privacy of the users, by gathering online as well as offline information to build a digital profile of a user.

2. Businesses such as insurance use data mining as a tool to get the customer information faster, which helps them to know the customer better so that it will be helpful while selling their products.

3. This is a new marketing tool to virtually gather the information without any overhead cost to the company. Recent survey conducted by Georgetown University states that, 92.8% websites collect the visitors personal data.

Mistakes and Myths Evolving Around Data Mining :

1. Usually obvious questions are asked instead of unusual questions by using analysis technique.

2. Sometimes they overreact to the results, where follow up studies is required to evaluate and analyse the information gathered.

3. Collecting small samples can sometimes mislead the results, which leads to inaccurate conclusions.

4. Data mining is supposed to be based on highly developd algorithms, whereas in reality only 10% of data mining process involves new and improved algorithms, other is related to setting business goals etc.


Related Solutions

(a) Briefly explain the data mining process. (b) What are the different problems that data mining...
(a) Briefly explain the data mining process. (b) What are the different problems that data mining can solve in general? Explain.
Dependence on information derived from examining patterns and relationships in data has increased as a result...
Dependence on information derived from examining patterns and relationships in data has increased as a result of the shift toward evidence-based practice in health care. True or false? True False
-Name & explain a task that can be achieved in Data Mining process in all of...
-Name & explain a task that can be achieved in Data Mining process in all of following fields and -why it is considered as data mining: (a) Manufacturing (b) Medical/Pharmacology (c) Physics/Astronomy
IN 200 WORDS OR MORE Which step in the data mining process is the most important...
IN 200 WORDS OR MORE Which step in the data mining process is the most important and why
A solvent-extraction process usingN,N − diethyldodecanamide, which is insoluble in water and has a density of...
A solvent-extraction process usingN,N − diethyldodecanamide, which is insoluble in water and has a density of 0.847 g / cm3 . In a typical experiment at 30°C, 50 g of 20 wt % citric acid and 80 wt % water was contacted with 0.85 g of amide. The resulting organic phase, assumed to be in equilibrium with the aqueous phase, contained 6.39 wt % citric acid and 2.97 wt % water. Determine the partition (distribution) coefficients for citric acid and...
Describe the extraction of magnesium from seawater in the Dow process. Show all relevant equations involved...
Describe the extraction of magnesium from seawater in the Dow process. Show all relevant equations involved in the process, starting with the precipitation of Mg2+ as Mg(OH)2(s) using CaO(s) until the electrolysis of molten MgCl2.
What are the 5 defined steps in the Data Mining process to gain knowledge? PLEASE EXPLAIN...
What are the 5 defined steps in the Data Mining process to gain knowledge? PLEASE EXPLAIN IN DETAIL
An extraction from the balance of payments for the current year shows that your country has...
An extraction from the balance of payments for the current year shows that your country has undergone a deterioration in its net international investment position. Suppose you are part of the policy analysts discussing the pros and cons of such a change, what would be your arguments?
Mining process equipment used as auxiliary equipment in the processing of copper concentrates has an installed...
Mining process equipment used as auxiliary equipment in the processing of copper concentrates has an installed cost of $200 000 with an estimated five year life and estimated salvage value of $15 000. Calculate the depreciation for a five year life using a) straight line method and b) the 200% DB method.
What subject deals with different methods of developing useful information from large data bases? data mining...
What subject deals with different methods of developing useful information from large data bases? data mining data manipulation     big data data warehousing
ADVERTISEMENT
ADVERTISEMENT
ADVERTISEMENT