Question

In: Math

Research the role of ETL tools in providing clean and purposely transformed data as part of...

Research the role of ETL tools in providing clean and purposely transformed data as part of data mining processes. Then explain the role of ETL in data mining and statistical analysis.

Solutions

Expert Solution

1.

To Clean data:-

To ensure data quality in data warehouse which can be done using data unification rules like:-

  • Making unique identifiers. Example:- sex categories Male/Female/Unknown can be given as M/F/null, Man/Woman/Not Available.
  • To Convert null values into standardized Not Available/Not Provided values.
  • To Convert phone numbers and ZIP codes into a standardized form.
  • To validate address fields and to convert them into proper naming.
  • To Validate address fields against each other Example:-State/Country or City/State or City/ZIP code.

To Transform data:-

Here set of rules are applied to transform data from source to target.

  • converting any measured data to same dimension using same units in order to join them later.
  • To join data from several sources, generating surrogate keys, generating aggregates, sorting, and to apply advanced validation rules.

2.

ETL stands for extract, transform, load. These three functions are combined into one tool in order to pull data from one database and transfer it to another database.

ETL tools helps in bringing data from diverse sources to gather them in a single, accessible structure. Then load that data into data marts or data warehouse. Data mining tools include techniques such as neural networks, advanced statistics in order to locate patterns within data and to develop hypotheses from them.


Related Solutions

What are the tools used in providing expert judgement?
What are the tools used in providing expert judgement?
What are the research data collecting tools that can be used to assess the knowledge of...
What are the research data collecting tools that can be used to assess the knowledge of students toward research?
What are EAI, EII, and ETL and how are each used to support data integration applications?...
What are EAI, EII, and ETL and how are each used to support data integration applications? Your answer should include a specific example of an application for each (not a tool/software example, but an example of how they are used). ***Limit your post to 300 words or less
Discuss the financial-crisis management tools and strategies of the US government. How have those been transformed...
Discuss the financial-crisis management tools and strategies of the US government. How have those been transformed by Dodd-Frank?
Market research tools can be used to collect risk related information and data ready for analysis....
Market research tools can be used to collect risk related information and data ready for analysis. Explain what they are and how they work. (Approx. 200 words
Part 1. Discuss the differences and similarities between Operations Research and Data Science. Part 2. What...
Part 1. Discuss the differences and similarities between Operations Research and Data Science. Part 2. What role does optimization play in Operations Research?
What is the role of the nurse in providing input for the design of this healthcare...
What is the role of the nurse in providing input for the design of this healthcare program/
The key part of Griffith's experiment was showing that.... Select one: a. Bacteria could become transformed...
The key part of Griffith's experiment was showing that.... Select one: a. Bacteria could become transformed with new abilities because of infection by a virus (phage) which brought in DNA from another organism b. Bacteria could acquire antibiotic resistance genes from dead bacteria c. Bacteria could pick up genes for producing toxins from dead bacteria d. Bacteria that couldn't form a capsule could get the ability to form one from dead bacteria e. Bacteria could acquire antibiotic resistance genes from...
On August 1, 2013, Bee Clean entered its second year of operations, providing cleaning services to...
On August 1, 2013, Bee Clean entered its second year of operations, providing cleaning services to community centres and sports, fitness and recreation arenas as well as doing small repairs (such as to ice). On July 31, 2014, Bee Cummins, the owner, finalized the company’s records, which showed the following items.   Accounts payable $ 11,000 Office equipment $ 20,800      Accounts receivable 58,000 Prepaid rent 5,600      Bee Cummins, capital, Rent expense 22,000         July 31, 2013* 80,900 Repair revenue...
How do cells get transformed with the SV40 DNA virus. What is role of extracts where...
How do cells get transformed with the SV40 DNA virus. What is role of extracts where antibodies are made in and the T antigen in this process.
ADVERTISEMENT
ADVERTISEMENT
ADVERTISEMENT