In: Computer Science
Data Preprocessing is an important area in order to have quality input data. The four methods discussed are data cleaning, data integration, data transformation and data reduction.
a.Explain each of these methods in 2 or 3 paragraphs. (16 points)
b.Suppose we were to develop software to implement each of these techniques. Discuss how easy or difficult it would be to develop software for each technique. Give this some thought and write a well-thought answer. Imagine that you had to write such software, how easy or hard would it be? (7 points)
c.After the software was developed, we are now going to run the programs. Explain for each of the programs implementing the four techniques, how much CPU processing time is required. I am not looking for an actual number. Rather, I am looking for your analysis how much CPU processing is needed for each technique. (7 points)