Question

In: Computer Science

What is good data? What is meant by bad data? A term that you may have...

What is good data? What is meant by bad data? A term that you may have already encountered is "GIGO". This term refers to Garbage In, Garbage Out. In other words, if incorrect/bad data is entered into a database, the same useless data will be extracted. This results in poor decisions, lost revenue, and unhappy customers. Have you ever been the victim of bad data?

Discuss the importance of queries and good/bad data as they relate to database reports. Describe the impact on business of erroneous reports generated by bad data or faulty queries.

Solutions

Expert Solution

Good Data:

Good data can be referred to non erroneous, consistent and well cleaned up data. When we collect data from various sources in order to incorporate that data into any of our projects or business models, we encounter different kinds of sources that may be reliable, unreliable, complete or incomplete. So we can state that good data is complete, consistent and in which null values are handled well. It is reliable and accurate.

Bad Data:

Bad data may contain :

  • Empty records
  • Inconsistent records
  • Typos
  • Errors
  • Redundant and non integral data

In today's world we scrape data from websites and other sources. We might not be able to scrape what we desired to or we may find that what we scraped ot collected is not a standard for use in our business model. This kind of data is called Bad Data.We all know that data is bought and sold in today's day and age we might find fraudulent sources that may sell us data pretending to be genuine sources, so we can say such are the sources of bad data.

Garbage In Garbage Out:

In Business Intelligence or Data Analysis, the first step is data preprocessing. The data we got can be bad data and training models or performing analyses on this bad data can lead to inconsistent results. Results which we cannot trust neither for our organisation nor for the stakeholders. This kind of data can cause millions and billions of dollars of loss. Regular surveys by Gartner on the cost of bad data remains at >$10 million per enterprise even after shelling out close to $200,000 annually on data quality tools. Hence we can get an idea what kind of effects bad data has on Businesses and how harmful it is in the long run. I personally have been a victim of bad data when I once wanted to analyse a dataset while training a machine learning model. The time taken to preprocess the data was much more than making analysis decisions for the data. So the most important thing is cleaning the data. We have tools these days made specially for cleaning and preprocessing data.

Importance of good database:

Good data stored in the database, correct retrieval and access of the data is crucial for any organization. Suppose Google misuses it's database of users. It will not only cause the company's reputation to go down but will cause huge losses. Writing correct queries and generating trustworthy and accurate reports using the database is a vital point in any modern organisation. We can say that data is the heart of any organisation. If data is good, the organisation will go a long way.


Related Solutions

Fiscal policy is a good thing for the economy? In what ways may it be bad...
Fiscal policy is a good thing for the economy? In what ways may it be bad and/or good for the economy? If you could choose between Keynesian Fiscal Policy or letting the economy self-correct what would you choose? Why?
Tell what ways you can have a bad and good credit score.
Tell what ways you can have a bad and good credit score.
Explain what forecasting is. Do you have any good or bad examples of forecasting done by...
Explain what forecasting is. Do you have any good or bad examples of forecasting done by firms with which you are familiar? As a financial manager in developing forecast for the firm, where would you go to in order to start your forecast and refine it with more accurate future projections concerning interest rates, raw material prices and the like to build your estimates?
What is meant by the term `dynamic range' in the context of a data acquisition system?...
What is meant by the term `dynamic range' in the context of a data acquisition system? How can selection of the wrong dynamic range affect either (a) the precision of the experimental measurements, or (b) capture of the entire signal of interest.
What is meant by the term ‘lobbying’?
What is meant by the term ‘lobbying’?
What does it mean to have a high accounts receivable. Is that good or a bad...
What does it mean to have a high accounts receivable. Is that good or a bad and why?
What is meant by the term ‘DNA polarity’?
What is meant by the term ‘DNA polarity’?One end of DNA has a 5’ phosphate group while the other end has a free 3’ hydroxyl group.DNA was first discovered at the North Pole.One side of DNA has the phosphate backbone while the other side has the nucleotide bases.There are many polar covalent bonds in DNA.
What is meant by the term ‘DNA polarity’?
What is meant by the term ‘DNA polarity’? One end of DNA has a 5’ phosphate group while the other end has a free 3’ hydroxyl group. DNA was first discovered at the North Pole. One side of DNA has the phosphate backbone while the other side has the nucleotide bases. There are many polar covalent bonds in DNA.
What is meant by the term “comparative advantage”?
What is meant by the term “comparative advantage”?
What is meant by the term steady state?
What is meant by the term steady state?
ADVERTISEMENT
ADVERTISEMENT
ADVERTISEMENT