In: Statistics and Probability
Assignment: Data Quality
Directions: Complete the table below by defining each data characteristic and providing an example of each quality characteristic as it relates to coding.
Data Characteristic | Definition | Coding Example |
Accuracy | ||
Accessibility | ||
Comprehensiveness | ||
Consistency | ||
Currency | ||
Definition | ||
Granularity | ||
Precision | ||
Relevancy | ||
Timeliness |
Accuracy: In term of model accuracy means its ratio of the truly predicted value by the model to the total values in the model. like model predicted 80 values correctly out of 100 then model accuracy is 80%.
Consistency: consistency of a data in rough word stands for how consistent statistic is where you change the data but the statistics are not going to change much or you can say that this statistics is good for this data.
Precision: Precision refers to the closeness of two or more measurements to each other. Using the example above, if you weigh a given substance five times, and get 3.2 kg each time, then your measurement is very precise. Precision is independent of accuracy. You can be very precise but inaccurate.
Definition: A definition is a statement of the meaning of a term. how you will define yourself or how you can explain yourself in a precise manner.
Relevancy: sufficiency to infer the conclusion(meaning) A data is a relevance if you have sufficient evidence to say about in general terms.
sorry, I don't know all the terms but I tried my level best to define what I know! sorry :(
thanks