Data quality - a characteristic showing the degree to which the data is suitable for use. [2] [3]
A concept may also refer to the state of a set of values ​​of qualitative or quantitative variables. There are many definitions of data quality, but data is generally considered high quality if it is “suitable for the intended use in operations, decision making and planning”. [4] According to another approach, data is considered high-quality if it correctly represents the events or objects of the real world to which this data relates. [five]
In addition to these definitions, as the amount of data increases, the question of the consistency of internal data becomes important, regardless of its suitability for use for any particular external purpose. People’s opinions about data quality can often be dissenting, even when they discuss the same set of data used for the same purpose. [6]
Notes
- ↑ Lewoniewski, Włodzimierz. Measures for Quality Assessment of Articles and Infoboxes in Multilingual Wikipedia . - 2019 .-- Vol. 339. - P. 619-633. - ISBN 978-3-030-04849-5 . - DOI : 10.1007 / 978-3-030-04849-5_53 .
- ↑ Beyond accuracy: What data quality means to data consumers
- ↑ Data quality
- ↑ Data Driven: Profiting from Your Most Important Business Asset
- ↑ Extending the ER Model to Represent Data Quality Requirements
- ↑ Lewoniewski, Włodzimierz; Węcel, Krzysztof. Relative Quality Assessment of Wikipedia Articles in Different Languages ​​Using Synthetic Measure // Lecture Notes in Business Information Processing: journal. - 2017 .-- Vol. 303 . - P. 282-292 . - ISBN 978-3-319-69022-3 . - DOI : 10.1007 / 978-3-319-69023-0_24 .