Please use this identifier to cite or link to this item: https://rfos.fon.bg.ac.rs/handle/123456789/1816
Full metadata record
DC FieldValueLanguage
dc.creatorIgić, Nemanja
dc.creatorTerzić, Branko
dc.creatorMatić, Milan
dc.creatorIvančević, Vladimir
dc.creatorLuković, Ivan
dc.date.accessioned2023-05-12T11:15:35Z-
dc.date.available2023-05-12T11:15:35Z-
dc.date.issued2018
dc.identifier.issn2190-3018
dc.identifier.urihttps://rfos.fon.bg.ac.rs/handle/123456789/1816-
dc.description.abstractThe Dermatology Clinic at the Clinical Center of Vojvodina, Novi Sad, Serbia, has actively collected data regarding patients' treatment, health insurance and examinations. These data were stored in documents in the comma-separated values (CSV) format. Since many fields in these documents were presented as free form text or allow null values, there are many data records that are inconsistent with the real-world system. Currently, there is a large need for an analytic system that can analyze these data and find relevant patterns. Since such an analytic system would require clean and accurate data, there is a need to assess data quality. Therefore, a data quality system should be designed and built with a goal of identifying inaccurate records so that they can be aligned with the real-world state. In our approach to data quality assessment, the domain knowledge about data is used to define rules which are then used to evaluate the quality of the data. In this paper, we present the architecture of a data quality system that is used to define and apply these rules. The rules are first defined by a domain expert and then applied to data in order to determine the number of records that do not match the defined rules and identify the exact anomalies in the given records. Also, we present a case study in which we applied this data quality system to the data collected by the Dermatology Clinic.en
dc.publisherSpringer-Verlag Berlin, Berlin
dc.relationinfo:eu-repo/grantAgreement/MESTD/Integrated and Interdisciplinary Research (IIR or III)/44010/RS//
dc.rightsrestrictedAccess
dc.sourceIntelligent Decision Technologies 2017, KES-IDT 2017, Pt II
dc.subjectDomain knowledge applicationen
dc.subjectDermatologyen
dc.subjectData quality assessmenten
dc.titleApplying Domain Knowledge for Data Quality Assessment in Dermatologyen
dc.typeconferenceObject
dc.rights.licenseARR
dc.citation.epage156
dc.citation.other73: 147-156
dc.citation.spage147
dc.citation.volume73
dc.identifier.doi10.1007/978-3-319-59424-8_14
dc.identifier.rcubconv_2038
dc.identifier.scopus2-s2.0-85020449131
dc.identifier.wos000432720100014
dc.type.versionpublishedVersion
item.cerifentitytypePublications-
item.fulltextWith Fulltext-
item.openairecristypehttp://purl.org/coar/resource_type/c_18cf-
item.grantfulltextrestricted-
item.openairetypeconferenceObject-
Appears in Collections:Radovi istraživača / Researchers’ publications
Files in This Item:
File Description SizeFormat 
1812.pdf
  Restricted Access
364.32 kBAdobe PDFView/Open    Request a copy
Show simple item record

Google ScholarTM

Check

Altmetric


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.