Pengimplementasian Fitur Data Profiling Pada Aplikasi Data Governance Berbasis Open Source Tools dengan Metode Itterative/Incremental


  • Ekky Chandra Wibowo * Mail Telkom University, Bandung, Indonesia
  • Tien Fabrianti Kusumasari Telkom University, Bandung, Indonesia
  • Ekky Novriza Alam Telkom University, Bandung, Indonesia
  • (*) Corresponding Author
Keywords: Data Governance; Data Quality; Data Quality Management; Profiling; Website

Abstract

Data is one element that is considered important in a business. If the processing is carried out correctly and as needed, the data has high quality. High data quality can have a positive impact, that is, it can create information and knowledge to support decision-making. In fact, many organizations accumulate data without a good data storage or management system and cause the data to become large so that it is difficult to understand and produce information in it that cannot be conveyed properly which makes a decision not have a maximum positive impact. Data Quality Management (DQM) is considered to be able to assist in maintaining quality in the data and ensure that no data is lost or damaged. Quality data is obtained by one of the stages in DQM, namely data profiling, which performs the process of checking the available and stored data to determine a content structure and data quality. The research focuses on implementing the DQM module data profiling feature on the Data Governance website application. This research produces data profiling features that can help people in data processing where the modules produced are value distribution, data completeness, clustering, value similarity and data deduplication.

Downloads

Download data is not yet available.

References

Dama International, DAMA-DMBOK2 (Data Management Body of knowledge), 2 ed. Basking Ridge, NJ 07920 USA: Technics Publication, 2017.

T. F. Kusumasari dan Fitria, “Data profiling for data quality improvement with OpenRefine,” 2016 Int. Conf. Inf. Technol. Syst. Innov. ICITSI 2016 - Proc., no. June, 2017, doi: 10.1109/ICITSI.2016.7858197.

Shelly Kramer, “The High Costs of Dirty Data - V3B: Marketing and Social Media Agency,” Mar 25, 2015. https://v3b.com/2015/03/the-high-costs-of-dirty-data/#ixzz3Y2t602Ex (diakses Nov 08, 2021).

M. R. Effendy, T. F. Kusumasari, dan M. A. Hasibuan, “Analysis and Design of Data Quality Monitoring Application using Open Source Tools: A Case Study at a Government Agency,” no. Iccetim 2019, hal. 214–219, 2020, doi: 10.5220/0009867402140219.

A. N. Laksono, T. F. Kusumasari, dan M. A. Hasibuan, “Implementation of Data Quality Management Application Architecture,” SCITEPRESS – Sci. Technol. Publ., no. Iccetim 2019, hal. 268–274, 2020, doi: 10.5220/0009868302680274.

H. A. Sulistyo, T. F. Kusumasari, dan E. N. Alam, “Implementation of Data Cleansing Pattern Module for Data Quality Management Application using Open Source Tools,” 2020 3rd Int. Conf. Comput. Informatics Eng. IC2IE 2020, hal. 7–12, 2020, doi: 10.1109/IC2IE50715.2020.9274628.

F. Ridzuan dan W. M. N. Wan Zainon, “A review on data cleansing methods for big data,” Procedia Comput. Sci., vol. 161, hal. 731–738, 2019, doi: 10.1016/j.procs.2019.11.177.

J. vom Brocke, A. Hevner, dan A. Maedche, “Introduction to Design Science Research,” https//www.researchgate.net/publication/345430098 Introd. to Des. Sci. Res. Chapter · Sept. 2020 DOI 10.1007/978-3-030-46781-4_1 CITATIONS, no. November, hal. 1–13, 2020, doi: 10.1007/978-3-030-46781-4_1.

A. M. Kuhn, Code Complete, vol. 47, no. 4. 2005.

A. Alshamrani dan A. Bahattab, “A Comparison Between Three SDLC Models Waterfall Model, Spiral Model, and Incremental/Iterative Model,” IJCSI Int. J. Comput. Sci. Issues, vol. 12, no. 1, hal. 106–111, 2015, [Daring]. Tersedia pada: https://www.academia.edu/10793943/A_Comparison_Between_Three_SDLC_Models_Waterfall_Model_Spiral_Model_and_Incremental_Iterative_Model.

S. Supriyono, “Software Testing with the approach of Blackbox Testing on the Academic Information System,” Int. J. Inf. Syst. Technol., vol. 3, no. 2, hal. 227–233, 2020.

A. Stojkov, L. Kazi, dan M. Bla, “Software Reengineering with Object-Oriented n-Tier Architecture : Case of Desktop-to-Web Transformation,” Int. Conf. Inf. Technol. Dev. Educ. – ITRO 2020, 2020.

S. K. Dewanti, T. F. Kusumasari, dan E. N. Alam, “ANALISIS DAN KLASIFIKASI PROSES DATA PROFILING PADA DATA QUALITY MANAGEMENT MENGGUNAKAN PENTAHO DATA INTEGRATION ANALYSIS AND CLASSIFICATION OF DATA PROFILING PROCESSES IN DATA QUALITY,” 2021.

D. A. Rahmani, T. F. Kusumasari, dan E. N. Alam, “Addition of Process Decomposition in Open Source Tools-Based Cleansing Data Modules,” ICODSA, hal. 129–134, 2021, doi: 10.1109/icodsa53588.2021.9617555.

R. A. Nugroho, T. F. Kusumasari, dan E. N. Alam, “INTEGRASI PAKET MODUL DATA CLEANSING PADA APLIKASI DATA QUALITY MANAGEMENT MENGGUNAKAN OPEN SOURCE TOOLS INTEGRATION OF DATA CLEANSING MODULE PACKAGES IN DATA QUALITY,” 2021.

S. Nidhra, “Black Box and White Box Testing Techniques - A Literature Review,” Int. J. Embed. Syst. Appl., vol. 2, no. 2, hal. 29–50, 2012, doi: 10.5121/ijesa.2012.2204.

K. F. Salmawati dan U. Telkom, ANALISIS PERFORMANSI MODUL PROFILING PADA APLIKASI DATA QUALITY MANAGEMENT BERBASIS OPEN SOURCE TUGAS AKHIR Tugas akhir disusun sebagai salah satu syarat untuk memperoleh gelar Sarjana dari Universitas Telkom ANALISIS PERFORMANSI MODUL PROFILING PADA APLI, vol. 1202160041. 2020.


Bila bermanfaat silahkan share artikel ini

Berikan Komentar Anda terhadap artikel Pengimplementasian Fitur Data Profiling Pada Aplikasi Data Governance Berbasis Open Source Tools dengan Metode Itterative/Incremental

Dimensions Badge
Article History
Submitted: 2022-09-29
Published: 2022-10-29
Abstract View: 531 times
PDF Download: 438 times
How to Cite
Wibowo, E., Kusumasari, T., & Alam, E. (2022). Pengimplementasian Fitur Data Profiling Pada Aplikasi Data Governance Berbasis Open Source Tools dengan Metode Itterative/Incremental. Journal of Information System Research (JOSH), 4(1), 117-126. https://doi.org/10.47065/josh.v4i1.2315
Section
Articles