Improving the ETL process through declarative transformation languages

Authors

  • Sarbaree Mishra Program Manager at Molina Healthcare Inc., USA Author
  • Sairamesh Konidala Vice President, JP Morgan & Chase, USA Author
  • Jeevan Manda Project Manager, Metanoia Solutions Inc, USA Author

Keywords:

ETL, data processing

Abstract

In the ever-evolving data management landscape, the Extract, Transform, Load (ETL) process ensures that organizations can efficiently manage and utilize their data. However, traditional ETL processes often suffer inefficiencies and complexities hindering data integration and quality. This project explores using declarative transformation languages to enhance the ETL process. By focusing on the "what" rather than the "how," declarative languages simplify data transformation tasks, making them more intuitive and easier to manage. These languages allow data engineers to express complex transformation logic succinctly, reducing the likelihood of errors and improving maintainability. Moreover, declarative transformation languages facilitate a more agile approach to ETL by abstracting the underlying implementation details, enabling organizations to adapt quickly to changing data requirements. This research will analyze various declarative languages and their impact on the ETL process, showcasing case studies demonstrating their effectiveness in real-world applications. The findings provide insights into best practices for leveraging declarative transformation languages to streamline ETL workflows, enhance data quality, and support better organizational decision-making. By adopting these innovative approaches, businesses can improve the efficiency of their ETL processes and gain a competitive edge in an increasingly data-driven world. Through this exploration, we aim to highlight the significant potential that declarative transformation languages hold in transforming the future of ETL, making data integration more straightforward and effective for organizations of all sizes.

Downloads

Download data is not yet available.

References

Raminhos, R. F., & Moura-Pires, J. (2007, June). Extraction and transformation of data from semi-structured text files using a declarative approach. In Ninth International Conference on Enterprise Information Systems, Madeira, Portugal.

Theodorou, V., Abelló, A., Thiele, M., & Lehner, W. (2014, November). A framework for user-centered declarative etl. In Proceedings of the 17th international workshop on data warehousing and OLAP (pp. 67-70).

Jörg, T., & Deßloch, S. (2008, September). Towards generating ETL processes for incremental loading. In Proceedings of the 2008 international symposium on Database engineering & applications (pp. 101-110).

Bansal, S. K. (2014, June). Towards a semantic extract-transform-load (ETL) framework for big data integration. In 2014 IEEE International Congress on Big Data (pp. 522-529). IEEE.

El-Sappagh, S. H. A., Hendawi, A. M. A., & El Bastawissy, A. H. (2011). A proposed model for data warehouse ETL processes. Journal of King Saud University-Computer and Information Sciences, 23(2), 91-104.

Vassiliadis, P., & Simitsis, A. (2009). Extraction, Transformation, and Loading. Encyclopedia of Database Systems, 10, 14.

Deufemia, V., Giordano, M., Polese, G., & Tortora, G. (2014). A visual language‐based system for extraction–transformation–loading development. Software: Practice and Experience, 44(12), 1417-1440.

Vassiliadis, P., Simitsis, A., Georgantas, P., Terrovitis, M., & Skiadopoulos, S. (2005). A generic

and customizable framework for the design of ETL scenarios. Information Systems, 30(7), 492-525.

Chakraborty, J., Padki, A., & Bansal, S. K. (2017, January). Semantic etl—State-of-the-art and open 16.research challenges. In 2017 IEEE 11th International Conference on Semantic Computing (ICSC) (pp. 413-418). IEEE.

Sellis, T. K., & Simitsis, A. (2007, September). Etl workflows: From formal specification to optimization.In East European Conference on Advances in Databases and Information Systems (pp. 1-11). Berlin, Heidelberg: Springer Berlin Heidelberg.

Vassiliadis, P., Vagena, Z., Skiadopoulos, S., Karayannidis, N., & Sellis, T. (2001). ARKTOS: towards the modeling, design, control and execution

of ETL processes. Information Systems, 26(8), 537-561.

Samimi-Dehkordi, L., Khalilian, A., & Zamani, B. (2016). Applying Programming Language EvaluationCriteria for Model Transformation Languages. International Journal of Software & Informatics, 10(4).

Schubert, L. (2010). An evaluation of model transformation languages for uml quality engineering (Doctoral dissertation, Master’s thesis, Masterarbeit im Studiengang Angewandte Informatik am Institute für Informatik, ZFI-MSC-2010-01, ISSN 1612-6793, Zentrum für Informatik, Georg-August-Universität Göttingen).

Albrecht, A., & Naumann, F. (2009, August). METL: Managing and Integrating ETL Processes. In VLDB PhD workshop.

dos Santos, V. N. C. (2015). A Relational Algebra Approach to ETL Modeling (Doctoral dissertation, Universidade do Minho (Portugal)).

Gade, K. R. (2017). Integrations: ETL vs. ELT: Comparative analysis and best practices. Innovative Computer Sciences Journal, 3(1).

Gade, K. R. (2017). Integrations: ETL/ELT, Data Integration Challenges, Integration Patterns. Innovative Computer Sciences Journal, 3(1).

Komandla, V. Transforming Financial Interactions: Best Practices for Mobile Banking App Design and Functionality to Boost User Engagement and Satisfaction.

Gade, K. R. (2018). Real-Time Analytics: Challenges and Opportunities. Innovative Computer Sciences Journal, 4(1).

Downloads

Published

17-06-2019

How to Cite

[1]
Sarbaree Mishra, Sairamesh Konidala, and Jeevan Manda, “Improving the ETL process through declarative transformation languages”, Distrib Learn Broad Appl Sci Res, vol. 5, Jun. 2019, Accessed: Dec. 27, 2024. [Online]. Available: https://dlabi.org/index.php/journal/article/view/242

Most read articles by the same author(s)

Similar Articles

121-130 of 178

You may also start an advanced similarity search for this article.