Umeå University's logo

umu.sePublications
Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Code Generation for Accelerating Data Flow: Enhancing Pentaho Data Integration Performance
Umeå University, Faculty of Science and Technology, Department of Physics.
2023 (English)Independent thesis Advanced level (professional degree), 20 credits / 30 HE creditsStudent thesisAlternative title
Kodgenerering för snabbare dataflöden : Prestandaförbättring av Pentaho Data Integrations (Swedish)
Abstract [en]

Pentaho Data Integration, called Kettle, is an ETL tool that functions as a no-code program. The tool, implemented in Java, enables users to create data flow structures via a graphical user interface and store them as XML files, which can be edited or executed. In some applications, the current execution method does not provide satisfactory performance. To speed up execution times, we propose a Java code generator that works by analyzing the existing XML setup and Kettle’s existing source code.We also conduct some exploratory work with Apache Hop, another Kettle-based ETL tool, and provide comparative insights.Our analysis demonstrates the potential for significant speed improvements, with times reduced by 60% or even more. We consider this method’s challenges and limitations and propose solutions to overcome them. Overall, our research contributes to the field of no-code programming by highlighting the potential for using code generation to optimize performance in data engineering processes.

Place, publisher, year, edition, pages
2023. , p. 29
National Category
Computer Systems Communication Systems Other Engineering and Technologies
Identifiers
URN: urn:nbn:se:umu:diva-210335OAI: oai:DiVA.org:umu-210335DiVA, id: diva2:1771458
External cooperation
Omicron Nord AB
Subject / course
Examensarbete i teknisk fysik
Educational program
Master of Science Programme in Engineering Physics
Presentation
2023-06-09, NAT.D.300, Umeå Universitet, Umeå, 15:47 (Swedish)
Supervisors
Examiners
Available from: 2023-06-21 Created: 2023-06-20 Last updated: 2025-02-10Bibliographically approved

Open Access in DiVA

fulltext(427 kB)267 downloads
File information
File name FULLTEXT01.pdfFile size 427 kBChecksum SHA-512
3b23a8d2e1d3d28205215476d38ff0401b5ba961f6fec2b339bd94e17849be51f54e48102220bfa5311a0651cd29b3f50897ef55f8a4083fab456e8b6ef99349
Type fulltextMimetype application/pdf

Search in DiVA

By author/editor
Svensson, Alexander
By organisation
Department of Physics
Computer SystemsCommunication SystemsOther Engineering and Technologies

Search outside of DiVA

GoogleGoogle Scholar
Total: 267 downloads
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

urn-nbn

Altmetric score

urn-nbn
Total: 548 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf