Grouping and joining transformations in the data extraction process

Marcin Gorawski, Paweł Marks

Abstract


In this paper we present a method of describing ETL processes (Extraction, Transformation and Loading) using graphs. We focus on implementation aspects such as division of a whole process into threads, communication and data exchange between threads, deadlock prevention. Methods of processing of large data sets using insufficient memory resources are also presented upon examples of joining and grouping nodes. Our solution is compared to the efficiency of the OS-level virtual memory in a few tests. Their results are presented and discussed.

Full Text:

PDF


DOI: http://dx.doi.org/10.17951/ai.2006.4.1.136-147
Date of publication: 2006-01-01 00:00:00
Date of submission: 2016-04-27 10:15:04


Statistics


Total abstract view - 399
Downloads (from 2020-06-17) - PDF - 0

Indicators



Refbacks

  • There are currently no refbacks.


Copyright (c) 2015 Annales UMCS Sectio AI Informatica

Creative Commons License
This work is licensed under a Creative Commons Attribution 4.0 International License.