Physical and virtual partitioning in OLAP database clusters

  • C. Furtado UFRJ
  • A. A. B. Lima UFRJ
  • E. Pacitti Atlas Group, INRIA and LINA, University of Nantes
  • P. Valduriez Atlas Group, INRIA and LINA, University of Nantes
  • M. Mattoso UFRJ

Resumo


On-line analytical processing (OLAP) applications require high performance database support to achieve good response time (crucial for decision making). Database clusters provide a cost-effective alternative to parallel database systems. For OLAP applications, that typically use heavy weight queries, intra-query parallelism yields better performance as it reduces the execution time of individual queries. Intra-query parallelism is based on processing the same query on different subsets of the query table. Combining physical and virtual partitioning to define table subsets provides flexibility in intra-query parallelism while optimizing disk space usage and data availability. Experiments with our partitioning technique using TPC-H benchmark queries on a 32-dual node cluster gave linear and super-linear speedup, thereby reducing significantly the time of typical OLAP heavy weight queries.
Palavras-chave: Parallel processing, Database systems, Query processing, Performance analysis, Delay, Decision making, Availability, Hardware, Control systems, Middleware
Publicado
24/10/2005
FURTADO, C.; LIMA, A. A. B.; PACITTI, E.; VALDURIEZ, P.; MATTOSO, M.. Physical and virtual partitioning in OLAP database clusters. In: INTERNATIONAL SYMPOSIUM ON COMPUTER ARCHITECTURE AND HIGH PERFORMANCE COMPUTING (SBAC-PAD), 17. , 2005, Rio de Janeiro/RJ. Anais [...]. Porto Alegre: Sociedade Brasileira de Computação, 2005 . p. 143-150.