Paper 2

Fast Loads and Queries

Authors: Goetz Graefe and Harumi Kuno

Volume 2 (2010)

Abstract

For efficient query processing, a relational table should be indexed in multiple ways; for efficient database loading, indexes should be omitted. This research introduces new techniques called zones filters, zone indexes, adaptive merging, and partition filters. The new data structures can be created as side effects of the load process, with all required analyses accomplished while a moderate amount of new data still remains in the buffer pool. Traditional sorting and indexing are not required. Nonetheless, query performance matches that of Netezza’s zone maps where those apply, exceeds it for the many predicates for which zone maps are ineffective, and can be comparable to query processing with traditional indexing, as demonstrated in our simulations.