The state of the art in distributed query processing
Top Cited Papers
- 1 December 2000
- journal article
- Published by Association for Computing Machinery (ACM) in ACM Computing Surveys
- Vol. 32 (4), 422-469
- https://doi.org/10.1145/371578.371598
Abstract
Distributed data processing is becoming a reality. Businesses want to do it for many reasons, and they often must do it in order to stay competitive. While much of the infrastructure for distributed data processing is already there (e.g., modern network technology), a number of issues make distributed data processing still a complex undertaking: (1) distributed systems can become very large, involving thousands of heterogeneous sites including PCs and mainframe server machines; (2) the state of a distributed system changes rapidly because the load of sites varies over time and new sites are added to the system; (3) legacy systems need to be integrated—such legacy systems usually have not been designed for distributed data processing and now need to interact with other (modern) systems in a distributed environment. This paper presents the state of the art of query processing for distributed database and information systems. The paper presents the “textbook” architecture for distributed query processing and a series of techniques that are particularly useful for distributed database systems. These techniques include special join techniques, techniques to exploit intraquery paralleli sm, techniques to reduce communication costs, and techniques to exploit caching and replication of data. Furthermore, the paper discusses different kinds of distributed systems such as client-server, middleware (multitier), and heterogeneous database systems, and shows how query processing works in these systems.Keywords
This publication has 36 references indexed in Scilit:
- Functional-join processingThe VLDB Journal, 2000
- Advanced data processing in KRISYS: modeling concepts, implementation techniques, and client/server issuesThe VLDB Journal, 1998
- Heuristic and randomized optimization for the join ordering problemThe VLDB Journal, 1997
- Mariposa: a wide-area distributed database systemThe VLDB Journal, 1996
- Query evaluation techniques for large databasesACM Computing Surveys, 1993
- Overview of multidatabase transaction managementThe VLDB Journal, 1992
- Join processing in relational databasesACM Computing Surveys, 1992
- Join and Semijoin Algorithms for a Multiprocessor Database MachineACM Transactions on Database Systems, 1984
- Query processing in a system for distributed databases (SDD-1)ACM Transactions on Database Systems, 1981
- Implementing a relational database by means of specialzed hardwareACM Transactions on Database Systems, 1979