Parallel Input/Output with Heterogeneous Disks.
Szu-Wen Kuo, Marianne Winslett, Ying Chen, Yong Cho, Mahesh Subramaniam, Kent E. Seamons:
Parallel Input/Output with Heterogeneous Disks.
SSDBM 1997: 79-90@inproceedings{DBLP:conf/ssdbm/KuoWCCSS97,
author = {Szu-Wen Kuo and
Marianne Winslett and
Ying Chen and
Yong Cho and
Mahesh Subramaniam and
Kent E. Seamons},
editor = {Yannis E. Ioannidis and
David M. Hansen},
title = {Parallel Input/Output with Heterogeneous Disks},
booktitle = {Ninth International Conference on Scientific and Statistical
Database Management, Proceedings, August 11-13, 1997, Olympia,
Washington, USA},
publisher = {IEEE Computer Society},
year = {1997},
isbn = {0-8186-7952-2},
pages = {79-90},
ee = {db/conf/ssdbm/KuoWCCSS97.html},
crossref = {DBLP:conf/ssdbm/97},
bibsource = {DBLP, http://dblp.uni-trier.de}
}
Abstract
Panda is a high-performance library for accessing large multidimensional array data on secondary storage of parallel platforms and networks of workstations. When using
Panda as the I/O component of a scientific application, H3expresso, on the IBM SP2 at Cornell Theory Center, we found that some nodes are more powerful with respect to
I/O than others, requiring the introduction of load balancing techniques to maintain high performance. We expect that heterogeneity will also be a big issue for DBMSs or
parallel I/O libraries designed for scientific applications running on networks of workstations, and the methods of allocating data to servers in these environments will need to
be upgraded to take heterogeneity into account, while still allowing users to exert control over data layout.
We propose such an approach to load balancing, under which we respect the user's choice of high-level disk layout, but introduce automatic subchunking. The use of
subchunks allows us to divide the very large chunks typically specified by the user's disk layout into more manageable-size units that can be allocated to I/O nodes in a
manner that fairly distributes the load. We also present two techniques for allocating subchunks to nodes, static and dynamic, and evaluate their performance on the SP2.
Copyright © 1997 by The Institute of
Electrical and Electronic Engineers, Inc. (IEEE).
Abstract used with permission.
CDROM Version: Load the CDROM "Volume 2 Issue 5, SSDBM, DBPL, KRDB, ADBIS, COOPIS, SIGBDP" and ...
DVD Version: Load ACM SIGMOD Anthology DVD 1" and ...
Citation Page
Printed Edition
Yannis E. Ioannidis, David M. Hansen (Eds.):
Ninth International Conference on Scientific and Statistical Database Management, Proceedings, August 11-13, 1997, Olympia, Washington, USA.
IEEE Computer Society 1997, ISBN 0-8186-7952-2
Contents
References
- [1]
- ...
- [2]
- Jean L. Bell:
A Specialized Data Management System For Parallel Execution of Particle Physics Codes.
SIGMOD Conference 1988: 277-285
- [3]
- Rajesh Bordawekar, Alok N. Choudhary, Ken Kennedy, Charles Koelbel, Michael H. Paleczny:
A Model and Compilation Strategy for Out-of-Core Data Parallel Programs.
PPOPP 1995: 1-10
- [4]
- Clemens H. Cap, Volker Strumpen:
Efficient Parallel Computing in Distributed Workstation Environments.
Parallel Computing 19(11): 1221-1234(1993)
- [5]
- Ying Chen, Marianne Winslett, Kent E. Seamons, Szu-Wen Kuo, Yong Cho, Mahesh Subramaniam:
Scalable Message Passing in Panda.
IOPADS 1996: 109-121
- [6]
- Ying Chen, Jarek Nieplocha, Ian T. Foster, Marianne Winslett:
Optimizing Collective I/O Performance on Parallel Computers: A Multisystem Study.
International Conference on Supercomputing 1997: 28-35
- [7]
- ...
- [8]
- David J. DeWitt, Navin Kabra, Jun Luo, Jignesh M. Patel, Jie-Bing Yu:
Client-Server Paradise.
VLDB 1994: 558-569
- [9]
- Derek L. Eager, Edward D. Lazowska, John Zahorjan:
Adaptive Load Sharing in Homogeneous Distributed Systems.
IEEE Trans. Software Eng. 12(5): 662-675(1986)
- [10]
- ...
- [11]
- John F. Karpovich, James C. French, Andrew S. Grimshaw:
High Performance Access to Radio Astronomy Data: A Case Study.
SSDBM 1994: 240-249
- [12]
- ...
- [13]
- ...
- [14]
- Leonid Libkin, Rona Machlin, Limsoon Wong:
A Query Language for Multidimensional Arrays: Design, Implementation, and Optimization Techniques.
SIGMOD Conference 1996: 228-239
- [15]
- ...
- [16]
- Sunita Sarawagi, Michael Stonebraker:
Efficient Organization of Large Multidimensional Arrays.
ICDE 1994: 328-336
- [17]
- Kent E. Seamons, Marianne Winslett:
Physical Schemas for Large Multidimensional Arrays in Scientific Computing Applications.
SSDBM 1994: 218-227
- [18]
- Kent E. Seamons, Ying Chen, Marianne Winslett, Yong Cho, Szu-Wen Kuo, Mahesh Subramaniam:
Persistent Array Access Using Server-Directed I/O.
SSDBM 1996: 98-107
- [19]
- ...
- [20]
- Michael Stonebraker, James Frew, Kenn Gardels, Jeff Meredith:
The Sequoia 2000 Benchmark.
SIGMOD Conference 1993: 2-11
Copyright © Sun Mar 14 23:27:18 2010
by Michael Ley (ley@uni-trier.de)