SEMINAR
DEPARTMENT OF COMPUTER ENGINEERING
ABSTRACT
Mustafa Uysal
Hewlett-Packard Laboratories
March 3, 2000
ACTIVE DISKS: PROGRAMMING MODEL, ALGORITHMS AND EVALUATION
Growth and usage trends for several data-intensive applications indicate that their datasets and processing requirements are growing very quickly. Meeting these rapidly increasing storage and processing demands requires architectures that scale processing power with the storage capacity and throughput. Active Disks integrate significant processing power and memory into a disk drive and allow application-specific code to be downloaded and executed in the disk. Active Disks naturally scale the processing power of the system with increasing storage capacity, offer dramatic reduction in data traffic and improve the price/performance for large installations by off-loading computation to inexpensive embedded disk processors. To program Active Disks, I will describe a stream-based programming model that facilitates safety and efficiency of the disk-resident code (called disklets) while simplifying programming and restructuring of applications. I will also describe operating system support (called DiskOS) for the stream-based programming model to manage, coordinate and control disklet execution in conjunction with the host operating system. I will demonstrate the potential of Active Disks for a wide range of data-intensive applications from data-warehousing, data-mining, satellite data processing and digital microscopy. Results show that for the same I/O interconnect, disks and number of processors, Active Disks provide better price/performance than both
SMP-based conventional disk farms and commodity clusters. Active Disks outperform SMP-based disk farms by up to an order of magnitude in price as well as in performance and match (and in some cases improve upon) the performance of commodity clusters for less than half the price.
The Seminar will be on March 3, 2000, at 14:00
in EA331