SC Conference - Activity Details

Using Server-to-Server Communication in Parallel File Systems to Simplify Consistency and Improve Performance

Philip H. Carns  (Argonne National Laboratory)
Bradley W. Settlemyer  (Clemson University)
Walter B. Ligon  (Clemson University)
Papers Session
I/O and File Systems
Tuesday,  11:00AM - 11:30AM
Room Ballroom G
The trend in parallel computing toward clusters running thousands of cooperating processes per application has led to an I/O bottleneck that has only gotten more severe as the CPU density of clusters has increased. Current parallel file systems provide large amounts of aggregate I/O bandwidth; however, they do not achieve the high degrees of metadata scalability required to manage files distributed across hundreds or thousands of storage nodes. In this paper we examine the use of collective communication between the storage servers to improve the scalability of file metadata operations. In particular, we apply server-to-server communication to simplify consistency checking and improve the performance of file creation, file removal, and file stat. Our results indicate that collective communication is an effective scheme for simplifying consistency checks and significantly improving the performance for several real metadata intensive workloads.
The full paper can be found in the IEEE Xplore Digital Library and ACM Digital Library
   IEEE Computer Society  /  ACM     2 0   Y E A R S   -   U N L E A S H I N G   T H E   P O W E R   O F   H P C