Award Finalist/Winner
Student Contribution

SC Conference - Activity Details

Feedback Controlled Resource Sharing for Predictable eScience

Sang-Min Park  (University of Virginia)
Marty Humphrey  (University of Virginia)
Papers Session
Grid Resource Management
Tuesday,  02:00PM - 02:30PM
Room Ballroom G
The emerging class of dynamic data driven applications are a significant problem for today’s HPC systems. In general, it is extremely difficult for queuing-system-controlled HPC resources to guarantee a tightly-bounded prediction regarding the time at which a newly-submitted application will execute. While a reservation-based approach partially addresses the problem, it can create severe resource underutilization. In contrast, this paper presents a fundamentally different approach to guarantee predictable execution. By creating a virtualized application layer called the performance container, and opportunistically multiplexing concurrent performance containers through the application of formal feedback control theory, we regulate the job’s progress such that the job meets its deadline without requiring exclusive access to resources even in the presence of unexpected disturbances. Our evaluation using two widely-used applications, WRF and BLAST, on an 8-core server show our approach is predictable and meets deadlines with 3.4 % of errors on average while achieving high overall utilization.
The full paper can be found in the IEEE Xplore Digital Library and ACM Digital Library
   IEEE Computer Society  /  ACM     2 0   Y E A R S   -   U N L E A S H I N G   T H E   P O W E R   O F   H P C