Skip to Content

Blue Waters Virtual Training - Managing HPC Systems and Centers - July 14-16, 2020

The Blue Waters project at the University of Illinois is offering a three-day virtual training event on Managing HPC Systems and Centers via three sessions of approximately two hours each on July 14-16, 2020.

This training is focused on sharing the lessons learned and best practices from the staff who are managing and operating the Blue Waters Cray system. The sessions are for staff that are managing and operating Cray and HPC systems and related centers.

Participation is by invitation.

Sessions begin daily at 8:30 AM Central / 9:30 AM Eastern.

The topics to be covered each day are listed in the following table. Slides will be posted before the event and video recordings will be posted after the event.

Day Topic Presenter(s) Slides Video
July 14 NCSA-NGA Collaboration
Bill Kramer
July 14
Blue Waters System Overview
Brett Bode
July 14 Job Scheduling
David King
July 14 Containers
Mark Dalton and Maxim Belkin
July 14 System wide & Job level Diagnostic and Performance Monitoring
Mike Showerman, Jeremy Enos, and Greg Bauer
July 14 External services and integration
Brett Bode
July 14 Resiliency
Brett Bode
July 15 Operational processes
Jeremy Enos
July 15 Benchmarking and performance testing
Bill Kramer, Greg Bauer, and Aaron Saxton
July 15 Open science security
Alex Withers and Jeremy Enos
July 15 Filesystem and Data methods
Chris Heller and Justin Davis
July 15 Lustre
Chris Heller and Justin Davis
July 16 Preemptive disk failing and metrics
Brett Bode
July 16 Acceptance and regression testing
Celso Mendes
July 16 Risk management and risk register
Celso Mendes
July 16 User portal, communications, and documentation
Greg Bauer
July 16 Language support Python
Roland Haas and Jeremy Enos
July 16 Community Software Support
Brett Bode
July 16 How Blue Waters supports assistance and consulting, etc.
Greg Bauer
July 16 Wrap-up
Scott Lathrop