Valerio Formicola, Saurabh Jha, Daniel Chen, Fei Deng, Amanda Bonnie, Mike Mason, Jim Brandt, Ann Gentile, Larry Kaplan, Jason Repik, Jeremy Enos, Mike Showerman, Annette Greiner, Zbigniew Kalbarczyk, Ravishankar K. Iyer, William Kramer (2017): Understanding Fault Scenarios and Impacts through Fault Injection Experiments in Cielo
, presented at CUG 2017, Redmond, Washington, U.S.A.
Saurabh Jha, Valerio Formicola, Catello Di Martino, Mark Dalton, William T. Kramer, Zbigniew Kalbarczyk, and Ravishankar K. Iyer (2017): Resiliency of HPC Interconnects: A Case Study of Interconnect Failures and Recovery in Blue Waters
, IEEE Transactions on Dependable and Secure Computing, Institute of Electrical and Electronics Engineers (IEEE), Num 99, pp1--1
Saurabh Jha, Jim Brandt, Ann Gentile, Zbigniew Kalbarczyk, Greg Bauer, Jeremy Enos, Michael Showerman, Larry Kaplan, Brett Bode, Annette Greiner, Amanda Bonnie, Mike Mason, William Kramer and Ravishankar Iyer (2017): Holistic Measurement Driven System Assessment
, IEEE, 2017 IEEE International Conference on Cluster Computing (CLUSTER), pp797-800, Honolulu, Hawai'i, U.S.A.
Phuong Cao, Eric C. Badger, Zbigniew T. Kalbarczyk, and Ravishankar K. Iyer (2016): A Framework for Generation, Replay, and Analysis of Real-World Attack Variants
, Association for Computing Machinery (ACM), Proceedings of the Symposium and Bootcamp on the Science of Security - HotSos '16, pp28--37, Pittsburgh, Pennsylvania, U.S.A.
Subho S. Banerjee, Arjun P. Athreya, Liudmila S. Mainzer, C. Victor Jongeneel, Wen-Mei Hwu, Zbigniew T. Kalbarczyk, and Ravishankar K. Iyer (2016): Efficient and Scalable Workflows for Genomic Analyses
, Association for Computing Machinery (ACM), Proceedings of the ACM International Workshop on Data-Intensive Distributed Computing (DIDC '16), pp27--36, Kyoto, Japan
Catello Di Martino, William Kramer, Zbigniew Kalbarczyk, and Ravishankar Iyer (2015): Measuring and Understanding Extreme-Scale Application Resilience: A Field Study of 5,000,000 HPC Application Runs
, IEEE, 2015 45th Annual IEEE/IFIP International Conference on Dependable Systems and Networks, pp25-36, Rio de Janeiro, Brazil
Catello Di Martino, Saurabh Jha, William Kramer, Zbigniew Kalbarczyk, and Ravishankar K. Iyer (2015): LogDiver: A Tool for Measuring Resilience of Extreme-Scale Systems and Applications
, Association for Computing Machinery (ACM), Proceedings of the 5th Workshop on Fault Tolerance for HPC at eXtreme Scale (FTXS '15), pp11--18, Portland, Oregon, U.S.A.
Catello Di Martino, Zbigniew Kalbarczyk, Ravishankar K. Iyer, Fabio Baccanico, Joseph Fullop, and William Kramer (2014): Lessons Learned from the Analysis of System Failures at Petascale: The Case of Blue Waters
, IEEE, 2014 44th Annual IEEE/IFIP International Conference on Dependable Systems and Networks, pp610-621, Atlanta, Georgia, U.S.A.
Mar 7, 2017
Twenty-six research teams at the University of Illinois at Urbana-Champaign have been allocated computation time on the National Center for Supercomputing Application's (NCSA) sustained-petascale Blue Waters supercomputer after applying in Fall 2016. These allocations range from 25,000 to 600,000 node-hours of compute time over a time span of either six months or one year. The research pursuits of these teams are incredibly diverse, ranging anywhere from physics to political science.Sources: