William T. Kramer

University of Illinois at Urbana-Champaign

Operations Research and Production Systems

RIKEN

(baif)
Jan 2017 - Jun 2018

Swift Workflow Support on Blue Waters

(bahk)
Oct 2016 - Oct 2017

Blue Waters Workload Analysis

(bafq)
Jun 2016 - Feb 2017

Joint Lab - INRIA

(jnn)
Mar 2016 - Sep 2017

Blue Waters Job Accounting data sets

(gkf)
Aug 2015 - Mar 2019

Blue Waters Darshan data sets

(gjn)
Jun 2015 - Apr 2018

Blue Waters System Performance data sets

(gjo)
Jun 2015 - Apr 2018

Blue Waters Darshan data sets

(ahr)
Jun 2015 - Apr 2018

Blue Waters System Performance data sets

(ahs)
Jun 2015 - Apr 2018

Log Analysis

(joa)
Apr 2015 - Dec 2015

BSC - JLESC

(jrj)
Jul 2014 - Dec 2014

HDF5

(jr7)
Jun 2014 - Jun 2015

SPIN (Students Pushing Innovation)

(jpu)
Nov 2013 - Nov 2014

GPU Supercomputing on Blue Waters

(jpt)
Nov 2013 - Dec 2013

IBM HPSS Staff

(jpj)
Oct 2013 - Apr 2018

Sandia National Labs (OVIS)

(jp7)
Aug 2013 - Sep 2016

Mellanox

(jp3)
Jun 2013 - Apr 2018

Blue Waters Reliability Study

(jox)
May 2013 - Sep 2013

Cray Scale Test Time

(joj)
May 2013 - Dec 2018

LSST Scaling Study

(joe)
Apr 2013 - Dec 2013

Adaptive Computing

(jod)
Apr 2013 - Apr 2018

WRF

(fyw)
Apr 2013 - Aug 2013

Allinea

(jnq)
Apr 2013 - Apr 2018

SPP Benchmarks

(jnd)
Apr 2013 - Apr 2018

Staff

(fyy)
Apr 2013 - Apr 2018

Bright Computing

(jnr)
Apr 2013 - Apr 2018

MILC

(fyr)
Apr 2013 - Aug 2013

PARATEC

(fyx)
Apr 2013 - Aug 2013

Xyratex

(jnp)
Apr 2013 - Apr 2018

System Console

(fyq)
Apr 2013 - Apr 2018

UTK

(jne)
Apr 2013 - May 2016

OLCF

(jny)
Apr 2013 - Oct 2013

Application Performance Consistency

(jnx)
Apr 2013 - Apr 2018

NCAR

(jnz)
Apr 2013 - Apr 2014

General System

(jnc)
Apr 2013 - Apr 2018

Computational Libraries

(fyj)
Apr 2013 - Apr 2018

NVIDIA

(jnb)
Apr 2013 - Apr 2018

Workflow

(fyl)
Apr 2013 - Apr 2018

Virtualization

(fyk)
Apr 2013 - Aug 2013

Storage

(fys)
Apr 2013 - Apr 2014

IDE

(fym)
Apr 2013 - Apr 2018

CRAY

(jme)
Apr 2013 - Apr 2018

DNS3D

(fyv)
Apr 2013 - Aug 2013

NAMD

(fyu)
Apr 2013 - Aug 2013

Programming Models

(fyi)
Apr 2013 - Apr 2018

2017

Valerio Formicola, Saurabh Jha, Daniel Chen, Fei Deng, Amanda Bonnie, Mike Mason, Jim Brandt, Ann Gentile, Larry Kaplan, Jason Repik, Jeremy Enos, Mike Showerman, Annette Greiner, Zbigniew Kalbarczyk, Ravishankar K. Iyer, William Kramer (2017): Understanding Fault Scenarios and Impacts through Fault Injection Experiments in Cielo, presented at CUG 2017, Redmond, Washington, U.S.A.
Saurabh Jha, Valerio Formicola, Catello Di Martino, Mark Dalton, William T. Kramer, Zbigniew Kalbarczyk, and Ravishankar K. Iyer (2017): Resiliency of HPC Interconnects: A Case Study of Interconnect Failures and Recovery in Blue Waters, IEEE Transactions on Dependable and Secure Computing, Institute of Electrical and Electronics Engineers (IEEE), Num 99, pp1--1
Saurabh Jha, Jim Brandt, Ann Gentile, Zbigniew Kalbarczyk, Greg Bauer, Jeremy Enos, Michael Showerman, Larry Kaplan, Brett Bode, Annette Greiner, Amanda Bonnie, Mike Mason, William Kramer and Ravishankar Iyer (2017): Holistic Measurement Driven System Assessment, IEEE, 2017 IEEE International Conference on Cluster Computing (CLUSTER), pp797-800, Honolulu, Hawai'i, U.S.A.

2016

Saurabh Jha, Valerio Formicola, Catello Di Martino, Zbigniew Kalbarczyk, William T. Kramer, Ravishankar K. Iyer (2016): Analysis of Gemini Interconnect Recovery Mechanisms: Methods and Observations, presented at CUG 2016, London, England, U.K.

2015

M. Gajbe, K. Chadalavada, G. Bauer, W. Kramer (2015): Benchmarking and Performance Studies of Mapreduce, Hadoop Framework on Blue Waters Supercomputer, presented at WorldComp 2015 - ABDA '15 International Conference on Advances in Big Data Analytics, Las Vegas, Nevada, U.S.A.
Celso L. Mendes, Brett Bode, Gregory H. Bauer, Jeremy Enos, Cristina Beldica, and William T. Kramer (2015): Deployment and Testing of the Sustained Petascale Blue Waters System, Journal of Computational Science, Elsevier BV, Vol 10, pp327--337
Catello Di Martino, William Kramer, Zbigniew Kalbarczyk, and Ravishankar Iyer (2015): Measuring and Understanding Extreme-Scale Application Resilience: A Field Study of 5,000,000 HPC Application Runs, IEEE, 2015 45th Annual IEEE/IFIP International Conference on Dependable Systems and Networks, pp25-36, Rio de Janeiro, Brazil
Catello Di Martino, Saurabh Jha, William Kramer, Zbigniew Kalbarczyk, and Ravishankar K. Iyer (2015): LogDiver: A Tool for Measuring Resilience of Extreme-Scale Systems and Applications, Association for Computing Machinery (ACM), Proceedings of the 5th Workshop on Fault Tolerance for HPC at eXtreme Scale - FTXS '15, pp11--18, Portland, Oregon, U.S.A.

2014

Bauer, Gregory and Mendes, Celso and Kramer, William and Fiedler, Robert (2014): Expanding Blue Waters with Improved Acceleration Capability, presented at CUG 2014, Lugano, Switzerland
Kramer, William, Butler, Michelle, Bauer, Gregory, Chadalavada, Kalyana and Mendes, Celso (2014): National Center for Supercomputing Applications, Chapman and Hall/CRC, High Performance Parallel I/O, pp17-31
Catello Di Martino, Zbigniew Kalbarczyk, Ravishankar K. Iyer, Fabio Baccanico, Joseph Fullop, and William Kramer (2014): Lessons Learned from the Analysis of System Failures at Petascale: The Case of Blue Waters, IEEE, 2014 44th Annual IEEE/IFIP International Conference on Dependable Systems and Networks, pp610-621, Atlanta, Georgia, U.S.A.
Celso L. Mendes, Brett Bode, Gregory H. Bauer, Jeremy Enos, Cristina Beldica, and William T. Kramer (2014): Deploying a Large Petascale System: The Blue Waters Experience, Procedia Computer Science (14th Annual International conference on Computational Science, ICCS 2014), Elsevier BV, Vol 29, pp198--209, Cairns, Australia
Anisimov, Victor M. and Bauer, Gregory H. and Chadalavada, Kalyana and Olson, Ryan M. and Glenski, Joseph W. and Kramer, William T. C. and Aprà, Edoardo and Kowalski, Karol (2014): Optimization of the Coupled Cluster Implementation in NWChem on Petascale Parallel Architectures, Journal of Chemical Theory and Computation, American Chemical Society (ACS), Vol 10, Num 10, pp4307--4316
Cappello, Franck and Al, Geist and Gropp, William and Kale, Sanjay and Kramer, Bill and Snir, Marc (2014): Toward Exascale Resilience: 2014 Update, Supercomputing Frontiers and Innovations, FSAEIHE South Ural State University (National Research University), Vol 1, Num 1, pp5--28

2013

Celso L. Mendes, Brett Bode, Gregory H. Bauer, Joseph R. Muggli, Cristina Beldica and William T. Kramer (2013): Blue Waters Acceptance: Challenges and Accomplishments, presented at CUG 2013, Napa, California, U.S.A.
Ana Gainaru, Franck Cappello, Marc Snir, and William Kramer (2013): Failure Prediction for HPC Systems and Applications, The International Journal of High Performance Computing Applications, SAGE Publications, Vol 27, Num 3, pp273--282

2012

Joseph Muggli, Brett Bode, Torsten Hoefler, William Kramer and Celso L. Mendes (2012): Blue Waters Testing Environment, presented at CUG 2012, Stuttgart, Germany
G. H. Bauer, T. Hoefler, W. T. Kramer, R. A. Fiedler (2012): Analyses and Modeling of Applications Used to Demonstrate Sustained Petascale Performance on Blue Waters, presented at CUG 2012, Stuttgart, Germany
Gainaru, Ana and Cappello, Franck and Snir, Marc and Kramer, William (2012): Fault Prediction Under the Microscope: A Closer Look into HPC Systems, IEEE Computer Society Press, Proceedings of the International Conference on High Performance Computing, Networking, Storage and Analysis (SC '12), pp77:1-77:11, Salt Lake City, Utah, U.S.A.
W. Kramer (2012): Top500 versus sustained performance - the top problems with the TOP500 list - and what to do about them, IEEE, 2012 21st International Conference on Parallel Architectures and Compilation Techniques (PACT), pp223-230, Minneapolis, Minnesota, U.S.A.
Ana Gainaru, Franck Cappello, and William Kramer (2012): Taming of the Shrew: Modeling the Normal and Faulty Behaviour of Large-Scale HPC Systems, IEEE, 2012 IEEE 26th International Parallel and Distributed Processing Symposium, pp1168-1179, Shanghai, China

2011

Ana Gainaru, Franck Cappello, Joshi Fullop, Stefan Trausan-Matu, and William Kramer (2011): Adaptive Event Prediction Strategy with Dynamic Time Window for Large-Scale HPC Systems, ACM Press, Managing Large-scale Systems via the Analysis of System Logs and the Application of Machine Learning Techniques (SLAML '11), pp4:1-4:8, Cascais, Portugal
William Kramer (2011): How to Measure Useful, Sustained Performance, ACM Press, State of the Practice Reports (SC '11), pp2:1-2:18, Seattle, Washington, U.S.A.
Eric Heien, Derrick Kondo, Ana Gainaru, Dan LaPine, Bill Kramer, and Franck Cappello (2011): Modeling and Tolerating Heterogeneous Failures in Large Parallel Systems, ACM Press, Proceedings of 2011 International Conference for High Performance Computing, Networking, Storage and Analysis (SC '11), pp45:1-45:11, Seattle, Washington, U.S.A.
Ana Gainaru, Franck Cappello, Stefan Trausan-Matu, and Bill Kramer (2011): Event Log Mining Tool for Large Scale HPC Systems, Springer Berlin Heidelberg, Euro-Par 2011 Parallel Processing, pp52--64, Bordeaux, France
Jack Dongarra, Pete Beckman, Terry Moore, Patrick Aerts, Giovanni Aloisio, Jean-Claude Andre, David Barkai, Jean-Yves Berthou, Taisuke Boku, Bertrand Braunschweig, Franck Cappello, Barbara Chapman, Xuebin Chi, Alok Choudhary, Sudip Dosanjh, Thom Dunning, Sandro Fiore, Al Geist, Bill Gropp, Robert Harrison, Mark Hereld, Michael Heroux, Adolfy Hoisie, Koh Hotta, Zhong Jin, Yutaka Ishikawa, Fred Johnson, Sanjay Kale, Richard Kenway, David Keyes, Bill Kramer, Jesus Labarta, Alain Lichnewsky, Thomas Lippert, Bob Lucas, Barney Maccabe, Satoshi Matsuoka, Paul Messina, Peter Michielse, Bernd Mohr, Matthias S. Mueller, Wolfgang E. Nagel, Hiroshi Nakashima, Michael E Papka, Dan Reed, Mitsuhisa Sato, Ed Seidel, John Shalf, David Skinner, Marc Snir, Thomas Sterling, Rick Stevens, Fred Streitz, Bob Sugar, Shinji Sumimoto, William Tang, John Taylor, Rajeev Thakur, Anne Trefethen, Mateo Valero, Aad van der Steen, Jeffrey Vetter, Peg Williams, Robert Wisniewski, and Kathy Yelick (2011): The International Exascale Software Project Roadmap, The International Journal of High Performance Computing Applications, SAGE Publications, Vol 25, Num 1, pp3--60

2009

Franck Cappello, Al Geist, Bill Gropp, Laxmikant Kale, Bill Kramer, and Marc Snir (2009): Toward Exascale Resilience, The International Journal of High Performance Computing Applications, SAGE Publications, Vol 23, Num 4, pp374--388