USRC staff and research areas can be found here.
USRC LANL summer intern, Alexandra DeLucia, working with Lissa Baseman on machine learning and anomaly detection was featured on the Rollinscollege Instagram. See more here.
USRC’s Dr. Qiang Guan attended and presented on the FSEFI fault injector at the SIAM 2017 Annual Meeting in Pittsburg, PA. Qiang spoke at the Resilient Computation in Large Scale Scientific Computing – Part I of II session.
The talk was entitled DECAF-FSEFI: A Fine-grained, Accountable, Flexible, and Efficient Soft Error Injection Framework for Profiling Application Vulnerability (PFSEFI 2.0) (slides).
USRC’s Lissa Baseman, Dr. Li Tan, and Olena Tkachenko are at The 47th IEEE/IFIP International Conference on Dependable Systems and Networks (DSN 2017) where they are presenting on resilience and machine learning research at USRC.
Lissa Baseman presented in the Industry Track III: Dependability
Data and Security the paper Automating DRAM Fault Mitigation By Learning From Experience (slides). USRC intern, Olena Tkachenko, provided much of the analysis for this work and the paper is in collaboration with AMD and Sandia National Laboratories.
Dr. Tan presented at the RADIANCE (International Workshop on Recent Advances in the DependabIlity AssessmeNt of Complex systEms). His presentation was entitled RSVP: Soft Error Resilient Power Savings at Near-ThresholdVoltage using Register Vulnerability (slides) and was co-authored by other USRC members.
USRC’s Dr. Laura Monroe attended The Platform for Advanced Scientific Computing (PASC) Conference where she presented at the session on Recent Advances in Resilience for Extreme Scale Computing. Dr. Monroe’s talk was entitled Inexact Computing and the Interface with Resilience.
This week is The 26th ACM International Symposium on High-Performance Parallel and Distributed Computing (HPDC). Opening the conference this week was the 7th Fault Tolerance for HPC at eXtreme Scale (FTXS) Workshop. FTXS is a workshop co-created by USRC’s Dr. Nathan DeBardeleben and has been run by Nathan ever since.
In the International Workshop on Runtime and Operating Systems for Supercomputers (ROSS 2017) on Tuesday, the paper UNITY: Unified Memory and File Space will be presented. This work includes contributions by USRC’s Mike Lang, Latchesar Ionkov, and Doug Otstott.
USRC’s Dr. Qiang Guan and Dr. Nathan DeBardeleben have a paper in the main conference (19% acceptance rate) primarily authored by USRC alumnus Bo Fang, entitled LetGo: A Lightweight Continuous Framework for HPC Applications Under Failures (slides).
The poster session included work by USRC alumnus Song Huang and work in progress by current USRC PhD student, Zongze Li.
USRC members Dr. Nathan DeBardeleben (resilience) and Dr. John Bent (storage) are at ISC HPC 2017 this week.
John Bent (ISC person page) presented at BoF 08: The Virtual Institute for I/O & the IO-500 Tuesday morning (slides “IO-500″).
Nathan DeBardeleben (ISC person page) presented at Fault Tolerance for Next Generation High Performance Computing on Evaluating Parallel Application Resiliency with the Software Fault Injector, PFSEFI (slides) Wednesday.
Nathan presented on work by Dr. Li Tan on injecting faults into the FleCSALE parallel application.
Summer 2017 is here and USRC has a great group of new and returning interns.
Graduate Student, Carnegie Mellon University
PhD Student, University of California, Merced
Kai currently a Computer Science PhD student in EECS at University of California, Merced. Before coming to UC Merced, he got his Masters degree in Computer Science and Engineering from Michigan State University in 2016. His research broadly falls into general areas of High Performance Computing (Large-Scale Parallel Systems). Specifically, he focuses on the following areas:(i) Parallel programming models and runtime; (ii) Performance optimization and modeling; (iii) Resilience and Consistency; (iv)Non-volatile memory; (v) Fault Tolerance in Extreme-Scale Parallel Systems. At USRC, Kai is working on building faults model on serial codes and predicting the faults on parallel codes.
Post Bachelor, Los Alamos National Laboratory
PhD Student, University of California, Santa Cruz
PhD Student, Computer Science, Illinois Institute of Technology
Graduate Student, Ohio State University
Post Bachelor, Los Alamos National Laboratory
PhD Student, Computer Science, North Carolina State University
Undergraduate Student, Rollins College
Rusty H Davis
Graduate Student, Clemson University
PhD Student, New Mexico State University
Ashley is currently a PhD student studying Computer Science at New Mexico State University. She has a BS and MS in Electrical Engineering also from NMSU. Her research is on multivariate time series prediction and segmentation. At USRC she is working on on an anomaly detection project focused on detecting anomalies in energy data.
In case you are wondering, USRC has been around since roughly 2010. While we will try and put some of that older content onto this page over time, generally we will focus on USRC from this point forward.
Laura Monroe presented at the prestigious Salishan Conference on High Speed Computing. This year’s theme was Perspectives on HPC’s Current Cambrian Explosion. Dr. Monroe’s talk was entitled Probabilistic Computing in the Post-Moore’s Era (slides). The conference program can be found here.