USRC’s university collaborators conduct research on the topics relevant to USRC. They are often collaborating professors and send their students to work at LANL’s USRC site in person. Many of these collaborators come to visit several times a year and present current research.
Dr. Paolo Rech
Associate Professor, UFRGS
Paolo Rech received his master and Ph.D. degrees from Padova University, Padova, Italy, in 2006 and 2009, respectively. He is currently an associate professor at the Federal University of Rio Grande do Sul, Porto Alegre, RS, Brazil.His main research interests include the evaluation and mitigation of radiation induced effects in large-scale HPC centers and safety-critical applications. Paolo lead the group that performed the first radiation experiment on GPUs in 2011. Since then, he has been studying the effects of radiation in parallel HPC devices. Now, he is collaborating with NVIDIA and AMD to evaluate and enhance the reliability of modern architectures. In collaboration with LANL and USRC he is designing experimentally-tuned selective hardening strategies to detect critical SDCs without unnecessary overhead. Lately, Paolo has been working on automotive applications reliability, understanding the error propagation in neural networks and designing novel hardening solutions for embedded safety critical applications.
Dr. William M Jones
Associate Professor and Chair, Coastal Carolina University
Will is an associate professor and chair of the Department of Computing Sciences at Coastal Carolina University (CCU). He attended Clemson University where he obtained a BS ('99), MS ('00) and PhD ('05), each in Computer Engineering. Before accepting a position at CCU, Will was an assistant professor in the Department of Electrical and Computer Engineering at the United States Naval Academy, as well as an adjunct professor at Clemson University in the ECE department and at Tri-County Technical College in the Department of Mathematics. His research interests include parallel computing, parallel file systems, computational grids, job scheduling, resilience, fault injection, performance evaluation and modeling, and discrete event simulation. In addition to traditional computer science courses, he also enjoys teaching computer architecture, digital logic design, FPGA programming and AC/DC circuit analysis. Will has been investigating the behavior of ABFT algorithms in the presences of hardware and memory faults through the use of F-SEFI, a soft error fault injector. This work has been in collaboration with Claude Davis, a Clemson University master's student, CCU student Scott Lavigne, a CS undergraduate, along with several members of the HPC-5 group, including Nathan DeBardeleben, Laura Monroe, Sean Blanchard, and Qiang Guan.
Dr. Song Fu
Assistant Professor in Computer Science and Engineering, University of North Texas
Dr. Xin Yuan
Professor, Dept of Computer Science, Florida State University
Dr. Xin Yuan is currently a full Professor in the Department of Computer Science at Florida State University. His research interests include parallel and distributed systems, communication optimizations, interconnection networks, and networking. He obtained his B.S. and M.S degrees in Computer Science from Shanghai Jiaotong University in 1989 and 1992, respectively. He earned his Ph.D degree in Computer Science from the University of Pittsburgh in 1998. He has published more than 80 papers in leading journals and conferences. The STAR-MPI software package that he and his students developed has been incorporated in the MPI stack of the IBM Blue Gene/P system. Dr. Yuan is currently serving on the Editorial Boards of several international journals. He has also served as the Program Chairs and vice-Chairs for several international conferences and workshops such as the International Conference on Parallel Processing (ICPP) and the IEEE International Conference on High Performance Computing (HiPC), and as Program Committee Members for many international conferences and workshops. He is a senior member of ACM and IEEE.
Dr. Yuan worked with Mike Lang, Scott Pakin, and the USRC Systems group on interconnection networks research that covers a spectrum of networking issues including topology, routing, switching, flow control, and congestion control. The goal was to identify the appropriate interconnection network technologies for the next generation supercomputers as well as the future generation exascale supercomputing systems.
Dr. Ioan Raicu
Assistant Professor in Computer Science, Illinois Institute of Technology
Dr. Ioan Raicu is an assistant professor in CS at Illinois Institute of Technology, as well as a guest research faculty in MCS at Argonne National Laboratory. He is also the founder (2011) and director of the Data-Intensive Distributed Systems Laboratory at IIT. His research work and interests are in the general area of distributed systems. His work focuses on a relatively new paradigm of Many-Task Computing (MTC), which aims to bridge the gap between two predominant paradigms from distributed systems, High-Throughput Computing (HTC) and High-Performance Computing (HPC). His work has focused on defining and exploring both the theory and practical aspects of realizing MTC across a wide range of large-scale distributed systems. He is particularly interested in resource management in large scale distributed systems with a focus on many-task computing, data intensive computing, cloud computing, grid computing, and many-core computing. Dr. Raicu has been working with Mike Lang for several years co-mentoring graduate students on exploring extreme-scale simulations in distributed services. The projects include exploring the scalability of distributed NoSQL key/value storage systems with different architectures, and distributed job-launch and scheduling through work-stealing for both MTC and HPC workloads.
Dr. Satyajayant Misra
Assist Professor, Computer Science Dept, New Mexico State University
Dr. Carlos Maltzahn
Associate Adjunct Professor Computer Science Dept, UC Santa Cruz
Dr. Patrick Bridges
Assistant Professor in Computer Science, University of New Mexico
Dr. Dorian Arnold
Professor, UNM Dept of Computer Science, University of New Mexico
Dorian is an assistant professor in the Department of Computer Science at the University of New Mexico. His research focuses on the performance and reliability of extremely large scale systems with tens of thousands, hundreds of thousands or even millions of processing elements.
Dorian is working with Mike Lang, Hugh Greenberg and the USRC Systems Group on the Redfish Project. This group investigates the basic, general computation, communication and storage primitives that underlie HPC system services and provide a library of building blocks that provides a flexible, resilient and scalable implementation of these primitives.