High Performance Computing Intern Showcase
Explore the multiple dimensions of a career at Los Alamos Lab: work with the best minds on the planet in an inclusive environment that is rich in intellectual vitality and opportunities for growth.
Contact
- Intern Liaison
- Julie Wiens
The 2024 HPC - SI - USRC Showcase will take place on Thursday, August 8, 2024 from 8:00 a.m. to 12:00 p.m. at the J.R. Oppenheimer Study Center in the Jemez and Cochiti rooms with a poster session from 12:00pm to 1:15pm in the Santa Clara room. The abstracts are posted below and if you cannot attend in person, you can join via WebEx:
- https://lanl-us.webex.com/meet/jwiens Meeting number: 1337 13 7199
- Join from a video conferencing system or application Dial: jwiens@lanl-us.webex.com
- You can also dial 173.243.2.68 and enter your meeting number.
- Join using Microsoft Skype for Business: jwiens.lanl-us@lync.webex.com
- Join by phone +1-415-655-0002 US Toll Access Code: 1337 13 7199
- Global call-in numbers https://lanl-us.webex.com/lanl-us/globalcallin.php?MTID=m33a66493c0dade30ed0fde35e4f159ac
Agenda (pdf)
2024 Showcase Presentations
Presentation Session, 8AM - 12PM
- Automated OpenCHAMI Integration Testing and Cluster Deployment — Marcos Johnson-Noya, Alana Kihn, Madison Mejia
- Cray Programming Environment Containerization — Ever J. Dominguez, Almond J. Heil
- Cluster Management with Containerization on Switches by Dohyun Lee, Anvitha Ramachandran, Robin Simpson
- Charliecloud as a Kubernetes Runtime by London Bielicke, Angelica A. Loshak
- HPCInfo Improvements with Focus on Fairshare by Matthew Vandeberg
- Communication Performance Assessment of Sapphire Rapids Architecture by Jackson Wesley
- Software is for People: A Pavilion Case Study by Hank Wikle
- Lightning talks: Crafting an FTP Hook for Fishing in MarFS by Jordan Hebert, MUSTANG: An Overview Cluster Care: Reducing Downtime with Automated Node Failure Recovery by Robin E. Preble, and MUSTANG: A Powerful Vehicle for MarFS Object Cataloging and Retrieval by Paul D. Karhnak
- Trusted Platform Provisioning for the OpenCHAMI Cluster Management Stack by Lucas Ritzdorf
- LDMS (Lightweight Distributed Metric Service) Deployment on CSM Systems Abstract by M. Aiden Phillips
- Development of a Capacity ON Demand User Interaction Toolkit (CONDUIT) Web Dashboard by Christa Collins
- Effective Database Design for Efficient Workflow Orchestration by Kabir Vats
- Are we there yet? Predicting the Queue Wait Times and Job Runtimes for HPC Jobs by Christin Whitton
- Leveraging Lustre to Implement Incremental Indexing in GUFI by Migeljan Imeri
- Enhancing Workflow Manager and Resource Manager to Support Elastic Scientific Workflows in HPC Systems by Rajat Bhattarai
- Echo State Networks: An Approach to Non-Intrusive Anomaly Detection in Manufacturing by Kendric Hood
- Edge-Disjoint Spanning Trees on Star-Product Networks by Daniel Hwang
Poster Session, 12PM - 1:15, Lunch provided
- Understanding Cosmological Simulation Data via an Ensemble Visualization Workflow by Chloe Kellers
- Comparison of data engineering tools by Kalem Smith
- Applied Machine Learning for Surrogate Modeling by Warren Graham
- MAIM: MarFS Access Improvements and MIMOSA by Benjamin Schlueter
- Automated OpenCHAMI Integration Testing and Cluster Deployment by Marcos Johnson-Noya, Alana Kihn, and Madison Mejia
- Cray Programming Environment Containerization by Ever J. Dominguez R. and Almond J. Heil
- Cluster Management with Containerization on Switches by Dohyun Lee, Anvitha Ramachandran, and Robin Simpson
- Charliecloud as a Kubernetes Runtime by London Bielicke and Angelica A. Loshak
- Communication Performance Assessment of Sapphire Rapids Architecture by Jackson Wesley
- Trusted Platform Provisioning for the OpenCHAMI Cluster Management Stack by Lucas Ritzdorf
- Are we there yet? Predicting the Queue Wait Times and Job Runtimes for HPC Jobs by Christin Whitton
- Development of a Capacity ON Demand User Interaction Toolkit (CONDUIT)Web Dashboard by Christa Collins
- MUSTANG: A Powerful Tool for MarFS Object Cataloging and Retrieval by Paul D. Karhnak
- Effective Database Design for Efficient Workflow Orchestration by Kabir Vats
- Echo State Networks: An Approach to Non-Intrusive Anomaly Detection in Manufacturing by Kendric Hood
- Edge-Disjoint Spanning Trees on Star-Product Networks by Daniel Hwang
2023 Showcase Archive
An archive of last years presentations and posters are available below.
Presentation Session — 8:30 - 12:00
- Benchmarking Effects of Erasure Scheme and MPI Configuration on MarFS Throughput — Janya Budaraju, Paul Karhnak, Zach Snyder
- Charliecloud’s Successful Prototype Integration with Slurm: A Promising Approach with Some Strings Attached — Layton McCafferty, Nicholas Volpe, Hank Wikle
- Monitoring Clusters Using Extended Berkeley Packet Filter (eBPF) — Weston Cadena, Alexis Ng, M. Aiden Phillips
- Evaluating Lustre Network Performance over InfiniBand and RoCE — David Medin, Benjamin Schuelter, Matthew Vandeberg
- Seeing the trees for the forest: Describing HPC Filesystems with the Grand Unified File-Index (GUFI) — Jenna Kline
- Development of a Capacity ON Demand User Interaction Toolkit (CONDUIT) Job Launch Mechanism — Christa Collins
- Post-Exascale Star Product Networks and Allreduce Spanning Trees — Aleyah Dawkins
- Creating, Debugging, and Optimizing Roles in Shasta Keycloak for Improved Administrator Privilege Separation — Airam Flores
- A year in the life of a charliecloud developer — Lucas Caudill
- Improving MPI with Rust — Jacob Tronge
- Detecting Spatter in Laser Powder Bed Fusion with Computer Vision — Sean Tronsen
- Cray EX40 (Chicoma) Cluster Intrusion Detection Project — Daniel Wild
- Pavilion test searching — Frank Keithley
- Elastic Workflows with PMIx — Rajat Bhattarai
Posters
- Benchmarking Effects of Erasure Scheme and MPI Configuration on MarFS Throughput — Janya Budaraju, Paul Karhnak, Zach Snyder
- Charliecloud’s Successful Prototype Integration with Slurm: A Promising Approach with Some Strings Attached — Layton McCafferty, Nicholas Volpe, Hank Wikle
- Monitoring Clusters Using Extended Berkeley Packet Filter (eBPF) — Weston Cadena, Alexis Ng,
M. Aiden Phillips - Evaluating Lustre Network Performance over InfiniBand and RoCE — David Medin, Benjamin Schuelter, Matthew Vandeberg
- Benefits of Time Series Data Tables for HPCInfo — Kenton Romero
- Seeing the trees for the forest: Describing HPC Filesystems with the Grand Unified File-Index (GUFI) — Jenna Kline
- Development of a Capacity ON Demand User Interaction Toolkit (CONDUIT) Job Launch Mechanism — Christa Collins
- Cray EX40 (Chicoma) Cluster Intrusion Detection Project — Daniel Wild
- Detecting Anomalies in Laser Powder Bed Fusion with Computer Vision — Sean Tronsen
2022 Showcase Archive
The 2022 presentations and posters are below organized by session.
Presentation Session 1 – 9:30 to 11:30
- Writing UMT Pavilion configs for Crossroads Acceptance Testing | Shivam Mehta
- Exploring Rust in High-Performance Computing for Mitigating Errors and Improving Security | Jake Tronge
- Porting the Energy Exascale Earth System Model to the Chicoma LANL HPC Platform | Timothy Goetsch and Franklin Keithley
- Tropical Neural Networks | Jose Ortiz
- Relating Epigenetic Information to the Structure of DNA Using Deep Learning | Vanessa Job
- Evaluating TCP Protocol Performance on High-Speed Networks | Noah Jones, Jerrod Parten and Lucas Ritzdorf
- Charliecloud's Git-based Cache is Competitive with Alternatives | Z. Noah Hounshel, Ashlynn Lee and Ben Stormer
- Performance Analysis of Non-Volatile Memory Express Over Fabrics (NVMeoF) Using Infiniband and Ethernet | Christa Collins, Joseph Sarrao and Zach Wadhams
Poster Session – 11:30 to 1 (Lunch provided)
- Streamlining Machine Learning for Molecular Dynamics by Interfacing Python with LAMMP3 | Steven Anaya
- Performance Analysis of Non-Volatile Memory Express Over Fabrics (NVMeoF) Using Infiniband and Ethernet | Christa Collins, Joseph Sarrao and Zach Wadhams
- Charlieclouds Git-based Cache is Competitive with Alternatives | Z. Noah Hounshel, Ashlynn Lee and Ben Stormer
- Evaluating TCP Protocol Performance on High-Speed Networks | Noah Jones, Jerrod Parten and Lucas Ritzdorf
- HPC Network Security Analytics Using Virtual Appliances | Victoria Sasaoka
- Automating and Customizing the Node Health Check Tool for Support System | Aedan Wells
2021 Showcase Archive
An archive of the abstracts and presentations from last years virtual showcase are below.
ABSTRACTSThursday, July 29, 2021
- Implementing Kexec with Ironic to Reduce HPC System Downtime | Kam Killfirst
- Number Representations and their Applications to Hardware Devices | Andrew Alexander and Matthew Broussard
Wednesday, August 11, 2021
- 10:05am | Machine Architecture Impact on Application Performance | Nicklaus Przybylski
- 10:10am | Relating Epigenetic Information to the Structure of DNA Using Deep Learning | Vanessa Job
- Abstract
- Presentation
- 10:26am | Perils of the One-Size-Fits-All Kernel: A Fast, Secure Search for FileSystem Metadata | Prajwal Challa
- Abstract
- Presentation
- 10:42 am | Robust Architectures for Arithmetic Circuits via Quantum Sampling | Vanessa Job and Nathan Kodama
- Abstract
- Presentation
Thursday, August 12, 2021
Morning Sessions
- 9:02am | Can it scale? : Metadata Performance Testing of Lustre Dynamic Namespace | Megan Booher & Seema Kulkarni
- 9:18am | Exploring the Trusted Platform Module to Establish Mutual Trust in High Performance Computing | Devon Bautista & Rebecca Whitten
- 9:34am | Managing Configuration Secrets from Ansible using Hashicorp Vault | Susan Foster & Raafiul Hossain
- 9:50am | Characterizing the impact of compiler and MPI version differences in Containers with Spack | David Bernado & Martha Dix
- 10:06am | Analyzing Server-side Scalability of Image Filesystems & Attachment Technologies | Timothy Bargo, Aedan Wells, and Michelle Yoon
- Abstract
- Presentation
Early Afternoon Sessions
- 12:02pm | Offloading Calculations to Computational Storage Devices: Spark and HDFS | Cunningham, Goldstein, Hammock, Janz, Liu and Rimerman
- 12:18pm | Using Computational Storage Devices: OpenMP/MPI and Charliecloud | Cunningham, Goldstein, Hammock, Janz, Liu and Rimerman
- 12:34pm | SquashFS & FUSE for Better HPC Containers | Megan Phinney
- 12:50pm | Integration of the PENNANT mini-app into the Pavilion Test Harness | Timothy Goetsch
- 2:32pm | MarFS and libNE Utility Development | Daniel Perry
- 2:48pm | Network Monitoring and Analytics with SFlow | Conner Whitfield
Thursday, August 19, 2021
- 1:02pm | Dust Destruction in Core Collapse Supernovae | Sarah Stangl
- Abstract
- Presentation
- 1:18pm | Performance Analysis of Common Loop Optimizations | Brian Gravelle
- 1:34pm | Machine learning for physics simulation anomaly detection | Adam Good
- 1:50pm | Exploring OpenSNAPI Use Cases and Evolving Requirements | Brody Williams
Thursday, September 30, 2021
- Tropical Matrix Factorization | Jose Ortiz
Projects
SUMMER 2021
2020 Showcase Archive
An archive of the abstracts and presentations from last years virtual showcase are below.
ABSTRACTSAugust 6, 2020
August 12, 2020
- Deploying Machine Learning Workflows into HPC environment | Ragini Gupta
- Managing Dynamic Workflows in BEE | Steven Anaya
- Embracing Open Firmware in HPC for Faster and More Secure Provisioning | Devon Bautista
- Performing Survival Analysis on HPC System Memory Error Data | Stephen Penton
- Memory Trace Analysis using Machine Learning | Braeden Slade
- Exploring the Feasibility of In-Line Compression on HPC Mini-Apps | Dakota Fulp
- No-Cost and Low-Cost Methods of Reducing Floating Point Error in Sums by Vanessa Job
- Data placement and movement in a heterogeneous memory environment | Onkar Patil
- bueno: Benchmarking, Performance, and Provenance | Jacob Dickens
- Parallelization and vectorization of nuDust | Ezra Brooker/Sarah Stangl
- Perils of the One-Size-Fits-All Kernel: A Fast, Secure Search for File System Metadata | Prajwal Challa
- The first virtual Supercomputing Institute | Richard Snyder
- Technical Project Management Migration to the Cloud and Confluence Documentation | Morgan Jones
August 13, 2020
- Easier JupyterLab Instances for HPC Users | Dylan Wallace
- Towards CFD Fault Detection and Resolution Scaling with Machine Learning | Adam Good
- The Future of Stereo 3D Data Analysis and Visualization | John Dermer
- Memory Address Decoding and Fault Analysis | Dylan Wallace
- Investigating Hard Disk Drive Failure Through Disk Torture | Daniel Perry
- Virtualizing the Network for Testing & Development | Conner Whitfield & Robby Rollins
- Analyzing Frameworks for HPC Systems Regression Testing | Berkelly Gonzalez & Sadie Nederveld
- A Virtual Cluster Monitoring Toolkit for Bottleneck Analysis | Natasha Frumkin & Christian Marquardt
- Auto-Mounted SquashFS for Charliecloud Containers | Anna Chernikov & Megan Phinney
- Integration of the ECP Proxy Apps Suite into the Pavilion Test Harness | Christine Kendrick, Yolanda Reyes & Anaira Quezada
- Stay GUFI with Performance Regression Testing | Skylar Hagen
- Evaluating Hardware Compression Offload in a Lustre File System | Mariana Hernandez
- Integration of The Energy Exascale Earth System Model (E3SM) into The Pavilion Test Harness | Timothy Goetsch
- OpenSNAPI: Toward a Unified API for SmartNICs | Brody Williams
- Hunting for Bottlenecks in ZFS Failure Recovery using NVMe Drives | Trevor Bautista
- Comparative Analysis of Metric Collecting Software | David Huff
- Generating HPC Job Profiles and Expectations with Time-Series Data | Brett Layman
- Using Statistical Methods to Validate Hardware Performance Monitors | Brian Gravelle
August 6, 2020
- Enhancing the MPI Sessions Prototype for Use on Exa-Scale Systems | Tom Herschberg
- Survey of Tools to Assess Reduced Precision on Floating Point Applications | Quinn Dibble
- Investigating the Efficacy of Unstructured Text Analysis for Failure Detection in Syslog | Katy Felkner
August 12, 2020
- Deploying Machine Learning Workflows into HPC environment | Ragini Gupta
- Managing Dynamic Workflows in BEE | Steven Anaya
- Embracing Open Firmware in HPC for Faster and More Secure Provisioning | Devon Bautista
- Performing Survival Analysis on HPC System Memory Error Data | Stephen Penton
- Memory Trace Analysis using Machine Learning | Braeden Slade
- Exploring the Feasibility of In-Line Compression on HPC Mini-Apps | Dakota Fulp
- No-Cost and Low-Cost Methods of Reducing Floating Point Error in Sums by Vanessa Job
- Data placement and movement in a heterogeneous memory environment | Onkar Patil
- bueno: Benchmarking, Performance, and Provenance | Jacob Dickens
- Parallelization and vectorization of nuDust | Ezra Brooker/Sarah Stangl
- Perils of the One-Size-Fits-All Kernel: A Fast, Secure Search for File System Metadata | Prajwal Challa
- The first virtual Supercomputing Institute | Richard Snyder
- Technical Project Management Migration to the Cloud and Confluence Documentation | Morgan Jones
August 13, 2020
- Easier JupyterLab Instances for HPC Users | Dylan Wallace
- Towards CFD Fault Detection and Resolution Scaling with Machine Learning | Adam Good
- The Future of Stereo 3D Data Analysis and Visualization | John Dermer
- Memory Address Decoding and Fault Analysis | Dylan Wallace
- Investigating Hard Disk Drive Failure Through Disk Torture | Daniel Perry
- Virtualizing the Network for Testing & Development | Conner Whitfield & Robby Rollins
- Analyzing Frameworks for HPC Systems Regression Testing | Berkelly Gonzalez & Sadie Nederveld
- A Virtual Cluster Monitoring Toolkit for Bottleneck Analysis | Natasha Frumkin & Christian Marquardt
- Auto-Mounted SquashFS for Charliecloud Containers | Anna Chernikov & Megan Phinney
- Integration of the ECP Proxy Apps Suite into the Pavilion Test Harness | Christine Kendrick, Yolanda Reyes & Anaira Quezada
- Stay GUFI with Performance Regression Testing | Skylar Hagen
- Evaluating Hardware Compression Offload in a Lustre File System | Mariana Hernandez
- Integration of The Energy Exascale Earth System Model (E3SM) into The Pavilion Test Harness | Timothy Goetsch
- OpenSNAPI: Toward a Unified API for SmartNICs | Brody Williams
- Hunting for Bottlenecks in ZFS Failure Recovery using NVMe Drives | Trevor Bautista
- Comparative Analysis of Metric Collecting Software | David Huff
- Generating HPC Job Profiles and Expectations with Time-Series Data | Brett Layman
- Using Statistical Methods to Validate Hardware Performance Monitors | Brian Gravelle