May 13 – 15, 2025
Argonne National Laboratory
America/Chicago timezone

Schedule: Tuesday, May 13th

Time Track 1 Track 2 Track 3
 

ST Apps 1

  ST Prog 1   BOS FPGAs
9:15

Multi-Physics Workflow at Exascale with NekRS: Mathis Bode                 

JSC chipStar:CUDA and HIP everywhere: Colleen Bertoni     ANL Heterogeneous and reconfigurable architectures for the future of computing:    Kentaro Sano, Kazutomo Yoshii, BSC, "Project Update"; Kazutomo Yoshii, "Simulating Custom Accelerators with FireSim"; John Tramm, "Efficient Algorithms for Monte Carlo particle transport on AI accelerator hardware"
9:30 Into The Void - Sparsity and the quest for performance: Ivo Kabadshow      JSC IMProving performance in RISC-V clusters: Juan Miguel de Haro    BSC
9:45 Generative Adversarial Simulation-Based Parameter Inference in Quantum Correlation Function Fitting: Katherine Keegan   R-CCS Performance-aware MPI Malleability: Petter Sandås           BSC
           
  ST Apps 2   ST Prog 2   BOS FPGAs
10:15 Porting and Performance Optimization of Lagrangian Particle Dispersion Models on GPUs: Lars Hoffmann JSC Adapting data-flow programming models for Quantum-classical applications: David Álvarez   BSC Heterogeneous and reconfigurable architectures for the future of computing: Tomohiro Ueno, “Customizable Virtual 2D Mesh FPGA Network for Large-Scale Systolic Operations”;  Kentaro Sano, “Unleashing CGRA’s Potential or HPC and AI”; Antonio Filgueras, “Scaling up heterogeneous hardware”                                             
10:30 Studying the scaling behaviour of quantum annealing to find the ground-state of the 1-dimensional Hubbard model: Kunal Vyas JSC Unifying the Architecture and Implementation of Task-Aware Libraries: Amadeu Moya Sardà            BSC
10:45 Quantum annealing and its variants: Application to quadratic unconstrained binary optimization: Vrinda Mehta        JSC From Dynamic Data-Parallel Dataflows to Task Graphs: Christian Perez               Inria
           
  ST Apps 3   ST Prog 3   BOS FPGAs
11:15 Reproducibility and: Mario Acosta                    BSC LCI: a Lightweight Communication Interface for efficient asynchronous multithreaded communication: Jiakun Yan    

UIUC/ NCSA

Heterogeneous and reconfigurable architectures for the future of computing: TBA       
11:30 An Autotuning-based Optimization Framework for Mixed-kernel SVM Classifications in Smart Pixel Datasets and Heterojunction Transistors: Xingfu Wu          ANL Accelerating MPI Collective Communication with Lossy Compression: Jiajun Huang        ANL
11:45 Fully Homomorphic Encryption based Inference Engine for Privacy Preserving Machine Learning: Priyam Kalpesh Mehta              BSC A role-based programming approach for the Compute Continuum: Xavier Casas Moreno             BSC
           
  K1
12:00 TBA      Sergio Girona                                                                        BSC
           
  ST Apps 4   ST Other   BOS FPGAs
13:15 Reimagining Performance and Reproducibility in the Post-Moore Era: Innovations in Checkpointing and Workflow Management: Michela Taufer    UTK Leveraging Large Language Models for Code Translation and Software Development in Scientific Computing: Akash Dhruv          ANL Heterogeneous and reconfigurable architectures for the future of computing: TBA           
13:30 Proper Process Affinity/Pinning on HPC: Thomas Breuer                JSC Driving Global Innovation Through International Industrial Supercomputing Partnerships: Brendan McGinty   UIUC/ NCSA
13:45 Building a Molecular Factory: Scaling Drug Discovery with HPC: Isaac Filella-Merce            BSC Supercomputing in a bounded world: Robin Boëzennec                   Inria
           
  ST Num 1   ST I/O 1   PT Data 1
14:15 Unifying nonlinearly constrained optimization: Sven Leyffer       ANL Scalable Data Management Techniques for AI workloads: Bogdan Nicolae              ANL 14:15                        Optimizing CI/CD Workflows on Fugaku:Yoshifumi Nakamura: R-CCS    
14:30 Billions of Particles on Millions of Threads: Arjus Lengvenis              JSC Improving the Efficiency of Interpolation-Based Scientific Data Compressors with Adaptive Quantization Index Prediction: Sheng Di  ANL 14:35                        Compression for instruments: Amarjit Singh:   R-CCS, Robert Underwood: ANL          
14:45 Solving optimal control problems on GPU with Julia: Jean-Baptiste Caillau                     Inria IOBAT: Input/Output Behavior Analysis Toolkit: Jakob Luettgau                   Inria
           
  ST Num 2   ST I/O 2   PT Data 2
15:15 Massively Space-Time Parallel Simulations in Python: Thomas Baumann                JSC Broadening community access to I/O workload information with Darshan: Shane Snyder               ANL 15:15                        VINARCH: An Interactive Visual Analytics Tool for Exploring Neural Network Evolution: Kin Ng: UTK
15:30 Sparsity pattern detection with Tapenade: Alexis Montoison            ANL Towards dynamic in-situ workflows - with applications stemming from crystal plasticity: Arthur Jaquard Inria15 15:35                        In-situ visualization and analysis for large-scale particle-mesh simulations: Jens Henrik Goebbert: JSC
15:45 RAPTOR: Numerical Profiling of Scientific Applications: Jens Domke      R-CCS Free  
           
  Poster Session
16:15 Unifying nonlinearly constrained optimization Sven Leyffer: ANL
  Towards Affordable Reproducibility Using Scalable Capture and Comparison of Intermediate Multi-Run Results Kevin Assogba: Rochester Institute of Technology
  Early Experiences in Building AI Assistants for Improving the Productivity of PETSc Users and Developers Junchao Zhang: ANL
  Fine grain energy consumption Jules Risse: Inria
  VINARCH: An Interactive Visual Analytics Tool for Exploring Neural Network Evolution Kin Ng: UTK
  Hamiltonian simulation for solving Linear PDE via Schrödingerisation Sangwon Kim: R-CCS
  Building performant distributed services with Mochi Shane Snyder: ANL
  Bidirectional Steering of Large-Scale Simulations Using ASCENT and Trame Victor Mateevitsi: ANL
  HPC Interconnect network simulations from benchmark communication patterns Seydou Ba: R-CCS
  Optimizing Number-Theoretic Transform for FPGAs in CKKS Homomorphic Encryption Mohamed Allam: BSC, UPC
  Performance-aware MPI Malleability Petter Sandås: BSC
  Processing-in-Memory for Homomorphically Encrypted Operations Tathagata Barik: BSC, UPC
  Overview of the Quantum-HPC Hybrid Platform Design Tomoya Yuki: RIKEN R-CCS
  A Long-term Operational Data Analysis for the Cogeneration System in RIKEN R-CCS Masaaki Terai: RIKEN
  Billions of Particles on Millions of Threads Arjus Lengvenis: Forschungszentrum Jülich
  Investigate and Explore the Realization of Adaptive Bandwidth Compression Hardware

Tomohiro Ueno: R-CCS

  ML-based Visual Analytics Tools for Investigating HPC Operational Log Data Jorji Nonaka: R-CCS
  Into The Void - Sparsity and the quest for GPU performance Ivo Kabadsho: JSC