Time | Track 1 | Track 2 | Track 3 | ||
ST Apps 1 |
ST Prog 1 | BOS FPGAs | |||
9:15 |
Multi-Physics Workflow at Exascale with NekRS: Mathis Bode |
JSC | chipStar:CUDA and HIP everywhere: Colleen Bertoni | ANL | Heterogeneous and reconfigurable architectures for the future of computing: Kentaro Sano, Kazutomo Yoshii, BSC, "Project Update"; Kazutomo Yoshii, "Simulating Custom Accelerators with FireSim"; John Tramm, "Efficient Algorithms for Monte Carlo particle transport on AI accelerator hardware" |
9:30 | Into The Void - Sparsity and the quest for performance: Ivo Kabadshow | JSC | IMProving performance in RISC-V clusters: Juan Miguel de Haro | BSC | |
9:45 | Generative Adversarial Simulation-Based Parameter Inference in Quantum Correlation Function Fitting: Katherine Keegan | R-CCS | Performance-aware MPI Malleability: Petter Sandås | BSC | |
ST Apps 2 | ST Prog 2 | BOS FPGAs | |||
10:15 | Porting and Performance Optimization of Lagrangian Particle Dispersion Models on GPUs: Lars Hoffmann | JSC | Adapting data-flow programming models for Quantum-classical applications: David Álvarez | BSC | Heterogeneous and reconfigurable architectures for the future of computing: Tomohiro Ueno, “Customizable Virtual 2D Mesh FPGA Network for Large-Scale Systolic Operations”; Kentaro Sano, “Unleashing CGRA’s Potential or HPC and AI”; Antonio Filgueras, “Scaling up heterogeneous hardware” |
10:30 | Studying the scaling behaviour of quantum annealing to find the ground-state of the 1-dimensional Hubbard model: Kunal Vyas | JSC | Unifying the Architecture and Implementation of Task-Aware Libraries: Amadeu Moya Sardà | BSC | |
10:45 | Quantum annealing and its variants: Application to quadratic unconstrained binary optimization: Vrinda Mehta | JSC | From Dynamic Data-Parallel Dataflows to Task Graphs: Christian Perez | Inria | |
ST Apps 3 | ST Prog 3 | BOS FPGAs | |||
11:15 | Reproducibility and: Mario Acosta | BSC | LCI: a Lightweight Communication Interface for efficient asynchronous multithreaded communication: Jiakun Yan |
UIUC/ NCSA |
Heterogeneous and reconfigurable architectures for the future of computing: TBA |
11:30 | An Autotuning-based Optimization Framework for Mixed-kernel SVM Classifications in Smart Pixel Datasets and Heterojunction Transistors: Xingfu Wu | ANL | Accelerating MPI Collective Communication with Lossy Compression: Jiajun Huang | ANL | |
11:45 | Fully Homomorphic Encryption based Inference Engine for Privacy Preserving Machine Learning: Priyam Kalpesh Mehta | BSC | A role-based programming approach for the Compute Continuum: Xavier Casas Moreno | BSC | |
K1 | |||||
12:00 | TBA Sergio Girona BSC | ||||
ST Apps 4 | ST Other | BOS FPGAs | |||
13:15 | Reimagining Performance and Reproducibility in the Post-Moore Era: Innovations in Checkpointing and Workflow Management: Michela Taufer | UTK | Leveraging Large Language Models for Code Translation and Software Development in Scientific Computing: Akash Dhruv | ANL | Heterogeneous and reconfigurable architectures for the future of computing: TBA |
13:30 | Proper Process Affinity/Pinning on HPC: Thomas Breuer | JSC | Driving Global Innovation Through International Industrial Supercomputing Partnerships: Brendan McGinty | UIUC/ NCSA | |
13:45 | Building a Molecular Factory: Scaling Drug Discovery with HPC: Isaac Filella-Merce | BSC | Supercomputing in a bounded world: Robin Boëzennec | Inria | |
ST Num 1 | ST I/O 1 | PT Data 1 | |||
14:15 | Unifying nonlinearly constrained optimization: Sven Leyffer | ANL | Scalable Data Management Techniques for AI workloads: Bogdan Nicolae | ANL | 14:15 Optimizing CI/CD Workflows on Fugaku:Yoshifumi Nakamura: R-CCS |
14:30 | Billions of Particles on Millions of Threads: Arjus Lengvenis | JSC | Improving the Efficiency of Interpolation-Based Scientific Data Compressors with Adaptive Quantization Index Prediction: Sheng Di | ANL | 14:35 Compression for instruments: Amarjit Singh: R-CCS, Robert Underwood: ANL |
14:45 | Solving optimal control problems on GPU with Julia: Jean-Baptiste Caillau | Inria | IOBAT: Input/Output Behavior Analysis Toolkit: Jakob Luettgau | Inria | |
ST Num 2 | ST I/O 2 | PT Data 2 | |||
15:15 | Massively Space-Time Parallel Simulations in Python: Thomas Baumann | JSC | Broadening community access to I/O workload information with Darshan: Shane Snyder | ANL | 15:15 VINARCH: An Interactive Visual Analytics Tool for Exploring Neural Network Evolution: Kin Ng: UTK |
15:30 | Sparsity pattern detection with Tapenade: Alexis Montoison | ANL | Towards dynamic in-situ workflows - with applications stemming from crystal plasticity: Arthur Jaquard | Inria15 | 15:35 In-situ visualization and analysis for large-scale particle-mesh simulations: Jens Henrik Goebbert: JSC |
15:45 | RAPTOR: Numerical Profiling of Scientific Applications: Jens Domke | R-CCS | Free | ||
Poster Session | |||||
16:15 | Unifying nonlinearly constrained optimization | Sven Leyffer: ANL | |||
Towards Affordable Reproducibility Using Scalable Capture and Comparison of Intermediate Multi-Run Results | Kevin Assogba: Rochester Institute of Technology | ||||
Early Experiences in Building AI Assistants for Improving the Productivity of PETSc Users and Developers | Junchao Zhang: ANL | ||||
Fine grain energy consumption | Jules Risse: Inria | ||||
VINARCH: An Interactive Visual Analytics Tool for Exploring Neural Network Evolution | Kin Ng: UTK | ||||
Hamiltonian simulation for solving Linear PDE via Schrödingerisation | Sangwon Kim: R-CCS | ||||
Building performant distributed services with Mochi | Shane Snyder: ANL | ||||
Bidirectional Steering of Large-Scale Simulations Using ASCENT and Trame | Victor Mateevitsi: ANL | ||||
HPC Interconnect network simulations from benchmark communication patterns | Seydou Ba: R-CCS | ||||
Optimizing Number-Theoretic Transform for FPGAs in CKKS Homomorphic Encryption | Mohamed Allam: BSC, UPC | ||||
Performance-aware MPI Malleability | Petter Sandås: BSC | ||||
Processing-in-Memory for Homomorphically Encrypted Operations | Tathagata Barik: BSC, UPC | ||||
Overview of the Quantum-HPC Hybrid Platform Design | Tomoya Yuki: RIKEN R-CCS | ||||
A Long-term Operational Data Analysis for the Cogeneration System in RIKEN R-CCS | Masaaki Terai: RIKEN | ||||
Billions of Particles on Millions of Threads | Arjus Lengvenis: Forschungszentrum Jülich | ||||
Investigate and Explore the Realization of Adaptive Bandwidth Compression Hardware |
Tomohiro Ueno: R-CCS |
||||
ML-based Visual Analytics Tools for Investigating HPC Operational Log Data | Jorji Nonaka: R-CCS | ||||
Into The Void - Sparsity and the quest for GPU performance | Ivo Kabadsho: JSC |
Choose timezone
Your profile timezone: