ALCF AI Testbed: Cerebras AI Training Workshop

America/Chicago
Murali Emani (LCF) , Venkatram Vishwanath (LCF)
Description

This training workshop will introduce users to the novel AI accelerators deployed at the ALCF AI Testbed. This workshop is targeted at universities, national labs, and industries working on open science.

This training session will introduce users to the Cerebras CS-2 system architecture and software stack. It will provide hands-on training to get started on the CS-2 system in the ALCF AI testbed. The event will be virtual and the sessions will be recorded and made available.

NOTE: Please register and sign-up/reactivate your ALCF Account by 04/05 if you would like to participate in the hands-on session. 

Day 1 will focus on Cerebras CS-2 hardware and software architecture, running models on the CS-2 system at ALCF AI Testbed.

Day 2 will focus on a deep dive into Large Language Models (LLMs) on CS-2, hands-on session, open source projects, and best practices.

To Access ALCF machines please see the instructions below. Deadline to complete the following is 04/05:

  • New Users: Request an ALCF Computer User Account if you do not currently have one. Specify "aitestbed_training" as the project name.
  • Previous Users: If you have an ALCF Account that is currently inactive, submit an account reactivation request. Specify "aitestbed_training" as the project name.
  • Current Users: If you have an active ALCF account, click Join Project to submit a membership request for the project "aitestbed_training".
Registration
Participants
    • 1:00 PM 1:20 PM
      Cerebras CS-2 Introduction 20m
    • 1:20 PM 1:35 PM
      Hardware and Systems 15m
    • 1:35 PM 1:50 PM
      Software and Programming 15m
    • 1:50 PM 2:00 PM
      Break 10m
    • 2:00 PM 2:30 PM
      How-to: Model porting, layer API, data loaders 30m
    • 2:30 PM 2:45 PM
      Huggingface to CS-2 overview 15m
    • 2:45 PM 3:05 PM
      How-to: Monitoring and profiling 20m
    • 3:05 PM 3:15 PM
      Break 10m
    • 3:15 PM 4:00 PM
      Hands-on session for training at ALCF 45m
    • 4:00 PM 4:30 PM
      Release 2.2.1 highlights 30m
  • Wednesday, 8 May
    • 1:00 PM 1:45 PM
      Efficient training with Cerebras, scaling laws, how to train LLMs 45m
    • 1:45 PM 2:45 PM
      User training: hands-on LLM model 1h
    • 2:45 PM 3:00 PM
      Break 15m
    • 3:00 PM 4:00 PM
      HPC: CS for HPC: SDK, CSL and past examples 1h
    • 4:00 PM 4:20 PM
      Roadmap presentation 20m
    • 4:20 PM 4:30 PM
      Closing, final Q&A 10m