Research On Algorithms & Data Structures (ROADS) to Mega-AI Models Workshop

Workshop Summary

The state-of-the-art on numerous machine learning (ML) benchmarks comes from training enormous neural network models on expensive, specialized hardware with massive quantities of data. However, this route to success in deep learning is unsustainable. Training a large transformer model in natural language processing, for instance, can incur a higher carbon footprint than the total lifetime cost of five cars. In addition, these massive models require immense memory and computing resources during deployment, which hinders their practical impact. To realize the full promise and benefits of artificial intelligence, we must solve these scalability challenges prevalent in both training and inference and design new algorithms with step-function improvements in efficiency. This workshop aims to bring together both computer science researchers and practitioners focused on ML efficiency to offer innovative solutions towards efficient modeling workflows grounded in principled algorithm design.

Keynote Speakers

Bilge Acun (Meta, USA)

Jonathan Frankle (Harvard University/Mosaic ML, USA)

Furong Huang (University of Maryland, USA)

Chen Luo (Amazon, USA)

Michael Mitzenmacher (Harvard University, USA)

Schedule (Thursday, June 8)

8:30am-8:40am - Welcome and Opening Remarks (Workshop Organizers)
8:40am-9:40am - Keynote #1: Prof. Michael Mitzenmacher (Harvard)
9:40am-10:05am - Invited Talk: Hongyi Wang Cuttlefish: Low-Rank Model Training without All the Tuning
10:05am-10:30am - Invited Talk: Daochen Zha Pre-train and Search: Efficient Embedding Table Sharding with Pre-trained Neural Cost Models
10:30am-10:50am - Coffee Break and Socialize
10:50am-11:30am - Keynote #2: Dr. Bilge Acun (Meta)
11:30am-11:55am - Invited Talk: Trevor Gale MegaBlocks: Efficient Sparse Training with Mixture-of-Experts
11:55am-12:20pm - Invited Talk: Daochen Zha RSC: Accelerating Graph Neural Networks Training via Randomized Sparse Computations
12:20pm-1:20pm - Lunch Break
1:20pm-2:00pm - Keynote #3: Prof. Jonathan Frankle (Harvard)
2:00pm-2:30pm - Keynote #4: Prof. Furong Huang (UMD)
2:30pm-3:10pm - Keynote #5: Dr. Chen Luo (Amazon)
3:10pm-3:30pm - Coffee Break and Socialize
3:30pm-4:30pm - Panel Discussion (Mitzenmacher, Frankle, Acun, Luo, Shrivastava)
4:30pm - Social

Organizing Committee

Zhaozhuo Xu (Rice University, USA)
Aditya Desai (Rice University, USA)
Vihan Lakshman (ThirdAI, USA)
Anshumali Shrivastava (Rice University/ThirdAI, USA)

Call for Papers

This workshop encourages submissions on original research, benchmarks, in-progress research results, and position papers. We also invite submissions of previously accepted papers where authors had limited presentation opportunities (e.g. due to pandemic-related constraints). We do not have strict requirements on paper lengths, but encourage authors to adhere to a maximum of 10-pages (excluding references) in MLSys 2023 format. We will manage submissions through OpenReview, but the review-process will not be open.

Scope

The technical topics of interest at this workshop include (but are not limited to):

Algorithms and data structures to improve the computational efficiency of neural network training and inference
Algorithmic solutions for deploying machine learning models on resource-constrained devices
Model compression approaches for training and inference, including pruning, quantization, and parameter sharing
Data reduction techniques (e.g. sketching, sampling, coresets) for efficient training and inference
Algorithmic techniques to enable longer sequence language models, higher resolution images for vision models, wide and deep neural networks, and other model architectures of interest to the community

Submission

OpenReview Link

Please format submissions using the MLSys 2023 style files.

The reviewing process will be double-blind. Please submit anonymized papers that omit all information about author identities. There are no formal proceedings generated from this workshop. Accepted papers will be made available on OpenReview.

Contact

Contact the organizers: zx22@rice.edu

Research On Algorithms & Data Structures (ROADS) to Mega-AI Models Workshop ​

Workshop Summary ​

Keynote Speakers ​

Schedule (Thursday, June 8) ​

Organizing Committee ​

Call for Papers ​

Scope ​

Submission ​

Contact ​