Homepage: https://sc23.supercomputing.org/
Paper list: https://dl.acm.org/doi/proceedings/10.1145/3581784
- EasyScale: Elastic Training with Consistent Accuracy and Improved Utilization on GPUs [Paper] [Code]
- BUAA & Alibaba
- Hanayo: Harnessing Wave-like Pipeline Parallelism for Enhanced Large Model Training Efficiency [Paper] [Code]
- NUS
- Interference-aware Multiplexing for Deep Learning in GPU Clusters: A Middleware Approach [Personal Notes] [Paper] [Code]
- UMacau & SIAT, CAS
- IADeep — a cluster scheduler to co-locate DL training tasks