Cited By
View all- Huang JDi SYu XZhai YLiu JHuang YRaffenetti KZhou HZhao KLu XChen ZCappello FGuo YThakur R(2024)gZCCL: Compression-Accelerated Collective Communication Framework for GPU ClustersProceedings of the 38th ACM International Conference on Supercomputing10.1145/3650200.3656636(437-448)Online publication date: 30-May-2024
- Huang JDi SYu XZhai YLiu JHuang YRaffenetti KZhou HZhao KChen ZCappello FGuo YThakur RLee IChabbi MSteuwer M(2024)POSTER: Optimizing Collective Communications with Error-bounded Lossy Compression for GPU ClustersProceedings of the 29th ACM SIGPLAN Annual Symposium on Principles and Practice of Parallel Programming10.1145/3627535.3638467(454-456)Online publication date: 2-Mar-2024
- Won WRashidi SSrinivasan SKrishna T(2024)LIBRA: Enabling Workload-Aware Multi-Dimensional Network Topology Optimization for Distributed Training of Large AI Models2024 IEEE International Symposium on Performance Analysis of Systems and Software (ISPASS)10.1109/ISPASS61541.2024.00028(205-216)Online publication date: 5-May-2024
- Show More Cited By