Full Publications
2025
RLHFuse: Efficient RLHF Training for Large Language Models with Inter- and Intra-Stage Fusion
Yinmin Zhong, Zili Zhang, Bingyang Wu, Shengyu Liu, Yukun Chen, Changyi Wan, Hanpeng Hu, Lei Xia, Ranchen Ming, Yibo Zhu, Xin Jin
USENIX Symposium on Networked Systems Design and Implementation (NSDI 2025), Philadelphia, April 28–30, 2025.
[PDF] [Slides]
2024
2023
Ditto: Efficient Serverless Analytics with Elastic Parallelism
Chao Jin, Zili Zhang, Xingyu Xiang, Songyun Zou, Gang Huang, Xuanzhe Liu, Xin Jin
ACM Special Interest Group on Data Communication (SIGCOMM 2023), New York City, September 10-14, 2023.
[PDF] [Slides]
Rise of Distributed Deep Learning Training in the Big Model Era: From A Software Engineering Perspective
Xuanzhe Liu , Diandian Gu, Zhenpeng Chen, Jinfeng Wen, Zili Zhang, Yun Ma, Haoyu Wang, Xin Jin
ACM Transactions on Software Engineering and Methodology (TOSEM 2023), 2023.
[PDF] [Slides]
2022
Optimizing Half Precision Winograd Convolution on ARM Many-Core Processors
Dedong Xie, Zhen Jia, Zili Zhang, Xin Jin
ACM Asia-Pacific Workshop on Systems (APSys 2022), online, August 23-24, 2022.
[PDF] [Slides]