About Me

I am now a first-year Ph.D. student majoring in Computer Science at Computer Systems Research Group, Peking University (2023-Present). I am advised by Prof. Xin Jin. My research interests focus on Machine Learning (LLM) System, Vector Database, and Cloud Computing, with a keen interest in their application to enhance AI technologies in unison.

I received my B.E. from the School of Electronics Engineering and Computer Science (EECS), Peking University (2019-2023). I used to be a research assistant at Software Engineering Institute advised by Prof. Xin Jin and Prof. Xuanzhe Liu (2020-2023).

Email: zzlcs (at) pku (dot) edu (dot) cn

Recent Publications

Fast Distributed Inference Serving for Large Language Models
Bingyang Wu*, Yinmin Zhong*, Zili Zhang*, Gang Huang, Xuanzhe Liu, Xin Jin
(* Equal contribution)
In Preprint.
[PDF] [Slides]

dLoRA: Dynamically Orchestrating Requests and Adapters for LoRA LLM Serving
Bingyang Wu, Ruidong Zhu, Zili Zhang, Peng Sun, Xuanzhe Liu, Xin Jin
USENIX Symposium on Operating Systems Design and Implementation (OSDI 2024), Santa Clara, July 10–12, 2024 (To appear).
[PDF] [Slides]

Jolteon: Unleashing the Promise of Serverless for Serverless Workflows
Zili Zhang, Chao Jin, Xin Jin
USENIX Symposium on Networked Systems Design and Implementation (NSDI 2024), Santa Clara, April 16–18, 2024 (To appear).
[PDF] [Slides]

Fast Vector Query Processing for Large Datasets Beyond GPU Memory with Reordered Pipelining
Zili Zhang, Fangyue Liu, Gang Huang, Xuanzhe Liu, Xin Jin
USENIX Symposium on Networked Systems Design and Implementation (NSDI 2024), Santa Clara, April 16–18, 2024 (To appear).
[PDF] [Slides]

Ditto: Efficient Serverless Analytics with Elastic Parallelism
Chao Jin, Zili Zhang, Xingyu Xiang, Songyun Zou, Gang Huang, Xuanzhe Liu, Xin Jin
ACM Special Interest Group on Data Communication (SIGCOMM 2023), New York City, September 10-14, 2023.
[PDF] [Slides]

Fast, Approximate Vector Queries on Very Large Unstructured Datasets
Zili Zhang, Chao Jin, Linpeng Tang, Xuanzhe Liu, Xin Jin
USENIX Symposium on Networked Systems Design and Implementation (NSDI 2023), Boston, April 17–19, 2023.
[PDF] [Slides]

Transparent GPU Sharing in Container Clouds for Deep Learning Workloads
Bingyang Wu, Zili Zhang, Zhihao Bai, Xuanzhe Liu, Xin Jin
USENIX Symposium on Networked Systems Design and Implementation (NSDI 2023), Boston, April 17–19, 2023.
[PDF] [Slides]

Rise of Distributed Deep Learning Training in the Big Model Era: From A Software Engineering Perspective
Xuanzhe Liu , Diandian Gu, Zhenpeng Chen, Jinfeng Wen, Zili Zhang, Yun Ma, Haoyu Wang, Xin Jin
ACM Transactions on Software Engineering and Methodology (TOSEM 2023), 2023.
[PDF] [Slides]

Teaching

  • [2024 Spring] Teaching Assistant, Operating System (Honor Track) at PKU.
  • [2022 Fall] Teaching Assistant, Introduction to Computer System (Honor Track) at PKU.
  • [2021 Fall] Teaching Assistant, Introduction to Computer System at PKU.

Internship

  • [2023.06 - Present] Alibaba, Researcher of Serverless Computing.
  • [2021.10 - 2023.02] MOQI, Advised by Linpeng Tang and Xinhui Tian, Researcher of Vector Search Engine.
  • [2021.06 - 2021.09] ByteDance Inc., Advised by Leyuan Wang, Researcher of Deep Learning Compiler.

Interests

  • Amateur runner, Here are my ITRA profile and Running Page.
  • Outdoor traveling, Mountain climbing, and Camping, I believe that only by immersing myself in nature can I find inner peace.
  • Video game enthusiast, I love playing CSGO, Minecraft, RA2 and some VR games.