Liangsheng Yin 「尹良升」

Hello, my name is Liangsheng Yin, and I got my bachelor degree at Shanghai Jiao Tong University, majoring in Computer Science, in ACM Honor Class. I am also a incoming Ph.D. student at UC Berkeley.

Currently, I am fortunate to work with Lianmin Zheng and Ying Sheng from LMSYS.ORG, working as a core developer of SGLang, and advised by Ion Stoica and Joseph E. Gonzalez in Sky Computing. We are committed to developing more efficient and powerful systems for Artificial Intelligence. I am also an applicant for the 2025 Fall Ph.D. program in Computer Science.

Feel free to check out my CV and drop me an e-mail if you want to chat with me!


Jul '25 Super fast SGLang v0.2 (vs. TensorRT, vLLM) is released! Check it out here.
Jul '04 Arrived at UC Berkeley and thrilled to start my journey with Sky Computing. Feel free to reach out!
Feb '05 The compressed FSM, a new feature for Faster JSON/regex decoding is available in SGLang.

Research Assistant | Sky Computing at UC Berkeley
July '24 – Dec '24

Working with Ion Stoica and Joseph E. Gonzalez from Sky Computing and LMSYS Group.

Research Intern | Large Model Systems Organization
Sep '23 – Dec '24

Working with Lianmin Zheng from UC Berkeley and Ying Sheng from Stanford University. We are committed to developing large models and systems that are open, accessible, and scalable.

Undergraduate Student | Shanghai Jiao Tong University
Sep '21 – Jun '25

Working under the supervision of Prof. Yong Yu and majoring in Computer Science, expected to graduate in 2025.


SGLang
SGLang: Efficient Execution of Structured Language Model Programs

Lianmin Zheng*, Liangsheng Yin, Zhiqiang Xie, Jeff Huang, Chuyue Sun, Cody Hao Yu, Shiyi Cao, Christos Kozyrakis, Ion Stoica, Joseph E. Gonzalez, Clark Barrett, Ying Sheng*


Compressed FSM
Fast JSON Decoding for Local LLMs with Compressed Finite State Machine

Liangsheng Yin, Ying Sheng, Lianmin Zheng

SGLang v0.2
Achieving Faster Open-Source Llama3 Serving with SGLang Runtime (vs. TensorRT-LLM, vLLM)

Liangsheng Yin, Yineng Zhang, Ying Sheng et al. in SGLang team

Fast JSON Decoding for Local LLMs with Compressed FSM
Fast JSON Decoding for Local LLMs with Compressed FSM

February 05, 2024