Be ahead of the curve
Research papers, repositories, and articles about serving
Showing 1 of 1 items
A slimmed-down version of the SGLang runtime aimed at easier experimentation. It focuses on fast text generation pipelines for modern language models in Python. ([github.com](https://github.com/trending))