Multilingual
Research papers, repositories, and articles about multilingual
Showing 4 of 4 items
T-pro 2.0: An Efficient Russian Hybrid-Reasoning Model and Playground
T-pro 2.0 is an open-weight Russian large language model focused on hybrid reasoning: it can answer directly or emit explicit reasoning traces, and it’s optimized for low-latency inference via speculative decoding. Alongside the model, the authors release a Russian instruction corpus, a math benchmark, and an EAGLE-based inference stack, making it a practical foundation for Russian-language reasoning applications.
EuroLLM-22B: Technical Report
EuroLLM-22B is a 22B-parameter open model focused on European languages, with long-context support and a detailed training recipe. It aims to give EU labs and companies a strong regional alternative to US-centric frontier models.
OpenBMB/VoxCPM
Tokenizer-free speech model for multilingual text-to-speech, creative voice design, and realistic cloning. Targets production-grade voice systems beyond simple demos. If your product talks to users, this is a serious open contender to evaluate. ([github.com](https://github.com/trending/python?since=daily))
BYOL: Bring Your Own Language Into LLMs
BYOL lays out a playbook to lift extremely low-resource languages into modern LLMs. It mixes corpus cleaning, synthetic data, extra training, and translation to build strong models for languages with tiny digital footprints.