Multilingual

Research papers, repositories, and articles about multilingual

Showing 4 of 4 items

T-pro 2.0: An Efficient Russian Hybrid-Reasoning Model and Playground

T-pro 2.0 is an open-weight Russian large language model focused on hybrid reasoning: it can answer directly or emit explicit reasoning traces, and it’s optimized for low-latency inference via speculative decoding. Alongside the model, the authors release a Russian instruction corpus, a math benchmark, and an EAGLE-based inference stack, making it a practical foundation for Russian-language reasoning applications.

Dmitrii Stoianov, Danil Taranets

EuroLLM-22B: Technical Report

EuroLLM-22B is a 22B-parameter open model focused on European languages, with long-context support and a detailed training recipe. It aims to give EU labs and companies a strong regional alternative to US-centric frontier models.

Miguel Moura Ramos, Duarte M. Alves

OpenBMB/VoxCPM

Tokenizer-free speech model for multilingual text-to-speech, creative voice design, and realistic cloning. Targets production-grade voice systems beyond simple demos. If your product talks to users, this is a serious open contender to evaluate. ([github.com](https://github.com/trending/python?since=daily))

17,233

BYOL: Bring Your Own Language Into LLMs

BYOL lays out a playbook to lift extremely low-resource languages into modern LLMs. It mixes corpus cleaning, synthetic data, extra training, and translation to build strong models for languages with tiny digital footprints.

Syed Waqas Zamir, Wassim Hamidouche