TechnologyMonday, December 15, 2025

Study finds Korean LLMs trail global rivals even on home‑turf exam math

Source: Korea JoongAng DailyRead original
Korean AI models lag behind overseas rivals even in domestic exam math tests

Summary

Korea JoongAng Daily reports that a Sogang University research team tested five Korean large language models against leading US and Chinese models on College Scholastic Ability Test (CSAT) questions and advanced essay‑style math problems, finding a stark performance gap. Foreign models like GPT‑5.1, Gemini 3 Pro Preview, Claude Opus 4.5, Grok 4.1 Fast and DeepSeek V3.2 scored between 76 and 92 points, while domestic systems such as Upstage’s Solar Pro‑2, LG AI Research’s Exaone 4.0.1, Naver’s HCX‑007, SK Telecom’s A.X 4.0 and NCSoft’s Llama Varco 8B mostly landed in the 20‑point range, with the weakest model scoring just 2 points. Even when models were allowed to use Python tools and given multiple attempts on a separate EntropyMath benchmark, non‑Korean systems still dominated, suggesting that Korea’s sovereign‑AI push has not yet closed the reasoning gap with frontier models. The authors say they plan to build an international math leaderboard and update tests as newer model versions arrive, but the results reinforce concerns that language‑localized LLMs can lag badly on deeper problem‑solving unless they match global players in both data and algorithmic sophistication.

Companies Mentioned

OpenAI
OpenAI
AI Lab|United States
Valuation: $500.0B
Private company - No stock data
Anthropic
Anthropic
AI Lab|United States
Valuation: $183.0B
Private company - No stock data
xAI
xAI
AI Lab|United States
Valuation: $24.0B
Private company - No stock data
DeepSeek
DeepSeek
AI Lab|China
Valuation: $15.0B
Private company - No stock data
Upstage
AI Company|South Korea
Valuation: $100.0M
Private company - No stock data
Google
Google
Cloud|United States
Valuation: $3790.0B
GOOGLNASDAQMarket Closed
At news: $306.77Now: $306.76

Related Deals

Partnership
Investment
Research
Drag nodes to explore | Featured companies highlighted
Partnership

BBVA and OpenAI entered a strategic partnership to expand ChatGPT Enterprise to BBVA’s global workforce and co-develop AI solutions for banking operations and customer experiences.

BBVAOpenAIOpenAI
Dec 2025
Investment

Disney makes a $1B equity investment in OpenAI alongside a multi-year character-licensing partnership for Sora-generated short videos.

The Walt Disney CompanyOpenAIOpenAI
Dec 2025
Partnership

Disney becomes Sora’s first major content licensing partner and commits a $1B equity investment in OpenAI as part of a three-year AI content and enterprise technology partnership.

The Walt Disney CompanyOpenAIOpenAI
Dec 2025
Research

OpenAI, Anthropic, Block and major cloud providers are co-founding the Agentic AI Foundation under the Linux Foundation to steward open, interoperable standards for AI agents.

OpenAIOpenAIAnthropicAnthropicBlockGoogleGoogleMicrosoftMicrosoftAmazon Web ServicesBloombergCloudflareCiscoAgentic AI Foundation
Dec 2025
Research

Founding members created the Agentic AI Foundation under the Linux Foundation to fund and govern open standards like MCP, goose and AGENTS.md for interoperable agentic AI.

AnthropicAnthropicOpenAIOpenAIBlockGoogleGoogleMicrosoftMicrosoftAmazon Web ServicesBloombergCloudflareAgentic AI Foundation (AAIF)
Dec 2025