TechnologyMonday, December 15, 2025

Study finds Korean LLMs trail global rivals even on home‑turf exam math

Source: Korea JoongAng DailyRead original

Korean AI models lag behind overseas rivals even in domestic exam math tests

Summary

Korea JoongAng Daily reports that a Sogang University research team tested five Korean large language models against leading US and Chinese models on College Scholastic Ability Test (CSAT) questions and advanced essay‑style math problems, finding a stark performance gap. Foreign models like GPT‑5.1, Gemini 3 Pro Preview, Claude Opus 4.5, Grok 4.1 Fast and DeepSeek V3.2 scored between 76 and 92 points, while domestic systems such as Upstage’s Solar Pro‑2, LG AI Research’s Exaone 4.0.1, Naver’s HCX‑007, SK Telecom’s A.X 4.0 and NCSoft’s Llama Varco 8B mostly landed in the 20‑point range, with the weakest model scoring just 2 points. Even when models were allowed to use Python tools and given multiple attempts on a separate EntropyMath benchmark, non‑Korean systems still dominated, suggesting that Korea’s sovereign‑AI push has not yet closed the reasoning gap with frontier models. The authors say they plan to build an international math leaderboard and update tests as newer model versions arrive, but the results reinforce concerns that language‑localized LLMs can lag badly on deeper problem‑solving unless they match global players in both data and algorithmic sophistication.

Companies Mentioned

AI Lab|United States

Valuation: $500.0B

Private company - No stock data

AI Lab|United States

Valuation: $183.0B

Private company - No stock data

AI Lab|United States

Valuation: $24.0B

Private company - No stock data

AI Lab|China

Valuation: $15.0B

Private company - No stock data

AI Company|South Korea

Valuation: $100.0M

Private company - No stock data

Cloud|United States

Valuation: $3790.0B

GOOGL • NASDAQMarket Closed

At news: $306.77Now: $306.76

Related Deals

Partnership

Investment

Research

Drag nodes to explore | Featured companies highlighted

Partnership

BBVA and OpenAI entered a strategic partnership to expand ChatGPT Enterprise to BBVA’s global workforce and co-develop AI solutions for banking operations and customer experiences.

BBVA→

OpenAI

Dec 2025

Investment

Disney makes a $1B equity investment in OpenAI alongside a multi-year character-licensing partnership for Sora-generated short videos.

The Walt Disney Company→

OpenAI

Dec 2025

Partnership

Disney becomes Sora’s first major content licensing partner and commits a $1B equity investment in OpenAI as part of a three-year AI content and enterprise technology partnership.

The Walt Disney Company→

OpenAI

Dec 2025

Research

OpenAI, Anthropic, Block and major cloud providers are co-founding the Agentic AI Foundation under the Linux Foundation to steward open, interoperable standards for AI agents.

OpenAI→

Anthropic→Block→

Google→

Microsoft→Amazon Web Services→Bloomberg→Cloudflare→Cisco→Agentic AI Foundation

Dec 2025

Research

Founding members created the Agentic AI Foundation under the Linux Foundation to fund and govern open standards like MCP, goose and AGENTS.md for interoperable agentic AI.

Anthropic→

OpenAI→Block→

Google→

Microsoft→Amazon Web Services→Bloomberg→Cloudflare→Agentic AI Foundation (AAIF)

Dec 2025

View all AI deals

Related News

CrowdStrike launches Falcon AI Detection and Response to secure the AI prompt layer

Podium launches Jerry 2.0 vertical AI agents, highlighted in new OpenAI case study

Podium launches Jerry 2.0 vertical AI agents, highlighted in new OpenAI case study

Merriam‑Webster picks “slop” as 2025 word of the year, citing AI‑generated junk

Merriam‑Webster picks “slop” as 2025 word of the year, citing AI‑generated junk

Militant groups are experimenting with AI, boosting propaganda and cyber risks

Militant groups are experimenting with AI, boosting propaganda and cyber risks