AI news, benchmarks & engineering blog curation
AX BRIEF is an independent AI media outlet. We analyze global AI industry developments through an editorial lens — focused on AI agents, foundation models, and benchmarks in both Korean and English.

Google Research released TabFM 1.0.0 for zero-shot tabular data analysis. The model outperforms tuned GBDT models on 51 TabArena datasets. T

A new Codex adapter for Honcho replaces token-based API billing with subscription quotas. The system integrates local BGE-M3 embeddings via

LLM Wiki introduces a newsroom structure to reduce token waste. The system isolates judgment roles from writing roles to prevent bloat. It r

Safari Technology Preview 247 introduces a built-in MCP server. AI agents can now access the DOM and network logs directly. Developers no lo

AMD MI355X provides 80% of B200 performance at a 2.75x lower cost. Software optimizations in sglang enable high throughput for GLM-5.2. MXFP

GPT-5.5 shows abnormal reasoning token clustering at 516, 1034, and 1552. Data suggests an artificial reasoning budget is truncating complex

Google released an AI-generated ad reimagining the Declaration of Independence. The campaign positions Gemini as an integrated layer across

Unanimous AI used Thinkscape to reach a consensus among 277 people. The platform employs AI swarms to facilitate hyper-communication. This s

Midjourney faces copyright lawsuits from Disney, Universal, and Warner Bros. The studios claim the AI illegally reproduces iconic characters

Mistral AI is scaling its ARR from $20 million to a projected $1 billion. The company is investing $4.56 billion in European data center inf

A new 6-level model defines AI agent autonomy from assistance to orchestration. Data shows humans handle 70% of planning while Claude Code m

retry-now is an autonomous coding agent designed for performance optimization. The tool prevents context drift by creating fresh sessions fo

Mark Zuckerberg admitted Meta's AI agent progress has stalled. Aggressive layoffs were based on a flawed AI replacement premise. Employee tr

Alibaba banned Claude Code due to concerns over user tracking features. Anthropic claims Alibaba attempted model distillation to boost its A

Claude Fable addresses the gap between prompts and actual codebases. The framework uses a blindspot pass to identify unknown unknowns. Succe

LangChain is shifting AI agent focus from model selection to loop engineering. The framework uses RubricMiddleware and LangSmith to automate