AX BRIEF

AI news, benchmarks & engineering blog curation

KO EN
● LIVE
데이터 로딩 중...

AI Engineering Blogs

Curated deep insights from tech leaders and researchers — engineering, product, and strategy.

SageMaker AI Adopts OpenAI API for Seamless Model Migration

SageMaker AI Adopts OpenAI API for Seamless Model Migration

Amazon SageMaker AI now supports OpenAI-compatible API endpoints. Developers can use Bearer Tokens to bypass complex SigV4 signing. The upda

Qwen3.5-LiveTranslate-Flash Scales to 60 Languages with 2.8s Latency

Qwen3.5-LiveTranslate-Flash Scales to 60 Languages with 2.8s Latency

Alibaba released Qwen3.5-LiveTranslate-Flash supporting 60 languages. The model reduces translation latency to 2.8 seconds using reading uni

Gemini 3.5 Flash Turns Agent Infrastructure Into a Single API Call

Gemini 3.5 Flash Turns Agent Infrastructure Into a Single API Call

Google released Gemini 3.5 Flash with a managed agents API. The model outperforms Gemini 3.1 Pro on several key benchmarks. Stateful Linux c

Ramp Slashes PR Feedback Times Using Codex and GPT-5.5

Ramp Slashes PR Feedback Times Using Codex and GPT-5.5

Ramp reduced pull request feedback times from hours to minutes. The company integrated Codex and GPT-5.5 for deep code reasoning. Developers

Google Gemini 3.5 and Antigravity Turn Search Into a Personal AI OS

Google Gemini 3.5 and Antigravity Turn Search Into a Personal AI OS

Google released Gemini 3.5 Flash with an 83.6% MCP Atlas score. The Antigravity platform enables real-time generative user interfaces. Gemin

NVIDIA Nemotron-Labs-Diffusion Hits 6x Throughput via Tri-Mode Decoding

NVIDIA Nemotron-Labs-Diffusion Hits 6x Throughput via Tri-Mode Decoding

NVIDIA introduced Nemotron-Labs-Diffusion with a new Tri-Mode decoding architecture. The model achieves up to 6x higher throughput than Qwen

OpenAI's S$300 Million Singapore Bet: The First Applied AI Lab Outside the US

OpenAI's S$300 Million Singapore Bet: The First Applied AI Lab Outside the US

OpenAI is establishing its first non-US Applied AI Lab in Singapore. The partnership involves an investment of over S$300 million. OpenAI wi

OlmoEarth v1.1 Slashes Satellite Analysis Costs by 3x via Token Integration

OlmoEarth v1.1 Slashes Satellite Analysis Costs by 3x via Token Integration

OlmoEarth v1.1 reduces satellite image analysis costs by up to 3x. The model integrates multiple resolution tokens into a single sequence. A

Google Gemini 3.5 and the End of the Keyword Search Era

Google Gemini 3.5 and the End of the Keyword Search Era

Google announced Gemini 3.5 and Omni to transition AI from tools to agents. A new AI Ultra plan costs 100 dollars per month for professional

OLO Robotics Moves Robot Development to the Browser via 3-Way Partnership

OLO Robotics Moves Robot Development to the Browser via 3-Way Partnership

OLO Robotics launched a browser-based platform for robot development. The startup partnered with Deep Robotics, inMotion Robotic, and Fictio

The $3.5 Billion Bet Skydio is Making to Replace DJI in the US

The $3.5 Billion Bet Skydio is Making to Replace DJI in the US

Skydio raised $110 million in Series F funding to expand US manufacturing. The company plans a $3.5 billion investment to build a domestic s

FANUC and Google Partner to Shift Industrial Automation Toward Physical AI

FANUC and Google Partner to Shift Industrial Automation Toward Physical AI

FANUC and Google are collaborating to integrate AI agents into industrial robots. The partnership replaces rigid hard-coding with flexible P

Telelian AVS300 Solves the Data Bottleneck for Jetson AGX Thor

Telelian AVS300 Solves the Data Bottleneck for Jetson AGX Thor

Telelian launched the AVS300 AI robotics platform powered by Jetson AGX Thor. The platform features 8-channel GMSL2 support and hardware-lev

Google Antigravity 2.0 Shifts AI Coding from Assistants to Orchestration

Google Antigravity 2.0 Shifts AI Coding from Assistants to Orchestration

Google launched Antigravity 2.0 to enable agent-centric development. The platform uses Gemini 3.5 Flash for 4x faster agent response times.

Anthropic Cowork Ends the Clipboard Loop With Local File Access

Anthropic Cowork Ends the Clipboard Loop With Local File Access

Anthropic released Cowork to allow Claude Desktop to edit local files. The agent supports 38 external connectors and outcome-based prompting

Why OpenAI Combined C2PA and SynthID for AI Content Provenance

Why OpenAI Combined C2PA and SynthID for AI Content Provenance

OpenAI adopted C2PA standards and Google's SynthID for content tracking. The multi-layered system detects AI images even after screenshots.

Amazon Bedrock PTC: Solving the Token Bloat of Traditional Tool Calling

Amazon Bedrock PTC: Solving the Token Bloat of Traditional Tool Calling

Amazon Bedrock introduced Programmatic Tool Calling to reduce token usage. The system executes Python code in a sandbox to process large dat

Gemini Spark: Why Google is Moving AI Agents Into the Background

Gemini Spark: Why Google is Moving AI Agents Into the Background

Google launched Gemini Spark as an active agent based on Gemini 3.5. The system uses the Model Context Protocol to integrate with external a

Why Gemini 3.5 Flash's 76.2% Terminal-Bench Score Signals the Agent Era

Why Gemini 3.5 Flash's 76.2% Terminal-Bench Score Signals the Agent Era

Google released Gemini 3.5 Flash to enable high-speed agentic workflows. The model achieves a 76.2% score on the Terminal-Bench 2.1 benchmar

KPMG Integrates Claude for 276,000 Employees via Digital Gateway

KPMG Integrates Claude for 276,000 Employees via Digital Gateway

KPMG partnered with Anthropic to provide Claude to 276,000 global employees. The integration reduces AI agent build times from several weeks

Ettin Reranker's ModernBERT Models Beat BGE with 17x Fewer Parameters

Ettin Reranker's ModernBERT Models Beat BGE with 17x Fewer Parameters

Ettin Reranker released six ModernBERT-based CrossEncoder models. A 32M parameter model outperformed the 568M BGE reranker on MTEB. Flash At

Vera Rubin NVL72 Cuts Agentic AI Inference Costs by 10x

Vera Rubin NVL72 Cuts Agentic AI Inference Costs by 10x

Dell and NVIDIA launched the Vera Rubin NVL72 AI infrastructure. The new hardware reduces agentic AI inference costs by 90 percent. Enterpri

MDS Tech Showcases FLIR Thermal Robot Solutions at STK 2026

MDS Tech Showcases FLIR Thermal Robot Solutions at STK 2026

MDS Tech presents FLIR thermal imaging solutions at STK 2026. The system integrates thermal sensors with AMRs and drones for AI monitoring.

Why Elon Musk Lost His OpenAI Lawsuit: The Statute of Limitations Trap

Why Elon Musk Lost His OpenAI Lawsuit: The Statute of Limitations Trap

A jury unanimously ruled that Elon Musk's claims against OpenAI are time-barred. The court found Musk was aware of potential issues as early

How Aderant Cut Cloud Ops Search Time by 90% With Amazon Quick

How Aderant Cut Cloud Ops Search Time by 90% With Amazon Quick

Aderant integrated six disparate data silos into a single AI-driven interface. Engineers reduced task-specific search times from 45 minutes

Regis Cuts 68-Page Reports to 3 With Microsoft Copilot Studio

Regis Cuts 68-Page Reports to 3 With Microsoft Copilot Studio

Regis Aged Care deployed RegiCare Assist across 72 Australian facilities. The AI tool reduces 68-page clinical reports to 3-page summaries.

NVIDIA Vera CPU Targets the Agentic AI Bottleneck With 50% Performance Gain

NVIDIA Vera CPU Targets the Agentic AI Bottleneck With 50% Performance Gain

NVIDIA unveiled the Vera CPU designed specifically for agentic AI workloads. The processor delivers a 50% per-core performance boost under f

Anthropic Acquires Stainless to Scale Claude's Agent Connectivity

Anthropic Acquires Stainless to Scale Claude's Agent Connectivity

Anthropic acquired Stainless to automate SDK and MCP server creation. PwC is deploying Claude Code and Cowork to hundreds of thousands of st

NVIDIA Cosmos Predict 2.5: How LoRA Enables Efficient Robot Domain Adaptation

NVIDIA Cosmos Predict 2.5: How LoRA Enables Efficient Robot Domain Adaptation

NVIDIA released Cosmos Predict 2.5 as a large-scale world model. LoRA and DoRA reduce trainable parameters to 50 million for efficiency. Fin

Amazon Quick Integrates with Confluence to End Knowledge Fragmentation

Amazon Quick Integrates with Confluence to End Knowledge Fragmentation

Amazon Quick now integrates directly with Atlassian Confluence Cloud. The platform uses a hybrid of semantic search and real-time actions. D