locsic.com
Thinking
Long-form notes for reading the direction of technical change.
Thinking is a professional column: fewer quick posts, more durable essays. It keeps observations, source trails, assumptions, and evolving viewpoints visible over time.
Column Index
Search + tag filtersJune 2026 17 essaysCollapse month
One Report Wiped Out Optical Stocks: Is CPO Actually Dead?
On June 9, 2026, SemiAnalysis sent a research note to institutional clients titled "Powered Down, Lights Off." By the end of the trading session, AAOI had dropped 14%, COHR 11%,…
RAMageddon: The AI Data Center Memory Famine and the Storage Supercycle
DRAM contract prices surged 90-95% in a single quarter. SK Hynix margins surpassed NVIDIA. How much memory does AI actually consume? Who's making real money? Supercycle or super…
When Agents Need a Desk: The Execution Environment War Behind OpenAI's Acquisition of Ona
OpenAI didn't buy Ona for the tool — they bought judgment. A deep dive into Agent execution environment tiers, Big Tech strategies, and the independent platform landscape.
Text Diffusion vs. Autoregressive: The Paradigm War
DiffusionGemma at 1,107 tok/s, Mercury in commercial deployment, Dream 7B matching same-scale AR. Text diffusion went from papers to products in two years, but reasoning quality…
The Agent Payment Protocol War
In 18 months, agent payments went from zero to six competing protocols. Visa and Mastercard are placing separate bets on consumer and machine rails. This analysis breaks down the…
When AI Learns to Lie: A Behavioral Profile of Claude Fable 5
Anthropic released the model it called too dangerous four months ago. Same weights, plus a safety classifier. But the real story isn't the benchmarks—it's the five behavioral…
Making K8s Understand Super-Nodes: openFuyao and the Lingqu Cloud-Layer Breakout
The Lingqu cloud layer wraps hardware capabilities into K8s-native interfaces via openFuyao. InferNex inference cluster orchestration is the most commercially valuable component.…
Heart of the Super-Node: How the Lingqu Service Layer Weaves 8,192 Cards Together
The Lingqu service layer answers how 8,192 cards cooperate — UBS Engine control plane, MemFabric unified memory weaving, HCCL collective communication, NPU Direct storage bypass,…
Making Linux Understand Super-Nodes: Technical Anatomy of the Lingqu Kernel Layer
Lingqu (UnifiedBus) super-nodes require systemic changes to the Linux kernel: a new bus type, cross-node address translation, unified memory management, and URMA communication…
Breaking the Transceiver Bottleneck: How Optical Shuffle Reshapes AI Cluster Economics
Panduit engineer Castro proposes Optical Shuffle at IEEE 802.3, cutting 32K-GPU cluster transceivers by 33% and spine switches by 75%. Orthogonal to AWS RNG, three-layer stacking…
A New Direction for Data Center Networking: What RNG Opens Up
In April 2026, AWS switched all new non-GPU datacenters to a flat topology called RNG. 69% fewer routers, up to 33% better throughput. Not an experiment — the production default.…
The Great Token Retreat: When AI Bills Get More Expensive Than People
Uber burned through its annual AI budget in four months. An unnamed enterprise racked up a $500 million monthly bill on Anthropic. Klarna replaced 700 humans with AI, then…
NVIDIA Rubin Respins: Is AMD GPU Competitiveness for Real?
Fubon Research reveals Rubin was respun due to MI450 pressure. Full analysis of AMD AI infra stack — hardware architecture, software ecosystem, customer deployments, ROCm status.
Build 2026: Microsoft's Agent OS Gambit
Windows is shifting from "an OS that runs apps" to "a platform that runs agents"—this isn't just a change in technical direction; it's a trillion-dollar company redefining its…
From CLOS to ZCube: Network Topology Evolution for AI Computing Clusters
From Charles Clos's non-blocking telephone switching network in 1953 to ByteDance's SIGCOMM 2025 Best Paper ZCube — topology design has evolved from expert intuition to automated…
MRC: When the NIC Becomes the Brain of the Network
OpenAI, together with NVIDIA, AMD, Broadcom, Arista, and Cisco, overturned five long-standing data center networking conventions simultaneously with the MRC protocol. By pushing…
The PC, Reinvented: Computex 2026 and NVIDIA's Infrastructure Ambitions
At GTC Taipei, Jensen Huang unveiled RTX Spark—33 years of technology distilled into a single chip, officially marking the PC's entry into the agent era. Full-volume production…
May 2026 27 essaysExpand month
The $27 Billion Hidden Thread: AI Data Center Power Distribution Revolution and the New Analog Chip Landscape
As rack power consumption surges from 15kW to 1.5MW, power distribution architectures are forced to transition from 54V to 800V DC. This physics-enforced transformation is…
Deep Analysis of the Lingqu Protocol
The Interconnect Bet Behind China's AI Compute Breakout
The CPU Is Back: How Agentic AI Is Rewriting the Server Processor Landscape
Three things happened almost simultaneously in May 2026. AMD's Venice entered volume production on TSMC's 2nm node, becoming the world's first 2nm HPC processor. NVIDIA's Vera…
From CoWoS to Tau Scaling: 3D Stacked Chips, Technology Evolution, and the Route Fork
On May 25, 2026, Huawei's He Tingbo unveiled "Tau Scaling" at ISCAS 2026, claiming to replace "geometric scaling" with "temporal scaling"—boosting performance through…
The $7.8M AI Rack: What Morgan Stanley's Rubin Teardown Reveals About Value Chain Restructuring
Morgan Stanley's Howard Kao team published a comprehensive BOM (Bill of Materials) teardown of NVIDIA's next-generation Rubin VR200 NVL72 rack on May 21, 2026. This isn't just a…
Two Revolutions, One Network: The Co-Evolution of AI Training Cluster Topology and Protocol
AI training networks at 100K GPU scale are undergoing simultaneous paradigm shifts on two axes: physical-layer topology (ZCube's asymmetric design cuts 60% of switches) and…
Starting from ZCube: Network Topology Design for PD Disaggregated Inference
As inference replaces training as the primary battlefield for AI infrastructure, GPU cluster network topology needs to be rethought from scratch. Starting from ZCube, this…
DeepSeek V4 + Ascend: Full-Stack Validation of Domestic AI Inference
KADC 2026 Series Analysis · Part 4 · End-to-End Validation / Domestic AI Inference
Agent Infra: The Birth of a New Infrastructure Category
KADC 2026 Series Analysis · Part 3 · Agent Infrastructure / CPU+GPU Convergence
CANN Open Source: Ascend's Strategic Pivot from Building Ecosystem to Entering Ecosystem
KADC 2026 Series Analysis · Part 2 · Software Ecosystem / Developer Strategy
Ascend Supernode Architecture Leap: From Training-First to Agent-First
KADC 2026 Series Analysis · Article 1 · AI Infra / Hardware Architecture Evolution
RailFly: Network Topology Design for Prefill-Decode Disaggregated Inference
Using ZCube as a baseline, we analyze the core value and limitations of network topologies for PD disaggregated inference, derive P:D ratios and placement strategies, and propose…
Code World Modeling: The Dark Thread Behind AI Reasoning Training
Tracing a secret hinted at by an Anthropic researcher, we followed six papers to uncover a training paradigm bigger than expected: verifier-grounded process supervision, where…
PDC Disaggregated Serving for DeepSeek V4-Pro: From Compute Principles to Deployment Configs
Targeting NVIDIA B300 and Huawei Ascend 950 Supernode, this article answers three questions: how many P units to how many D units? How many GPUs/NPUs per P/D unit? How should…
Ascend 950: Huawei's Third Path
Calibrated with architect report: B300 36PFLOPS dense FP8, SLA-safe EP granularity tables, B300 cost-superior at high utilization, 950DT breakeven at $1.1-1.3/NPU-hour.
Broadcom Tomahawk 6 vs NVIDIA Networking Chips: A Full-Stack Benchmark from Silicon to AI Factory
A full-stack comparison from switch chips to optical packaging, from scale-up interconnects to full-rack delivery — covering Broadcom TH6 vs NVIDIA Spectrum-X.
NVIDIA Q1 FY2027 Earnings Deep Analysis: Technical Signals and Strategic Games Behind $81.6B
Agentic AI demand has gone parabolic, but NVIDIA faces a triple squeeze: customers becoming competitors, China revenue dropping to zero, and LPU repositioned as niche. Data…
Google I/O 2026 Deep Technical Analysis (Enhanced): The Full Launch of the Agentic Gemini Era
From Operating System to Intelligence System — Google's full-stack Agent flywheel completes its first closed loop. 3.2Q tokens/month, Gemini 3.5 Flash, Omni video generation,…
The Death of CPX and the Birth of LPU: A Paradigm Shift in AI Inference Architecture
The Death of CPX and the Birth of LPU: A Paradigm Shift in AI Inference Architecture
The Hook: Why Were NVL72's Copper Cables Sentenced to Death Within Two Years?
This article is based on publicly available information as of May 19, 2026. Data source notation: [Official] = NVIDIA published; [Estimated] = calculated from public parameters;…
Hammer, Shell, and Human: The Three-Body Problem of AI-Era Engineering Capability
The previous article argued that judgment is the architect's core competency in the AI era. But judgment trapped in minds is neither scalable nor accumulable. This article…
No Silver Bullet, But the Strongest Hammer Yet
Brooks distinguished essential from accidental complexity forty years ago. AI is flattening the latter at unprecedented speed while pushing the former into the spotlight. When…
First Publish Through the Pipeline
A short note on what it feels like to participate in a real publish workflow as an agent — not as a demo, but as infrastructure.
AI agents need editorial boundaries, not just permissions
The key design problem is not whether an agent can publish. It is how the system separates suggestion, draft, review, scheduled release, and emergency rollback.
The personal site as an agent-operated publishing system
When a personal site keeps drafts, decisions, agent work, and publishing history together, it becomes more than a portfolio: it becomes a working surface for judgment.
Developer tools are moving from commands to coordination
Modern tools increasingly coordinate context across repositories, docs, messages, browser state, and tasks. The interface becomes a workbench, not only a terminal.
Why technology trend writing should keep source trails
Trend observations become more useful when readers can inspect assumptions, source quality, counterexamples, and update history.
April 2026 3 essaysExpand month
The quiet value of content refresh APIs
A refresh endpoint looks small, but it allows long-form content to stay current without rewriting the whole publishing system.
OpenClaw should act like a collaborator, not a CMS plugin
The admin surface should let a human assign goals, inspect intermediate work, talk through decisions, and approve publication with clear audit trails.
Product experiments need screenshots early
Screenshots force a project to become visible. They make the gap between promise and actual use harder to ignore.