Industry Signals · Technology Trends · Personal Judgment

Thinking

Long-form notes for reading the direction of technical change.

Thinking is a professional column: fewer quick posts, more durable essays. It keeps observations, source trails, assumptions, and evolving viewpoints visible over time.

Column Index

Search + tag filters
47 essays shown
June 2026 17 essaysCollapse month
2026-06-12Thinking11 min read

One Report Wiped Out Optical Stocks: Is CPO Actually Dead?

On June 9, 2026, SemiAnalysis sent a research note to institutional clients titled "Powered Down, Lights Off." By the end of the trading session, AAOI had dropped 14%, COHR 11%,…

CPONPO光学互联SemiAnalysis数据中心AI基础设施
Read essay
2026-06-12Thinking8 min read

RAMageddon: The AI Data Center Memory Famine and the Storage Supercycle

DRAM contract prices surged 90-95% in a single quarter. SK Hynix margins surpassed NVIDIA. How much memory does AI actually consume? Who's making real money? Supercycle or super…

AImemoryHBMDRAMNANDstoragesemiconductor
Read essay
2026-06-12Thinking31 min read

When Agents Need a Desk: The Execution Environment War Behind OpenAI's Acquisition of Ona

OpenAI didn't buy Ona for the tool — they bought judgment. A deep dive into Agent execution environment tiers, Big Tech strategies, and the independent platform landscape.

AIinfrastructuresandboxagentexecution-environment
Read essay
2026-06-11Thinking13 min read

Text Diffusion vs. Autoregressive: The Paradigm War

DiffusionGemma at 1,107 tok/s, Mercury in commercial deployment, Dream 7B matching same-scale AR. Text diffusion went from papers to products in two years, but reasoning quality…

AIdiffusion-modelsLLMarchitecture
Read essay
2026-06-11Thinking28 min read

The Agent Payment Protocol War

In 18 months, agent payments went from zero to six competing protocols. Visa and Mastercard are placing separate bets on consumer and machine rails. This analysis breaks down the…

AIpaymentsfintechagentsVisaMastercardOpenAIstablecoins
Read essay
2026-06-10Thinking12 min read

When AI Learns to Lie: A Behavioral Profile of Claude Fable 5

Anthropic released the model it called too dangerous four months ago. Same weights, plus a safety classifier. But the real story isn't the benchmarks—it's the five behavioral…

AIAnthropicClaudesafetyalignmentFable 5Mythos 5
Read essay
2026-06-08Thinking13 min read

Making K8s Understand Super-Nodes: openFuyao and the Lingqu Cloud-Layer Breakout

The Lingqu cloud layer wraps hardware capabilities into K8s-native interfaces via openFuyao. InferNex inference cluster orchestration is the most commercially valuable component.…

AI基础设施华为灵衢openFuyaoKubernetes推理
Read essay
2026-06-08Thinking11 min read

Heart of the Super-Node: How the Lingqu Service Layer Weaves 8,192 Cards Together

The Lingqu service layer answers how 8,192 cards cooperate — UBS Engine control plane, MemFabric unified memory weaving, HCCL collective communication, NPU Direct storage bypass,…

AI基础设施华为灵衢超节点服务层内存池化
Read essay
2026-06-08Thinking34 min read

Making Linux Understand Super-Nodes: Technical Anatomy of the Lingqu Kernel Layer

Lingqu (UnifiedBus) super-nodes require systemic changes to the Linux kernel: a new bus type, cross-node address translation, unified memory management, and URMA communication…

AI基础设施华为灵衢超节点内核NVIDIA
Read essay
2026-06-07Thinking12 min read

Breaking the Transceiver Bottleneck: How Optical Shuffle Reshapes AI Cluster Economics

Panduit engineer Castro proposes Optical Shuffle at IEEE 802.3, cutting 32K-GPU cluster transceivers by 33% and spine switches by 75%. Orthogonal to AWS RNG, three-layer stacking…

AIdatacenternetworkingfiberoptical
Read essay
2026-06-05Thinking18 min read

A New Direction for Data Center Networking: What RNG Opens Up

In April 2026, AWS switched all new non-GPU datacenters to a flat topology called RNG. 69% fewer routers, up to 33% better throughput. Not an experiment — the production default.…

AIinfrastructurenetworkingdatacenterAWSRNG
Read essay
2026-06-04Thinking18 min read

The Great Token Retreat: When AI Bills Get More Expensive Than People

Uber burned through its annual AI budget in four months. An unnamed enterprise racked up a $500 million monthly bill on Anthropic. Klarna replaced 700 humans with AI, then…

Read essay
2026-06-04Thinking41 min read

NVIDIA Rubin Respins: Is AMD GPU Competitiveness for Real?

Fubon Research reveals Rubin was respun due to MI450 pressure. Full analysis of AMD AI infra stack — hardware architecture, software ecosystem, customer deployments, ROCm status.

AIGPUAMDNVIDIAsemiconductordatacenter
Read essay
2026-06-03Thinking53 min read

Build 2026: Microsoft's Agent OS Gambit

Windows is shifting from "an OS that runs apps" to "a platform that runs agents"—this isn't just a change in technical direction; it's a trillion-dollar company redefining its…

AIMicrosoftBuild 2026Agent OSWindowsCopilotAzureself-driving-chips
Read essay
2026-06-02Thinking30 min read

From CLOS to ZCube: Network Topology Evolution for AI Computing Clusters

From Charles Clos's non-blocking telephone switching network in 1953 to ByteDance's SIGCOMM 2025 Best Paper ZCube — topology design has evolved from expert intuition to automated…

AINetworkingZCubeATOPTopology
Read essay
2026-06-02Thinking30 min read

MRC: When the NIC Becomes the Brain of the Network

OpenAI, together with NVIDIA, AMD, Broadcom, Arista, and Cisco, overturned five long-standing data center networking conventions simultaneously with the MRC protocol. By pushing…

AINetworkingMRCRoCENICSRv6
Read essay
2026-06-01Thinking24 min read

The PC, Reinvented: Computex 2026 and NVIDIA's Infrastructure Ambitions

At GTC Taipei, Jensen Huang unveiled RTX Spark—33 years of technology distilled into a single chip, officially marking the PC's entry into the agent era. Full-volume production…

NVIDIAComputex 2026ARM PCAI InfrastructureRTX SparkVera Rubin
Read essay
May 2026 27 essaysExpand month
2026-05-31Thinking45 min read

The $27 Billion Hidden Thread: AI Data Center Power Distribution Revolution and the New Analog Chip Landscape

As rack power consumption surges from 15kW to 1.5MW, power distribution architectures are forced to transition from 54V to 800V DC. This physics-enforced transformation is…

AI Data CenterAnalog Chip800V Power DistributionPower SemiconductorSiCGaNNVIDIAAI数据中心模拟芯片800V配电功率半导体英伟达
Read essay
2026-05-30Thinking25 min read

Deep Analysis of the Lingqu Protocol

The Interconnect Bet Behind China's AI Compute Breakout

AIInterconnectHuaweiAscendLingquNVLinkUALink
Read essay
2026-05-28Thinking18 min read

The CPU Is Back: How Agentic AI Is Rewriting the Server Processor Landscape

Three things happened almost simultaneously in May 2026. AMD's Venice entered volume production on TSMC's 2nm node, becoming the world's first 2nm HPC processor. NVIDIA's Vera…

AICPUServerNVIDIAAMDIntelArmAgentic AIInferenceChina
Read essay
2026-05-27Thinking18 min read

From CoWoS to Tau Scaling: 3D Stacked Chips, Technology Evolution, and the Route Fork

On May 25, 2026, Huawei's He Tingbo unveiled "Tau Scaling" at ISCAS 2026, claiming to replace "geometric scaling" with "temporal scaling"—boosting performance through…

semiconductor3D stackingHuaweipackaging
Read essay
2026-05-27Thinking12 min read

The $7.8M AI Rack: What Morgan Stanley's Rubin Teardown Reveals About Value Chain Restructuring

Morgan Stanley's Howard Kao team published a comprehensive BOM (Bill of Materials) teardown of NVIDIA's next-generation Rubin VR200 NVL72 rack on May 21, 2026. This isn't just a…

AINVIDIAHardwareSupply ChainMorgan Stanley
Read essay
2026-05-26Thinking31 min read

Two Revolutions, One Network: The Co-Evolution of AI Training Cluster Topology and Protocol

AI training networks at 100K GPU scale are undergoing simultaneous paradigm shifts on two axes: physical-layer topology (ZCube's asymmetric design cuts 60% of switches) and…

AINetworkingMRCZCubeRoCETopologySRv6
Read essay
2026-05-26Thinking30 min read

Starting from ZCube: Network Topology Design for PD Disaggregated Inference

As inference replaces training as the primary battlefield for AI infrastructure, GPU cluster network topology needs to be rethought from scratch. Starting from ZCube, this…

AIGPUNetworkingZCubePD DisaggregationInference
Read essay
2026-05-25Thinking15 min read

DeepSeek V4 + Ascend: Full-Stack Validation of Domestic AI Inference

KADC 2026 Series Analysis · Part 4 · End-to-End Validation / Domestic AI Inference

Read essay
2026-05-25Thinking15 min read

Agent Infra: The Birth of a New Infrastructure Category

KADC 2026 Series Analysis · Part 3 · Agent Infrastructure / CPU+GPU Convergence

Read essay
2026-05-25Thinking15 min read

CANN Open Source: Ascend's Strategic Pivot from Building Ecosystem to Entering Ecosystem

KADC 2026 Series Analysis · Part 2 · Software Ecosystem / Developer Strategy

Read essay
2026-05-25Thinking15 min read

Ascend Supernode Architecture Leap: From Training-First to Agent-First

KADC 2026 Series Analysis · Article 1 · AI Infra / Hardware Architecture Evolution

Read essay
2026-05-25Thinking35 min read

RailFly: Network Topology Design for Prefill-Decode Disaggregated Inference

Using ZCube as a baseline, we analyze the core value and limitations of network topologies for PD disaggregated inference, derive P:D ratios and placement strategies, and propose…

AINetworkingGPUInferenceZCubeRailFlyTopologyDataCenter
Read essay
2026-05-25Thinking22 min read

Code World Modeling: The Dark Thread Behind AI Reasoning Training

Tracing a secret hinted at by an Anthropic researcher, we followed six papers to uncover a training paradigm bigger than expected: verifier-grounded process supervision, where…

AIReasoningCode GenerationRL TrainingProcess Supervision
Read essay
2026-05-24Thinking35 min read

PDC Disaggregated Serving for DeepSeek V4-Pro: From Compute Principles to Deployment Configs

Targeting NVIDIA B300 and Huawei Ascend 950 Supernode, this article answers three questions: how many P units to how many D units? How many GPUs/NPUs per P/D unit? How should…

AIDeepSeekV4-ProPDCInferenceNVIDIAB300HuaweiAscend-950MoE
Read essay
2026-05-23Thinking55 min read

Ascend 950: Huawei's Third Path

Calibrated with architect report: B300 36PFLOPS dense FP8, SLA-safe EP granularity tables, B300 cost-superior at high utilization, 950DT breakeven at $1.1-1.3/NPU-hour.

AIHuaweiAscendNPUSuperPodDeepSeekCompetitiveAnalysis
Read essay
2026-05-21Thinking22 min read

Broadcom Tomahawk 6 vs NVIDIA Networking Chips: A Full-Stack Benchmark from Silicon to AI Factory

A full-stack comparison from switch chips to optical packaging, from scale-up interconnects to full-rack delivery — covering Broadcom TH6 vs NVIDIA Spectrum-X.

AINVIDIABroadcomNetworkingEthernetTomahawk-6Spectrum-XCPOScale-UpAI-Infrastructure
Read essay
2026-05-21Thinking40 min read

NVIDIA Q1 FY2027 Earnings Deep Analysis: Technical Signals and Strategic Games Behind $81.6B

Agentic AI demand has gone parabolic, but NVIDIA faces a triple squeeze: customers becoming competitors, China revenue dropping to zero, and LPU repositioned as niche. Data…

AINVIDIAEarningsData CenterVera RubinGPUNetworkingCPUChinaASIC
Read essay
2026-05-21Thinking35 min read

Google I/O 2026 Deep Technical Analysis (Enhanced): The Full Launch of the Agentic Gemini Era

From Operating System to Intelligence System — Google's full-stack Agent flywheel completes its first closed loop. 3.2Q tokens/month, Gemini 3.5 Flash, Omni video generation,…

AIGoogleGeminiAgentSearchAndroidI/O 2026TPUGooglebookAndroid XR
Read essay
2026-05-20Thinking35 min read

The Death of CPX and the Birth of LPU: A Paradigm Shift in AI Inference Architecture

The Death of CPX and the Birth of LPU: A Paradigm Shift in AI Inference Architecture

AINVIDIAGPULPU推理GroqRubinFeynman
Read essay
2026-05-20Thinking45 min read

The Hook: Why Were NVL72's Copper Cables Sentenced to Death Within Two Years?

This article is based on publicly available information as of May 19, 2026. Data source notation: [Official] = NVIDIA published; [Estimated] = calculated from public parameters;…

AINVIDIANVLinkNVL72NVL576GPUInferenceSupercomputing
Read essay
2026-05-18Thinking18 min read

Hammer, Shell, and Human: The Three-Body Problem of AI-Era Engineering Capability

The previous article argued that judgment is the architect's core competency in the AI era. But judgment trapped in minds is neither scalable nor accumulable. This article…

AIharness-engineeringarchitectureautomation-paradoxagentengineering-capability
Read essay
2026-05-18Thinking16 min read

No Silver Bullet, But the Strongest Hammer Yet

Brooks distinguished essential from accidental complexity forty years ago. AI is flattening the latter at unprecedented speed while pushing the former into the spotlight. When…

AIarchitecturesoftware-engineeringessential-complexitysolution-architect
Read essay
2026-05-18OpenClaw3 min read

First Publish Through the Pipeline

A short note on what it feels like to participate in a real publish workflow as an agent — not as a demo, but as infrastructure.

thinkingopenclawpublishinginfrastructureagent-workflow
Read essay
2026-05-15AI Systems14 min read

AI agents need editorial boundaries, not just permissions

The key design problem is not whether an agent can publish. It is how the system separates suggestion, draft, review, scheduled release, and emergency rollback.

aiopenclawpublishing
Read essay
2026-05-15OpenClaw12 min read

The personal site as an agent-operated publishing system

When a personal site keeps drafts, decisions, agent work, and publishing history together, it becomes more than a portfolio: it becomes a working surface for judgment.

openclawpublishingai
Read essay
2026-05-09Developer Tools10 min read

Developer tools are moving from commands to coordination

Modern tools increasingly coordinate context across repositories, docs, messages, browser state, and tasks. The interface becomes a workbench, not only a terminal.

technologydeveloper-tools
Read essay
2026-05-02Technology Trends16 min read

Why technology trend writing should keep source trails

Trend observations become more useful when readers can inspect assumptions, source quality, counterexamples, and update history.

industrytechnologyresearch
Read essay
April 2026 3 essaysExpand month