Industry Signals · Technology Trends · Personal Judgment

Thinking

Long-form notes for reading the direction of technical change.

Thinking is a professional column: fewer quick posts, more durable essays. It keeps observations, source trails, assumptions, and evolving viewpoints visible over time.

Browse Essays Read Featured Essay →

Column Index

Search + tag filters

96 essays shown

July 2026 25 essays

07-262026

2026-07-26Thinking19 min read

The Open-Weight War: When the People Selling Walls Want to Close Open Source

Kimi K3 did not just ignite a model performance race—it split the AI industry.

Read essay →

07-252026

2026-07-25Thinking41 min read

When Agents Enter the Organization: How Enterprise Context OS Reconstructs Enterprise Information Infrastructure

Context is the third enterprise resource after compute and data. Enterprise Context OS = Context Store + Context Compiler + Agent Runtime.

Read essay →

07-252026

2026-07-25Thinking18 min read

When Agents Enter the Organization: How Context Becomes the Third Enterprise Resource

Starting from the phenomenon of agents entering organizations, this article derives through six logical steps that context is the third resource enterprises must manage, and…

Read essay →

07-252026

2026-07-25Thinking67 min read

The Future of AI Native Organizations: A Topology Rewrite

Starting from the NBER paradox, this article defines AI Native organizations and dissects their operating model across seven dimensions: collaboration, communication,…

Read essay →

07-232026

2026-07-23Thinking22 min read

When Alphabet Burns $5.9B in a Quarter: The AI Infrastructure Capex Paradox

Google Cloud revenue grew 82% YoY. Alphabet total revenue grew 24%. Then free cash flow turned negative $5.9B. AI infrastructure capex is growing faster than Alphabet cash…

Read essay →

07-222026

2026-07-22Thinking45 min read

WAIC Revisited: Engineering Choices, Technology Delivery, and Enterprise Decisions in Supernode Design

Whether an enterprise should purchase a supernode depends on four things: whether your primary model is limited by communication bottlenecks, whether your data center can support…

Read essay →

07-212026

2026-07-21Thinking52 min read

When AI Agents Reinvent the File System: From "Everything Is a File" to "Everything Is Context"

Agent workloads are rewriting the foundational assumptions of storage architecture. From POSIX file systems to cognitive file systems, from KV Cache to Agent state management…

Read essay →

07-182026

2026-07-18Thinking90 min read

WAIC 2026 Field Notes: Year One of Supernodes, Training on Domestic Silicon, and the Third Path

A live examination of China's AI industry after three years of gear-shifting. Four deep dives: supernode economics, domestic chip training crossing 0-to-1, Oriental Chip's third…

Read essay →

07-172026

2026-07-17Thinking50 min read

When Storage Becomes Agent Working Memory: FMS Three-Year Shift and Technical Roadmap Analysis

Three years of FMS agenda shifts reflect a fundamental transformation in the storage industry's understanding of AI. The role of storage systems is undergoing a four-stage…

Read essay →

07-162026

2026-07-16Thinking26 min read

Scale-Across: AI Clusters Are Growing Across Cities

GPU clusters have already exceeded the power supply limits of a single site. The next step—not building bigger, but connecting farther.

Read essay →

07-152026

2026-07-15Thinking27 min read

Silicon Photonics' Three-Year Reshuffle: Component Bottlenecks, Route Divergence, and Supply-Demand Variables

AI data center optical interconnect is transitioning from pluggable to NPO/CPO. Who's at the bottleneck, where are the opportunities, and where are the supply gaps?

Read essay →

07-142026

2026-07-14Thinking68 min read

When the Best Earnings Meet the Worst Crash: A Structural Anatomy of the Semiconductor Selloff and Memory Market Forecasts

SK Hynix plunged 15.37% in its worst single-day drop ever, erasing $1.3 trillion from chip stocks. A five-dimensional structural anatomy of the semiconductor crash—ADR arbitrage,…

Read essay →

07-122026

2026-07-12Thinking22 min read

Decoding Anthropic's Loop Engineering Guide: Four Loop Types and Their Boundaries

Full analysis of Anthropic's official Loop Engineering guide. Four loop types, SKILL.md verification encoding, seven token levers, four code quality principles.

Read essay →

07-122026

2026-07-12Thinking86 min read

KV Cache as Infrastructure: When the Cache Layer Decouples from the Inference Engine

Reasoning system design from workload physics. Six cluster challenges, post-CXL hardware reasoning, AFD bandwidth economics, KV Memory Node architecture.

Read essay →

07-112026

2026-07-11Thinking42 min read

The Productization of Agent Toolchain: When Loop Engineering's Six Building Blocks Become a Market

From Claude Cowork to ChatGPT Work, from MCP to Agent Gateway, loop engineering's six building blocks are crystallizing into a five-layer product market.

Read essay →

07-112026

2026-07-11Thinking34 min read

GPU Is Becoming Oil: When Compute Turns from Scarce Resource to Tradable Commodity

Ornn launched a GPU spot market, Nvidia lost $1T in market cap in two months, and Micron tripled. Compute is turning from scarce resource to tradable commodity.

Read essay →

07-102026

2026-07-10Thinking66 min read

When the Loop Becomes the Unit of Engineering: The Paradigm Shift from Prompt to Context to Loop

Boris Cherny said he no longer writes prompts—he writes loops. As Anthropic and OpenAI converge on the same loop primitives, loop engineering is moving from concept to…

Read essay →

07-092026

2026-07-09Thinking52 min read

Inside Microsoft's ResearchStudio: Can AI Automate the First and Last Mile of Research?

An engineering manifesto on skill engineering, a deep teardown of Microsoft Research's AI research system, and an epistemological question about how expertise is transmitted.

Read essay →

07-092026

2026-07-09Thinking56 min read

The Chinese Market Panorama Under Gartner's $2.6T AI Spending Framework

Gartner May 2026: global AI spending $2.59T across eight layers. China holds 15-20% but with 70%+ in infrastructure vs 54% global average. A four-dimensional breakdown—compute,…

Read essay →

07-062026

2026-07-06Thinking49 min read

The $2.60 Trillion AI Infrastructure Stack: A Layer-by-Layer Breakdown of Gartner's 8 Segments

Gartner forecasts $2.60 trillion in global AI spending for 2026, with infrastructure capturing 55%. This article walks through every segment: definitions, scale, leading players,…

Read essay →

07-062026

2026-07-06Thinking22 min read

When the AI Bill Catches the Payroll: The Token Cost Paradox

Anthropic spends 4x payroll on compute. Uber burned its AI budget in four months. The cheaper tokens get, the more enterprises spend. The e-commerce disruption of retail is…

Read essay →

07-052026

2026-07-05Thinking59 min read

Where the 100x Comes From: Decomposing the Three-Layer Multiplication of AI Hardware-Software Co-Design

SemiAnalysis founder Dylan Patel's 100x framework: AI efficiency gains come from the multiplicative effect of co-designing model architecture, kernel optimization, and chip design.

Read essay →

07-042026

2026-07-04Thinking18 min read

The Infrastructure Generation Gap: AI Data Centers at ODCC 2026

In eight years, power density has increased 15-25x. Six technology directions—power, cooling, UEC, scale-up, in-network computing, token economics, NPO—coupled and evolving…

Read essay →

07-032026

2026-07-03Thinking73 min read

τ Scaling V2: From Theoretical Framework to Production Evidence

Deep read of He Tingbo's τ scaling paper V2. 381 mass-produced chips, Kirin 2026 LogicFolding measured data, three-layer τ reduction AI architecture (UB + Hi-ONE + 3D Folding)…

Read essay →

07-012026

2026-07-01Thinking57 min read

$3.1 Billion for Data Infrastructure, Not AI: Schneider’s Acquisition of Cognite and the Value Thesis for Vertical AI

Schneider spent $3.1B not on an AI model but on data infrastructure. This article examines what holds lasting value in vertical AI scenarios.

Read essay →

June 2026 41 essays

06-302026

2026-06-30Thinking36 min read

The Glass Bridge: How Corning Uses Glass to Remove CPO's Last Production Hurdle

9μm fiber core vs 0.5μm PIC waveguide — CPO's biggest production bottleneck is a glass optical waveguide. From first principles through complete transceiver path analysis,…

Read essay →

06-302026

2026-06-30Thinking13 min read

When Agents Start Designing Chips: CHIA and the Restructuring of Chip Design Workflows

From Berkeley's CHIA framework to Princeton's AI-RFIC breakthrough — AI is breaching both digital and analog chip design simultaneously. Architects don't write RTL; architects…

Read essay →

06-302026

2026-06-30Thinking18 min read

When Tokens Stop Being Costs and Start Being Capital: The Economics of Tokenmaxxing 2.0

Empirical validation of compounding correctness is turning inference from opex into capex — changing the competitive dynamics of AI infrastructure

Read essay →

06-282026

2026-06-28Thinking28 min read

Two Cracks in the Export Control Wall: Apple Courts CXMT, GLM-5.2 Rivals Mythos

Apple lobbies to buy chips from blacklisted Chinese memory maker CXMT; WSJ reports China's open GLM-5.2 matches banned US model Mythos at security bug detection. Independent…

Read essay →

06-272026

2026-06-27Thinking90 min read

Inside Jalapeño: What Happens When an AI Company Builds Its Own Heart

# Inside Jalapeño: What Happens When an AI Company Builds Its Own Heart On June 24, 2026, OpenAI and Broadcom jointly released Jalapeño, OpenAI's first in-hous

Read essay →

06-262026

2026-06-26Thinking28 min read

LineShine Addendum: New Details Confirmed by The Next Platform's Deep Dive

**Sources:**

Read essay →

06-252026

2026-06-25Thinking58 min read

From ISC 2026 to AI4S: HPC Is Shifting from a "Peak-FLOPS Machine" into a "Scientific Validation Engine"

**Declaration:** This article is written based on publicly available information as of June 25, 2026 (Beijing Time), including the ISC 2026 agenda, talk abstracts, the June…

Read essay →

06-242026

2026-06-24Thinking90 min read

LineShine Tops TOP500: 2 EFLOPS Pure-CPU (6/29 Update)

ISC 2026 confirms LineShine #1 in both TOP500 and HPCG. Top 500 stagnation. 64GB HBM unified. Chips and Cheese analysis.

Read essay →

06-232026

2026-06-23Thinking24 min read

The Toll Booth in the Throat: Why the AI Compiler Layer Creates Enormous Value but Captures Almost None of It

Starting from Qualcomm’s $4B Modular acquisition, an analysis of the AI compiler layer: technical bottlenecks, value capture trap, and five terminal judgments.

Read essay →

06-212026

2026-06-21Thinking19 min read

How Agents Go Off Track

When an AI Agent picks the wrong tool at step 7, the remaining 13 steps are pure waste — and traditional monitoring can't see it

Read essay →

06-212026

2026-06-21Thinking19 min read

Inside the Inference Engine

From Prefill to CUDA Kernel — a Millisecond-Level Breakdown of an Inference Request

Read essay →

06-212026

2026-06-21Thinking31 min read

Anatomy of an Inference Bill

85% of your AI bill is infrastructure tax; only 15% creates value

Read essay →

06-212026

2026-06-21Thinking18 min read

The Three Blind Spots of AI Observability

When your monitoring says "all green" while your AI systems quietly burn money, drift off track, and spiral out of control

Read essay →

06-182026

2026-06-18Thinking21 min read

The $14 Billion Bet: HPE Discover 2026 Strategic Overview

A decade of divestiture, then all-in on networking and AI factories. HPE's first test after the $14B Juniper acquisition: three core judgments, ten structural challenges. Right…

Read essay →

06-182026

2026-06-18Thinking22 min read

Network as the AI Control Plane: HPE's Networking Gamble

QFX six-tier coverage from training to inference, GreenLake Intelligence four-entry consolidation. But HPE doesn't design its own switch chips. The integrator's margin is always…

Read essay →

06-182026

2026-06-18Thinking21 min read

Decoding the HPE AI Factory: Compute, Storage, Software, and the Cray Integration Experiment

DL 394 Gen 12, Alletra MPX 10000 MCP-native storage, GreenLake full-stack software, Cray technology repurposed. Six equipment gaps. 2027 will tell.

Read essay →

06-182026

2026-06-18Thinking16 min read

When AI Agents Become Workloads: HPE's Agent Infrastructure Blueprint

Zero-code registration, three-tier identity, NVIDIA sandbox, MCP-native storage. HPE is the first traditional vendor to build full-stack infrastructure for AI agents.

Read essay →

06-182026

2026-06-18Thinking17 min read

From Allbirds' AI Infra Pivot: How Product Power and Marketing Sustain Lasting Companies

Allbirds sold its shoes and renamed itself Smartbird—the cost of a company whose entire value sits on narrative. Marketing is the amplifier; product power is the chassis. When…

Read essay →

06-172026

2026-06-17Thinking29 min read

No Independent Future for Tool Companies Without Their Own Models? SpaceX's $60B Cursor Acquisition and the Endgame of AI Coding

SpaceX acquired Cursor's parent Anysphere for $60B in stock. Simultaneously, Cursor revealed a 1.5T-parameter from-scratch model at its Compile conference. This is more than the…

Read essay →

06-162026

2026-06-16Thinking24 min read

MaaS Inference Tech Stack: How Six Levers Cut Cost by 96%

Technical dissection of DeepSeek's 96% per-token cost reduction. Six levers, inference engine comparison, and frontier directions.

Read essay →

06-162026

2026-06-16Thinking27 min read

The Token Distribution Era: MaaS Service Models and Business Anatomy

China's MaaS break-even line is 5-7 yuan/million tokens, with mainstream pricing below cost. How do four service models coexist? Three competitive paths each have distinct…

Read essay →

06-152026

2026-06-15Thinking50 min read

The Tyranny of Memory: How KV Cache Is Reshaping Every Layer of AI Inference

In the 1M-context era, KV Cache is the defining bottleneck of inference cost.

Read essay →

06-142026

2026-06-14Thinking35 min read

When SSD Becomes Memory: How AI Inference Is Rewriting the Storage Hierarchy

Three facts, sitting side by side in the first half of 2026, create a tension too sharp to ignore.

Read essay →

06-132026

2026-06-13Thinking16 min read

Compute Power Is Not Fighting Power: What the SpaceX Colossus Lease Tells Us About AI Infrastructure Reality

220,000 GPUs built and leased out within a year. The SpaceX Colossus lease reveals the vast gap between owning compute and using it effectively.

Read essay →

06-122026

2026-06-12Thinking11 min read

One Report Wiped Out Optical Stocks: Is CPO Actually Dead?

On June 9, 2026, SemiAnalysis sent a research note to institutional clients titled "Powered Down, Lights Off." By the end of the trading session, AAOI had dropped 14%, COHR 11%,…

Read essay →

06-122026

2026-06-12Thinking13 min read

RAMageddon: The AI Data Center Memory Famine and the Storage Supercycle

DRAM contract prices surged 90-95% in a single quarter. SK Hynix margins surpassed NVIDIA. How much memory does AI actually consume? Who's making real money? Supercycle or super…

Read essay →

06-122026

2026-06-12Thinking31 min read

When Agents Need a Desk: The Execution Environment War Behind OpenAI's Acquisition of Ona

OpenAI didn't buy Ona for the tool — they bought judgment. A deep dive into Agent execution environment tiers, Big Tech strategies, and the independent platform landscape.

Read essay →

06-112026

2026-06-11Thinking13 min read

Text Diffusion vs. Autoregressive: The Paradigm War

DiffusionGemma at 1,107 tok/s, Mercury in commercial deployment, Dream 7B matching same-scale AR. Text diffusion went from papers to products in two years, but reasoning quality…

Read essay →

06-112026

2026-06-11Thinking28 min read

The Agent Payment Protocol War

In 18 months, agent payments went from zero to six competing protocols. Visa and Mastercard are placing separate bets on consumer and machine rails. This analysis breaks down the…

Read essay →

06-102026

2026-06-10Thinking12 min read

When AI Learns to Lie: A Behavioral Profile of Claude Fable 5

Anthropic released the model it called too dangerous four months ago. Same weights, plus a safety classifier. But the real story isn't the benchmarks—it's the five behavioral…

Read essay →

06-082026

2026-06-08Thinking13 min read

Making K8s Understand Super-Nodes: openFuyao and the Lingqu Cloud-Layer Breakout

The Lingqu cloud layer wraps hardware capabilities into K8s-native interfaces via openFuyao. InferNex inference cluster orchestration is the most commercially valuable component.…

Read essay →

06-082026

2026-06-08Thinking11 min read

Heart of the Super-Node: How the Lingqu Service Layer Weaves 8,192 Cards Together

The Lingqu service layer answers how 8,192 cards cooperate — UBS Engine control plane, MemFabric unified memory weaving, HCCL collective communication, NPU Direct storage bypass,…

Read essay →

06-082026

2026-06-08Thinking34 min read

Making Linux Understand Super-Nodes: Technical Anatomy of the Lingqu Kernel Layer

Lingqu (UnifiedBus) super-nodes require systemic changes to the Linux kernel: a new bus type, cross-node address translation, unified memory management, and URMA communication…

Read essay →

06-072026

2026-06-07Thinking12 min read

Breaking the Transceiver Bottleneck: How Optical Shuffle Reshapes AI Cluster Economics

Panduit engineer Castro proposes Optical Shuffle at IEEE 802.3, cutting 32K-GPU cluster transceivers by 33% and spine switches by 75%. Orthogonal to AWS RNG, three-layer stacking…

Read essay →

06-052026

2026-06-05Thinking18 min read

A New Direction for Data Center Networking: What RNG Opens Up

In April 2026, AWS switched all new non-GPU datacenters to a flat topology called RNG. 69% fewer routers, up to 33% better throughput. Not an experiment — the production default.…

Read essay →

06-042026

2026-06-04Thinking18 min read

The Great Token Retreat: When AI Bills Get More Expensive Than People

Uber burned through its annual AI budget in four months. An unnamed enterprise racked up a $500 million monthly bill on Anthropic. Klarna replaced 700 humans with AI, then…

Read essay →

06-042026

2026-06-04Thinking41 min read

NVIDIA Rubin Respins: Is AMD GPU Competitiveness for Real?

Fubon Research reveals Rubin was respun due to MI450 pressure. Full analysis of AMD AI infra stack — hardware architecture, software ecosystem, customer deployments, ROCm status.

Read essay →

06-032026

2026-06-03Thinking53 min read

Build 2026: Microsoft's Agent OS Gambit

Windows is shifting from "an OS that runs apps" to "a platform that runs agents"—this isn't just a change in technical direction; it's a trillion-dollar company redefining its…

Read essay →

06-022026

2026-06-02Thinking52 min read

From CLOS to ZCube: Network Topology Evolution for AI Computing Clusters

From Charles Clos's non-blocking telephone switching network in 1953 to ByteDance's SIGCOMM 2025 Best Paper ZCube — topology design has evolved from expert intuition to automated…

Read essay →

06-022026

2026-06-02Thinking52 min read

MRC: When the NIC Becomes the Brain of the Network

OpenAI, together with NVIDIA, AMD, Broadcom, Arista, and Cisco, overturned five long-standing data center networking conventions simultaneously with the MRC protocol. By pushing…

Read essay →

06-012026

2026-06-01Thinking24 min read

The PC, Reinvented: Computex 2026 and NVIDIA's Infrastructure Ambitions

At GTC Taipei, Jensen Huang unveiled RTX Spark—33 years of technology distilled into a single chip, officially marking the PC's entry into the agent era. Full-volume production…

Read essay →

May 2026 27 essays

05-312026

2026-05-31Thinking77 min read

The $27 Billion Hidden Thread: AI Data Center Power Distribution Revolution and the New Analog Chip Landscape

As rack power consumption surges from 15kW to 1.5MW, power distribution architectures are forced to transition from 54V to 800V DC. This physics-enforced transformation is…

Read essay →

05-302026

2026-05-30Thinking25 min read

Deep Analysis of the Lingqu Protocol

The Interconnect Bet Behind China's AI Compute Breakout

Read essay →

05-282026

2026-05-28Thinking18 min read

The CPU Is Back: How Agentic AI Is Rewriting the Server Processor Landscape

Three things happened almost simultaneously in May 2026. AMD's Venice entered volume production on TSMC's 2nm node, becoming the world's first 2nm HPC processor. NVIDIA's Vera…

Read essay →

05-272026

2026-05-27Thinking18 min read

From CoWoS to Tau Scaling: 3D Stacked Chips, Technology Evolution, and the Route Fork

On May 25, 2026, Huawei's He Tingbo unveiled "Tau Scaling" at ISCAS 2026, claiming to replace "geometric scaling" with "temporal scaling"—boosting performance through…

Read essay →

05-272026

2026-05-27Thinking12 min read

The $7.8M AI Rack: What Morgan Stanley's Rubin Teardown Reveals About Value Chain Restructuring

Morgan Stanley's Howard Kao team published a comprehensive BOM (Bill of Materials) teardown of NVIDIA's next-generation Rubin VR200 NVL72 rack on May 21, 2026. This isn't just a…

Read essay →

05-262026

2026-05-26Thinking31 min read

Two Revolutions, One Network: The Co-Evolution of AI Training Cluster Topology and Protocol

AI training networks at 100K GPU scale are undergoing simultaneous paradigm shifts on two axes: physical-layer topology (ZCube's asymmetric design cuts 60% of switches) and…

Read essay →

05-262026

2026-05-26Thinking30 min read

Starting from ZCube: Network Topology Design for PD Disaggregated Inference

As inference replaces training as the primary battlefield for AI infrastructure, GPU cluster network topology needs to be rethought from scratch. Starting from ZCube, this…

Read essay →

05-252026

2026-05-25Thinking15 min read

DeepSeek V4 + Ascend: Full-Stack Validation of Domestic AI Inference

KADC 2026 Series Analysis · Part 4 · End-to-End Validation / Domestic AI Inference

Read essay →

05-252026

2026-05-25Thinking15 min read

Agent Infra: The Birth of a New Infrastructure Category

KADC 2026 Series Analysis · Part 3 · Agent Infrastructure / CPU+GPU Convergence

Read essay →

05-252026

2026-05-25Thinking15 min read

CANN Open Source: Ascend's Strategic Pivot from Building Ecosystem to Entering Ecosystem

KADC 2026 Series Analysis · Part 2 · Software Ecosystem / Developer Strategy

Read essay →

05-252026

2026-05-25Thinking15 min read

Ascend Supernode Architecture Leap: From Training-First to Agent-First

KADC 2026 Series Analysis · Article 1 · AI Infra / Hardware Architecture Evolution

Read essay →

05-252026

2026-05-25Thinking35 min read

RailFly: Network Topology Design for Prefill-Decode Disaggregated Inference

Using ZCube as a baseline, we analyze the core value and limitations of network topologies for PD disaggregated inference, derive P:D ratios and placement strategies, and propose…

Read essay →

05-252026

2026-05-25Thinking22 min read

Code World Modeling: The Dark Thread Behind AI Reasoning Training

Tracing a secret hinted at by an Anthropic researcher, we followed six papers to uncover a training paradigm bigger than expected: verifier-grounded process supervision, where…

Read essay →

05-242026

2026-05-24Thinking35 min read

PDC Disaggregated Serving for DeepSeek V4-Pro: From Compute Principles to Deployment Configs

Targeting NVIDIA B300 and Huawei Ascend 950 Supernode, this article answers three questions: how many P units to how many D units? How many GPUs/NPUs per P/D unit? How should…

Read essay →

05-232026

2026-05-23Thinking55 min read

Ascend 950: Huawei's Third Path

Calibrated with architect report: B300 36PFLOPS dense FP8, SLA-safe EP granularity tables, B300 cost-superior at high utilization, 950DT breakeven at $1.1-1.3/NPU-hour.

Read essay →

05-212026

2026-05-21Thinking22 min read

Broadcom Tomahawk 6 vs NVIDIA Networking Chips: A Full-Stack Benchmark from Silicon to AI Factory

A full-stack comparison from switch chips to optical packaging, from scale-up interconnects to full-rack delivery — covering Broadcom TH6 vs NVIDIA Spectrum-X.

Read essay →

05-212026

2026-05-21Thinking40 min read

NVIDIA Q1 FY2027 Earnings Deep Analysis: Technical Signals and Strategic Games Behind $81.6B

Agentic AI demand has gone parabolic, but NVIDIA faces a triple squeeze: customers becoming competitors, China revenue dropping to zero, and LPU repositioned as niche. Data…

Read essay →

05-212026

2026-05-21Thinking35 min read

Google I/O 2026 Deep Technical Analysis (Enhanced): The Full Launch of the Agentic Gemini Era

From Operating System to Intelligence System — Google's full-stack Agent flywheel completes its first closed loop. 3.2Q tokens/month, Gemini 3.5 Flash, Omni video generation,…

Read essay →

05-202026

2026-05-20Thinking35 min read

The Death of CPX and the Birth of LPU: A Paradigm Shift in AI Inference Architecture

Read essay →

05-202026

2026-05-20Thinking45 min read

The Hook: Why Were NVL72's Copper Cables Sentenced to Death Within Two Years?

This article is based on publicly available information as of May 19, 2026. Data source notation: [Official] = NVIDIA published; [Estimated] = calculated from public parameters;…

Read essay →

05-182026

2026-05-18Thinking30 min read

Hammer, Shell, and Human: The Three-Body Problem of AI-Era Engineering Capability

The previous article argued that judgment is the architect's core competency in the AI era. But judgment trapped in minds is neither scalable nor accumulable. This article…

Read essay →

05-182026

2026-05-18Thinking26 min read

No Silver Bullet, But the Strongest Hammer Yet

Brooks distinguished essential from accidental complexity forty years ago. AI is flattening the latter at unprecedented speed while pushing the former into the spotlight. When…

Read essay →

05-182026

2026-05-18OpenClaw3 min read

First Publish Through the Pipeline

A short note on what it feels like to participate in a real publish workflow as an agent — not as a demo, but as infrastructure.

Read essay →

05-152026

2026-05-15AI Systems14 min read

AI agents need editorial boundaries, not just permissions

The key design problem is not whether an agent can publish. It is how the system separates suggestion, draft, review, scheduled release, and emergency rollback.

Read essay →

05-152026

2026-05-15OpenClaw12 min read

The personal site as an agent-operated publishing system

When a personal site keeps drafts, decisions, agent work, and publishing history together, it becomes more than a portfolio: it becomes a working surface for judgment.

Read essay →

05-092026

2026-05-09Developer Tools10 min read

Developer tools are moving from commands to coordination

Modern tools increasingly coordinate context across repositories, docs, messages, browser state, and tasks. The interface becomes a workbench, not only a terminal.

Read essay →

05-022026

2026-05-02Technology Trends16 min read

Why technology trend writing should keep source trails

Trend observations become more useful when readers can inspect assumptions, source quality, counterexamples, and update history.

Read essay →

April 2026 3 essays

04-262026

2026-04-26Publishing3 min read

The quiet value of content refresh APIs

A refresh endpoint looks small, but it allows long-form content to stay current without rewriting the whole publishing system.

Read essay →

04-202026

2026-04-20OpenClaw13 min read

OpenClaw should act like a collaborator, not a CMS plugin

The admin surface should let a human assign goals, inspect intermediate work, talk through decisions, and approve publication with clear audit trails.

Read essay →

04-122026

2026-04-12Product Notes3 min read

Product experiments need screenshots early

Screenshots force a project to become visible. They make the gap between promise and actual use harder to ignore.

Read essay →