Think in Context: AWS re:Invent 2025 Keynote with Dr. Swami Sivasubramanian

Post Title Image (Illustration: AWS re:Invent 2025 Keynote with Dr. Swami Sivasubramanian. Image source: AWS.)

Dr. Swami Sivasubramanian, AWS VP of Agentic AI, delivered the developer-focused keynote at re:Invent 2025, showcasing how AI agents are transforming the way we build software. From Blue Origin using agentic AI to design lunar infrastructure 75% faster, to Vercel powering 11 million customers with self-driving infrastructure, this keynote demonstrated that the age of autonomous AI agents is here. The announcement of Nova Act with 90% reliability for enterprise workflows marks a turning point for production-ready automation.

✳️ tl;dr

One theme “Agentic AI” runs throughout, with five sections:

  • Understanding Agents: Agents sense environments, convert objectives into executable steps, and continuously learn; unlike chatbots that give advice, agents investigate and initiate solutions.
  • Building Agents: Strands Agent SDK (5M+ downloads) and Amazon Bedrock AgentCore provide model-driven development with Identity, Policy, Evals, and Episodic Memory capabilities.
  • Customizing Models: Reinforcement Fine-Tuning in Amazon Bedrock delivers 66% accuracy gains; Nova Forge enables custom frontier models by mixing proprietary data during mid-training.
  • Ensuring Trust: Amazon Nova Act achieves 90% reliability through neuro-symbolic AI combining automated reasoning with LLMs; Cedar policy language provides deterministic controls.
  • Reimagining Work: Kiro autonomous agent, AWS Security Agent, AWS DevOps Agent, plus Amazon Connect with 8 new agentic capabilities and Nova Sonic integration.

✳️ Live Experience

(Caption: A few nights before the Dr. Swami Keynote, on the second floor of The Venetian in Las Vegas, we luckily caught Dr. Swami in the wild. We introduced ourselves as AWS Heroes from Taiwan and got an enthusiastic handshake from this super approachable leader — absolutely thrilled.)

Read More

Think in Context: AWS re:Invent 2025 Keynote with CEO Matt Garman

Post Title Image (Illustration: AWS re:Invent 2025 Keynote with CEO Matt Garman. Image source: AWS.)

AWS CEO Matt Garman opened the 14th annual re:Invent to over 60,000 attendees and nearly 2 million online viewers, framing AWS as a $132 billion business accelerating at 20% year over year. Under the theme “Freedom to Invent,” Garman walked through five interconnected areas: AI infrastructure (Trainium3 GA, Trainium4 preview, P6e-GB300, AI Factories), models and intelligence (Nova 2 family with Lite/Pro/Sonic/Omni, Nova Forge open training models), agents at scale (AgentCore Policy and Evaluations), developer transformation (AWS Transform Custom, Kiro IDE, three frontier agents), and a rapid-fire round of 25 core service launches across compute, Lambda, storage, EMR, security, and databases.

✳️ tl;dr

One theme “Freedom to Invent” runs throughout, with five sections:

  • AI Infrastructure: AWS leads GPU reliability, launches P6e-GB300 with Nvidia, announces Trainium3 GA (first 3nm AI chip in AWS Cloud), previews Trainium4 (6x FP4 compute), and introduces AWS AI Factories for dedicated on-premises AI infrastructure.
  • Models & Intelligence: Amazon Bedrock surpasses 100K customers with 50+ trillion-token accounts (by comparison, when I compiled my 2025 Year in Review, feeding an entire year of Heptabase journal notes only came to 200k tokens); new open-weights models from Mistral, Google, MiniMax, Nvidia; Nova 2 family (Lite, Pro, Sonic, Omni) delivers frontier-level intelligence at optimized cost; Nova Forge introduces open training models for custom Novellas.
  • Agents at Scale: Amazon Bedrock AgentCore adds Policy (Cedar-based deterministic controls) and Evaluations (continuous quality inspection with 13 pre-built evaluators); guest speakers from Sony, Adobe, and Writer showcase enterprise adoption.
  • Developer Transformation: AWS Transform Custom supports any code modernization; Kiro becomes Amazon’s official AI IDE; three frontier agents launched: Kiro autonomous agent, AWS Security Agent, AWS DevOps Agent.
  • 25 Core Service Launches: Rapid-fire announcements including new EC2 instance families (X, C8a, C8ine, M8azn, Mac), Lambda durable functions, S3 50TB max object size, S3 Vectors GA, EMR serverless storage, GuardDuty for ECS, Security Hub GA, RDS 256TB capacity, and database savings plans.

✳️ Live Experience

(Caption: On the day of the CEO Keynote, I woke up early in Las Vegas, lined up to get in, and for the first time challenged myself to take notes live on site using only a reMarkable Paper Pro for handwritten notes — quickly jotting down keywords, rapidly categorizing and capturing the structure, while simultaneously mapping out how I could help brief teammates if they were pulled into other tasks on short notice. Grateful for my experience leading camps as a kid and the brain-intensive training from a disciplined morning routine.)

(Caption: After the CEO Keynote ended, we had only an hour and a half to regroup, move locations, grab food, and brief each other before heading straight into the re:Invent Podcast recording studio. Every year we challenge ourselves with different tasks that push the bar even higher.)

Read More

The Unbearable Lightness of Being Focused - Unbox Ricoh GR IV (61P)

Ricoh GR IV (Caption: Ricoh GR IV, the latest chapter of the street photography god machine.)

Oh oh oh! The “Ginkgo Viewing Set” that I’ve been waiting for two and a half months has finally leisurely left the warehouse! These two and a half months were too long, so long that I felt I needed Ginkgo to supplement my brain power.

  • Ricoh released GR IV on 2025-08-20,
  • I rushed to B&H to pre-order one (and related accessories) as soon as possible on 2025-08-21,
  • Waiting and hoping, waiting and waiting, I received two Backorder Status updates in between.
  • In mid-October, I couldn’t help but ask customer service if there was an estimated delivery date, because I really wanted to take the GR IV on my upcoming business trips and street sweeping.
  • Luckily, I received the Order Shipped notification on 2025-10-31,
  • But encountering Double 11 and African Swine Fever, the entire customs was jammed,
  • It wasn’t until 2025-11-09 that I received the import duty and tax notification. Fortunately, I received the box immediately after payment, and I can happily unbox it.

My last Ricoh was the R3 from 2005 (it even lacked a G!), and in a blink of an eye, it’s been twenty years. This time, I even bought the camera from B&H. Is it true love or destiny? Am I really destined with these two letters? Although the Ginkgo camera (plus batteries, flash, and other accessories) was stuck in customs for quite a few days, and for the first time I didn’t receive any EZ WAY notification (maybe because I manually filled out the paper appointment letter first?), I finally got it before Thanksgiving, giving me time to learn how to set it up (play with it?!) before my business trip.

Today, GR IV has finally arrived. Used some other bricks to exchange for these bricks in front of me. Tomorrow I’ll tell the kids this is also a type of Minecraft?

Okay, okay, no more chatting, let’s hurry up and unbox this year’s “Unbearable Lightness of Being Focused”. (At least compared to last year’s “Unbearable Lightness of Being Focused”, it is really light!)

Read More

Summary of the Amazon DynamoDB Service Disruption in the Northern Virginia (US-EAST-1) Region, 2025-10

Post Title Image (Illustration: Screenshot of AWS Health Dashboard at 2025-10-20 12:51 PDT. Image source: Ernest.)

✳️ tl;dr

  • The following content is from an official AWS report 1, segmented and highlighted by AWS Community Hero Ernest 2 from the perspective of a developer and technical manager, aiming to stay close to the facts and conduct reasoning and extended learning based on these facts.
  • Through studying this report, we hope that both parties (AWS and us as AWS customers) can accumulate experience and continue to improve together, whether in the cloud or on-premises.
  • Unless otherwise specified, all times below are in Pacific Daylight Time (PDT) from AWS Seattle headquarters on the West Coast.
  • This note will begin with a knowledge graph, followed by a breakdown of the original official report content, divided into four sections: Amazon DynamoDB, Amazon EC2, Network Load Balancer (NLB), Other AWS Services
  • If you have the budget to adjust your architecture for cross-region high availability but don’t have enough time for major architectural changes, it is recommended to take a look at AWS services with “global” in their name. For example, “Amazon DynamoDB Global Tables” from the same DynamoDB family was almost unaffected during this incident.

  • We wanted to provide you with some additional information about the service disruption that occurred
    • in the N. Virginia (us-east-1) Region 3
    • on October 19 and 20, 2025.
    • While the event started at 11:48 PM PDT on October 19 (Taipei Timezone UTC+8, 2025-10-20 14:48)
    • and ended at 2:20 PM PDT on October 20 (Taipei Timezone UTC+8, 2025-10-21 05:20),
    • there were three distinct periods of impact to customer applications.
      • First, between 11:48 PM on October 19 and 2:40 AM on October 20, Amazon DynamoDB experienced increased API error rates in the N. Virginia (us-east-1) Region.
      • Second, between 5:30 AM and 2:09 PM on October 20, Network Load Balancer (NLB) experienced increased connection errors for some load balancers in the N. Virginia (us-east-1) Region.
        • This was caused by health check failures in the NLB fleet, which resulted in increased connection errors on some NLBs.
      • Third, between 2:25 AM and 10:36 AM on October 20, new EC2 instance launches failed and, while instance launches began to succeed from 10:37 AM, some newly launched instances experienced connectivity issues which were resolved by 1:50 PM.

Read More

Sönke Ahrens' How to Take Smart Notes: Modern Systematization of the Zettelkasten Method

Post Title Image (Hand-drawn by Ernest Chiang. You might also be interested in his Ernest PKM workflow.)


1️⃣ Introduction: Making Zettelkasten Learnable and Replicable

In 2017, German scholar Sönke Ahrens published a book that transformed the knowledge management field: How to Take Smart Notes. 1

This book accomplished something important: systematizing, proceduralizing, and making actionable Niklas Luhmann's Zettelkasten method.

Before Ahrens, Zettelkasten was more like a “legend”—we knew Luhmann wrote 70 books using this method, but weren’t quite clear how ordinary people could replicate this system. Luhmann’s own 1981 paper “Communicating with Slip Boxes” was more philosophical reflection than operational manual. (I personally prefer reflection, but many friends have been asking about methods, so I compiled this note.)

“Writing is not what happens after thinking.
Writing is the medium of thinking."

— Sönke Ahrens, How to Take Smart Notes (2017)

Read More

Niklas Luhmann's Original Zettelkasten: Two Slip Boxes, Fixed Numbering, and Communication Partner

Zettelkasten slip box note-taking system (Hand-drawn by Ernest Chiang. You might also be interested in his Ernest PKM workflow.)

1️⃣ Introduction: A Sociologist and His Thinking Machine

Niklas Luhmann (1927–1998) was a German sociologist renowned for his systems theory. During his academic career, he achieved astonishing productivity: 70 books and over 400 scholarly articles.

But even more remarkable, he attributed all of this to what seemed like a simple tool: the Zettelkasten (slip box).

  • This was not an ordinary note-taking system.
    • Luhmann began building this system in the 1950s, eventually accumulating over 90,000 index cards.
  • He called this system his communication partner
    • An external brain capable of dialoguing with him, facilitating thinking, and even “surprising” himself. 1
    • Doesn’t this sound like a manual, century-old version of an AI Agent or knowledge assistant?!

In his famous 1981 paper “Communicating with Slip Boxes” (Kommunikation mit Zettelkästen), Luhmann described in detail how this system worked. But interestingly, much of the modern understanding of the Zettelkasten method is actually mixed with interpretations and adaptations by later scholars.

This note explores the original method that Luhmann himself actually used.


“Without writing, one cannot think;
at least not in a sophisticated, connectable manner.”

— Niklas Luhmann, Kommunikation mit Zettelkästen (1981)1

Read More

KKR and ECP's $50B AI Infrastructure Play

Post Title Image (Illustration: KKR HQ locates at 30 hudson yards in New York. Image source: Photo by Illya Goloborodko.)

✳️ tl;dr

  • News tracking 2024-10-30 1 → 2025-07-30 2
  • KKR is no stranger to the tech industry, having acquired an 80% stake in Philips’ semiconductor division in 2006, which was later renamed NXP Semiconductors. This was a client Ernest once served, leaving a particularly deep impression.

  • 【2024-10-30】KKR and Energy Capital Partners announced a $50 billion strategic partnership focused on accelerating data center, power generation, and transmission infrastructure development to support global AI and cloud computing expansion 1
  • The partnership combines over 8GW of existing data center pipeline and 100GW of operating and development-ready power generation capacity, with KKR owning over 100 data center facilities worldwide 1
  • ECP invests in clean energy asset base, owning and operating over 83GW of power generation capacity in the U.S. market, spanning five asset classes including power generation, renewables, and storage 1
  • The partnership aims to collaborate with utilities, power producers, and data center developers to rapidly and responsibly develop large data center campuses for hyperscalers 1
  • Context: BlackRock launched a $30 billion AI infrastructure fund in the same month, backed by Microsoft and Nvidia, demonstrating capital’s rush into AI infrastructure 1

  • 【2025-07-30】First investment lands: 190MW hyperscale data center campus in Bosque County, Texas, marking 9 months from strategic announcement to first project announcement 2
  • Innovative co-location model: Data center adjacent to Calpine’s Thad Hill Energy Center natural gas power plant, representing the first such dedicated power agreement with a hyperscaler 2
  • Constructed through a joint venture between CyrusOne and ECP, expected to be operational in Q4 2026, with total investment approaching $4 billion, initial IT capacity of 144MW, spanning over 700,000 square feet 2
  • Behind-the-Meter model: Calpine provides 190MW of dedicated power, which can be redirected to support system reliability and local demand during ERCOT grid emergencies 2

  • AI-driven power demand: Goldman Sachs predicts global data center power demand will grow 165% by 2030 compared to 2023, with AI data center power density surging from traditional 5-10kW/rack to 50-200kW/rack 34
  • Grid bottleneck severity: New data center grid connection delays have reached 5 years, while Behind-the-Meter natural gas generation can be deployed within 18-24 months, becoming a pragmatic choice 5
  • Texas regulatory environment: SB6 new legislation (signed June 2025) requires large loads over 75MW to bear grid costs, accept emergency curtailment, and install ERCOT-controlled “kill switches” 67
  • Private equity continues to double down: 2024 data center investment reached $108 billion (triple that of 2023), with KKR's 2021 acquisition of CyrusOne for $15 billion laying the foundation for this partnership 89

Read More

BlackRock-Led Consortium Acquires Aligned Data Centers for $40 Billion in Record AI Infrastructure Deal

Post Title Image (Illustration: Aligned Data Centers. Image source: Aligned Data Centers.)

✳️ tl;dr

  • AIP, MGX, and GIP formed a consortium to acquire Aligned Data Centers for $40 billion 12
  • This marks AIP’s (AI Infrastructure Partnership) first investment since its establishment

  • AIP was founded by BlackRock, Microsoft, NVIDIA, and MGX in September 2024 3
  • AIP targets mobilizing $30 billion in equity capital, with potential to reach $100 billion including debt financing
  • Aligned owns 50 campuses with over 5 GW of operational and planned capacity across key digital hubs in the US and Latin America

  • The hyperscale data center market is projected to reach $167.3 billion in 2025
  • Growing at a 23.58% CAGR to $602.4 billion by 2030 4
  • Global data center electricity consumption is expected to double from 415 TWh in 2024 to 945 TWh in 2030, accounting for 3% of global electricity usage 5
  • Aligned holds over 50 patented cooling technologies, including air, liquid, and hybrid cooling systems designed specifically for high-density AI workloads 16

  • NVIDIA’s latest GB200 chip requires power density up to 120 kilowatts per rack, making liquid cooling essential for racks above 20 kilowatts 78
  • Kuwait Investment Authority and Singapore’s Temasek serve as anchor investors in AIP, demonstrating sovereign wealth funds’ long-term commitment to AI infrastructure 23
  • Notably, Macquarie Asset Management first invested in Aligned in 2018, expanding it from 2 facilities to 50 campuses over 7 years, achieving an exit valuation of approximately $40 billion 910

  • Major tech companies are expected to invest $400 billion in AI infrastructure in 2025, with OpenAI’s Stargate initiative alone reaching $500 billion 211
  • US data center power demand is projected to double from 35 GW in 2024 to 78 GW by 2035, with average hourly electricity consumption tripling 12

  • For you as a manager: Macquarie entered investments (in multiple data center companies) in 2018 and exited 7 years later. As we approach 2026, have you activated your radar to identify targets for the next 3-5 years?

Read More

Off-Balance Sheet AI: How SPVs Are Financing the Data Center Boom While Hiding Leverage

Post Title Image (Photo by Ray Hennessy on Unsplash)

✳️ tl;dr

  • Meta completed nearly $300 billion in financing through SPV structure
  • Building the Hyperion data center in Louisiana, setting a record for the largest private equity transaction in history 1
  • Hyperion data center covers 4 million square feet,
  • When fully operational, it will consume 5 gigawatts of electricity, equivalent to the power consumption of 4 million American households

  • Meta retains only 20% equity yet maintains full operational control,
  • This “control without consolidation” accounting technique keeps $270 billion in debt off the balance sheet 2
  • Equity accounts for only 8.5% of total financing ($2.5 billion/$29.5 billion)
  • Insurance companies invest heavily in such projects through private credit,
  • But face asset-liability mismatch risks and may be forced to liquidate investments during economic downturns 3
  • Historical lesson: In the 1990s, telecom companies laid 80 million miles of fiber optic cables,
  • Four years after the bubble burst, 85%-95% remained unused, earning the nickname “dark fiber4
  • Meta, Amazon, Google, Microsoft committed to a record $320 billion in capital expenditure this year, mostly for AI infrastructure, yet Meta’s 10-K admits: “there can be no assurance that the usage of AI will enhance our products or services” 5

  • Power infrastructure becomes one of the bottlenecks
  • Grid Strategies estimates data centers will need an additional 60 gigawatts of electricity by 2030, equivalent to Italy’s national peak demand 6
  • Cooling technology is also crucial: from air cooling to direct liquid cooling to immersion cooling, affecting long-term operating costs 7

  • Morgan Stanley serves as the exclusive underwriter, while also providing financing advisory for multiple similar projects 1
  • Bonds are issued in 144A format private placement, with a spread of 225 basis points above Treasury bonds 1

  • What can technology managers do? (1) Quantify the actual ROI timeline for AI investments (2) Assess power supply chain risks (3) Maintain appropriate financial leverage ratios

8 9

Read More

The Geography of AI: Anthropic's Economic Index Tracks AI's Real-World Impact Across 150 Countries

Post Title Image (Photo by The New York Public Library on Unsplash)

✳️ tl;dr

  • On 2025-09-15, Anthropic released its third Economic Index (approaching from different dimensions), tracking Claude usage patterns across 150+ countries and all US states for the first time. 1
  • (Possibly the first comprehensive geographic distribution data of AI adoption in the model industry?!)
  • Enterprise API customers show an automation rate of 77%, significantly higher than consumer users’ 50%, indicating that enterprises are actively shifting AI from collaborative tools to productivity replacement solutions.
  • Directive automation jumped from 27% to 39% within 8 months, marking the first time automation (49.1%) surpassed augmentation (47%), reflecting growing user confidence driven by improved model capabilities.

  • API usage shows only 3% price sensitivity (each 1% increase in cost index reduces usage by only 0.29%), with enterprises prioritizing capability and value over cost,
  • The speculated reason is that hidden infrastructure costs far exceed model fees (every $1 in model fees requires an additional $5-10 to deploy and reach production-ready status).
  • Ernest’s field observations align with this: those who complain about token costs typically lack sound organizational operational systems or workflows. Conversely, those who see the overall value created are bold in adopting AI.
  • Approximately 5% of API traffic is dedicated to developing and evaluating AI systems, forming a recursive improvement loop of “AI developing AI,” which is speculated to accelerate capability advancement but also require stronger safety oversight.
  • US interstate GDP elasticity (1.8) is significantly higher than cross-country (0.7), yet income has lower explanatory power, indicating that industry composition and economic structure are stronger adoption drivers.

  • AUI = Anthropic AI Usage Index
  • Washington DC has the highest AUI (3.82), primarily for document editing and information search; California (third) focuses on programming; New York (fourth) prefers financial tasks, with local economic structures directly mapping to AI usage patterns.
  • Educational instruction tasks grew 40% (9% → 13%), scientific research grew 33% (6% → 8%), showing rapid adoption in knowledge-intensive fields, suggesting that high-skilled workers are leveraging AI to enhance professional capabilities.
  • Business management tasks declined 40% (5% → 3%), financial operations tasks halved (6% → 3%), suggesting these fields may be undergoing automation or users are shifting to more specialized tools.
  • Wealthy countries tend to use AI for augmentation, while poorer countries prefer automation, with each 1% increase in population-adjusted usage corresponding to approximately 3% reduction in automation after controlling for task mix.

  • The research uses privacy-preserving classification methods combining the O*NET database (19,498 task descriptions) and Claude’s proprietary classification system for dual verification, ensuring data anonymization.
  • However, its static nature and coarse-grained classification may fail to capture emerging tasks created by AI and programming work of varying complexity. 23
  • The true cost of enterprise AI includes data engineering, security compliance, continuous monitoring, and integration architecture, far exceeding surface-level API fees, which is speculated to explain why enterprises are price-insensitive. 45
  • Model capability improvements (Sonnet 3.6 → 4.x series) directly drive behavioral changes, with better output quality reducing iteration needs, suggesting that future more powerful models may further increase automation ratios and transform human-AI collaboration patterns.

1 4 5 2 6 7 3 8 9

Read More