<?xml version="1.0" encoding="utf-8"?>
<feed xmlns="http://www.w3.org/2005/Atom" xml:lang="en">
  <title>AI Architect News</title>
  <subtitle>A magazine for the architecture of intelligence — inference, models, agents, hardware, and the systems between the layers.</subtitle>
  <link href="https://ai-architect.news/feed.xml" rel="self" />
  <link href="https://ai-architect.news/" />
  <id>https://ai-architect.news/</id>
  <updated>2026-06-14T00:00:00Z</updated>
  <author><name>AI Architect News</name></author>
  <entry>
    <title>A frontier without an ecosystem is not stable</title>
    <link href="https://x.com/satyanadella/status/2066182223213293753" />
    <id>https://x.com/satyanadella/status/2066182223213293753</id>
    <published>2026-06-14T00:00:00Z</published>
    <updated>2026-06-14T00:00:00Z</updated>
    <author><name>Satya Nadella</name></author>
    <category term="Strategy" />
    <summary>Every firm now compounds two assets — the human capital of its people and the token capital of the AI it owns — joined in a learning loop that turns institutional knowledge into IP no model can commoditize. The durable bet is a frontier ecosystem where value flows broadly, not one frontier model that eats every industry.</summary>
  </entry>
  <entry>
    <title>Introducing the Fusion API, the smartest compound model in the market</title>
    <link href="https://x.com/OpenRouter/status/2065856853989270011" />
    <id>https://x.com/OpenRouter/status/2065856853989270011</id>
    <published>2026-06-13T00:00:00Z</published>
    <updated>2026-06-13T00:00:00Z</updated>
    <author><name>OpenRouter</name></author>
    <category term="Models" />
    <summary>Fusion achieves Fable-level intelligence at half the price.</summary>
  </entry>
  <entry>
    <title>Moonshot&apos;s Kimi K2.7-Code cuts thinking tokens 30%</title>
    <link href="https://www.marktechpost.com/2026/06/12/moonshot-ai-releases-kimi-k2-7-code-a-coding-model-reporting-21-8-on-kimi-code-bench-v2-over-k2-6/" />
    <id>https://www.marktechpost.com/2026/06/12/moonshot-ai-releases-kimi-k2-7-code-a-coding-model-reporting-21-8-on-kimi-code-bench-v2-over-k2-6/</id>
    <published>2026-06-12T00:00:00Z</published>
    <updated>2026-06-12T00:00:00Z</updated>
    <author><name>Asif Razzaq</name></author>
    <category term="Models" />
    <summary>An open-weight coding model built on K2.6 with a 256K context — Moonshot reports a 21.8% gain on Kimi Code Bench v2 and roughly 30% lower reasoning-token usage.</summary>
  </entry>
  <entry>
    <title>Durable Agent Workflows Are Just Data Pipelines</title>
    <link href="https://urmzd.com/blog/durable-agent-workflows-are-just-data-pipelines/" />
    <id>https://urmzd.com/blog/durable-agent-workflows-are-just-data-pipelines/</id>
    <published>2026-06-12T00:00:00Z</published>
    <updated>2026-06-12T00:00:00Z</updated>
    <author><name>Urmzd</name></author>
    <category term="Agents" />
    <summary>Orchestrating agents at scale isn&apos;t a new problem — it&apos;s the same problem every system has: define a unit of work, persist it, and resume from the log. Durability is a logging discipline; scale and governance are different problem classes, and knowing the difference is the whole game.</summary>
  </entry>
  <entry>
    <title>The Rise of the Agent Runtime</title>
    <link href="https://golem.cloud/blog/the-rise-of-the-agent-runtime/" />
    <id>https://golem.cloud/blog/the-rise-of-the-agent-runtime/</id>
    <published>2026-06-10T00:00:00Z</published>
    <updated>2026-06-10T00:00:00Z</updated>
    <author><name>John A. De Goes</name></author>
    <category term="Industry Articles" />
    <summary>Every major vendor is quietly rebuilding fragments of the same missing runtime layer, and none ships the whole thing.</summary>
  </entry>
  <entry>
    <title>Introducing Claude Fable 5 and Mythos 5</title>
    <link href="https://www.anthropic.com/news/claude-fable-5-mythos-5" />
    <id>https://www.anthropic.com/news/claude-fable-5-mythos-5</id>
    <published>2026-06-09T00:00:00Z</published>
    <updated>2026-06-09T00:00:00Z</updated>
    <author><name>AI Architect News</name></author>
    <category term="Announcements" />
    <summary>A Mythos-class model made safe for general use, with capabilities exceeding any Claude model Anthropic has made generally available.</summary>
  </entry>
  <entry>
    <title>Introducing FrontierCode</title>
    <link href="https://x.com/cognition/status/2064061031912288715" />
    <id>https://x.com/cognition/status/2064061031912288715</id>
    <published>2026-06-08T00:00:00Z</published>
    <updated>2026-06-08T00:00:00Z</updated>
    <author><name>Cognition</name></author>
    <category term="Agents" />
    <summary>A coding eval built from 40+ hour tasks by open-source maintainers to test whether model-written code is actually mergeable.</summary>
  </entry>
  <entry>
    <title>Serverless agents and the durable execution problem</title>
    <link href="https://electric.ax/blog/2026/06/04/serverless-agents" />
    <id>https://electric.ax/blog/2026/06/04/serverless-agents</id>
    <published>2026-06-04T00:00:00Z</published>
    <updated>2026-06-04T00:00:00Z</updated>
    <author><name>James Arthur</name></author>
    <category term="Agents" />
    <summary>Managed agents scale better when their loop runs as stateless serverless functions, with durable state in the data layer and tool execution handled by backend systems. This architecture avoids sandbox overhead, fragmented artifacts, and brittle coordination while letting agents integrate into business workflows.</summary>
  </entry>
  <entry>
    <title>Modern Engineering Values</title>
    <link href="https://cpojer.net/posts/modern-engineering-values" />
    <id>https://cpojer.net/posts/modern-engineering-values</id>
    <published>2026-06-03T00:00:00Z</published>
    <updated>2026-06-03T00:00:00Z</updated>
    <author><name>Christoph Nakazawa</name></author>
    <category term="engineering" />
    <summary>Coding agents make software execution dramatically faster, moving engineering work toward ownership, taste, guardrails, repo-local context, stack control, and option value. Teams that keep feedback loops tight and technical judgment close to the code can ship more without treating AI-generated code as a substitute for engineering direction.</summary>
  </entry>
  <entry>
    <title>Introducing MAI-Thinking-1</title>
    <link href="https://microsoft.ai/news/introducing-mai-thinking-1/" />
    <id>https://microsoft.ai/news/introducing-mai-thinking-1/</id>
    <published>2026-06-02T00:00:00Z</published>
    <updated>2026-06-02T00:00:00Z</updated>
    <author><name>Superintelligence team</name></author>
    <category term="Models" />
    <summary>Microsoft AI introduced MAI-Thinking-1, a medium-sized reasoning model for complex problems with strong coding and math performance, trained on clean, traceable enterprise-grade data.</summary>
  </entry>
  <entry>
    <title>How LLMs Actually Work</title>
    <link href="https://www.0xkato.xyz/how-llms-actually-work/" />
    <id>https://www.0xkato.xyz/how-llms-actually-work/</id>
    <published>2026-06-01T00:00:00Z</published>
    <updated>2026-06-01T00:00:00Z</updated>
    <author><name>0xkato</name></author>
    <category term="Machine Learning" />
    <summary>A from-the-ground-up walkthrough of how modern LLMs work, from tokens to transformer blocks to the next-token loop</summary>
  </entry>
  <entry>
    <title>Anthropic eyes Microsoft&apos;s Maia 200 for Claude inference</title>
    <link href="https://www.cnbc.com/2026/05/21/anthropic-microsoft-maia-200-ai-chip.html" />
    <id>https://www.cnbc.com/2026/05/21/anthropic-microsoft-maia-200-ai-chip.html</id>
    <published>2026-05-21T00:00:00Z</published>
    <updated>2026-05-21T00:00:00Z</updated>
    <author><name>Jordan Novet</name></author>
    <category term="Infrastructure" />
    <summary>Anthropic is in talks to run Claude on Microsoft&apos;s Maia 200 chips — a hedge on inference capacity, and the first frontier test of Microsoft&apos;s custom silicon.</summary>
  </entry>
  <entry>
    <title>OpenAI makes GPT-5.5 Instant the new ChatGPT default</title>
    <link href="https://techcrunch.com/2026/05/05/openai-releases-gpt-5-5-instant-a-new-default-model-for-chatgpt/" />
    <id>https://techcrunch.com/2026/05/05/openai-releases-gpt-5-5-instant-a-new-default-model-for-chatgpt/</id>
    <published>2026-05-05T00:00:00Z</published>
    <updated>2026-05-05T00:00:00Z</updated>
    <author><name>Ivan Mehta</name></author>
    <category term="Models" />
    <summary>GPT-5.5 Instant replaces GPT-5.3 Instant as ChatGPT&apos;s default model, with better benchmarks and fewer hallucinations in law, medicine, and finance — plus richer context management and memory-source controls.</summary>
  </entry>
  <entry>
    <title>AMD&apos;s MI355X clears 1M tokens/sec on MLPerf 6.0</title>
    <link href="https://www.amd.com/en/blogs/2026/amd-delivers-breakthrough-mlperf-inference-6-0-results.html" />
    <id>https://www.amd.com/en/blogs/2026/amd-delivers-breakthrough-mlperf-inference-6-0-results.html</id>
    <published>2026-04-01T00:00:00Z</published>
    <updated>2026-04-01T00:00:00Z</updated>
    <author><name>Chris Raymond</name></author>
    <category term="Hardware" />
    <summary>AMD&apos;s MLPerf Inference 6.0 run puts MI355X GPUs past 1 million tokens/sec at multinode scale, with single-node results competitive against NVIDIA&apos;s B200 and B300.</summary>
  </entry>
  <entry>
    <title>Tracing the thoughts of a language model</title>
    <link href="https://www.anthropic.com/research/tracing-thoughts-language-model" />
    <id>https://www.anthropic.com/research/tracing-thoughts-language-model</id>
    <published>2025-03-27T00:00:00Z</published>
    <updated>2025-03-27T00:00:00Z</updated>
    <author><name>AI Architect News</name></author>
    <category term="Research" />
    <summary>Anthropic&apos;s circuit tracing maps the internal pathways of Claude — surfacing advance planning, shared cross-language concepts, and moments where a model&apos;s stated reasoning diverges from what it actually computes.</summary>
  </entry>
</feed>
