From the team

Insights on AI agents, reliability engineering, and the future of production operations.

SRE Engineer on lawn chair watching production melt down. Credit: ChatGPT

Could Your AI-Generated Code Destroy Your Company?

When everyone can build software, someone still has to keep it running. A reliability engineer leader with two decades at the companies that defined how modern infrastructure runs.

Ben Sigel playing cello with code and past experiences flowing out from the instrument

The Code Nobody Read Is Already in Production

Ben Sigelman argues that AI-generated code is a reliability crisis in slow motion, and what it means for how we observe production systems.

Cannon metaphorically firing new code to production, adoroably anthropomorphized by a jocular thrill-seeking circus bear.

The Future of Software is Production

Ship every piece of code you write directly into production.

Why LLM-Over-Logs Is the Wrong Abstraction.

Why LLM-Over-Logs Is the Wrong Abstraction.

Dumping logs into an LLM causes high variance and latency. Learn the data engineering approach for AI SRE that prioritizes signal over context.

I Don’t Care if AI Wrote the Code. You Own It.

I Don’t Care if AI Wrote the Code. You Own It.

SREcon Chair Heinrich Hartmann on why the age of AI-assisted engineering demands a radical return to design rigor.

The SDLC is Dead. Long Live the SDLC.

The SDLC is Dead. Long Live the SDLC.

You cannot review your way out of the new glut of AI code. Winning teams will learn faster from what reaches production.

The On-Call Problem AI Can Actually Solve

The On-Call Problem AI Can Actually Solve

Heinrich Hartmann argues AI’s most valuable role isn’t autonomous remediation. It’s ensuring on-call engineers have the context they need to fix incidents fast.

AI-Created Code Is Putting Us in Debt

AI-Created Code Is Putting Us in Debt

The velocity trap is real. Here is the new engineering framework for surviving the age of AI-generated code.

Can AI Spot Outages Faster Than Your Customers?

Can AI Spot Outages Faster Than Your Customers?

How AI shortens detection time and prevents trust-eroding surprises

The End of SRE Tribal Knowledge

The End of SRE Tribal Knowledge

How AI turns expert intuition into operational infrastructure

Never Let a Good Incident Go to Waste

Never Let a Good Incident Go to Waste

How AI turns firefighting into continuous learning

Why SREs Need an AI Teammate

Why SREs Need an AI Teammate

AI that clears the path so on-call engineers move faster

Respecting Control by Design

Respecting Control by Design

Principles for adding an AI teammate to incident response

The Glass Box AI SRE

The Glass Box AI SRE

Why Transparency Wins in Incident Response

AI Reduces Alert Fatigue in Incident Response

More Needle, Less Haystack: Solving the AI SRE Trust Gap

How AI-assisted incident response separates signal from noise to slash MTTR and alert fatigue.

MTTR: The Emergency Room Metric for SRE

MTTR: The Emergency Room Metric for SRE

MTTR: The Emergency Room Metric for SRE

Is Vibe Coding Rewriting Software Development?

Is Vibe Coding Rewriting Software Development?

When AI writes 95% of the code, the engineer's job shifts from production to direction.

Your Top Engineer Just Gave Notice

Your Top Engineer Just Gave Notice

From tribal knowledge to antifragility: How to stop a talent exodus from torching your engineering know-how.

Why Most Enterprise AI Projects Fail Before They Even Start

Why Most Enterprise AI Projects Fail Before They Even Start

Stop asking "how to use AI" and start asking which business problems are finally solvable.

Beyond the Model Wars: The Real AI Race Begins

Beyond the Model Wars: The Real AI Race Begins

Applications, not models, will increasingly define the next phase of AI innovation

AI's Last Mile Problem

AI's Last Mile Problem

Bridging the gap between out-of-the-box LLMs and practical production value.

How Corelight Saved 30% of Technical Support Time with RunLLM’s AI Support Engineer

How Corelight Saved 30% of Technical Support Time with RunLLM’s AI Support Engineer

RunLLM’s AI Support Engineer helped Corelight’s team save time, respond faster, and maintain quality without adding headcount.

How vLLM Uses RunLLM's AI Support Engineer to Deflect 99% of All Technical Questions

How vLLM Uses RunLLM's AI Support Engineer to Deflect 99% of All Technical Questions

Deflecting Massive Volume, Freeing Maintainers, Scaling Effortlessly.

Beyond AGI: Why Specialization Is the Real AI Breakthrough

Beyond AGI: Why Specialization Is the Real AI Breakthrough

Beyond AGI: Why Specialization Is the Real AI Breakthrough

How DataHub Saved $1MM with RunLLM's AI Support Engineer

How DataHub Saved $1MM with RunLLM's AI Support Engineer

RunLLM saved DataHub $1 million in engineering cost, increased question capacity 6X, and improved ticket deflection by 90%.

Arize AI Transforms Technical Support with RunLLM

Arize AI Transforms Technical Support with RunLLM

50% Faster Resolutions, 25% Less Engineering Work, and a 15% Boost in Customer Retention.

The Hard Thing About Building AI Applications

The Hard Thing About Building AI Applications

How we moved beyond the hype to define the 4 core principles of great AI-native design.

So you want to buy your first AI product

So you want to buy your first AI product

So you want to buy your first AI product

DeepSeek, o3, and AI applications

DeepSeek, o3, and AI applications

DeepSeek, o3, and AI applications

AI is yet another platform shift

AI is yet another platform shift

AI is yet another platform shift

One month of using Devin

One month of using Devin

One month of using Devin

The end of scaling laws doesn't matter

The end of scaling laws doesn't matter

The end of scaling laws doesn't matter

Your AI strategy is a waste of time

Your AI strategy is a waste of time

Your AI strategy is a waste of time

A theory of the AI market

A theory of the AI market

A theory of the AI market

In defense of vibes-based evaluations

In defense of vibes-based evaluations

In defense of vibes-based evaluations

LLMs are becoming commodities

LLMs are becoming commodities

LLMs are becoming commodities

You can't build a moat with AI

You can't build a moat with AI

You can't build a moat with AI

OpenAI is too cheap to beat

OpenAI is too cheap to beat

OpenAI is too cheap to beat

RLHF and LLM evaluations

RLHF and LLM evaluations

RLHF and LLM evaluations