This $1.5 Billion AI Startup Steps In When Software Breaks

Spiros Xanthos has spent more than two decades building systems to help engineers monitor and troubleshoot complex software. Yet the most punishing part of the job never changes: When something breaks, on-call engineers are on the hook to fix it, even if it’s the middle of the night.

While overseeing cybersecurity and data platform Splunk’s software monitoring teams, Xanthos saw this strain firsthand as his teams integrated newly acquired tools while trying to keep its existing systems running.

“[It] was so hard on our site reliability engineers,” Xanthos says. “Over a period of six months, we lost 90% of them. It was a complete burnout.”

That experience led Xanthos and his cofounder Mayank Agarwal to launch Resolve AI in 2024 to automate the painful process of responding to production issues. Traditionally, teams rotate who has to be on call, where developers are paged when systems fail. With Resolve’s system, multiple agents investigate when a problem is flagged. They orchestrate a host of AI models and tools to analyze logs, metrics and test different hypotheses like traffic spikes or faulty code. If Resolve can automatically fix the issue, it will. If it’s more complex, the system will present a recommended solution to a human engineer, who can review and approve it.

“We're still in this human-in-the-loop process,” Xanthos says. “But we're moving more to this concept of human on the loop – a human can review what Resolve does, but doesn’t have to pause and decide.”

Resolve’s customers, like Coinbase, DoorDash and Salesforce, have seen dramatic improvement in the time it takes to fix an incident and investigate why it occurred. According to Resolve, DoorDash reduced investigation........

© Forbes