close

Why Platform Teams Are Burning Out: How MyCoCo Escaped the Developer Experience Trap

When platform teams become overloaded bottlenecks instead of enablers, the solution isn't more tooling—it's strategic constraints and clear boundaries that preserve team sanity while accelerating delivery.

Elevator Pitch

Platform engineering promised to solve developer productivity challenges by creating self-service infrastructure. But at many organizations, platform teams are drowning in complexity they've absorbed from developers. MyCoCo discovered their platform team became the new bottleneck—working 60-hour weeks while being blamed for slowing everyone down. Here's how they transformed platform engineering from a burnout factory into a sustainable practice.

TL;DR

The Problem: Platform teams absorb all infrastructure complexity while becoming the scapegoat for deployment delays

The Solution: Golden paths, Team Topologies, clear SLAs, and project inception frameworks

The Impact: MyCoCo reduced platform team stress by 50% and increased deployment velocity 3x

Key Implementation: Pre-approved Terraform patterns covering 80% of use cases with architectural education

Bottom Line: Stop trying to be everything to everyone—constrain flexibility to preserve sanity

Click to enlarge
Platform Engineering Golden Paths Architecture Diagram

From chaos to sustainable platform engineering: The golden paths transformation

The Challenge: MyCoCo's Platform Team Meltdown

Jordan had been leading MyCoCo's platform engineering initiative for nine months when they finally admitted defeat. "We're not building a platform," they told Alex during their one-on-one. "We're running an IT helpdesk that happens to use Terraform."

The numbers told the story. MyCoCo's platform team of three engineers supported 50 developers across five product teams. They'd built what seemed like a modern setup: GitHub Actions for CI/CD, an LLM-based code reviewer for infrastructure PRs, and standardized Terraform modules for AWS deployments. Yet the team was drowning.

Every infrastructure PR became a potential disaster. One developer's "simple fix" to update a security group rule would have deleted and recreated the entire security group, causing a 20-minute production outage. Another's "minor IAM adjustment" affected 15 other services. The AI code reviewer caught syntax errors but couldn't understand business context or blast radius.

Worse, developers constantly bypassed the intake process. Jordan's Slack was a stream of "URGENT: Need this deployed NOW!" messages. Sprint planning became fiction—the team spent every day firefighting requests from DMs, hallway conversations, and executive escalations.

The breaking point came when a developer complained: "I can deploy a Lambda in 5 minutes on my personal AWS account. Why does it take your team 3 days?" They didn't see that their "simple" Lambda required updating three Terraform modules, modifying VPC configurations, creating new IAM patterns, updating monitoring dashboards, and committing to support this variation forever.

The Solution: From Chaos to Sustainable Platform Engineering

MyCoCo's transformation started with a hard truth: unlimited flexibility means unlimited support burden. Jordan's team implemented five key changes:

1. Golden Paths Over Infinite Options

Instead of supporting every possible infrastructure pattern, MyCoCo created pre-approved Terraform modules for common use cases. These "golden paths" covered 80% of developer needs while encoding all enterprise requirements automatically.

Example: Rather than letting developers architect networking and compute from scratch, MyCoCo offered three standardized deployment patterns:

  • Standard VPC and Networking: Multi-AZ setup with public/private subnets, NAT gateways, VPC endpoints, and enterprise-compliant security groups
  • Standard Linux EC2: Application-ready instances with SSM access, CloudWatch agent, security patching, backup policies, and proper IAM roles
  • Standard ECS Serverless: Fargate deployments with ALB integration, service discovery, auto-scaling policies, and centralized logging

Each golden path included cost allocation tags, security scanning, compliance controls, and monitoring—all pre-configured. Developers got consistent, secure infrastructure while the platform team eliminated architectural debates and one-off configurations entirely.

2. Team Topologies for Clear Boundaries

Following Team Topologies principles, MyCoCo established clear interaction modes:

X-as-a-Service: Golden path modules with no customization
Collaboration: Scheduled pairing for complex requirements
Facilitation: Office hours for questions and guidance

No more random Slack interruptions. Developers knew exactly how and when to engage the platform team.

3. Transparent Service Level Expectations

Published SLAs set clear expectations:

Standard patterns (using golden paths): 2 business days
Custom configurations: 5 business days
New patterns requiring platform changes: 10 business days

"Urgent" requests dropped 80% once teams realized proper planning yielded faster results.

4. Project Inception Requirements

Borrowing from agile methodologies, any project requiring infrastructure now needed a kick-off with platform team involvement. This revealed infrastructure needs early, preventing mid-sprint surprises.

A simple template captured requirements:

  • What are you building?
  • Which golden paths fit your needs?
  • What's unique about your requirements?
  • When do you need infrastructure ready?

5. Infrastructure Architecture Reviews

The platform team instituted monthly "Infrastructure Deep Dives" where they walked through real production incidents and changes. Instead of abstract documentation, they showed actual scenarios:

Case Study Session Example:
"Here's last week's 'simple' Lambda request. Let me show you what actually happened behind the scenes..."

Step 1: New subnet allocation in three VPCs (dev, staging, prod)
Step 2: IAM role creation with 14 policy attachments for scoped permissions
Step 3: Updating shared security groups across 6 availability zones
Step 4: Modifying 3 Terraform modules used by 20 other services
Step 5: Cost allocation tag updates affecting monthly billing reports

Developers finally understood why their personal AWS experience didn't translate. One developer commented: "I had no idea changing one Lambda meant touching 30 other resources. Now I get why you push golden paths so hard."

These sessions became so popular that developers started attending voluntarily, leading to better infrastructure proposals and fewer "urgent" requests.

Results: MyCoCo's Platform Renaissance

The transformation took three months, but the results were dramatic:

Quantitative improvements:

  • Platform team overtime dropped from 60 to 40 hours weekly
  • Standard deployments accelerated from 3 days to 4 hours
  • Urgent requests decreased by 80%
  • Developer satisfaction scores increased 35%

Qualitative changes:

  • Platform engineers felt valued instead of blamed
  • Developers understood infrastructure complexity better
  • Executive pressure shifted from "work faster" to "work smarter"

Most importantly, the platform team could finally focus on innovation instead of firefighting. They had time to improve golden paths, automate more processes, and even tackle technical debt from the early "build everything custom" days.

Sam, who'd been skeptical of platform engineering after years of manual DevOps work, admitted: "I finally have time to think about improvements instead of just surviving each day."

Key Takeaways

Platform engineering isn't about building a portal or implementing the latest tools. It's about sustainable practices that balance developer autonomy with operational reality.

Start with constraints: Less flexibility means less support burden. Your platform team can't be everything to everyone.

Invest in education: Developers aren't trying to make your life difficult—they genuinely don't understand enterprise infrastructure complexity.

Set boundaries: Clear interaction patterns and SLAs prevent platform teams from becoming an always-on helpdesk.

Measure what matters: Track platform team health alongside developer productivity metrics.

Ready to rescue your platform team from burnout? Start by auditing how many infrastructure patterns you're supporting and ask: could 80% of these use a golden path instead?

← Back to Logs

Explore more articles and projects