We’re hiring a Tech Lead to build the system that catches what everyone else misses.
Not dashboards. Not surface level monitoring.
We’re talking about the stuff that silently breaks; missing events, failed handoffs, transactions that never complete and building the infrastructure that proves, end-to-end, that everything actually happened.
If you’ve ever looked at a distributed system and thought “we’re blind here” this is your chance to fix it.
12-month day rate contract. Small team. Zero passengers.
The work:
• Build “CCTV for data” across event-driven systems
• Trace transactions across multiple services and prove they completed correctly
• Detect what didn’t happen (not just what failed)
• Turn messy, ambiguous requirements into working systems
• Design it, spec it, build it — no throwing it over the fence
The stack:
AWS (EventBridge, Step Functions, CloudWatch, Aurora/RDS)
Kotlin / Java
Event-driven architecture (SNS, SQS, EventBridge)
SQL + reconciliation patterns
You:
• You’ve built real observability / tracing / reconciliation systems
• You actually understand event-driven architecture (not just pub/sub basics)
• You’re comfortable with ambiguity you define the solution
• You’re hands-on and stay that way
Why this is different
Most teams monitor what they can see. This work is about what they can’t. You’ll be building the layer that stops silent failures the kind that cause real damage because no one notices until it’s too late. If you want clean requirements and tidy roadmaps, this isn’t it. If you want to solve hard, undefined problems that actually matter, it is.