Loading
Loading
Most CTOs facing delivery problems reach for a talent answer: hire more, hire better. The problem is usually the system — the processes, structures, and feedback loops around the talent they already have.
Author
Pavel Siddique
Published
21 May 2026
Reading time
9 min read
Topics
nordic-tech, architecture, scaling
We were brought in to assess a 14-person engineering team that had missed three consecutive product milestones. The CTO had already concluded they needed to hire three more senior engineers. Before the hiring started, we spent two weeks with the team. The engineers were good. The system they were working in was not. Three new senior engineers in that system would have slowed things down further. Here's what we found and what it means for how CTOs should diagnose delivery problems.
When delivery slips, the instinctive response is a talent hypothesis: the team isn't good enough, fast enough, or big enough. This hypothesis leads to a hiring response. It's often wrong — and it's expensive to test at SEK 1.8M per hiring cycle. The talent hypothesis is attractive because it's concrete and actionable. "We need to hire two senior engineers" is a clear instruction that can be given in a board meeting. "Our system has structural quality problems that are generating waste in review cycles" is harder to explain and harder to act on, so it often doesn't get said.
The diagnostic we ran with this 14-person team took two weeks. We reviewed 90 days of PR data, sat in on sprint reviews, and interviewed six engineers individually. The picture that emerged was consistent: the engineers were competent. The system was generating rework, review friction, and context-switching that consumed the capacity that should have been going to features.
These are the patterns we see consistently when delivery problems are systemic rather than individual. They're observable from data — you don't need a two-week diagnostic to spot them.
1
High review cycles mean code is reaching review in a state that requires multiple rounds of correction. This is a standards and guardrails problem, not a developer quality problem. In this team: 3.8 average. The PRs weren't bad code — they were code without automated checks, so reviewers caught issues that a pre-commit hook would have blocked.
2
Low coverage means developers are writing code faster than tests — which means they're incurring future rework debt. When coverage is below 75%, bug rates in the first week after deployment tend to be 3–5× higher than in well-covered codebases. In this team: 62%, declining from 71% six months prior.
3
When unplanned work (bug fixes, production incidents, ad-hoc requests) consistently consumes more than 30% of sprint capacity, it means the codebase is fragile and the production environment is unstable. This team: unplanned work averaged 38% of capacity across the last three sprints — which directly explained why all three milestones slipped.
4
When new engineers take more than three weeks to make their first meaningful commit, the codebase and its conventions aren't documented well enough. This doesn't show up as a metric anywhere — but it's visible in hiring timelines and onboarding friction. Hiring three more engineers into a poorly documented system would have tripled the onboarding cost.
5
When you look at engineer calendars and see multiple daily context switches — from feature work to bug triage to review requests to architecture questions — the system is not protecting deep work time. Senior engineers are being used as human routing layers rather than force multipliers. The fix is structural (clearer ownership, async-first communication norms, pull-not-push review requests), not headcount.
"Adding engineers to a broken system doesn't scale output. It scales the system's problems. We've seen teams of eight outperform teams of 20 because the eight had a system that compounded their work. The 20 were fighting the system." — Pavel Siddique, CEO, Indpro AB
In the 14-person team: we introduced three changes. First, we implemented pre-commit hooks that blocked commits with lint errors or type errors — this reduced review cycles from 3.8 to 1.9 in four weeks because reviewers no longer needed to flag mechanical issues. Second, we enforced a coverage floor of 80% via CI gates — this reduced post-deploy bug rates by 60% in the first sprint after implementation. Third, we wrote the first three SKILL.md domain files for the codebase — this cut onboarding time for two new engineers (the CTO still hired two, but fewer than the original three planned) from the team's typical five-week ramp to three weeks.
We didn't hire. We didn't fire anyone. We didn't reorganise reporting structures. Three changes to the system, measurable in four weeks. The fourth sprint after implementation hit the product milestone that had been missed for three consecutive sprints.
Before you open the next engineering headcount request, run this diagnostic. It takes 90 minutes and gives you a clearer picture than a two-week external assessment in most cases. Pull 90 days of PR data. Calculate: average review cycles per PR, test coverage trend (improving or declining), percentage of sprint capacity consumed by unplanned work, and post-deploy defect count in the first seven days. If review cycles are above 2.5, coverage is below 75%, unplanned work is above 30%, or post-deploy defects are high and rising — fix the system before adding people.
Hiring into a broken system is the most expensive form of this mistake. It's expensive at SEK 1.8M per mis-hire, and it's expensive in the subtler sense that it legitimises the system's problems by treating them as a capacity issue rather than a structural one. New hires learn the broken system. They learn to work around its friction. They become part of the system's gravity rather than a correction to it.
Run the 90-day PR diagnostic on your team and compare it to the benchmarks in this post. If two or more metrics are in the problem zone, the system — not the talent — is the priority.
Book a System DiagnosticCode Quality: 62 → 91 in 30 Days →This post argues for a system-first diagnostic, not a system-only one. There are genuine talent problems: a team that lacks the domain expertise for a strategic shift (moving from a monolith to a distributed data architecture, for example, requires skills the existing team may not have), a team too junior for the architectural complexity of the next growth phase, or a specific senior leadership gap where the team has strong execution capability but no technical direction-setting. These are talent problems — and the answer is hiring, not guardrails.
The distinction: a talent problem shows up as "we can't do this class of work, even with the right conditions." A system problem shows up as "we can do this class of work, but the conditions are creating waste." Most of what presents as the first is actually the second. Run the diagnostic before you decide.
The diagnostic shortcut: Ask your best engineer: "If we fixed the three things that frustrate you most about how we work, how much faster would we ship?" If the answer is "a lot faster" — the problem is the system. If the answer is "it wouldn't change much — we need people who know X" — the problem might be talent.
How do you distinguish a system problem from a talent problem quickly?
The 90-day PR diagnostic is the fastest quantitative test. Qualitatively, ask your best engineer the question in the highlight box above. The most reliable signal: if your engineers are producing good code in greenfield or solo contexts but struggling in the team system, the system is the problem. If the same individuals produce inconsistent quality regardless of context, that's a talent signal.
How long does it take to see results from system changes?
Guardrail and hook changes (pre-commit, CI gates) show results within the first sprint — typically within two weeks. Coverage improvements take longer: the coverage floor stops the decline immediately, but building up from 62% to 84% took four sprints (eight weeks) in this case. SKILL.md adoption and review cycle reduction typically follow a similar four-to-eight week curve.
Does Indpro help with system diagnosis even if we don't need an augmented team?
Yes. System diagnostic engagements are a standalone service. We run the 90-day data review, interview key engineers, and produce a prioritised finding with specific recommendations. Some clients implement the changes themselves. Others bring us in for implementation. Either path is available — the diagnostic value is independent of what comes next.

CEO & Co-Founder
Pavel founded Indpro in 2010 with a vision to bridge Nordic engineering culture with India's deep tech talent pool. Based in Stockholm, he oversees strategy and client relationships.
Connect on LinkedIn →Data audit case study: Indpro found 47 data quality issues in a single audit. Three of them were costing the client €200,000 per year. Full breakdown and methodology.
10 pages of practical insight on operating models, compensation benchmarks, and a hiring playbook. Free PDF.
Download the Free GuideOr reach us directly: sales@indpro.se · +46 73 932 21 38