The Executive Visibility Gap
Infrastructure teams have tools. Grafana dashboards, PagerDuty alerts, CloudWatch metrics — they have real-time visibility into what's happening. But when a VP of Engineering asks "how healthy is our infrastructure?" the answer is usually a shrug or a rambling explanation of this quarter's uptime.
The problem isn't a lack of data. It's a lack of synthesis. Executives don't need to know that CPU utilization on 12 instances spiked at 3:14 PM. They need to know:
- Is our infrastructure secure? (CISO)
- Are we spending efficiently? (CFO)
- Can we ship faster without breaking things? (CTO)
- Are we compliant with our commitments? (Board)
Each of these requires a different view of the same data. Knowledge Tree's Executive Health Score provides a single, synthesized metric — and persona-based dashboards that deliver the context each leader needs.
The Health Score: A Single Number That Tells the Story
The Executive Health Score is a 0-100 composite metric calculated from four pillars:
Health Score = (Security × 0.30) + (Reliability × 0.30) + (Cost Efficiency × 0.20) + (Compliance × 0.20)
1. Security Score (30% weighting)
Measures the security posture of your infrastructure:
| Metric | Weight | What It Measures |
|---|---|---|
| Exposure surface | 25% | Publicly exposed resources, open ports, unrestricted IAM policies |
| Encryption coverage | 20% | Percentage of data-at-rest and in-transit encrypted resources |
| IAM hygiene | 20% | Overly permissive roles, unused credentials, root account activity |
| Vulnerability posture | 20% | Known vulnerabilities in container images, AMIs, and dependencies |
| Security incidents | 15% | Recent security events and their resolution status |
2. Reliability Score (30% weighting)
Measures the operational health of your infrastructure:
| Metric | Weight | What It Measures |
|---|---|---|
| Uptime compliance | 25% | Actual uptime vs SLO targets across all services |
| Incident frequency | 20% | Number and severity of incidents per week (inverse scoring) |
| MTTR | 20% | Mean time to resolve (lower is better) |
| Change failure rate | 20% | Percentage of changes that cause incidents |
| Backup coverage | 15% | Resources with verified backups vs total critical resources |
3. Cost Efficiency Score (20% weighting)
Measures financial health of infrastructure operations:
| Metric | Weight | What It Measures |
|---|---|---|
| Waste percentage | 30% | Spend on orphaned/underutilized resources |
| Reserved instance coverage | 20% | Percentage of eligible compute covered by RIs/Commitments |
| Cost per resource | 20% | Average cost per managed resource (trending) |
| Budget adherence | 15% | Actual spend vs budget by team/environment |
| Orphaned resource ratio | 15% | Resources without active dependencies or owners |
4. Compliance Score (20% weighting)
Measures adherence to regulatory and internal policy frameworks:
| Metric | Weight | What It Measures |
|---|---|---|
| Control pass rate | 30% | Percentage of controls passing automated checks |
| Documentation coverage | 25% | Percentage of resources with current documentation |
| Evidence freshness | 25% | Age of most recent compliance evidence |
| Remediation velocity | 20% | Average time to remediate compliance findings |
Persona Dashboards
The Health Score is the headline number. Each leader gets a dashboard tailored to their perspective.
CTO / VP Engineering Dashboard
The CTO needs to know: "Can we ship features without breaking things?"
Infrastructure Health: 87/100 ▲ (+3 from last week)
Reliability:
├── SLO attainment: 99.94% (target: 99.9%) ✓
├── Incidents this week: 3 P2, 0 P1 (trending down)
├── MTTR: 24 min (target: <30 min) ✓
└── Change failure rate: 2.1% (target: <5%) ✓
Change Velocity:
├── Deployments this week: 47
├── Resources changed: 1,234
└── Change-related incidents: 1 (2.1% failure rate)
CISO / Security VP Dashboard
The CISO needs to know: "Are we secure?"
Security Posture: 82/100 ▲ (+2 from last week)
Critical Findings:
├── 3 security groups with unrestricted SSH (trending down from 12)
├── 7 unencrypted RDS instances (2 critical)
└── 12 container images with critical CVEs
Compliance:
├── SOC 2 control pass rate: 94%
├── Open security findings: 23 (avg age: 14 days)
└── Last penetration test: 12 days ago
Key Risks:
- IAM policy "admin-backup" grants full access to 3 service accounts
- 2 production S3 buckets missing encryption
CFO / FinOps Dashboard
The CFO needs to know: "Are we spending efficiently?"
Cost Efficiency: 79/100 → (no change from last week)
Monthly Run Rate: $247,000
├── Compute: $142,000 (57%)
├── Storage: $48,000 (19%)
├── Networking: $32,000 (13%)
└── Services: $25,000 (10%)
Waste Analysis:
├── Orphaned resources: $12,000/month (4.9%)
├── Underutilized instances: $8,000/month (3.2%)
└── Unattached storage: $3,000/month (1.2%)
Cost by Team:
├── Platform: $98,000 (40%)
├── Payments: $52,000 (21%)
├── Data: $42,000 (17%)
└── Other: $55,000 (22%)
Board / Investor Dashboard
The board needs to know: "Is this a well-run operation?"
Infrastructure Health Score: 84/100 ▲ (+5 YoY)
Year-over-Year Trends:
├── Security posture: 67 → 84 (+17 pts)
├── Cost efficiency: 71 → 79 (+8 pts)
├── Compliance: 73 → 88 (+15 pts)
└── Reliability: 85 → 87 (+2 pts)
Key Achievements:
├── SOC 2 Type II completed with zero findings
├── Cloud spend reduced 12% YoY despite 30% resource growth
├── P1 incidents down 60% from last year
└── 100% encryption coverage achieved on production data
Risk Areas:
└── 6 critical vulnerabilities (CVSS > 9.0) awaiting patches
The Health Score API
The Health Score is accessible programmatically for integration with executive dashboards and reporting tools:
$ curl -H "Authorization: Bearer ${KT_API_KEY}" \
https://api.knowledgetree.dev/v1/health-score
{
"overall": 84,
"components": {
"security": 82,
"reliability": 87,
"cost_efficiency": 79,
"compliance": 88
},
"trend": {
"direction": "up",
"change": 3,
"period": "weekly"
},
"top_issues": [
{"severity": "high", "category": "security", "title": "3 security groups with unrestricted SSH"},
{"severity": "medium", "category": "cost", "title": "$12K/month in orphaned resources"}
]
}
Persona Views for Every Stakeholder
Beyond the executive team, Knowledge Tree provides tailored views for:
| Persona | Focus | Key Metric |
|---|---|---|
| Engineering Manager | Team velocity and stability | Change failure rate, deployment frequency |
| SRE | Operational health | MTTR, SLO attainment, incident trends |
| Platform Engineer | Resource utilization | Infrastructure coverage, drift detection |
| Security Analyst | Threat posture | Open vulnerabilities, exposure surface |
| FinOps Analyst | Cost optimization | Waste percentage, RI coverage |
| Compliance Officer | Audit readiness | Control pass rate, evidence age |
| Product Manager | Feature delivery velocity | Infrastructure change lead time |
Real Scenario: The Board Meeting
A Knowledge Tree customer was preparing for a quarterly board presentation. Previously, the CTO would spend 2 days compiling infrastructure metrics from multiple sources. With the Executive Dashboard, they generated the report in 10 minutes:
- Opened the Executive Dashboard
- Selected the "Board Report" persona
- Set the date range to the last quarter
- Exported the report as a PDF
The board saw:
- Infrastructure Health Score trending from 72 to 84 over the quarter
- Security posture improving 17 points after a focused remediation initiative
- Cost efficiency improving despite 30% infrastructure growth
- SOC 2 compliance achieved with no major findings
The conversation shifted from "why is our infrastructure so messy?" to "how do we maintain this trajectory?"
Getting Started
Executive dashboards are available on all plans:
| Feature | Team | Business | Enterprise |
|---|---|---|---|
| Health Score | Weekly snapshot | Real-time | Real-time + trends |
| Persona dashboards | CTO view | All personas | Custom personas |
| Board reports | -- | Quarterly PDF | Automated monthly |
| API access | Read-only | Full access | Full + webhooks |
| Trend analysis | -- | 90 days | Custom date ranges |
What's your Infrastructure Health Score? Book a demo to see your score, or start a free trial to get your first report.