Demonstrate Engineering Progress
Quarterly review is due and the only data is ticket counts -- singling out individuals, not illuminating the system. This guide walks you through preparing a quarterly presentation with Landmark: system-level trends, marker evidence, and engineer voice that show direction without naming individuals.
Prerequisites
- Getting Started: Map for Leaders -- install Map, migrate the activity schema, load your roster, and sync GetDX data.
-
Getting Started: Landmark for Leaders
-- install Landmark and confirm you can run
npx fit-landmark org show. -
Authoring Agent-Aligned Engineering Standards
-- define drivers and markers in your standard data.
Landmark's health, evidence, and readiness views require
drivers in
drivers.yamland markers in your capability YAML files.
The rest of this guide assumes Map's activity layer is running and populated. If you want to explore with synthetic data first, see Trying the activity layer with synthetic data in the Map guide.
Confirm your data is ready
Before building views for a quarterly review, confirm that the standard data, roster, and snapshots are in place.
Validate your standard data against the schema:
npx fit-map validate
Expected output (your counts will reflect your installation's standard):
Validation passed
Data Summary
Skills — 12
Behaviours — 6
Disciplines — 3
Tracks — 2
Levels — 5
Drivers — 4
If any errors appear, resolve them using the guidance in Authoring Agent-Aligned Engineering Standards.
Confirm that your roster is loaded and the team hierarchy is visible:
npx fit-landmark org team --manager alice@example.com
Team under alice@example.com
Alice Smith alice@example.com Software Engineering / J080 (manager)
Bob Chen bob@example.com Software Engineering / J060
Carol Davis carol@example.com Software Engineering / J070
Dan Park dan@example.com Data Engineering / J060
If the output is empty, re-run
npx fit-map people push roster.csv with your current
roster file.
Confirm that snapshot data is available:
npx fit-landmark snapshot list
GetDX Snapshots
MjUyNbaY 2025-03-15 completed
NzE4MmRk 2025-06-14 completed
The third column is the snapshot status (completed or
pending); the date is the snapshot's
scheduled_for value.
If the output is empty, run
npx fit-map getdx sync followed by
npx fit-map activity transform to ingest the latest
GetDX data.
See system-level trends across snapshots
Quarterly reviews need context: is a score improving, declining, or flat? Before diving into the health view, check how a specific driver has moved over time.
Track a driver's trend across snapshots, scoped to your team:
npx fit-landmark snapshot trend --item code_review --manager alice@example.com
Trend for code_review
2025-03-15 72
2025-06-14 78
2025-09-13 81
The output shows the driver's score at each snapshot date,
making the direction visible. Replace code_review with
any driver ID from your drivers.yaml -- the starter
data includes code_review,
incident_response, and deep_work.
Compare the latest snapshot against organizational benchmarks:
npx fit-landmark snapshot compare --snapshot MjUyNbaY --manager alice@example.com # ID from 'snapshot list'
Snapshot snap_2025_Q3
Code Review 78 vs_prev: +6, vs_org: +8, vs_50th: +8, vs_75th: -2, vs_90th: -10
Incident Response 65 vs_prev: -3, vs_org: -3, vs_50th: -3, vs_75th: -11, vs_90th: -19
Deep Work 82 vs_prev: +1, vs_org: +10, vs_50th: +10, vs_75th: +1, vs_90th: -8
Each row shows the team's score followed by signed deltas
against the previous snapshot (vs_prev), the
organization median (vs_org), and the 50th/75th/90th
percentiles. Use the snapshot ID from
npx fit-landmark snapshot list.
Build the health view
The health view is the centerpiece of Landmark's quarterly presentation. It joins driver scores, contributing-skill evidence, engineer voice comments, and growth recommendations into a single picture scoped to a manager's team.
Run the health view for your team:
npx fit-landmark health --manager alice@example.com
alice@example.com team — health view
Drivers (2)
────────────────────────────────────────────────────────────
# Driver Percentile vs_org More
1 code_review 72nd +5 -
2 incident_response 48th -3 -
Recommendations (1 unique)
────────────────────────────────────────────────────────────
- Carol Davis (working) could develop planning — for code_review (high)
The default output is a compact table organized by driver, followed by deduped growth recommendations. Each row shows:
-
Driver name -- the driver ID from your
drivers.yaml. -
Percentile -- the team's GetDX score position
relative to the organization (e.g.
72nd). -
vs_org -- the signed delta against the org median
(e.g.
+5). -
More -- a hint when additional per-driver anchors
are available via
--verbose.
The trailing Recommendations table is populated when Summit is
installed and deduped per (candidate, skill) — each
line names the individual who could develop the skill, their current
proficiency, and the driver the development serves.
Pass --verbose to switch to a per-driver paragraph
layout that discloses all percentile anchors, contributing skills,
evidence counts, and the two most recent GetDX comments per driver:
npx fit-landmark health --manager alice@example.com --verbose
alice@example.com team — health view
Driver: code_review (72nd percentile)
Anchors: percentile=72, vs_org=+5
Contributing skills: task_completion, planning
Evidence: 12 artifacts for task_completion, 8 artifacts for planning
GetDX comments: "We've been catching more issues in review lately"
"Design docs are getting better but still inconsistent"
⮕ Recommendation: Carol Davis (working) could develop planning.
(Summit growth alignment: high)
Understanding what the health view shows
The health view is designed for conversations about the system, not about individuals. Driver scores are team-level aggregates from GetDX. Evidence counts show how many artifacts across the team match a skill's markers -- not which individual produced them. Comments are surfaced by keyword relevance to the driver, not attributed to specific respondents.
When presenting health data in a quarterly review, the narrative is: "Here is where the system is strong, here is where it is trending, and here is what engineers are saying about it." The data supports that narrative without requiring anyone to name names.
Hear what engineers are saying
GetDX snapshot comments contain direct engineer feedback. Landmark surfaces these comments in two modes, both useful for quarterly preparation.
See comments themed by topic across your team:
npx fit-landmark voice --manager alice@example.com
alice@example.com team — engineer voice
Most discussed themes:
incident 3 comments "On-call handoffs are still rough", "Runbook coverage is improving but gaps remain"
planning 2 comments "Sprint planning feels more realistic this quarter", "Design docs are getting better but still inconsistent"
testing 1 comments "Integration tests saved us twice this month"
Below-50th driver alignment:
incident_response (48th percentile) — 3 incident comments
The manager view buckets comments by theme and counts how many mention each, showing the two most recent snippets inline per theme. It also highlights drivers scoring below the 50th percentile where engineer comments align -- where sentiment matches the quantitative data.
This is valuable for quarterly reviews because it grounds numerical
scores in the team's own words. A low
incident_response score paired with three incident
comments tells a clearer story than the score alone.
Check where evidence supports the standard
Evidence coverage shows whether the team's actual work produces artifacts that match your standard's markers. Two views help here: practice patterns across the team and the gap between derived and evidenced capability.
See practice patterns for your team:
npx fit-landmark practice --manager alice@example.com
Practice patterns
task_completion matched: 12 unmatched: 4 total: 16
planning matched: 8 unmatched: 2 total: 10
incident_response matched: 4 unmatched: 6 total: 10
sre_practices matched: 2 unmatched: 5 total: 7
Each row shows how many marker-matched artifacts exist for the
skill, how many unmatched candidates remain, and the total
considered. Skills with high matched: counts have
strong evidence; rows with low matched and high unmatched signal
where the evidence pipeline is light. Filter to a specific skill for
detail:
npx fit-landmark practice --skill task_completion --manager alice@example.com
Compare what the standard predicts the team should be capable of against what evidence actually shows:
npx fit-landmark practiced --manager alice@example.com
Practiced capability — alice@example.com (4 members)
Task Completion derived: practitioner evidenced: 18 evidence rows
Planning derived: working evidenced: 7 evidence rows
Incident Response derived: working evidenced: 0 ← on paper only
SRE Practices derived: working evidenced: 0 ← on paper only
Architecture Design derived: practitioner evidenced: 0 ← on paper only
Each row aggregates across the team: derived: is the
highest proficiency the team's role definitions imply for the
skill, and evidenced: counts the marker-matched
evidence rows backing it. Rows trailing
← on paper only flag skills the standard predicts but
evidence has not yet caught (the inverse,
← evidenced beyond role, marks skills whose evidence
outruns the derived role profile). This can mean the evidence
pipeline has a gap, or it can highlight a coaching opportunity.
Either way, it is information worth surfacing in a quarterly review
-- it shows where the organization's definitions and actual
practice diverge.
Verify
You have demonstrated engineering progress without surveillance when:
-
Health view renders with data.
npx fit-landmark health --manager alice@example.comshows at least one driver with a score, contributing skills, and evidence counts. No "No GetDX snapshot data available" messages. -
Trends show direction.
npx fit-landmark snapshot trend --item code_review --manager alice@example.comshows scores across multiple snapshots, making the trajectory visible. -
Engineer voice is surfaced.
npx fit-landmark voice --manager alice@example.comshows themed comments with counts. Comments align to drivers without attributing them to specific individuals. -
Evidence backs the story.
npx fit-landmark practiced --manager alice@example.comshows where the team's actual work matches the standard and where it does not -- system-level insight, not individual performance data.
All commands accept --format text,
--format json, or --format markdown. Use
--format markdown to produce output suitable for
sharing in documents and presentations.
What's next
Tell Whether Culture Investments Are Working
Track an initiative's impact on engineering outcomes by reading driver-score trends across the snapshots that straddle its completion date — and assemble a readout that holds up under VP scrutiny.
Get Career Guidance Grounded in the Standard
When a promotion conversation ends with 'not yet' and no specifics, use Guide and Landmark to find what's missing and show concrete evidence of growth.