Every forward-looking statement management makes gets captured at the time and scored when the period arrives. The result is a durable record of promises vs. delivery, comparable across your coverage.
Score Management Against What They Said They'd Do
A 40–60-name coverage list produces hundreds of forward-looking statements every quarter. Logging each one, then re-checking it when its period closes, is the part that defaults to "I'll do it next time" and never quite gets done. Guidance Tracking does the logging and the re-checking continuously, and presents the result as a comparable record across every name you cover.

The defensibility of that record is what makes it useful. Baik, Farber, and Lee (2011) found that managers who issue frequent and accurate forecasts run firms that outperform peers on both operating and stock performance. Scoring management against their own commitments survives an IC challenge in a way that "their CEO is impressive" does not.
Official Guidance Matters, but the Edge Is in the Informal Forecasts
Official guidance is carefully managed and widely tracked. Revenue ranges, EPS bands, and the formal outlook are already covered by every major data vendor. The differentiated signal lives in the operational commentary around it: production volumes, user growth, factory openings, customer adoption, milestone timing. Executives speak more freely on operations, and that is where their actual confidence shows up.
Guidance Tracking covers official guidance, but the work nobody else does is the rest: every informal forward-looking statement an executive makes across earnings calls, press releases, investor presentations, interviews, and capital market days, all picked out of the automated ingestion pipeline the moment the document is published.
Two Statement Types, Both Testable
- Metric-based: quantitative targets for financial or operational measures (revenue, margins, production output, unit economics), at the company level or by segment, product line, or geography.
- Event-based: milestone predictions, externally visible (product launches, factory openings, regulatory approvals) or internal but strategically important (reorganizations, hiring plans, technology rollouts, cost-reduction initiatives).
Each statement is standardized with a clear metric or event, timeframe, scope, and target.
Vague Language, Made Testable
The hard part of this work is translating loose phrasing without losing what management actually said. Guidance Tracking does it with documented assumptions and the original quote attached to each interpretation:
- "low to mid single digits" becomes a 1–5% range
- "flat q/q" becomes a tolerance band around the prior period
- "later in the year" becomes a date range with documented assumptions
- "similar to last quarter" becomes a relational test against the prior result
Reviewers can see how each translation was resolved and challenge it if they disagree.
A Score You Can Take to an IC Meeting
When the period arrives, each statement is automatically labeled "met", "exceeded", "missed", or "dropped" (when there is insufficient disclosure to evaluate). Metric-based scores read against the reported result on the basis management guided to, with adjusted vs. as-reported preserved. Event-based scores are validated against company communications and external reporting, since launches and factory openings are often visible publicly without a formal filing.
A management team that hits 80% of operational forecasts over six quarters reads differently from one that hits 50%. That difference is the part of management quality that holds up in an IC memo.
Start your free evaluation
Analyze 15 leading companies immediately. No registration, no credit card, no sales call.
How It Shows Up in an Earnings Review
The most useful integration is the simplest one. When a Deep Research Agent produces an earnings review, last quarter's guidance evaluations appear in a sidebar against the current call commentary. Reading the call against the prior promises, rather than in isolation, is most of the analytical work already done.
The same evaluations are queryable in AI Analyst Chat for ad-hoc questions, and the full statement-level history is browsable as structured tables in the Marvin Labs platform. For the end-to-end management-quality workflow, see the Management Quality Assessment solution.

