Short Definition
Proxy metrics are indirect measurements used to approximate true objectives or outcomes that are difficult, delayed, or costly to observe.
Definition
A proxy metric is a measurable signal used in place of a true target metric when the true metric is unavailable in real time, delayed, expensive to compute, or difficult to measure directly. Proxy metrics are commonly used in training, evaluation, monitoring, and online experimentation to provide faster feedback.
Proxies stand in for what really matters.
Why It Matters
Many real-world objectives—such as long-term revenue, user satisfaction, safety outcomes, or default risk—cannot be measured immediately. Proxy metrics enable iteration and monitoring, but if poorly chosen, they can misrepresent true performance and drive harmful optimization.
Optimizing the proxy does not guarantee optimizing the goal.
Common Reasons for Using Proxy Metrics
Proxy metrics are used when:
- true outcomes are delayed (label latency)
- business impact is long-term
- labels require human verification
- experimentation needs rapid feedback
- deployment constraints limit observability
Proxies trade fidelity for speed.
Examples of Proxy Metrics
Depending on the domain, proxy metrics may include:
- click-through rate as a proxy for engagement
- short-term conversion as a proxy for lifetime value
- prediction confidence as a proxy for correctness
- early fraud flags as a proxy for confirmed fraud
- surrogate loss as a proxy for business cost
Each proxy encodes assumptions.
Minimal Conceptual Illustration
Decision → Proxy Metric → (time) → True Outcome
Relationship to Offline Metrics vs Business Metrics
Offline metrics are often proxies for business outcomes. The farther a proxy is from the real objective, the greater the risk of misalignment between technical performance and real-world impact.
All offline metrics are proxies by design.
Proxy Metrics and Delayed Feedback
Proxy metrics are frequently used to bridge delayed feedback loops. However, proxies may be biased toward short-term signals and systematically misrepresent long-term outcomes.
Delay amplifies proxy risk.
Risks of Proxy Metrics
Poorly chosen proxies can lead to:
- Goodhart’s Law effects (“when a measure becomes a target…”)
- optimization toward unintended behavior
- overfitting to short-term signals
- neglect of rare but costly failures
- erosion of long-term performance
Proxies shape behavior.
Proxy Drift
Over time, the relationship between a proxy metric and the true objective may change due to:
- distribution shift
- user adaptation
- policy changes
- feedback loops
- model updates
Proxy drift breaks assumed correlations.
Evaluation Implications
When using proxy metrics:
- validate correlation with true outcomes
- monitor divergence over time
- avoid treating proxies as ground truth
- re-evaluate proxies after model updates
- complement proxies with delayed outcome audits
Proxies require continuous validation.
Relationship to Online Evaluation
Online experiments often rely on proxy metrics due to latency constraints. Decisions based solely on proxies must be revisited once true outcomes become available.
Online success may be temporary.
Common Pitfalls
- optimizing proxies without validating alignment
- treating proxy improvements as business success
- ignoring proxy drift
- using single proxies for multi-objective systems
- reporting proxies without uncertainty or caveats
Proxies simplify reality—sometimes too much.
Summary Characteristics
| Aspect | Proxy Metrics |
|---|---|
| Purpose | Approximate true objectives |
| Availability | Immediate or early |
| Fidelity | Lower than true metrics |
| Risk | Misalignment |
| Monitoring need | High |