agent context window cost

Agent Context Window Cost

Every eager MCP schema has an operational cost. The cost is small in one request, then meaningful across thousands of agent sessions.

Open scanner preview

Operating steps

  1. Estimate eager catalog tokens per session.
  2. Multiply by monthly sessions, retries, and model price assumptions.
  3. Compare latency between full schema loading and progressive loading.
  4. Set savings targets that CI can enforce.

Common risks

  • Cost dashboards can undercount retries caused by ambiguous routing.
  • Latency savings depend on client transport and cache behavior.
  • Context savings should not hide security or permission metadata.