Vantage Integration
Connect Vantage so Autoheal can query your cost data and root-cause anomalies across your cloud providers. When a Vantage cost or anomaly alert fires, the agent pulls the underlying cost time series and pivots to AWS / GCP / Kubernetes tools to identify the workload responsible for the spike.
Capabilities
| Capability | Description |
|---|---|
| Query Cost Data | Pull cost time series for any CostReport or VQL filter |
| List Anomalies & Recommendations | Surface active anomalies and cost-saving recommendations from Vantage |
| Cross-Provider RCA | Combine Vantage data with AWS / GCP / Kubernetes tools to locate the workload driving the spend |
| Trigger Investigations from Vantage Alerts | Relay Vantage cost / anomaly alerts into Autoheal via Jira or Slack (see below) |
Prerequisites
- A Vantage account with API access
- A Vantage API token (a read-only token is sufficient)
Setup
- In Vantage, go to Settings → API Tokens
- Create a new token (read-only is fine) and copy the value
Authorization: Bearer credentials — no OAuth client-credentials exchange is performed, despite what the public OAS suggests.- Go to Integrations in Autoheal
- Click Vantage
- Enter a name (e.g., "Production Vantage")
- Paste your API Token
- Click Save
Configuration Fields
| Field | Required | Description |
|---|---|---|
| API Token | Yes | Vantage API token from Settings → API Tokens. Read-only is sufficient. |
Triggering Investigations on Vantage Alerts
Vantage's Cost Alerts and Anomaly Alerts only deliver notifications to Email, Slack / Microsoft Teams, and Jira — there's no native webhook destination. To get Autoheal to auto-investigate when Vantage fires an alert, use one of these relays:
Recommended: Jira relay
If you already use the Jira integration with Autoheal:
- In Vantage, open the Cost Alert (or Anomaly Alert) you want Autoheal to investigate
- In Setup App Alerts, add a Jira Alerts destination pointing at the same Jira project that's wired into Autoheal
- Include the anomaly token in the Jira ticket description (e.g.,
Vantage anomaly: anml_AbCdEfGh1234) so the agent can callvantage_get_anomaly_alertto drill in - When Vantage fires, it creates a Jira ticket → Jira webhook hits Autoheal → Autoheal opens an investigation with the Vantage anomaly token in the body
The agent will recognise the Vantage anomaly token in the ticket body and pivot directly into Vantage MCP tools for the cost detail, then cross-platform tools (gcloud / aws) for resource-level RCA.
Slack mention
Have Vantage post the alert to Slack, then @autoheal mention the message in-thread:
@autoheal investigate this Vantage cost anomaly
Manual / agent-on-demand
You can always trigger an investigation by asking the agent directly:
Investigate the Vantage anomaly anml_AbCdEfGh1234
What caused our AWS EC2 spike yesterday? Check Vantage and AWS.
MCP Tools
When agents have access to this integration, they can call:
| Tool | What it does |
|---|---|
vantage_list_anomaly_alerts | List cost anomaly alerts with filters (provider, service, status, date range) |
vantage_get_anomaly_alert | Full detail for a single anomaly — amount, baseline, resources, resource_tokens |
vantage_list_cost_reports | List CostReports in the workspace |
vantage_get_costs | Pull cost time series for a CostReport or VQL filter, with date_bin + groupings |
vantage_list_recommendations | List cost-saving recommendations (rightsizing, idle, commitment, …) |
vantage_list_workspaces | List workspaces the token can access |
vantage_list_cost_providers | List cloud providers visible in a workspace |
vantage_list_cost_services | List services per provider (for VQL filter discovery) |
vantage_test_connection | Verify the API token (debug tool) |
vantage_get_costs is response-size aware: it defaults to a 30-day window, day-level binning, and grouping by service. If a response would exceed the soft size cap, the tool refuses with a structured error telling the agent to narrow the window, coarsen date_bin, or drop groupings — rather than silently returning a partial time series.
How It Works
- A Vantage Cost / Anomaly Alert fires and lands in your Slack or Jira (via Vantage's native delivery)
- The Slack mention or Jira-webhook relay opens an Autoheal investigation
- The agent calls
vantage_get_anomaly_alertfor full detail, thenvantage_get_costson the originating CostReport to confirm the spike timing - Using the
resourceslist, the agent pivots to AWS / GCP tools to identify the workload behind the spend - Findings land on the investigation; you see the root cause without manually digging through Vantage
Add a context form in Autoheal (Vantage → Configure context) so the Knowledge Agent can pin your most-watched CostReports and provider/service tuples. This lets the agent skip discovery and go straight to root-causing.
Example Queries
Show me the active cost anomalies from the last 7 days
Why did our AWS EC2 spend spike yesterday?
What are the largest cost-saving recommendations Vantage has for our GCP account?
Troubleshooting
vantage_get_costs Returns _truncated
Cost time-series responses can dwarf the agent's context window. When you see _truncated:
- Narrow
start_date/end_date - Coarsen
date_bin(usemonthfor windows over 60 days) - Reduce the number of
groupings
API Returns 401 / 403
- Confirm the API Token is current (not revoked) and has access to the workspaces you expect
- If the token has access to multiple workspaces, several tools require you to pass
workspace_token— check the tool's argument schema
Agent Can't Find the Anomaly from a Jira Relay
Make sure the Vantage → Jira alert template includes the anomaly token (e.g., anml_AbCdEfGh1234) in the ticket title or description. Without the token, the agent can't call vantage_get_anomaly_alert directly and has to fall back to vantage_list_anomaly_alerts.