dexcost Documentation

Complete reference for every public symbol in the dexcost Python SDK, grouped by function.

Parameter	Type	Default	Description
`api_key`	`str \| None`	`None`	API key. Falls back to `DEXCOST_API_KEY` env var.
`storage`	`str \| None`	`None`	Force `"local"` to disable cloud push.
`buffer_path`	`str \| None`	`None`	Path to local SQLite buffer.
`batch_size`	`int`	`100`	Events per sync batch.
`flush_interval`	`float`	`5.0`	Seconds between background sync pushes.
`auto_instrument`	`list[str] \| None`	`None` (all installed)	LLM SDKs to auto-patch. Pass `[]` to disable.
`redact_fields`	`list[str] \| None`	`None`	Field names removed from event details before storage.
`hash_customer_id`	`bool`	`False`	SHA-256 hash `customer_id` before storage.
`track_http`	`bool`	`True`	Patch HTTP libraries for non-LLM cost capture.
`service_catalog_url`	`str \| None`	`None`	URL to fetch an updated service catalog at startup.
`environment`	`str \| None`	`None`	`"development"` enables dev console mode, no cloud push. Falls back to `DEXCOST_ENV`.
`enable_retry_heuristics`	`bool`	`False`	Enable the `RetryHeuristicEngine` for automatic retry detection.
`retry_heuristic_window`	`float \| None`	`None`	Sliding-window size in seconds for heuristic detection.
`retry_heuristic_threshold`	`float \| None`	`None`	Confidence threshold (0.0–1.0) for heuristic flagging.

Parameter	Type	Description
`customer_id`	`str \| None`	Customer identifier propagated to every event.
`project_id`	`str \| None`	Project identifier propagated to every event.
`metadata`	`dict \| None`	Arbitrary key-value metadata.
`agent`	`str \| None`	Agent name used as `task_type` for auto-created session tasks.

Tasks

`dexcost.task()`

@contextmanager
def task(
    task_type: str = "",
    metadata: dict[str, Any] | None = None,
) -> Generator[TrackedTask, None, None]

Context manager for grouping costs into a single business task. Supports both with and async with. Reads customer_id and project_id from set_context().

Parameter	Type	Description
`task_type`	`str`	Identifier for the kind of task (e.g. `"resolve_ticket"`).
`metadata`	`dict \| None`	Optional metadata attached to the task.

Yields a TrackedTask. On clean exit, status is set to "success". On exception, status is "failed" and events are still persisted.

Raises RuntimeError if dexcost.init() has not been called.

`dexcost.get_current_task()`

def get_current_task() -> Task | None

Return the Task currently active in this context, or None.

`dexcost.set_current_task()`

def set_current_task(task: Task | None) -> contextvars.Token[Task | None]

Set the active task and return a token for later restoration via ContextVar.reset().

`dexcost.task_context()` / `dexcost.async_task_context()`

@contextmanager
def task_context(task: Task) -> Generator[None, None, None]

@asynccontextmanager
async def async_task_context(task: Task) -> AsyncGenerator[None, None]

Low-level context managers used internally by CostTracker. Set and restore _current_task for the duration of the block.

Recording costs

`dexcost.record_cost()`

def record_cost(
    service: str,
    cost_usd: Decimal | str,
    *,
    event_type: str = "external_cost",
    cost_confidence: str = "exact",
    pricing_source: str = "manual",
    pricing_version: str | None = None,
    details: dict[str, Any] | None = None,
) -> Event

Record a non-LLM cost against the current active task. Use cost_usd as Decimal or string — never float.

Parameter	Type	Default	Description
`service`	`str`	—	Service name (e.g. `"google_maps_api"`).
`cost_usd`	`Decimal \| str`	—	Cost in USD.
`event_type`	`str`	`"external_cost"`	`"external_cost"` or `"compute_cost"`.
`cost_confidence`	`str`	`"exact"`	One of `exact`, `computed`, `estimated`, `unknown`.
`pricing_source`	`str`	`"manual"`	Source label for the pricing data.
`pricing_version`	`str \| None`	`None`	Hash referencing the rate snapshot.
`details`	`dict \| None`	`None`	Arbitrary extra metadata.

Raises RuntimeError if init() has not been called or no active task exists.

`dexcost.flush()`

def flush() -> None

Force an immediate sync of buffered events to the Control Layer. No-op in local-only mode or if init() has not been called.

`dexcost.close()`

def close() -> None

Stop the SyncWorker and flush pending events. Safe to call even if init() has not been called (no-op).

Tasks — `TrackedTask`

`TrackedTask.record_cost()`

def record_cost(
    self,
    service: str,
    cost_usd: Decimal | str,
    *,
    event_type: str = "external_cost",
    cost_confidence: str = "exact",
    pricing_source: str = "manual",
    pricing_version: str | None = None,
    details: dict[str, Any] | None = None,
) -> Event

Record a non-LLM cost event for this task. Same signature as the module-level record_cost().

Raises ValueError if event_type is not "external_cost" or "compute_cost".

`TrackedTask.record_usage()`

def record_usage(
    self,
    service: str,
    units: int | float = 1,
    *,
    details: dict[str, Any] | None = None,
) -> Event

Compute cost from the rate registry (units × registered_rate) and record the event.

Parameter	Type	Default	Description
`service`	`str`	—	Service name matching a registered rate.
`units`	`int \| float`	`1`	Number of units consumed.
`details`	`dict \| None`	`None`	Arbitrary extra metadata.

Raises ValueError if no rate is registered for service.

`TrackedTask.record_llm_call()`

def record_llm_call(
    self,
    provider: str,
    model: str,
    input_tokens: int,
    output_tokens: int,
    cost_usd: Decimal | str | None = None,
    *,
    cost_confidence: str | None = None,
    pricing_source: str | None = None,
    cached_tokens: int = 0,
    latency_ms: int | None = None,
    details: dict[str, Any] | None = None,
    error_type: str | None = None,
) -> Event

Manually record an LLM call event. When cost_usd is None, cost is auto-computed via PricingEngine.

Parameter	Type	Default	Description
`provider`	`str`	—	Provider name (e.g. `"openai"`).
`model`	`str`	—	Model identifier (e.g. `"gpt-4o"`).
`input_tokens`	`int`	—	Number of input tokens.
`output_tokens`	`int`	—	Number of output tokens.
`cost_usd`	`Decimal \| str \| None`	`None`	Cost in USD. `None` → auto-computed.
`cost_confidence`	`str \| None`	Auto-set	Confidence level.
`pricing_source`	`str \| None`	Auto-set	Source of pricing data.
`cached_tokens`	`int`	`0`	Cached input tokens (receive a discount).
`latency_ms`	`int \| None`	`None`	Response latency in milliseconds.
`details`	`dict \| None`	`None`	Extra metadata.
`error_type`	`str \| None`	`None`	Transient error type (`"rate_limit"`, `"timeout"`, `"5xx"`).

`TrackedTask.mark_retry()`

def mark_retry(
    self,
    reason: str,
    *,
    cost_usd: Decimal | str = Decimal("0"),
    retry_of: uuid.UUID | None = None,
) -> Event

Explicitly flag the current operation as a retry.

Parameter	Type	Default	Description
`reason`	`str`	—	Why the retry occurred (e.g. `"rate_limit"`, `"timeout"`).
`cost_usd`	`Decimal \| str`	`Decimal("0")`	Additional cost incurred by the retry.
`retry_of`	`uuid.UUID \| None`	`None`	UUID of the original event this retries.

`TrackedTask.mark_not_retry()`

def mark_not_retry(self, event_id: uuid.UUID | None = None) -> Event | None

Override a false-positive retry detection. When event_id is None, the most recent retry event for this task is un-flagged. Returns the updated Event, or None if no retry event was found.

`TrackedTask.link_trace()`

def link_trace(self, provider: str, trace_id: str) -> None

Link an external trace to this task. Stored in task.metadata["_trace_links"]. Multiple traces from different providers can be linked.

Parameter	Type	Description
`provider`	`str`	Observability platform name (e.g. `"langfuse"`, `"langsmith"`).
`trace_id`	`str`	Trace or run identifier from the external platform.

`TrackedTask.get_trace_links()`

def get_trace_links(self) -> list[dict[str, str]]

Return all linked traces as a list of {"provider": ..., "trace_id": ...} dicts.

`TrackedTask.end()`

def end(self, status: str = "success") -> None

Close a manually-started task, compute aggregates, and persist. Must be called exactly once for tasks created via CostTracker.start_task().

Raises RuntimeError if called more than once.

`TrackedTask.task_id`

uuid.UUID — the unique identifier for this task.

Instrumentation

`dexcost.instrument_openai()` / `dexcost.uninstrument_openai()`

def instrument_openai(tracker: CostTracker) -> None
def uninstrument_openai() -> None

Patch / restore openai.resources.chat.completions.Completions.create (sync + async).

`dexcost.instrument_anthropic()` / `dexcost.uninstrument_anthropic()`

def instrument_anthropic(tracker: CostTracker) -> None
def uninstrument_anthropic() -> None

Patch / restore anthropic.resources.messages.Messages.create (sync + async).

`dexcost.instrument_litellm()` / `dexcost.uninstrument_litellm()`

def instrument_litellm(tracker: CostTracker) -> None
def uninstrument_litellm() -> None

Patch / restore litellm.completion (sync) and litellm.acompletion (async).

`dexcost.instrument_gemini()` / `dexcost.uninstrument_gemini()`

def instrument_gemini(tracker: CostTracker) -> None
def uninstrument_gemini() -> None

Patch / restore google.genai.models.Models.generate_content (sync + streaming).

`dexcost.instrument_bedrock()` / `dexcost.uninstrument_bedrock()`

def instrument_bedrock(tracker: CostTracker) -> None
def uninstrument_bedrock() -> None

Patch / restore botocore.client.BaseClient._make_api_call — intercepts bedrock-runtime InvokeModel calls only.

`dexcost.instrument_cohere()` / `dexcost.uninstrument_cohere()`

def instrument_cohere(tracker: CostTracker) -> None
def uninstrument_cohere() -> None

Patch / restore cohere.Client.chat and cohere.AsyncClient.chat, plus chat_stream when present.

`dexcost.instrument_mcp()` / `dexcost.uninstrument_mcp()`

def instrument_mcp(tracker: CostTracker) -> None
def uninstrument_mcp() -> None

Patch / restore mcp.ClientSession.call_tool.

`ALL_SUPPORTED_INSTRUMENTS`

ALL_SUPPORTED_INSTRUMENTS: list[str] = [
    "openai", "anthropic", "litellm", "gemini", "bedrock", "cohere", "mcp"
]

The full list of SDK names accepted by CostTracker.instrument() and the auto_instrument parameter.

Pricing & rates

`PricingEngine`

class PricingEngine:
    def __init__(
        self,
        data_path: str | Path | None = None,
        *,
        auto_update: bool = False,
        api_key: str | None = None,
    ) -> None

Calculates LLM costs from model name and token counts. Loads bundled LiteLLM pricing data on construction.

`PricingEngine.get_cost()`

def get_cost(
    self,
    model: str,
    input_tokens: int,
    output_tokens: int,
    cached_tokens: int = 0,
    cache_creation_tokens: int = 0,
) -> CostResult

Calculate the cost for an LLM call. Returns a CostResult.

`PricingEngine.set_custom_pricing()`

def set_custom_pricing(
    self,
    model: str,
    input_per_1k: Decimal | str | float,
    output_per_1k: Decimal | str | float,
) -> None

`PricingEngine.pricing_version`

str — hash of the currently loaded pricing data, for reproducibility.

`PricingEngine.close()`

Cancel the background update timer, if running.

`CostResult`

@dataclass(frozen=True)
class CostResult:
    cost_usd: Decimal
    cost_confidence: str
    pricing_source: str
    pricing_version: str

Result of a PricingEngine.get_cost() call.

Field	Description
`cost_usd`	Calculated cost in USD.
`cost_confidence`	`"computed"` or `"unknown"`.
`pricing_source`	`"litellm"`, `"custom"`, or `"unknown"`.
`pricing_version`	Hash of the pricing data used.

`RateRegistry`

Registry of per-service cost rates for non-LLM services.

`RateRegistry.register()`

def register(self, service: str, per: str, cost_usd: Decimal | str) -> None

`RateRegistry.get()`

def get(self, service: str) -> RateEntry | None

Return the RateEntry for service, or None.

`RateRegistry.load()`

def load(self, path: str | Path) -> None

Load rates from a YAML file. Expected keys: rates.<service>.per and rates.<service>.cost_usd.

`RateRegistry.export()`

def export(self, path: str | Path) -> None

Export current rates to a YAML file, sorted by service name.

`RateRegistry.pricing_version`

str — deterministic SHA-256 hash of all registered rates.

`RateEntry`

@dataclass(frozen=True)
class RateEntry:
    service: str
    per: str
    cost_usd: Decimal

Types

`DexcostConfig`

@dataclass
class DexcostConfig:
    api_key: str | None = None
    storage: str | None = None
    batch_size: int = 100
    flush_interval_seconds: float = 5.0
    buffer_path: str | None = None
    redact_fields: list[str] = field(default_factory=list)
    hash_customer_id: bool = False
    environment: str | None = None

Global SDK configuration. Created by init() and stored globally.

Property	Returns	Description
`storage_mode`	`str`	`"cloud"` if API key is set and `storage != "local"`, otherwise `"local"`.
`key_type`	`str \| None`	`"live"`, `"test"`, or `None`.
`is_sandbox`	`bool`	`True` when using a `dx_test_` key.
`endpoint`	`str`	Control Layer URL. Reads `DEXCOST_ENDPOINT` env var, defaults to `https://api.dexcost.io`.
`is_dev`	`bool`	`True` when `environment == "development"`.

`DexcostContext`

@dataclass
class DexcostContext:
    customer_id: str | None = None
    project_id: str | None = None
    metadata: dict[str, Any] = field(default_factory=dict)
    agent: str | None = None

Attribution context stored in a ContextVar by set_context().

`Task`

The dataclass persisted to SQLite for each tracked business task.

Field	Type	Description
`task_id`	`uuid.UUID`	Unique task identifier.
`task_type`	`str`	Kind of task (e.g. `"resolve_ticket"`).
`customer_id`	`str \| None`	Customer attribution.
`project_id`	`str \| None`	Project attribution.
`parent_task_id`	`uuid.UUID \| None`	Set automatically for nested tasks.
`status`	`str`	`"success"` or `"failed"`.
`started_at`	`datetime`	UTC timestamp when the task was created.
`ended_at`	`datetime \| None`	UTC timestamp when the task was closed.
`total_cost_usd`	`Decimal`	Sum of all event costs.
`llm_cost_usd`	`Decimal`	Sum of `llm_call` event costs.
`external_cost_usd`	`Decimal`	Sum of `external_cost` event costs.
`compute_cost_usd`	`Decimal`	Sum of `compute_cost` event costs.
`total_input_tokens`	`int`	Aggregate input tokens across all LLM calls.
`total_output_tokens`	`int`	Aggregate output tokens across all LLM calls.
`total_cached_tokens`	`int`	Aggregate cached tokens across all LLM calls.
`retry_count`	`int`	Number of events flagged as retries.
`retry_cost_usd`	`Decimal`	Sum of retry event costs.
`metadata`	`dict`	Arbitrary metadata, including `_trace_links`.

`Event`

The dataclass persisted for each individual cost event within a task.

Field	Type	Description
`event_id`	`uuid.UUID`	Unique event identifier.
`task_id`	`uuid.UUID`	Task this event belongs to.
`event_type`	`str`	`"llm_call"`, `"external_cost"`, `"compute_cost"`, or `"retry_marker"`.
`cost_usd`	`Decimal`	Cost in USD.
`cost_confidence`	`str`	`exact`, `computed`, `estimated`, or `unknown`.
`pricing_source`	`str`	Source of pricing data.
`pricing_version`	`str \| None`	Hash of the pricing snapshot.
`service_name`	`str \| None`	Service name for non-LLM events.
`provider`	`str \| None`	LLM provider name for `llm_call` events.
`model`	`str \| None`	Model identifier for `llm_call` events.
`input_tokens`	`int \| None`	Input tokens (LLM calls).
`output_tokens`	`int \| None`	Output tokens (LLM calls).
`cached_tokens`	`int \| None`	Cached tokens (LLM calls).
`latency_ms`	`int \| None`	Response latency in milliseconds.
`is_retry`	`bool`	Whether this event is flagged as a retry.
`retry_reason`	`str \| None`	Why the retry occurred.
`retry_of`	`uuid.UUID \| None`	UUID of the original event this retries.
`occurred_at`	`datetime`	UTC timestamp of the event.
`details`	`dict`	Arbitrary extra metadata.

`EventType`

Enum of valid event_type values: llm_call, external_cost, compute_cost, retry_marker.

`TaskStatus`

Enum of valid task status values: success, failed.

`CostConfidence`

Enum of valid cost_confidence values: exact, computed, estimated, unknown.

`PricingSource`

Enum of valid pricing_source values: litellm, custom, manual, rate_registry, service_catalog, unknown.

`InvalidAPIKeyError`

class InvalidAPIKeyError(ValueError)

Raised by validate_api_key() when an API key does not start with dx_live_ or dx_test_.

`validate_api_key()`

def validate_api_key(key: str | None) -> str | None

Validate API key format. Returns "live", "test", or None if key is None. Raises InvalidAPIKeyError for invalid formats.

`TrackedOpenAI` / `TrackedAnthropic`

Thin wrappers around the OpenAI and Anthropic client classes that ensure a CostTracker is wired before the first call. Useful when you instantiate a provider client before calling dexcost.init().

`ServiceCatalog`

Manages the bundled 163-service catalog. ServiceCatalog.lookup(url) returns a catalog entry for a URL's hostname, or None. ServiceCatalog.extract_cost(entry, headers, body) returns a CostExtractionResult with amount, confidence, pricing_source, and service_name.

`SessionManager`

Manages auto-created "session" tasks for LLM calls that occur outside an explicit dexcost.task() context. get_session_manager() returns the global instance.

`SyncWorker`

Background worker that flushes buffered Event and Task records to the Control Layer. Started automatically by init() when an API key is present and storage != "local". Exposes start(), stop(), and flush().

`CostTracker`

High-level tracker class. The global instance is managed by dexcost.init(). Use directly when you need multiple independent trackers:

from dexcost import CostTracker

tracker = CostTracker(auto_instrument=["openai"])
tracker.register_rate("maps.googleapis.com", per="request", cost_usd="0.005")

with tracker.task(task_type="route_plan") as t:
    t.record_usage("maps.googleapis.com", units=4)

Key methods: instrument(name), uninstrument(name), task(...), start_task(...), track_task(...) (decorator), register_rate(...), load_rates(path), export_rates(path), get_cost(...), set_custom_pricing(...).

tracker.instrumented returns a frozenset[str] of currently patched SDK names.

API Reference

On this page