Question 1

How does Sigmoda flag risky LLM outputs?

Accepted Answer

Every event is scored for max output tokens and banned phrases per project. Anything that crosses those guardrails is auto-flagged so you can triage before it reaches users.

Question 2

Can I break down spend by model and route?

Accepted Answer

Yes. Cost and call charts are grouped by day with model-level breakdowns so you can see which routes use gpt-5.2, gpt-5-mini, gpt-5-nano, or other providers and tune the expensive ones first.

Question 3

Will this work with my existing stack?

Accepted Answer

Sigmoda accepts plain HTTPS events, so you can plug in any language. The quickstart shows a Python helper, but the API works from JS/TS, Go, or your favorite SDK.

Question 4

Is there a free tier?

Accepted Answer

Yes. Free (Beta) includes 10k events/month with 7-day retention and up to 3 projects. Upgrade to Pro (Early Access) for higher volume, longer retention, and team support.

Question 5

Do you store prompts and responses?

Accepted Answer

Sigmoda can store prompt/response content when you enable content capture. In production, content capture is off by default — you can still log metadata and token usage for filtering and cost visibility.

Ship safer LLM features with a dashboard your team actually uses.

Observability and guardrails in one place

Full-fidelity logging

Guardrails that stick

Cost + performance clarity

Three steps to ship with confidence

Start free, upgrade when ready

Free (Beta)

Pro (Early Access)

Answers for teams shipping AI

How does Sigmoda flag risky LLM outputs?

Can I break down spend by model and route?

Will this work with my existing stack?

Is there a free tier?

Do you store prompts and responses?