Stack · Zyphh

The Tools We Use to Build AI Apps and Agents at Zyphh

Our opinionated, proven stack across models, evals, monitoring, data layer, and deployment—what we use in production for clients.

11 min read Tooling breakdown Safe by design
5Core layers we standardize
3Eval gates before go-live
24/7Monitoring coverage
2–6 moPayback we target

Models

Retrieval & data layer

Indexes: pgvector, Weaviate, or native embeddings in Postgres.
ETL: Airbyte/Fivetran for sync; dbt for modeling.
Storage: Postgres/BigQuery; S3/GCS for blobs.

Tools & actions

Evals and quality

Monitoring & ops

Tracing: Structured logs and spans across every call.
Alerts: Latency, cost, refusal, and error thresholds.
Replay: Record-and-replay for failures and audits.
Approvals: Human-in-the-loop for risky actions and PII.

Deployment

Opinionated, flexible, and proven—the stack we deploy depends on your risk, data, and goals, not hype.
Design your stack See more insights

FAQ

Can you work on-prem?

Yes. We deploy local models and isolated services when data residency is required.

Do you support SOC2/GDPR?

We design for compliance: access controls, logging, data minimization, and regional routing.

How do you handle drift?

Automated evals, alerts, and staged rollouts with rollback paths.

Can we use our vendor list?

Yes. We adapt to your approved stack while maintaining our guardrails and evals.