$ tensorgen agents eval --suite prod-workflows
◆ Client: InsurTech, claims processing automation
◆ Architecture: multimodal orchestrator + tool-use
◆ Inputs: scanned docs, images, structured claims data
◆ Agents: triage → doc-parser → verifier → HITL-routing
◆ Guardrails: compute bounds, hallucination detection, audit log
✓ STP Rate: 68.4% (+22 pts over baseline agent stack)
✓ Running on client GPUs — no third-party API
$ tensorgen llm deploy --model GLM-5.1 --env on-prem
◆ Client: Healthcare provider, PHI compliance required
◆ Constraint: zero external API calls, full audit trail
◆ Stack: vLLM + GLM-5.1 (371B) + 8× H100 (on-prem)
◆ Optimizations: FP8 quantization, speculative decoding
◆ Benchmarks: frontier-class parity on clinical tasks
✓ Deployed: 180 tok/s, fully air-gapped network
✓ Data never leaves. You own it all.
$ tensorgen sd deploy --pipeline ltx2.3 --env on-prem
◆ Client: Media studio, brand-controlled video gen
◆ Stack: LTX2.3 + custom brand LoRAs
◆ Pipeline: API → batch queue → asset store
◆ Training: fine-tuned on 12K in-house brand assets
✓ Output: 5K clips/day, <12s per generation (H100 cluster)
✓ Models + weights stay fully in-house