Skip to main content
Group43227(2) Pn

👋 Welcome to Repello Guard Docs

Repello Guard is a real-time firewall for LLM models and applications. Drop it in front of your LLM call - or behind it - and get instant checks to prevent injections, jailbreaks, data leaks and more.

Why we built Repello Guard

  1. LLMs hallucinate & leak
    Modern LLM models can be jail-broken, reveal hidden prompts, or emit hateful text. Shipping without a guardrail is a brand-risk waiting to happen.
  2. General safety APIs are black boxes
    Most “moderation” services return a single boolean flag. Repello Guard exposes fine-grained violation types & calibrated risk levels so you can auto-block, soft-block, rewrite, or just log.
  3. Latency matters
    End-users bounce when responses crawl. Our P99 latency is < 100 ms from us-east-1, under heavy load.
  4. Vendor freedom
    Works with OpenAI®, Anthropic®, Google Gemini®, Mistral®, local GGUF models, or anything you spin up tomorrow.

Key capabilities at a glance

CapabilityWhat it catchesSample violation enums
Jailbreak / prompt-injection detection”Ignore previous instructions …” attemptsPROMPT_INJECTION,UNSAFE_PROMPT
Toxicity & hate speech, Bias & stereotypingHarassment, Slurs, Threats, Demographic Prejudice, Political BiasTOXIC_PROMPT,UNSAFE_PROMPT
Competitor vetoMentions or defamation of specified brandsCOMPETITOR_MENTION
Banned topicsAnything you blacklist (e.g. self-harm, medical)BANNED_TOPICS
System-prompt leakageSemantic overlap score 0-1, checks for system prompt leak in the responseSYSTEM_PROMPT_LEAK
Policy CheckViolations of set policies and guidelinesPOLICY_VIOLATION

How it fits in your stack

Use the API as a proxy to your LLM calls. Pass the input prompts through Repello Guard to detect any of the above violations, pass the outputs from the LLMs to prevent leaks, moderate content and prevent brand reputation damage.
Contact us for an on-prem deployment in your Virtual Private Cloud.

Supported languages & scripts

Repello Guard accepts any UTF-8 string and identifies 100+ natural languages (Latin, Cyrillic, CJK, RTL, etc.).

Performance and Uptime

MetricSLA / Typical
P99 latency< 100 ms (us-east-1)
Throughput3 k req/s per tenant (burstable)
Uptime99.9 % rolling 30 day