Skip to main content

Documentation Index

Fetch the complete documentation index at: https://docs.protectorplus.cloudsine.tech/llms.txt

Use this file to discover all available pages before exploring further.

The table below documents every guardrail surfaced by input-check and output-check, the engine class behind it, and the phase(s) in which it operates.
GuardrailPhaseEngineDescription
LLMInput + OutputLLM-as-judgeScores prompt-injection likelihood 0–1 using a model-based judge.
KeywordInput + OutputRule-basedExact keyword/phrase blocklist match.
RegexInput + OutputRule-basedRegex pattern match against operator-configured patterns.
PII DetectionInput + OutputNER modelDetects names, emails, phone numbers, NRIC/passport, credit cards, etc.
VectorInputSemantic similarity over TVDBSimilarity search against the proprietary Threat Vector Database. Catches paraphrased injection.
Content ModerationInput + OutputPurpose-built classifierClassifies harmful content (violence, hate speech, unethical, etc.).
System Prompt ProtectionOutputIn-line detectionDetects when the LLM is leaking its system prompt in its response.

When each guardrail fires

injection_detected is true in the API response if any active guardrail flags the input or output. Use the per-guardrail block within checks to identify which layer produced the decision and tune accordingly.

Configuring sensitivity

GuardrailTunable
LLMThreshold (per security profile)
VectorSensitivity (Low / Medium / High)
Content ModerationCategory-level enable/disable
System Prompt ProtectionAutomatic in Forwarding mode (no application-side configuration required)
Keyword / RegexOperator-defined rule sets
PIIPer-category toggles (NRIC/FIN, Phone, Email, Person, Credit Card, etc.)