Each guardrail is enabled per profile and may operate in block or monitor mode (in-line deployments). Block drops the connection on violation; Monitor passes the prompt through but records the detection on the Alerts page. Worked examples below follow the patterns formalised in CloudsineAI’s internal UAT Test Plans.Documentation Index
Fetch the complete documentation index at: https://docs.protectorplus.cloudsine.tech/llms.txt
Use this file to discover all available pages before exploring further.
Standard Protection
Keyword Blocklist
Inspects both inputs and outputs for words that may be sensitive or violate company policies.Add a keyword rule
Click Add New Keyword and supply:
- Phase: Input or Output
- Title: short label (e.g.,
bomb-block) - Description: free text
- Keywords: the literal strings to match
Regex Pattern Matching
Inspects inputs and outputs for text matching a regular expression. Example: detect confidentiality markers.- Phase: Input (or Output)
- Name:
Sensitive doc pattern - Description: any
- Regular Expression: pattern above
Please share the confidential roadmap, top secret playbook, internal use only memo.
PII Detection
Enabling the PII guardrail activates detection for all supported categories by default. Individual categories can be toggled. Supported categories: Singapore NRIC/FIN, Phone Number, Person, Email Address, Credit Card, Passport Number, and similar identifiers. Worked SG-context test patterns (from UAT Test Plans §2.3):- Credit card:
4716-3018-9386-3240 - SG NRIC/FIN:
S9334193G
ShieldPrompt™
LLM Guardrail (CyberLLM)
Enable: ShieldPrompt → LLM Guardrail → ON. The threshold is configurable per security profile; defaults are tuned for production. Increasing the threshold reduces false positives at the cost of recall. Test prompts:Vector Guardrail (TVDB)
Enable: ShieldPrompt → Vector Guardrail → ON. Sensitivity: Low / Medium / High.- Medium — recommended default.
- High — recommended when stacking with the LLM guardrail and accepting higher false-positive rates for stronger coverage of paraphrased attacks.
Content Moderation
Enable: ShieldPrompt → Content Moderation → ON. Inspects both inputs and outputs; returns a category label when content is flagged unsafe (VIOLENCE, UNETHICAL, HATE, etc.).
System Prompt Protection
Enable: ShieldPrompt → System Prompt Protection → ON. No additional configuration required — Protector Plus detects system-prompt leakage in LLM responses using an in-line LLM-based check.Block vs Monitor
| Mode | Behaviour |
|---|---|
| Block | Connection dropped on detection (inline deployments). Alert recorded as Blocked. |
| Monitor | Prompt passes through to the LLM. Alert recorded as Detected. |

