cyberneticlibrary

Audit AI agent security configurations

ecc-agentshieldskillsetup L20
affaan-m/agentshield
What it does

Detect prompt injection and jailbreak attempts in agent inputs

Best for

Production agents where defense-in-depth against prompt injection reduces exploit risk without manual review latency.

Inputs
  • · agent input text
  • · rule set
  • · threat model
Outputs
  • · threat score
  • · injection pattern matches
  • · sanitized input
Requires
  • · pattern matching engine
  • · LLM classifiers
Preconditions

Input text available; rule catalog loaded

Failure modes
  • · false positives blocking legitimate input
  • · adversarial prompt bypass
  • · rule set drift
Trust signals
  • · threat scoring rubric
  • · example injection patterns
  • · false-positive tuning guidance