cyberneticlibrary

Audit AI systems for security risks

ai-securityskillsetup L217,464
alirezarezvani/claude-skills
What it does

Assess AI/ML system for prompt injection and jailbreak risks

Best for

LLM agents or classifiers needing ATLAS-mapped vulnerability assessment pre-deployment.

Inputs
  • · test prompts (JSON array)
  • · target type (llm/classifier/embedding)
Outputs
  • · injection signature matches
  • · risk score (0.0-1.0)
  • · MITRE ATLAS technique mapping
Requires
  • · Python: ai_threat_scanner.py
  • · MITRE ATLAS framework
Preconditions

Test prompts available, authorization for gray-box/white-box access

Failure modes

Skipping authorization, ignoring indirect RAG injection, conflating with app security

Trust signals
  • · Static signature matching
  • · MITRE ATLAS IDs in output
  • · Access-level gating