-
Notifications
You must be signed in to change notification settings - Fork 1
Home
SkillFortify is the first security tool that formally verifies what your AI agent skills can actually do -- instead of guessing with heuristic pattern matching. Grounded in five mathematical theorems with full proofs, SkillFortify provides soundness-guaranteed analysis across 22 agent frameworks and 23 IDE configurations. One command scans every AI tool on your system. Zero false positives on the benchmark suite. Open source under MIT License.
In January 2026, the ClawHavoc campaign infiltrated 1,200+ malicious skills into the largest AI agent marketplace. Researchers catalogued 6,487 malicious agent tools that conventional scanners cannot detect. CVE-2026-25253 demonstrated remote code execution through a single compromised skill.
The industry responded with heuristic scanning tools -- YARA rules, LLM-as-judge scoring, regex patterns. Every one shares the same limitation: absence of findings does not mean absence of risk.
SkillFortify solves this with formal verification. It constructs a mathematical model of what each skill can and cannot do -- and verifies that model against declared capabilities. The same category of guarantee used to verify cryptographic protocols and flight control software, applied to agent skills for the first time.
# Install
pip install skillfortify
# Scan your entire system -- auto-discovers ALL AI tools
skillfortify scan
# Scan a specific project
skillfortify scan ./my-agent-project
# Generate an interactive HTML security dashboard
skillfortify dashboard
# List all 22 supported frameworks
skillfortify frameworksRequirements: Python 3.11+ | Works on macOS, Linux, Windows | Runs entirely offline | MIT License
SkillFortify analyzes skills, tools, and configurations across the entire agent ecosystem:
| Tier | Frameworks |
|---|---|
| Major Platforms | Claude Code, MCP, OpenClaw, OpenAI Agents SDK, Google ADK, Anthropic Agent SDK |
| Orchestration | LangChain, CrewAI, AutoGen, Semantic Kernel, LlamaIndex, PydanticAI, Haystack |
| Agent Frameworks | Agno (Phidata), CAMEL-AI, MetaGPT, Composio, Mastra |
| Workflow Builders | n8n, Flowise, Dify Plugins |
Run skillfortify frameworks for the full list with detection patterns.
| Command | Purpose |
|---|---|
skillfortify scan |
Auto-discover all AI tools on your system and analyze every skill found |
skillfortify scan <path> |
Scan a specific project directory |
skillfortify verify <skill> |
Formally verify a single skill file |
skillfortify lock <path> |
Generate skill-lock.json for reproducible configurations |
skillfortify trust <skill> |
Compute multi-signal trust score for a skill |
skillfortify sbom <path> |
Generate CycloneDX 1.6 Agent Skill Bill of Materials |
skillfortify frameworks |
List all 22 supported frameworks with detection patterns |
skillfortify dashboard |
Generate a standalone HTML security report |
skillfortify registry-scan |
Scan remote registries (MCP, PyPI, npm) for supply chain risks |
| Metric | Value |
|---|---|
| Tests | 1,818 passing |
| Supported Frameworks | 22 |
| IDE Profiles | 23 (auto-discovery) |
| Benchmark Size | 540 skills (270 malicious, 270 benign) |
| F1 Score | 96.95% |
| Precision | 100% (zero false positives) |
| Recall | 94.12% |
| Analysis Speed | ~2.5 ms per skill |
| Paper | 31 pages, 5 theorems with proofs |
- Why SkillFortify -- The problem, the gap in current tools, and what formal analysis provides that heuristics cannot
- Formal Foundations -- The five theorems, DY-Skill model, capability lattice, trust algebra, and SAT-based resolution
- Trust Levels -- L0 through L3 trust levels, how trust scores are computed, and trust propagation
- Getting Started -- Installation, system scan walkthrough, dashboard generation
- CLI Reference -- All nine commands with full options, examples, and exit codes
- Supported Formats -- All 22 agent frameworks with detection patterns
- Skill Lock JSON -- The lockfile format for reproducible agent configurations
- ASBOM Guide -- Agent Skill Bill of Materials, CycloneDX 1.6, compliance
- SkillFortifyBench -- The 540-skill benchmark: construction, results, reproduction
| Resource | URL |
|---|---|
| GitHub Repository | github.com/varun369/skillfortify |
| PyPI Package | pypi.org/project/skillfortify |
| Research Paper | arXiv:2603.00195 | Zenodo |
If you use SkillFortify in your research, please cite:
@article{bhardwaj2026skillfortify,
author = {Bhardwaj, Varun Pratap},
title = {Formal Analysis and Supply Chain Security for Agentic AI Skills},
journal = {arXiv preprint arXiv:2603.00195},
year = {2026},
url = {https://arxiv.org/abs/2603.00195}
}Varun Pratap Bhardwaj -- Solution Architect with 15+ years in enterprise technology. Dual qualifications in technology and law (LL.B.), with a focus on formal methods for AI safety and regulatory compliance for autonomous systems.
- ORCID: 0009-0002-8726-4289
- Email: varun.pratap.bhardwaj@gmail.com
MIT License. See LICENSE for details.