eval-harness
Formal evaluation framework for Claude Code sessions implementing eval-driven development principles.
SkillJury keeps community verdicts, source metadata, and external repository signals in separate lanes so ranking data never pretends to be a review.
No approved reviews yet
Waiting on enough review volume
Weekly or total install activity from catalog data
npx skills add https://github.com/affaan-m/everything-claude-code --skill eval-harness
As of May 1, 2026, eval-harness has 3 weekly installs, 0 community reviews on SkillJury. Community votes currently stand at 0 upvotes and 0 downvotes. Source: affaan-m/everything-claude-code. Canonical URL: https://skills.sh/affaan-m/everything-claude-code/eval-harness.
Source description provided by the upstream listing. Community review signal and install context stay separate from this narrative layer.
Latest reviews
No community reviews yet. Be the first to review.
What does eval-harness do?
Formal evaluation framework for Claude Code sessions implementing eval-driven development principles.
Is eval-harness good?
eval-harness does not have approved reviews yet, so SkillJury cannot publish a community verdict.
Which AI agents support eval-harness?
eval-harness currently lists compatibility with Claude Code, Skills CLI.
Is eval-harness safe to install?
eval-harness has been scanned by security audit providers tracked on SkillJury. Check the security audits section on this page for detailed results from Socket.dev and Snyk.
What are alternatives to eval-harness?
Skills in the same category include review-management, conversation-memory, coverage, grimoire-aave.
How do I install eval-harness?
Run the following command to install eval-harness: npx skills add https://github.com/affaan-m/everything-claude-code --skill eval-harness
More from affaan-m/everything-claude-code
gateguard
A PreToolUse hook that forces Claude to investigate before editing. Instead of self-evaluation ("are you sure?"), it demands concrete facts. The act of investigation creates awareness that self-evaluation never did.
golang-patterns
Idiomatic Go patterns, best practices, and conventions for building robust applications.
security-review
Comprehensive security checklist and patterns for authentication, input validation, secrets management, and sensitive operations.
backend-patterns
Architectural patterns, API design, and database optimization for Node.js, Express, and Next.js backends.
Alternatives in Software Engineering
review-management
Source details, install context, and public review data are available on the full page.
conversation-memory
Persistent memory systems for LLM conversations with tiered storage and intelligent retrieval.
coverage
Map all testable surfaces in the application and identify what's tested vs. what's missing.
grimoire-aave
Query Aave V3 market data, reserve snapshots, and health metrics across supported chains.