ai-evals
Systematic evaluation framework for AI products using practitioner-driven methodologies.
SkillJury keeps community verdicts, source metadata, and external repository signals in separate lanes so ranking data never pretends to be a review.
No approved reviews yet
Waiting on enough review volume
Weekly or total install activity from catalog data
npx skills add https://github.com/refoundai/lenny-skills --skill ai-evals
As of May 1, 2026, ai-evals has 1 weekly installs, 0 community reviews on SkillJury. Community votes currently stand at 0 upvotes and 0 downvotes. Source: refoundai/lenny-skills. Canonical URL: https://skills.sh/refoundai/lenny-skills/ai-evals.
Source description provided by the upstream listing. Community review signal and install context stay separate from this narrative layer.
Latest reviews
No community reviews yet. Be the first to review.
What does ai-evals do?
Systematic evaluation framework for AI products using practitioner-driven methodologies.
Is ai-evals good?
ai-evals does not have approved reviews yet, so SkillJury cannot publish a community verdict.
Which AI agents support ai-evals?
ai-evals currently lists compatibility with Claude Code, Codex, Skills CLI.
Is ai-evals safe to install?
ai-evals has been scanned by security audit providers tracked on SkillJury. Check the security audits section on this page for detailed results from Socket.dev and Snyk.
What are alternatives to ai-evals?
Skills in the same category include review-management, conversation-memory, coverage, grimoire-aave.
How do I install ai-evals?
Run the following command to install ai-evals: npx skills add https://github.com/refoundai/lenny-skills --skill ai-evals
More from refoundai/lenny-skills
engineering-culture
Build strong engineering culture using frameworks from 19 product leaders.
managing-up
Strategies for working effectively with managers and executives drawn from 35 product leaders.
negotiating-offers
Negotiate job offers and compensation using strategies from product leaders.
organizational-transformation
Guide organizations toward modern product practices through structural, cultural, and process change.
Alternatives in Software Engineering
review-management
Source details, install context, and public review data are available on the full page.
conversation-memory
Persistent memory systems for LLM conversations with tiered storage and intelligent retrieval.
coverage
Map all testable surfaces in the application and identify what's tested vs. what's missing.
grimoire-aave
Query Aave V3 market data, reserve snapshots, and health metrics across supported chains.