Skip to main content
Back to the directory
wshobson/agentsSoftware EngineeringFrontend and Design

evaluation-methodology

This document is the authoritative reference for how PluginEval measures plugin and skill quality. It covers the three evaluation layers, all ten scoring dimensions, the composite formula, badge thresholds, anti-pattern flags, Elo ranking, and actionable improvement tips.

SkillJury keeps community verdicts, source metadata, and external repository signals in separate lanes so ranking data never pretends to be a review.

SkillJury verdict
Pending

No approved reviews yet

Would recommend
Pending

Waiting on enough review volume

Install signal
2

Weekly or total install activity from catalog data

Sign in to review
0 review requests
Install command
npx skills add https://github.com/wshobson/agents --skill evaluation-methodology
SkillJury does not have enough approved reviews to publish a community verdict yet. Source metadata and repository proof are still available above.
SkillJury Signal Summary

As of Apr 30, 2026, evaluation-methodology has 2 weekly installs, 0 community reviews on SkillJury. Community votes currently stand at 0 upvotes and 0 downvotes. Source: wshobson/agents. Canonical URL: https://skills.sh/wshobson/agents/evaluation-methodology.

Security audits
Gen Agent Trust HubPASS
SocketPASS
SnykPASS
About this skill
This document is the authoritative reference for how PluginEval measures plugin and skill quality. It covers the three evaluation layers, all ten scoring dimensions, the composite formula, badge thresholds, anti-pattern flags, Elo ranking, and actionable improvement tips. Related: Full rubric anchors PluginEval stacks three complementary layers. Each layer produces a score between 0.0 and 1.0 for each applicable dimension, and later layers override or blend with earlier ones according to per-dimension blend weights. Speed: 1 , scores are averaged and Cohen's kappa is reported as an inter-judge agreement metric. Speed: 5–20 minutes. N=50 simulated Agent SDK invocations (default). Statistical.

Source description provided by the upstream listing. Community review signal and install context stay separate from this narrative layer.

Community reviews

Latest reviews

No community reviews yet. Be the first to review.

Browse this skill in context
FAQ
What does evaluation-methodology do?

This document is the authoritative reference for how PluginEval measures plugin and skill quality. It covers the three evaluation layers, all ten scoring dimensions, the composite formula, badge thresholds, anti-pattern flags, Elo ranking, and actionable improvement tips.

Is evaluation-methodology good?

evaluation-methodology does not have approved reviews yet, so SkillJury cannot publish a community verdict.

Which AI agents support evaluation-methodology?

evaluation-methodology currently lists compatibility with Claude Code, Skills CLI.

Is evaluation-methodology safe to install?

evaluation-methodology has been scanned by security audit providers tracked on SkillJury. Check the security audits section on this page for detailed results from Socket.dev and Snyk.

What are alternatives to evaluation-methodology?

Skills in the same category include grimoire-morpho-blue, conversation-memory, second-brain-ingest, zai-tts.

How do I install evaluation-methodology?

Run the following command to install evaluation-methodology: npx skills add https://github.com/wshobson/agents --skill evaluation-methodology

Related skills

More from wshobson/agents

Related skills

Alternatives in Software Engineering