Skip to main content
Back to registry

model-pruning

davila7/claude-code-templates

Installs175
Install command
npx skills add https://github.com/davila7/claude-code-templates --skill model-pruning
Security audits
Gen Agent Trust HubFAIL
SocketFAIL
SnykWARN
About this skill
Use Model Pruning when you need to: Key Techniques : Wanda (weights × activations), SparseGPT (second-order), structured pruning, N:M sparsity Papers : Wanda ICLR 2024 (arXiv 2306.11695), SparseGPT (arXiv 2301.00774) Source : ICLR 2024 (arXiv 2306.11695) Source : arXiv 2301.00774 Magnitude Pruning (baseline): Wanda (weights × activations): SparseGPT (second-order): Unstructured (fine-grained): Structured (coarse-grained): Semi-structured (N:M) : Pruning methods at 50% sparsity (LLaMA-7B): Source : Wanda paper (ICLR 2024), SparseGPT paper - Reduce model size by 40-60% with <1% accuracy loss - Accelerate inference using hardware-friendly sparsity (2-4× speedup) - Deploy on constrained hardware (mobile, edge devices) - Compress without retraining using one-shot methods - Enable efficient serving with reduced memory footprint - Prune individual weights - Higher quality (better accuracy) - No hardware speedup (irregular sparsity) - Prune entire neurons, heads, or layers - Lower quality (more accuracy loss) - Hardware speedup (regular sparsity) - Best of both worlds - 50% sparsity (2:4) → 2× speedup on NVIDIA GPUs - Minimal accuracy loss - Wanda Paper (ICLR 2024) : - Wanda GitHub : - SparseGPT Paper : - SparseGPT GitHub : - NVIDIA Sparse Tensor Cores :

Source description provided by the upstream skill listing. Community reviews and install context appear in the sections below.

Community Reviews

Latest reviews

Sign in to review

No community reviews yet. Be the first to review.

Browse this skill in context
FAQ
What does model-pruning do?

model-pruning is listed in SkillJury, but the source summary is still sparse.

Is model-pruning good?

model-pruning does not have approved reviews yet, so SkillJury cannot publish a community verdict.

What agent does model-pruning work with?

model-pruning currently lists compatibility with codex, gemini-cli, opencode, cursor, github-copilot, claude-code.

What are alternatives to model-pruning?

Skills in the same category include telegram-bot-builder, flutter-app-size, sharp-edges, iterative-retrieval.

How do I install model-pruning?

npx skills add https://github.com/davila7/claude-code-templates --skill model-pruning

Related skills

More from davila7/claude-code-templates

Related skills

Alternatives in Software Engineering