← Registry

model-pruning

Community

Reduce LLM size and accelerate inference using pruning techniques like Wanda and SparseGPT. Use when compressing models without retraining, achieving 50% sparsity with minimal accuracy loss, or enabling faster inference on hardware accelerators. Covers unstructured pruning, structured pruning, N:M sparsity, magnitude pruning, and one-shot methods.

Install

skillpm install model-pruning

Format score

95/100

Spec

v1.0

Installs

0

Published

April 1, 2026