← Registry

miles-rl-training

Community

Provides guidance for enterprise-grade RL training using miles, a production-ready fork of slime. Use when training large MoE models with FP8/INT4, needing train-inference alignment, or requiring speculative RL for maximum throughput.

Install

skillpm install miles-rl-training

Format score

100/100

Spec

v1.0

Installs

0

Published

April 1, 2026