Skip to main content
Back to registry

stable-baselines3

davila7/claude-code-templates

Stable Baselines3 (SB3) is a PyTorch-based library providing reliable implementations of reinforcement learning algorithms. This skill provides comprehensive guidance for training RL agents, creating custom environments, implementing callbacks, and optimizing training workflows using SB3's unified API.

Installs138
Install command
npx skills add https://github.com/davila7/claude-code-templates --skill stable-baselines3
About this skill
Stable Baselines3 (SB3) is a PyTorch-based library providing reliable implementations of reinforcement learning algorithms. This skill provides comprehensive guidance for training RL agents, creating custom environments, implementing callbacks, and optimizing training workflows using SB3's unified API. Basic Training Pattern: Important Notes: Algorithm Selection: Use references/algorithms.md for detailed algorithm characteristics and selection guidance. Quick reference: See scripts/train_rl_agent.py for a complete training template with best practices. Requirements: Custom environments must inherit from gymnasium.Env and implement: Key Constraints: Validation: See scripts/custom_env_template.py for a complete custom environment template and references/custom_environments.md for comprehensive guidance. Purpose: Vectorized environments run multiple environment instances in parallel, accelerating training and enabling certain wrappers (frame-stacking, normalization). Types: Quick Setup: Off-Policy Optimization: When using multiple environments with off-policy algorithms (SAC, TD3, DQN), set gradient_steps=-1 to perform one gradient update per environment step, balancing wall-clock time and sample efficiency. API Differences: See references/vectorized_envs.md for detailed information on wrappers and advanced usage.

Source description provided by the upstream skill listing. Community reviews and install context appear in the sections below.

Community Reviews

Latest reviews

Sign in to review

No community reviews yet. Be the first to review.

Browse this skill in context
FAQ
What does stable-baselines3 do?

Stable Baselines3 (SB3) is a PyTorch-based library providing reliable implementations of reinforcement learning algorithms. This skill provides comprehensive guidance for training RL agents, creating custom environments, implementing callbacks, and optimizing training workflows using SB3's unified API.

Is stable-baselines3 good?

stable-baselines3 does not have approved reviews yet, so SkillJury cannot publish a community verdict.

What agent does stable-baselines3 work with?

stable-baselines3 currently lists compatibility with codex, gemini-cli, opencode, cursor, claude-code, antigravity.

What are alternatives to stable-baselines3?

Skills in the same category include telegram-bot-builder, flutter-app-size, sharp-edges, iterative-retrieval.

How do I install stable-baselines3?

npx skills add https://github.com/davila7/claude-code-templates --skill stable-baselines3

Related skills

More from davila7/claude-code-templates

Related skills

Alternatives in Software Engineering