0
Episode
0
Current Score
0
Avg (Last 20)
0
Best Score
Ready - Select an algorithm and click Start
CartPole Environment
Training Progress
1x

Hand-Coded Rules

The simplest approach: if the pole tilts right, push right; if it tilts left, push left. This combines the pole angle with its angular velocity for smoother control.

Formula: action = (angle + 0.1 * angular_velocity) > 0 ? RIGHT : LEFT

No Learning Deterministic Instant

Configuration (Click to Expand)

Higher = faster but unstable
How much to value future rewards
ε = ε * decay after each episode
reward = 1.0
Same seed = same results (for comparing settings)
Add randomness to physics (harder task)