0
Episode
0
Current Score
0
Avg (Last 20)
0
Best Score
CartPole Environment
Training Progress
1x
Hand-Coded Rules
The simplest approach: if the pole tilts right, push right; if it tilts left, push left. This combines the pole angle with its angular velocity for smoother control.
Formula: action = (angle + 0.1 * angular_velocity) > 0 ? RIGHT : LEFT
No Learning
Deterministic
Instant
Configuration (Click to Expand)
▼Higher = faster but unstable
How much to value future rewards
ε = ε * decay after each episode
reward = 1.0
Same seed = same results (for comparing settings)
Add randomness to physics (harder task)