Robot Arm

Mode: Human Control

Epsilon: 1.0

Episode: 0

Reward: 0

Robot Arm Control Modes

Human Control Mode

In this mode, you can control the robot arm directly:

Your demonstrations are recorded and used to help train the AI agent.

RL Agent Mode

Watch the AI agent learn to control the arm through trial and error:

Training Process

The agent learns through these steps:

  1. Collects experiences (state, action, reward, next state)
  2. Stores experiences in a replay buffer (mixed with human demos)
  3. Periodically updates its neural network to predict better actions
  4. Gradually reduces random exploration as it improves

The epsilon value shows how often the agent takes random actions (1.0 = always, 0.0 = never).