AI & ML interests
Offline RL datasets
farama-minari/HumanoidStandup-v5-SAC-simple
Reinforcement Learning
• Updated • 7
farama-minari/HumanoidStandup-v5-SAC-medium
Reinforcement Learning
• Updated • 6
farama-minari/HumanoidStandup-v5-SAC-expert
Reinforcement Learning
• Updated • 6
farama-minari/Ant-v5-SAC-expert-fine-tuned
Updated
farama-minari/Humanoid-v5-TQC-simple
Reinforcement Learning
• Updated • 6
farama-minari/Humanoid-v5-TQC-medium
Reinforcement Learning
• Updated • 10
farama-minari/Humanoid-v5-TQC-expert
Reinforcement Learning
• Updated • 12
farama-minari/Swimmer-v5-PPO-medium
Reinforcement Learning
• Updated • 7
farama-minari/Swimmer-v5-PPO-expert
Reinforcement Learning
• Updated • 7
farama-minari/Ant-v5-SAC-simple
Reinforcement Learning
• Updated • 19
farama-minari/HalfCheetah-v5-TQC-simple
Reinforcement Learning
• Updated • 2
farama-minari/Hopper-v5-SAC-simple
Reinforcement Learning
• Updated • 4
farama-minari/Hopper-v5-SAC-medium
Reinforcement Learning
• Updated • 22
farama-minari/Hopper-v5-SAC-expert
Reinforcement Learning
• Updated • 28
farama-minari/HalfCheetah-v5-TQC-medium
Reinforcement Learning
• Updated • 2
farama-minari/HalfCheetah-v5-TQC-expert
Reinforcement Learning
• Updated • 9
farama-minari/Hopper-v5-TQC-expert
Reinforcement Learning
• Updated • 2
farama-minari/Pusher-v5-SAC-medium
Reinforcement Learning
• Updated • 13
farama-minari/Pusher-v5-SAC-expert
Reinforcement Learning
• Updated • 81
farama-minari/Reacher-v5-SAC-medium
Reinforcement Learning
• Updated • 4
farama-minari/Reacher-v5-SAC-expert
Reinforcement Learning
• Updated • 18
farama-minari/Ant-v5-SAC-medium
Reinforcement Learning
• Updated • 32
farama-minari/Walker2d-v5-SAC-simple
Reinforcement Learning
• Updated • 2
farama-minari/Walker2d-v5-SAC-medium
Reinforcement Learning
• Updated • 29
farama-minari/Walker2d-v5-SAC-expert
Reinforcement Learning
• Updated • 127
farama-minari/Reacher-v5-SAC-simple
Reinforcement Learning
• Updated • 4
farama-minari/HumanoidStandup-v5-PPO-medium
Reinforcement Learning
• Updated • 1
farama-minari/HumanoidStandup-v5-PPO-simple
Reinforcement Learning
• Updated • 1
farama-minari/Humanoid-v5-SAC-medium
Reinforcement Learning
• Updated • 9
farama-minari/Humanoid-v5-SAC-simple
Reinforcement Learning
• Updated • 15