solved classic rl environments
Nitish Pandey
nitishpandey04
AI & ML interests
LLMs, Translation
Recent Activity
upvoted
an
article
about 2 months ago
Deriving the PPO Loss from First Principles
updated
a collection
2 months ago
Classic Reinforcement Learning
updated
a model
2 months ago
nitishpandey04/CarRacing-v3