dllmpg commited on
Commit
8626014
·
verified ·
1 Parent(s): d5e67f2

Push Q-Learning agent for CliffWalking-v0

Browse files
Files changed (4) hide show
  1. README.md +52 -0
  2. q-learning.pkl +3 -0
  3. replay.mp4 +0 -0
  4. results.json +1 -0
README.md ADDED
@@ -0,0 +1,52 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ tags:
3
+ - CliffWalking-v0
4
+ - q-learning
5
+ - reinforcement-learning
6
+ - custom-implementation
7
+ model-index:
8
+ - name: qlearning
9
+ results:
10
+ - task:
11
+ type: reinforcement-learning
12
+ name: reinforcement-learning
13
+ dataset:
14
+ name: CliffWalking-v0
15
+ type: CliffWalking-v0
16
+ metrics:
17
+ - type: mean_reward
18
+ value: -13.00 +/- 0.00
19
+ name: mean_reward
20
+ verified: false
21
+ ---
22
+
23
+ # Q-Learning Agent playing CliffWalking-v0
24
+
25
+ This is a trained model of a Q-Learning agent playing **CliffWalking-v0**.
26
+ The agent was trained for 100000 episodes.
27
+
28
+ ## Evaluation Results
29
+ - Mean Reward: -13.00 +/- 0.00
30
+
31
+ ## Usage
32
+ ```python
33
+ import gymnasium as gym
34
+ import pickle
35
+ from huggingface_hub import hf_hub_download
36
+
37
+ def load_from_hub(repo_id, filename):
38
+ pickle_model = hf_hub_download(repo_id=repo_id, filename=filename)
39
+ with open(pickle_model, 'rb') as f:
40
+ downloaded_model_file = pickle.load(f)
41
+ return downloaded_model_file
42
+
43
+ model_data = load_from_hub(repo_id="dllmpg/qlearning", filename="q-learning.pkl")
44
+ q_table = model_data["qtable"]
45
+ env_id = model_data["env_id"]
46
+
47
+ # Example of running the loaded agent
48
+ env = gym.make(env_id)
49
+ raw_state, info = env.reset()
50
+ state_idx = raw_state # CliffWalking uses direct state indexing
51
+ # ... run agent using greedy_policy(q_table, state_idx) ...
52
+ ```
q-learning.pkl ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:ec4e75d164a79a5f852ff518b01223b5b959cbdb6e6a1cfd07d139763182db64
3
+ size 2399
replay.mp4 ADDED
Binary file (79.3 kB). View file
 
results.json ADDED
@@ -0,0 +1 @@
 
 
1
+ {"env_id": "CliffWalking-v0", "mean_reward": -13.0, "std_reward": 0.0, "n_eval_episodes": 100, "eval_datetime": "2025-06-15T22:29:06.965639"}