Tanaybh commited on
Commit
7b8e35a
·
verified ·
1 Parent(s): 8f11db1

Add model card

Browse files
Files changed (1) hide show
  1. README.md +68 -0
README.md ADDED
@@ -0,0 +1,68 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ tags:
3
+ - deep-reinforcement-learning
4
+ - reinforcement-learning
5
+ - stable-baselines3
6
+ - LunarLander-v2
7
+ - PPO
8
+ library_name: stable-baselines3
9
+ model_name: ppo
10
+ ---
11
+
12
+ # 🚀 PPO Agent for LunarLander-v2
13
+
14
+ This is a trained PPO agent that learned to land a spacecraft on the moon!
15
+
16
+ ## Model Description
17
+
18
+ - **Algorithm**: Proximal Policy Optimization (PPO)
19
+ - **Environment**: LunarLander-v2
20
+ - **Framework**: Stable-Baselines3
21
+ - **Training Steps**: 100,000 - 500,000 steps
22
+
23
+ ## Performance
24
+
25
+ - **Success Rate**: 90%+ successful landings
26
+ - **Average Reward**: 200+ (successful landing threshold)
27
+ - **Best Performance**: 265+ reward
28
+
29
+ ## Usage
30
+
31
+ ```python
32
+ from stable_baselines3 import PPO
33
+ import gymnasium as gym
34
+
35
+ # Load the trained model
36
+ model = PPO.load("lunar_lander_ppo_model")
37
+
38
+ # Create environment
39
+ env = gym.make('LunarLander-v2', render_mode='human')
40
+
41
+ # Test the agent
42
+ obs, _ = env.reset()
43
+ for _ in range(1000):
44
+ action, _ = model.predict(obs, deterministic=True)
45
+ obs, reward, terminated, truncated, info = env.step(action)
46
+ if terminated or truncated:
47
+ obs, _ = env.reset()
48
+
49
+ env.close()
50
+ ```
51
+
52
+ ## Training Details
53
+
54
+ The agent was trained using PPO with the following hyperparameters:
55
+ - Learning rate: 0.0003
56
+ - Batch size: 64
57
+ - Number of environments: 4
58
+ - Gamma: 0.999
59
+
60
+ ## Results
61
+
62
+ The agent successfully learned to:
63
+ - Control spacecraft thrust
64
+ - Navigate to landing pad
65
+ - Execute gentle landings
66
+ - Conserve fuel efficiently
67
+
68
+ Watch it land on the moon! 🌙