·
AI & ML interests
NLP, DLA, DNN
Organizations
malanevans/SmolLM3-3B-Jobs-SFT
Text Generation
•
3B
•
Updated
malanevans/smollm3-jobs-sft
Updated
malanevans/a2c-PandaPickAndPlace-v3
Reinforcement Learning
•
Updated
•
2
malanevans/poca-SoccerTwos
Reinforcement Learning
•
Updated
•
3
malanevans/ppo-LunarLander-v2-tuned-2
Reinforcement Learning
•
Updated
•
3
malanevans/ppo-LunarLander-v2-tuned
Reinforcement Learning
•
Updated
•
1
malanevans/a2c-PandaReachDense-v3
Reinforcement Learning
•
Updated
•
1
malanevans/ppo-PyramidsRND
Reinforcement Learning
•
Updated
•
3
malanevans/ppo-SnowballTarget
Reinforcement Learning
•
Updated
•
8
malanevans/pixelcopter-v5
Reinforcement Learning
•
Updated
malanevans/pixelcopter-v4
Reinforcement Learning
•
Updated
malanevans/pixelcopter-v3
Reinforcement Learning
•
Updated
malanevans/pixelcopter-v2
Reinforcement Learning
•
Updated
malanevans/pixelcopter-v1
Reinforcement Learning
•
Updated
malanevans/Reinforce-CartPole1
Reinforcement Learning
•
Updated
malanevans/dqn-SpaceInvadersNoFrameskip-v4
Reinforcement Learning
•
Updated
•
1
Reinforcement Learning
•
Updated
Reinforcement Learning
•
Updated
Reinforcement Learning
•
Updated
malanevans/q-FrozenLake-v1-4x4-noSlippery_v2
Reinforcement Learning
•
Updated
malanevans/PPO-LunarLander-v2_v2
Reinforcement Learning
•
Updated
•
1
Reinforcement Learning
•
Updated
malanevans/q-FrozenLake-v1-4x4-noSlippery
Reinforcement Learning
•
Updated
Reinforcement Learning
•
Updated
•
1
malanevans/ppo-LunarLander-v2
Reinforcement Learning
•
Updated
•
2