Reinforcement Learning Engineer
£80,000 - £125,000 / year (£6,667/month) - Negotiable
الوصف الوظيفي
Develop reinforcement learning systems for training and aligning AI models. Work on RLHF and advanced RL techniques.
المتطلبات
- MSc/PhD in ML, Robotics, or related
- 3+ years RL experience
- Strong knowledge of policy optimization
- Experience with RLHF, PPO, DPO
- Proficient in Python and RL frameworks
- Research publications a plus
- 3+ years RL experience
- Strong knowledge of policy optimization
- Experience with RLHF, PPO, DPO
- Proficient in Python and RL frameworks
- Research publications a plus
المسؤوليات
- Implement RL training pipelines
- Develop reward modeling systems
- Optimize RL algorithms
- Collaborate with safety team
- Research new RL techniques
- Document and share learnings
- Develop reward modeling systems
- Optimize RL algorithms
- Collaborate with safety team
- Research new RL techniques
- Document and share learnings
المزايا
- Salary £80,000 - £125,000
- Research-focused role
- Conference attendance
- Stock options
- Premium benefits
- Gym membership
- Research-focused role
- Conference attendance
- Stock options
- Premium benefits
- Gym membership
نظرة عامة على الوظيفة
نوع التوظيف
دوام كامل
مستوى الخبرة
متوسط
الموقع
London, England, United Kingdom
الشواغر
2
تقدم الآن