SearchLondonJobs.co.uk

🏛️ London's Premier Job Portal

← Back to London Jobs

Reinforcement Learning Engineer

Company: Appit LLC

Location: montreal (administrative region), London

Posted: May 24, 2026

Apply for This Position

Submit Application

Position Details

APPIT Software Solutions is hiring a Reinforcement Learning Engineer in Montreal, Canada . Design reinforcement learning systems at APPIT Software in Montreal, building adaptive AI agents for optimization, autonomous decision-making, and RLHF alignment of large language models.

Responsibilities

  • Design and implement reinforcement learning algorithms for enterprise optimization problems
  • Build RLHF and reward modeling pipelines for LLM alignment and fine-tuning
  • Develop simulation environments for training and evaluating RL agents
  • Implement multi-agent reinforcement learning systems for complex coordination tasks
  • Optimize RL training stability and sample efficiency using state-of-the-art techniques
  • Collaborate with research teams to translate RL advances into production applications

Requirements

  • 5+ years of ML experience with 2+ years focused on reinforceme...