SearchLondonJobs.co.uk

🏛️ London's Premier Job Portal

← Back to London Jobs

Learning Algorithms for Designing Undeceivable Policies to Foster Sustainable Behavior // Learning Algorithms for the Design of Undeceivable Policies to Foster Sustainable Behavior

Company: Télécom SudParis

Location: Évry, London

Posted: June 13, 2026

Apply for This Position

Submit Application

Position Details

Topic description

Personalized Demand-Side Mitigation Strategies (PDSMS) encourage agents to make sustainable choices.[IE21] A regulator learns agents' preferences by observing their choices, and adapts signals, e.g., incentives or prices.[Ar18] State of art PDSMS[Ar23,As21] are based on Random Utility Theory (RUT), assuming agents are honest, making choices to maximize their utility.[Be19,§3.1] We study instead the case where agents may be deceptive, making choices to manipulate the regulator and get favorable signals. Our objective is to answer the following research questions: Under which conditions deceptive agents cancel-out the benefits of PDSMS? How to make PDSMS robust to them? This remains an open question, highlighting the novelty of our project. We build our approach on recent advances in AI & Game Theory,[Gan20,Xu21] but our originality is that we will explicitly model the regulator's learning process (missing so far) and show that it can deter agents f...