MetaAI_Official OP t1_izfpggs wrote on December 8, 2022 at 7:45 PM

Reply to comment by PolarCow89 in [D] We're the Meta AI research team behind CICERO, the first AI agent to achieve human-level performance in the game Diplomacy. We’ll be answering your questions on December 8th starting at 10am PT. Ask us anything! by MetaAI_Official

CICERO always tries to maximize its own score. However, there is a regularizer that penalizes it for deviating from a human-like policy. When all actions have the same expected value (e.g., when it's guaranteed to lose no matter what) then it will just try to play in a human-like way, which may involve retaliating against those that attacked it. -NB