Yes, it’s simply a way to maximize the benefit of cooperation for an agent. Cooperate until the other agent steals then punish immediately but don’t hold a grudge about it and go back to cooperation. It’s relatively simple to simulate and it’s the most fruitful strategy.
I agree, from the quotes I saw they were really mismanaging the response in the discord.