It also seems that if you get the agent a score, it will keep trying to increase the score without appreciation of which part of the score is meaningful