Logical Reasoning via SAT-based Puzzle Solving | OpenReward