You can just do it generation for generation. The only thing hard about it is that it's two explained concepts you need to combine. A model which aces math Olympiad problems shouldn't have any trouble with this whatsoever - unless it's overfitting on them somehow.