The new paper didn't say that the problem wasn't correct, just that their original solution (test) didn't work out. It's true that it seems to be hard to come up with a reliable test for this sort of thing, but that doesn't mean that it's impossible.