There's a situation where GPT4 is actually (more than) reasonably good at math and science, which signals ability to create novel, reliable algorithms. Granted, the prompter has to recognize errors and prompt for correction, however this can also be accomplished via adding "double check your work" at the end of the first prompt. Akin to "show your work" when working on proofs.