It doesn't imply that at all. That's like saying openai is creating a human level intelligence with chatgpt. Emulating a single function a human is able to perform really well is not the same as aiming for human level intelligence.
This has nothing to do with ChatGPT. They claim they only need vision because a human only needs vision. Although that statement in of itself is false because humans have other senses, but they can't know how much complexity of the human brain is required to make only vision image processing work. If they can't at least replicate that level of intelligence then they have no business making such a claim.