I don't think I'm the one being presumptuous or demanding. I've actually tried to help you make a stronger argument. Shooting a hundred or even a thousand queries to 3 or 4 LLMs and shoving the results through established sentiment analysis algorithms is something ChatGPT can one-shot in just about any language. You demand people agree with your opinion and refuse to spend 20 minutes supporting it with facts. Not my problem, I tried to help. You may not see it that way. That's fine.
You can't just run a few queries and base conclusion off that, you need to run tens of thousands of different ones and then somehow evaluate the responses. It's a huge amount of work.
Demanding empirical data and then coming up with shoddy half-arsed methodology is unserious.
20 lines of code and some data would really bolster your case, but I don't see them.