My understanding is that Google "owns" reddit in the sense that they paid to use it as source of training data. And goodle paid reddit so much that they have exclusive rights for that.
Probably this is the reason why all the reddit free public APIs are gone - to block scraping.
With that logic Github, StackOverflow, rest of internet is also "only" training data.
X just produces extra valuable training data as a byproduct. Like power plants create certain byproducts that can be sold etc. Good to see it going to Grok primarily, as other LLM's are far from being truth seeking with their built-in, documented, extreme bias.
That is irrelevant to the invalidity of your original statement. LLM's clearly don't have problems having their training data scraped from all those mentioned irregardless of their ownership.
No it isn't ok to patronize X users with a false precedent. X and Grok work very well together, one can ask questions and get relevant, and RECENT posts by X users answering that query, something other LLM's can't really do.
Content created by X users is for X users to find either through their feed, basic search, or Grok. There's no foul play here, and how Grok uses data on X is not hard to defend even from a basic "better search" angle. Your "emphatize" comment sounds like "will someone think of the african children" kind of detached waste of breath, something the Chinese call "Baizuo".
It's not patronizing, it's a statement of fact: X is the only social network owned by an AI company (xAI), that only has one product (Grok) that is trained by data from X, which is user-generated data.