Problems with data: - as a representation of something else, it can be incorrect...

virgilp · on Aug 23, 2024

The "problems of data" are not really problems with data, I feel that's what Rich Hickey was alluding to in that discussion (and no, I didn't feel that him & Alan Kay were talking past each other)

> as a representation of something else, it can be incorrect (meaning error)

- So here, you're saying that "Knowledge" may be incorrect. "Sun observed at this position in the sky during various times of day" is data, whereas "Sun moves around the Earth" is (wrong) knowledge. Yes data can contain errors (e.g. incorrect measurements). But Rich Hickey was saying that the fact that data doesn't contain the "interpretation" too is a feature, not a bug!

> as a domain of decision input, it can be misleading (sampling error)

- Right. But at least, it gives you the tools to validate the decision process and identify errors, or potential weaknesses. If you include the interpreter with the data and give direct access to the decision - any error with the interpreter will automatically invalidate all the data (and really it will make it hard to tell whether it's a sampling error, interpretation error, or simply error in the original measurements)

> it gives agents the illusion that they understand, leading to overconfident actions

- On the contrary, KNOWLEDGE does that.

dgb23 · on Aug 23, 2024

I like this rebuttal. It disentangles data from interpretation and knowledge. This distinction helps us to solve problems associated with data and is a core tenet of science and problem solving.

Increasing the amount of generated data and not jumping to conclusions at the same time is how we avoid getting stuck in misconceptions or plain ignorance.