I worked on the Semantic Web, designing core data infrastructure, back when it w...

tokyodude · on Sept 21, 2018

Third, you can't force people to use the correct semantics. They'll use them wrong on purpose for fun and profit. Mark some disturbing content as wholesome, mark it as whatever is popular at the moment to get it in front of more eyeballs, mark it as something only tangentially related in the hope there's a cross over of market, mark it wrong because they don't actually know better.

r_c_a_d · on Sept 21, 2018

Yes, I think this is the biggest issue. Since many websites are funded by advertising, they do not want people to be able to extract data from their pages, they want people to "visit" the site and view the ads.

Also, rights. People "own" things like sports fixture lists... they don't want you extracting that data without paying to use it.

The semantic web could be perfect technically, but it was never going to apply to content that people were attempting to "monetise"... which seems to be most of the web's content.

smadge · on Sept 21, 2018

I don’t really understand this argument, because there already are lies published on the internet. What difference does it make if those lies are published in a standardized machine readable format or not?

PeterisP · on Sept 22, 2018

A human-targeted web structure that contains some lies is still useful for humans because humans can filter out those lies with somewhat satisfactory efficiency.

A machine-targeted web structure that contains some lies is not useful for machines because they can't filter out those lies yet. It might become useful when they can (but that might be a hard-AI problem), but it's simply not usable until that point.

tokyodude · on Sept 21, 2018

What's the point of having the marks at all if they are not actually correct?

Volt · on Sept 21, 2018

If you're answering the question, I think it would be good to answer it directly.

moltensyntax · on Sept 21, 2018

We as humans have a lot of intuitive tools for knowing whether a source of data is trustworthy. AI could possibly approach this ability given enough training... we'd need to do something like add a "trust" score to every node in the graph.

subhobroto · on Sept 20, 2018

Your experience shows!

I made a few observations in my own comment.

One being that there is no usable graph store you and I can use as of 2018.

Another being about monetizing the Semantic Web when playing the role of the data/ontology provider. You provide all the data while the consumers (Siri, Alexa and Google Home) get the glory: https://news.ycombinator.com/item?id=18036041

jbreckmckye · on Sept 21, 2018

> First, there is no universal "semantic". The meaning of things is ambiguous, and given a broad enough pool of people and cultural contexts, it becomes nigh impossible to converge on a single consistent model for individual terms and concepts in practice

It sounds like the Semantic Web failed because we tried treating a longstanding (and possibly unresolvable) ontological problem as a straightforward and technical one.

I don’t really follow academic philosophy, but is it known these days if such categorisation problems are even “solvable” in the general case?

jeremiem · on Sept 20, 2018

Your first point is being worked on and there are a few upper ontologies being used. I find BFO [1] quite promising.

I believe that we actually can generalize modelling. We have dictionaries filled with definitions, given enough time and discipline, I don't see why we couldn't make them formal. It's not an engineering problem though.

[1] http://basic-formal-ontology.org/

openfuture · on Sept 20, 2018

This is why category theory is so important! It allows us to move between axiomatic systems by looking at the structure they have in common. I am convinced that the 'semantic web' will be accomplished via some easy to use version control meets functors gui program.