So wikipedia and wikidata? This actually happened already and it's part of why l...

seanwilson · 2025-08-08T19:01:51 1754679711

Only a relatively small part of Wikipedia has semantic markup though? Like if the article says "_Bob_ was born in _France_ in 1950" where the underlines are Wikpedia links, you'll get some semantic info from the use of links (Bob is a person, France is a country), but you'd be missing the "born" relationship and "1950" date as these are still only raw text.

Same with the rest of articles with much more complex relationships that would probably be daunting even for experts to markup in an objective and unambiguous way.

I can see how the semantic web might work for products and services like ordering food and booking flights, but not for more complex information like the above, or how semantic markup is going to get added books, research articles, news stories etc. that are always coming out.

TZubiri · 2025-08-09T00:57:53 1754701073

The semantic information is first present not in markup but in natural language.

But it is also present inside the website, there's infoboxes that mark the type of object, place, person, theory.

Additionally infoboxes also hold relationships, you might find when a person was born in an infobox, or where they studied.

seanwilson · 2025-08-09T06:08:48 1754719728

> The semantic information is first present not in markup but in natural language.

Accurate natural language processing is a very hard problem though and is best processed by AI/LLMs today, but this goes against what the article was going for when it's saying we shouldn't need AI if the semantic web had been done properly?

For example, https://en.wikipedia.org/wiki/Resource_Description_Framework and https://en.wikipedia.org/wiki/Web_Ontology_Language are some markup approaches related to the semantic web.

Complex NLP is the opposite to what the semantic web was advocating? Imagine asking the computer to buy a certain product and it orders the wrong thing because the natural language parsed was ambiguous.

> Additionally infoboxes also hold relationships, you might find when a person was born in an infobox, or where they studied.

That's not a lot of semantic information compared to the contents of a Wikipedia article that's several pages long. Imagine a version of Wikipedia that only included the infoboxes and links within them.

TZubiri · 2025-08-09T13:57:36 1754747856

Yeah. Wikidata