Golden raises $14.5M Series A led by a16z and Marc Andreessen joins board

theYipster · on Sept 30, 2020

Having worked with the "original" Watson, I saw first hand how the system stumbled upon a particularly stupid but hard problem as it tried to scale.

In 2014, I saw a demo of the original Discovery Advisor, which was at the time the closest commercial equivalent to the "Jeopardy system." This demo took in Wikipedia as a corpus, and a question was asked: "what country produced the greatest amount of wheat in 2012?" The system returned a list of countries as answers, so it wasn't quite nonsensical, but it was clear the answers were incorrect. The answers were countries like "England," "Norway," or "Zimbabwe." This system also returned passages from Wikipedia as supporting evidence, but the passages weren't about wheat production. Instead, they were about quotes that contained the word wheat... such as "let's cut the wheat from the chaff."

So of course, some smart-alec in the room Googles the same question, and this was before Google had the ability to return factual answers to factual questions, so instead we got a list of web results. The top result, interestingly, was a Wikipedia article titled "Wheat Production by Country." Opening that article presented a table that clearly showed that China produced the greatest amount of wheat in 2012.

Unfortunately, that Watson system at the time didn't read information from tables. I'm not sure if it does now, but I do know that reading data from tables in a manner that can be easily integrated and scaled within a broader semantic processing system is quite difficult. I'm not as focused on the space as I once was, so I'm not sure if the problem has been well solved yet. If not, I'd say it's a worthy area to invest in a solution.

mattmcknight · on Sept 30, 2020

> I do know that reading data from tables in a manner that can be easily integrated and scaled within a broader semantic processing system is quite difficult. I'm not as focused on the space as I once was, so I'm not sure if the problem has been well solved yet.

I saw a presentation on this paper at SIGKDD this year. https://dl.acm.org/doi/10.1145/3394486.3406468 "Multi-modal Information Extraction from Text, Semi-structured, and Tabular Data on the Web"

nl · on Oct 1, 2020

This isn't a solved problem but is something that is very actively researched.

Google's TAPAS system deals with natural language queries on tabular data:

https://ai.googleblog.com/2020/04/using-neural-networks-to-f...

There are other strands of research too - just finding which tables are relevant to a query is a real problem.

dllthomas · on Oct 1, 2020

On the topic of Watson... I just really want Chef Watson back.

tinus_hn · on Sept 30, 2020

WolframAlpha actually has this data but doesn’t understand the question

what country produced the greatest amount of wheat in 2012

If you ask the suggested

country produced most wheat

You do get the table you talk about.

tough · on Oct 1, 2020

Wikipedia just released an API for Tables, that should help?

teruakohatu · on Oct 1, 2020

Do you have a link to that? Google is failing me.

tough · on Oct 2, 2020

Maybe it was this? https://news.ycombinator.com/item?id=24533808

koosha · on Sept 30, 2020

I agree.

Cactus2018 · on Sept 30, 2020

> I'm not sure if it does now, but I do know that reading data from tables in a manner that can be easily integrated and scaled within a broader semantic processing system is quite difficult. I'm not as focused on the space as I once was, so I'm not sure if the problem has been well solved yet. If not, I'd say it's a worthy area to invest in a solution.

In R you can read data from tables like this:

    df<-htmltab::htmltab("http://en.wikipedia.org/wiki/Upper_Peninsula_of_Michigan",3)

In google sheets

    =ImportHtml("http://en.wikipedia.org/wiki/Upper_Peninsula_of_Michigan","table",3)

In Python+Pandas

    df=pandas.read_html('https://en.wikipedia.org/wiki/List_of_postal_codes_of_Canada:_M', header=0)[0]

zaroth · on Sept 30, 2020

In the problem space of “reading data from tables in a manner that can be easily integrated and scaled within a broader semantic processing system”... I would assume that “reading data from tables” isn’t the hard part.

theYipster · on Oct 1, 2020

You would assume correctly. The core issue is that one can't interpret meaning from a table and its values from semantics alone. A table's layout conveys a great deal of meaning.

I remember looking at a couple of systems that would try to do a visual-based zonal tagging of a table, but I think the challenge there was how to logically integrate the zonal tagging into the broader semantic processing of the surrounding text.

Not being able to construe information from tables is a huge stumbling block for semantic and NLP systems for a large number of use cases that incorporate technical content. Automating patent research is one I looked at 6 or 7 years ago and tables tanked the concept. Semantic search over digitized maintenance manuals is another use-case I've wrestled with that's a tough nut to crack if the underlying manuals aren't available in a structured schema.

gerbler · on Sept 30, 2020

Pretty slick but I think calling it "The most comprehensive knowledge platform" is a slight exaggeration. I searched for a few random topics ("urban planning", "pianos" and "bitcoin") and bitcoin was the only one to have a (lengthy) article.

I also don't understand the incentive for users to contribute to a knowledge base that is then being sold: https://golden.com/pricing

ordinaryradical · on Sept 30, 2020

Unfortunately this was created because Wikipedia refused to keep articles on obscure altcoins so it’s very crypto heavy. That’s why a16z has jumped in I’d imagine—they very much drank the crypto koolaid and like the synergy.

This was also why I could never really see this becoming a serious product. It’s an SEO trick for ICOs looking to hype themselves, a far cry from a knowledge base. Mismatched incentives will screw up the value prop.

gerbler · on Sept 30, 2020

Interesting - didn't realize Wikipedia didn't allow such articles.

But in general, Golden seems to have a very strong tech bias - which is fine, but limits what it can be used for.

troymc · on Sept 30, 2020

"On Wikipedia, notability is a test used by editors to decide whether a given topic warrants its own article."

From: https://en.wikipedia.org/wiki/Wikipedia:Notability

Most altcoins are not notable by the Wikipedia standard of notability.

huac · on Sept 30, 2020

> But in general, Golden seems to have a very strong tech bias - which is fine, but limits what it can be used for.

Today, yes, but not necessarily in the future. Facebook started on elite college campuses, built its network, and expanded.

AndrewUnmuted · on Sept 30, 2020

It makes little sense for someone to contribute to a knowledge base like this one.

One of the quotes from the home page - supposedly written by a person who's tried it out - is as follows:

"18 years later, a startup to take on @Wikipedia --> @golden" [0]

This really confuses me. What exactly would it mean to take on Wikipedia in a meaningful sense? You'd have to build the community of Wikipedia with the same kind of values and ethics governing it, then you'd have to get the novel software (the "AI" thing they claim to be using) implemented in a way that doesn't violate said ethics, nor lowers the quality of the information.

Seems like it'd be much better just to work towards putting the AI into use at Wikipedia. Whatever innovation does end up resulting from this company, it will likely get lost if/when it gets acquired, so in the end few if any people will be able to enjoy its benefits.

[0] https://golden.com/home

fab1an · on Sept 30, 2020

Really great to see some actual non incremental innovation happening in the search space.

When looking at Golden's value prop, it becomes clear that Google has actually been somewhat, ehm, lazy when it comes to making search better, relying almost 100% on UGC to provide answers instead of trying to structure them in a concise way.

Very curious to see where this leads!

newguy1234 · on Sept 30, 2020

I agree as well with google. When google first showed up, it was a breakthrough since the quality of the results were so good compared to what we are used to. Fast forward to today - more data, more websites and so on has resulted in new problems and I think Google has not kept up. Just because a website has been around for a decade or has 50,000 backlinks doesn't mean it is still good today. Information might be obsolete. Websites like this usually rank high in the search results page while being low-value. Meanwhile, higher-value websites that are more recent get lower rankings even though the quality is superior. Google is not able to make these connections. It seems like AI might be able to improve the quality of search results if applied correctly.

sethherr · on Oct 1, 2020

To the contrary, new websites with garbage content, swamped by ads - but published recently - often rank above actual quality sites with well written content from 10 years ago.

Not saying what you described doesn’t happen, but both are problems.

paxys · on Sept 30, 2020

Google has had various products and efforts over the years to do exactly that, so I'm not sure if "lazy" is the right word. Knol was doomed from the start because it foolishly went head-to-head with Wikipedia. Other efforts, like Freebase, were fairly successful, and knowledge graph today is pretty great for what it does, both as part of search results but even more so when powering their ML/vision/NLP APIs.

Looking through Golden's website they seem to want to do all of the same, but using their own (also user-contributed) content, aiming to make it accessible and valuable enough that companies will pay $1000 per month per employee (!) for it. I know almost nothing about the product so will hold off too much judgement, but that sounds like a pipe dream.

xoxoy · on Sept 30, 2020

it’s more like a proprietary wikipedia than a google. at least that’s my impression playing around with it.

newguy1234 · on Sept 30, 2020

I could see this being used more for research type of work. Wikipedia is good for the high-level/popular topics but for very specific fields, there won't be anything significant. Think of areas like drug research.

troymc · on Sept 30, 2020

I gather that the knowledge in Golden is more structured, making it more like a database than a wiki, i.e. more like Wikidata or Semantic MediaWiki than Wikipedia.

fab1an · on Sept 30, 2020

it looks like a mix of Wikipedia, Google and a Bloomberg terminal

treelovinhippie · on Sept 30, 2020

> Very curious to see where this leads!

Well they took VC. So it won't lead anywhere interesting other than value enclosure and exit to surveillance capitalism.

memossy · on Oct 1, 2020

Lots of promise and a nice front end, but I'm struggling to see where after three years an example of high value knowledge in this platform is.

For example on the top topic AI most of the sections are unfilled / or lightweight aside from a list of companies with minimal contributions over the last few months: https://golden.com/wiki/Cluster%3A_Artificial_intelligence-J...

There is nothing on AI ethics (indeed this is the only thing on ethics: https://golden.com/wiki/Ethics-BJW8)

Looking at some comments above that maybe crypto was the angle I looked at a few articles and under Ethereum it says that the Constantinople release date was still to be determined (actual release date was 28th February 2019).

It seems there is a decent amount of company data in there (a la pitchbook, crunchbase), but in terms of practical, useful knowledge that is authoritative per the front page, what are some good examples now there?

narrationbox · on Sept 30, 2020

This particular field is difficult. Other than Google (through Search and also its acquisition of Metaweb), no other company has managed to achieve tremendous revenue with a knowledge base product alone. Cyc, Wolfram Alpha, the original IBM Watson (the expert system, not the ML APIs borrowing the brand) are all surviving but not thriving.

smitty1110 · on Oct 1, 2020

Don't forget the original knowledge base product - Bloomberg. They still bring in a lot of cash each year, but as a private company we can't get exact amounts.

Liron · on Sept 30, 2020

I didn't understand Golden's value prop when they raised last year, and wrote a post explaining why [1]. I don't understand it now either.

[1] https://medium.com/bloated-mvp/golden-is-a-bloated-mvp-27971...

Traster · on Sept 30, 2020

I feel I might be too jaded to comment on this, so let me just preface this as the view from outside silicon valley

> our vision to build an extensive database and graph of knowledge for humanity, including practical commercial tools and community features to aid discovery and decisions.

So you, and your, what? half dozen phds? want to produce a graph of human knowledge. Okay fine. That's a lofty goal, let's set you up like Harry Seldon and check in on you in a thousand years. Oh! You're going to do the almost impossible and have practical commercial tools in our life time. Ok.

Look, I'm not saying that Golden is going to be unsuccessful, it's probably going to be very succesful, they've got those guys that backed that misogynistic online frat house behind them, so there's a certain level of assured successs. I just question why blatantly lying about your goals is a pre-requisit for funding in silicon valley.

hazz99 · on Sept 30, 2020

Most companies have a large vision. The creation of a far reaching vision/mission is a standard management technique that allows people to judge actions based on whatever it falls in line with the (long term) vision.

This is taught in business school. It’s not a Silicon Valley thing.

FanaHOVA · on Sept 30, 2020

Good for Jude, he's one of the few people that is always about knowledge and not the flavor of day discussion.

_ugfj · on Oct 1, 2020

Are we in the year yet where Marc Andreessen joining a board is seen as a negative thing for bro culture problems? Not yet?

llarsson · on Sept 30, 2020

Congrats!

Now what, without marketing speak, does it do?

texasbigdata · on Sept 30, 2020

Consume investor money? :)

m3kw9 · on Oct 1, 2020

A corporate knowledge database. I don’t think it is trying to replace google right now.

bfieidhbrjr · on Sept 30, 2020

Serious question, are they building another FreeBase?

electriclove · on Sept 30, 2020

But monetized ala Bloomberg

contingencies · on Oct 1, 2020

... with infinite pockets at the helm and cronies to spin product to which can support the notion of growth far enough in to the hype cycle to secure returns.

Perhaps their CEO will have the moral fiber to sign a commitment guaranteeing a nontrivial (eg. double digit percentage) of annual expenses (not earnings, I'd wager this is a long way from generating profit) going directly to open source database projects they draw from. I'd wager not.

some-people · on Oct 1, 2020

Brings back the question: What problem does it solve? Content? Because it's really lacking. It seems great for a company search, but we have crunchbase for that. I could get a far more detailed explanation on any technical topic from Wikipedia.

frakkingcylons · on Sept 30, 2020

Tried searching for Bach, first result besides a bunch of companies (?) is a Canadian product designer. Johann Sebastian Bach isn’t even in the first page of search results somehow.

dubcanada · on Sept 30, 2020

I am confused as how it can call itself a "knowledge database" when it doesn't even have basic knowledge.

Searched for Falcon, apparently it's a company in the AI industry, has a website falcon.com.cn and is a genus of birds.

Also searched "Apple", got the company, good knowledge base. The fruit Apple is a page that says it's a fruit tree, with the CEO Tim Cook, former CEO Tim Cook, and Timothy Cook.

It seems to just be completely wrong, minus maybe a 100 articles.

I feel like basic encyclopedia information should have at least been pre-populated.

nradov · on Oct 1, 2020

Right Tim Cook is the former CEO. According to an authoritative source the current CEO is Tim Apple.

thesimon · on Sept 30, 2020

Mozart not much better, but more recent artist seem to score better, though the pages don't contain any content and the meta-information is also not great.

A search for Merkel delivers an arms company before Angela Merkel, GDP delivers some companies.

Relationships between persons don't seem to be present.

A search for Mercedes Benz doesnt deliver anything too great. Snowden requires an Edward to find him, NSA is a company, Chrome has no info.

Maybe I'm looking for the wrong terms, but it seems like they basically just imported companies from public domain and some random stuff on the side, which mostly is just the title of something.

tonystubblebine · on Sept 30, 2020

Reminds me of metaweb.

adventured · on Sept 30, 2020

An attempt at an AI enhanced, suped up Wikipedia. Definitely in the model of Freebase.

It'll end in a sell-and-bury exactly as Freebase did, for exactly the same reason: venture capital + knowledge service = only one possible eventual outcome. It's always just a matter of time before the money corrupts the service. The demand for an exit / return (outsized at that, typically) by the owners who have put up a large amount of money forces the matter. Now that big venture capital controls them, they have to pursue revenue and profit as their long-term primary goal for existing, rather than knowledge being at the center of the mission (initially they'll pretend knowledge is at the center of their mission, that will pivot as the return pressure builds on them over time).

When's the IPO? But but but we're a knowledge service, we're here to help humanity. Where's my return? When do I get a 1,000% return on my $10m? But but but we're a knowledge service, we just want to spread knowledge for the betterment of all. Breaking news, July 2024: Golden purchased by Verizon Media [insert big corporate swamp monster here] for $586 million in a fire sale. July 2026, Verizon Media quietly buries Golden.

Andreessen in particular seems bent on driving as many interesting knowledge concepts into the ground as he can. His magic knowledge service touch was all over Rap Genius as well (with dreams of annotate-everything going back to the Netscape days [1]).

There hasn't been a single prominent knowledge service in the history of the Web that has escaped destruction once they've taken big venture capital, except for Stack Exchange and they're starting to teeter on the edge where the owners start to push it in a way that begins the rolling corruption phase (with Stack that inevitable process was delayed for a long time by the influence of its founders and the decisions they made, but eventually papa VC wants his fat return).

The only for-profit knowledge services that survive with their soul intact, are slim independent operations like wikiHow that are not commanded by venture capital and the never-ending need to force an exit.

[1] https://genius.com/Marc-andreessen-why-andreessen-horowitz-i...

"But that's just the start. It turns out that Rap Genius has a much bigger idea and a much broader mission than that. Which is: Generalize out to many other categories of text... annotate the world... be the knowledge about the knowledge... create the Internet Talmud."

"Back in 1993, when Eric Bina and I were first building Mosaic, it seemed obvious to us that users would want to annotate all text on the web"

Bullshit.

breck · on Sept 30, 2020

I think this is a reasonable prediction. To paraphrase something I saw on here recently: "in the long run, business model trumps culture". I think there are hundreds of directions the business could go (Freebase being one, the next Bloomberg another, a simple shutdown being the most likely route in VC startups). But I agree with your skepticism that in the long-run the "we're here to help humanity" ethos will take a back seat to the profit motive.

But all that being said, there's always at least a chance that the organization somehow bucks the trend. Or, even if the organization eventually becomes dominated by the profit motive in the long run, that's not to say that it won't build really beneficial things before that happens. Freebase eventually sold and stopped maintaining it, but it built a free database that anyone in the world could use (and still could use). It pioneered a concept. I don't know what Rap Genius is up to these days, but I thought their annotations ux was really innovative and I'm sure pioneered a whole lot of other sites. So even if an organization's mission eventually takes a back seat to profit, it can create ton of value along the way.

Personally I find this startup very interesting and am excited to see where they go.

tonystubblebine · on Sept 30, 2020

> The only for-profit knowledge services that survive with their soul intact

I agree about the corrupting influence of VC. The following isn't a super popular opinion on HN lately, but this is exactly why I've been a believer in Medium since they launched their subscription service. It's the rare startup where I could see their financial incentives and also think those incentives would be good for me as a reader. They made the knowledge the product and removed the incentive to use the knowledge as a sales pitch for some other product, i.e. content marketing. And they have to constantly push for articles that qualify as subscription worthy. That means focus on quality. I don't think they've tipped over yet, but what I've seen so far is that the more subscribers Medium gets the more they spend to get better and better articles. And as the payouts to authors get better, better authors come on board.

Barrin92 · on Sept 30, 2020

hadn't heard of it. Sounds like the semantic web or Wolfram Alpha. It's very ambitious and I think almost like an AGI type problem, because parsing human queries to the point where the system can actually reason about semantics, and on its own create ontologies and relationships of everything you find on the web that are actually useful and accurate is difficult.

koosha · on Sept 30, 2020

Congrats Jude! Proud of you.

pbronez · on Sept 30, 2020

I wish the pricing/business model supported niche wiki creation. I want to put together a broad public knowledge base about a niche product segment, that connects common data elements for companies and products with deep technical models of the products themselves. Golden's tooling looks super useful, but too expensive for this use case.

miket · on Oct 1, 2020

MediaWiki, the software that Wikipedia uses is open source and Wikibase, the software that WikiData uses is also open source

jamesmishra · on Oct 1, 2020

Sometimes I wonder how critical it is for an early-stage startup to have a great .com domain name.

If Golden had a harder-to-type domain name, would they get the same level of momentum and SEO juice?

wilfredk · on Oct 1, 2020

I am also curious.

Is there is any data on this?

freediver · on Sept 30, 2020

The lack of clear use case reminds me of Qwiki 10 years ago.

prepend · on Sept 30, 2020

Is the name a reference to Dune’s “Golden Path?”

jedc · on Oct 1, 2020

More the "Eternal Golden Braid" - https://en.wikipedia.org/wiki/G%C3%B6del,_Escher,_Bach

xoxoy · on Sept 30, 2020

Isn’t this just a proprietary wikipedia?

brokensegue · on Sept 30, 2020

yeah.

_ugfj · on Sept 30, 2020

Is this another Cyc? https://en.wikipedia.org/wiki/Cyc

ffggvv · on Sept 30, 2020

tried searching "election" and didn't really get any useful information

anxman · on Sept 30, 2020

Congrats Jude! Super star CEO.

tagami · on Sept 30, 2020

Congrats Jude & Team!

KingFelix · on Oct 1, 2020

I just wrote a similar accolade, I've only recently discovered Jude and his work. He seems like a pretty awesome guy, stoked for him.

Glamdring137 · on Sept 30, 2020

Very cool!

KingFelix · on Oct 1, 2020

Congrats Jude and co.!