The Age of PageRank Is Over

widdershins · on Nov 9, 2022

I've been using Kagi the last few months. I've never had a reason to complain about its search results - they seem to work plenty well enough day to day that I don't really 'notice' them. Like most, I now search reflexively, as an extension of the mind, so I only 'notice' search when it's bad.

What really excites me is that is that I'm paying them. That sounds odd, but seriously. It's incredibly refreshing to know that the company providing my search results has an incentive to make things better for _me_ and not a legion of advertisers. With Google I can't help thinking about every keypress being logged to optimize sales pitches at me. I just don't feel that with Kagi, because I'm paying them.

Sure, they might be logging every keypress (I don't actually think they are, but you never can tell) but even if they were, I could be reasonably certain they were doing it to retain my subscription, which probably means making my search better, not selling me other stuff.

It's a priveleged position to be in, and the economic argument isn't watertight, but in the "search as a brain extension" space it still _feels_ premium, because it creates trust. And that frees up brain space for other things - like where the hell was that article I was looking for?

fardo · on Nov 10, 2022

>It's incredibly refreshing to know that the company providing my search results has an incentive to make things better for _me_ and not a legion of advertisers

Cable television and netflix have made it quite clear that payment does not mean “no ads”, it just means “no ads, yet”.

It still might be a better alternative in the short term, but the moment growth starts to peak, companies follow where the incentives lead them.

andyfleming · on Nov 10, 2022

I'm a little tired of the rhetoric about Netflix adding ads. It's only on an entry-level account that is subsidized from the normal pricing (IIUC, correct me if I'm wrong). You could argue that they've artificially inflated the price of the normal plan or that it's a slippery slope, but it's not like they've forced ads on the existing plans.

I think what's more interesting/concerning/insidious is hidden ad content like product placement. It's getting to the point where personalized product placement could be embedded in shows dynamically.

vintermann · on Nov 10, 2022

It's not really about Netflix. The point is merely that "just because you pay them, doesn't mean you're not also a product to them". Companies love nothing more than to charge both ends of a deal.

Take Spotify as another example. I pay them, along with 188 million other people. Does that mean they won't turn around and ask artists, record companies and rights-owning conglomerates for money (or rebates) for putting their stuff in front of me? Of course not. Paying them means they have some interest in pleasing me, but it's far from the only way.

On the off chance that Spotify was being scrupulous about not taking payola in any form, it would be impossible for me to verify. Which is in itself a reason for them to cheat; they don't get much economic benefit from goodwill if no one can actually see them being honest.

This is not a reason to not use Kagi, it's just a reminder of what forces we're up against. Kagi will need an unusual amount of transparency in everything they do, in order to stand a chance in the long term.

And it IS a reason to not get warm fuzzy feelings merely from the fact that you're paying them.

FrontierPsych · on Nov 13, 2022

What you say is true.

However, for sure that will get out eventually.

If that happens, I would then look at other search engines as well.

Most people wouldn't once they get used to doing it one way, so that's all to the good for Kagi. But people like me wouldn't then use it, which is what life is about - your own personal choices.

I prefer to pay for services because at least there is a slight chance that they will follow their business plan.

But nothing stays the same for forever. As a consumer, we must also change to the changing environment. I'm completely weened off Google, for quite a while now, for example. I'm paying for everything that I use. But it is so little as to be laughable. For example, I use tutanota for my email, first account is free and subsequent ones are $1/month. Big whoop. $12 per year for private emails. I have a number of emails through them because I segregate all my emails - one for just friends, one for business, one for education, etc.

falsenapkin · on Nov 10, 2022

I would consider the Netflix UI to be ad laden for some years now with hundreds of originals shoved in your face in a non-customizable fashion and autoplaying etc. Also personalized thumbnails based on past watching behavior. You could argue that it's a good UX for most people, but for me it's always felt a bit more like user hostile marketing.

dmitriid · on Nov 10, 2022

Netflix doesn't have enough content, so they resort to these tactics.

All major content providers left them years ago and started their own Netflixes. So they're left with a few movies from ten years ago, a couple of recent-ish releases, and their own content.

wardedVibe · on Nov 10, 2022

really, content creators should be banned from creating markets like Netflix (and eventually the other way around), but that would require the regulation to not be asleep at the wheel.

scarface74 · on Nov 10, 2022

Are you really saying that creators shouldn’t be able to publish their own content on their own infrastructure?

wardedVibe · on Nov 10, 2022

Should've specified I meant movies and other large conglomerates that tend towards oligopoly. I mean things like Disney+ shouldn't exist, its extending the concentration of market power vertically too.

scarface74 · on Nov 10, 2022

I real wish that HN wannabe lawyers would stop throwing *poly words around with no legal justification.

In the streaming space in the US there is: Netflix, Disney+, AppleTV, Amazon Prime Video, HBO/Discovery, Paramount Plus, Peacock, STARZ, and a few other players. There is no “opoly” in streaming video.

eskaytwo · on Nov 10, 2022

That’s still a small number of players dominating the market. That’s the definition of an Oligopoly, and the Disney/HBO offerings very much fall into that.

They certainly have the ability to uniformly raise prices (tacit collusion) with no viable competition to enter the market and fill the gap (as even the vast sums others have thrown into it have shown how hard it is to produce good original content).

This is probably due to the characteristic that producing goods (decent original content) in this market is a big barrier to entry - which naturally leads to a small number of players. Natural monopolies and oligopolies are common - but do require closer regulatory attention to ensure desirable consumer outcomes than just letting the free market decide.

It may not entirely fit all definitions, but the general economic theory and applications/implications are relevant to consider this market.

The original post that you questioned was related to vertical integration - you could effectively find and replace it to “so you’re saying producers of operating systems shouldn’t be able to make their own web browser?”

scarface74 · on Nov 10, 2022

> That’s still a small number of players dominating the market. That’s the definition of an Oligopoly, and the Disney/HBO offerings very much fall into that.

The market is “content”. Netflix competes with YouTube content producers, TikTok and it even said that one of its biggest competitors is Fortnite.

> This is probably due to the characteristic that producing goods (decent original content)

This goes back to YouTube. You and I may not think that YouTubers and TikTokkers are producing “decent original content”. But there is a generation that spends hours on both.

Besides that, there were over 550 original series being produced last year (https://collider.com/too-many-tv-shows-550-series-2021/). Competition is much fiercer for your attention than it was when you only had the three major networks producing content and everyone else buying rights to show reruns.

There are bidding wars between all of the streaming services for new content from producers. Competition is more fierce than ever.

The price of streaming before was never sustainable. Netflix was borrowing billions a year for years to produce and obtain content. Disney+ was never going to be profitable selling its service at the introductory price. It’s not “collusion”. Every company has to turn a profit eventually.

Yes I realize that Netflix was “profitable” by GAAP standards. But it was getting deeper in debt every year.

dmitriid · on Nov 10, 2022

> Netflix competes with YouTube content producers, TikTok and it even said that one of its biggest competitors is Fortnite.

It competes with these for screen time, not for content.

Meanwhile Disney owns how much content (movies, series and related IP)?

scarface74 · on Nov 11, 2022

Very little as a percentage of all the professional content that’s in the world.

And the value of IP without execution capability is overrated. Warner also has iconic content. But Disney has been able to make successful movies out of its 3rd tier IP while Warner has struggled with its first tier.

Sony isn’t doing too well with most of its Marvel content except Spider-Man and that’s produced by Disney.

bryanrasmussen · on Nov 10, 2022

>The original post that you questioned was related to vertical integration - you could effectively find and replace it to “so you’re saying producers of operating systems shouldn’t be able to make their own web browser?”

I think time has shown MS was correct in making their own browser, but of course incorrect in all the corrupt tactics they used to make their browser succeed over Netscape's.

scarface74 · on Nov 11, 2022

This is severe rose colored glasses. Netscape was always a horrible application and crash prone on every operating system it ran on. There were geek wars back in the day bragging about how well our operating systems handle Netscape crashes - classic MacOS failed miserably.

IE was a much better browser. Especially the Mac version that when it was introduced, was the most CSS compliant browser that existed.

Netscape starting over from scratch was cited as “Things you should never do” in an article written by Joel Spolsky (StackOverflow cofounder) two decades ago.

https://www.joelonsoftware.com/2000/04/06/things-you-should-...

asiachick · on Nov 10, 2022

It's not just netflix. I hate the my Google TV pops up with ads for media. My PS5 boots straight into the store. At least my Apple Tv mostly doesn't shove ads in my face though my iPhone seems try try to shove Apple Music and/or Apple TV+ at me now and then

Gareth321 · on Nov 10, 2022

1. Increase the price of the normal plans.

2. Create a new plan at the price of the old plan. With ads.

I'm sorry, but pretending that they're not adding ads to their "regular" pricing plan is semantic at best. This isn't something they dreamed up overnight. They've been planning this for years and increasing their prices accordingly.

UberFly · on Nov 10, 2022

I agree with your product placement comment. I feel gross when I notice it. I also wonder what happens when Netflix decides it makes way more money on people watching the ads - ie. Google initially calling ads a detriment to search quality but not being able to resist the $. I could see a day when they remove the no-ads plan.

DoingIsLearning · on Nov 10, 2022

Chromecast screensavers _are_ an add for other shows in Netflix.

I am paying and have no way to disable that visual pollution nonsense.

Extremely exasperating when you are trying to choose something adequate for toddlers and they keep seeing flashy stuff on the screen and saying they want to watch that.

Double_a_92 · on Nov 10, 2022

It's not about the user having to see ads. It's about Netflix trying to sell ad spots to advertisers, which might affect all (even paying) users. I.e. by getting pressured into censoring or promiting certain shows, to close the advertising deal.

irrational · on Nov 10, 2022

> It's only on an entry-level account

So far. I imagine they will be expanding it over time.

themitigating · on Nov 10, 2022

HBO started broadcasting 50 years ago, charging a monthly fee for commercial free movies (often released much earlier than on cable) and tv shows.

With the exception of filler ads for their own content that occurs when they need to wait for the next quarter time, ex movie ended at 12:55pm. There have been no ads

lozenge · on Nov 10, 2022

It's the exception that proves the rule. HBO has always marketed as a premium service, while Netflix's goal is to reach every home.

HWR_14 · on Nov 10, 2022

I mean, also Showtime, Starz, Cinemax...

To say nothing of the fact that Hulu has maintained an "ad" and "ad-free" tier for their original programming.

ghayes · on Nov 10, 2022

Aren't those called "promos" since they aren't ads, specifically as they aren't paid for?

Eddy_Viscosity2 · on Nov 11, 2022

> It's only on an entry-level account

for now....

mtsr · on Nov 10, 2022

I think it’s important to consider whether this happens with all companies, or whether venture capital seeking to maximize returns to the detriment of both the business and the user is a factor here.

wardedVibe · on Nov 10, 2022

If they plan to go public ever, its pretty much an inevitability, at least if US market incentives stay as they are. Cable isn't venture capital funded, and they're rife with profit maximization.

api · on Nov 10, 2022

Having the actual user pay is a necessary but not sufficient condition for a user centered product or service. It doesn’t guarantee the company won’t double dip but without a direct economic model nothing is even possible but ads and surveillance.

toomanyusers · on Nov 10, 2022

It strikes me that a company like Kagi should be able to craft a legally enforcable agreement with its customers which expressly forbids the company from selling ads and conducting surveillance.

The agreement could be carefully written by a skilled lawyer to define the things Kagi cannot do, the proof customers must present in order to proceed with a valid lawsuit, and even the maximum damages that the customer may sue for.

In that case, if Kagi was found at some point to be using customer data for these purposes, it could be sued very easily and by many parties.

People are calling for regulation for data privacy. In the meantime, Kagi can create its own regulation it will hold itself to for the benefit of its users, can it not?

acover · on Nov 10, 2022

Can you add a poison pill to a company where a select group of people can decide you broke one of your founding principles and hand the company over to someone else?

No VC startup would do that but if possible would allow trust.

tomcam · on Nov 10, 2022

Sure you could. Of course no serious investor would make the acquisition.

danuker · on Nov 10, 2022

Also smartphones and smart TVs.

dehrmann · on Nov 10, 2022

Even Amazon

ReptileMan · on Nov 10, 2022

Well ... I also made Netflix clear that my subscription means "no torrents, yet"

ur-whale · on Nov 10, 2022

Counter-powers is what makes the world go round.

rrwo · on Nov 10, 2022

Yes, we can't emphasize this enough.

Companies have an incentive to make money off of their customers' data, and technology just means that they can make money at every level (your ISP, your smart TV, the apps you use on it and the websites your browse).

We're so accustomed to entities gathering data about us that it's become part of the background.

Why is it the default that my home is on Google Maps/Google Earth? I should have to opt in, not opt out.

Just because there is "public" information about me as a registered voter or home owner doesn't mean that anyone should be able get a copy of this data, put it online and connect it to other data?

scarface74 · on Nov 10, 2022

Cable was never intended to be ad free. It was originally a method to rebroadcast over the air content from the broadcast stations in areas that couldn’t get a signal. You’re paying for the infrastructure to make that possible - not the content.

Complaining that cable TV isn’t ad free because you pay for it is like saying all content over the internet isn’t Ad free because you pay for your internet connection.

mrjin · on Nov 10, 2022

As long as there are good alternatives, it's fairly easy to switch, especially for something like search engine or streaming service.

D_Alex · on Nov 10, 2022

>Cable television and netflix have made it quite clear that payment does not mean “no ads”, it just means “no ads, yet”.

'It's better to be a millionaire hero than a billionaire asshole' (Ben Elton, Gridlock)

Of course at $10/month even a 1% share of the search engine market would make the company worth billions.

crocwrestler · on Nov 10, 2022

Perhaps, but the millionaire hero will likely be outcompeted by the billionaire "asshole"

sph · on Nov 10, 2022

If the only metric of success is yearly revenue, yes.

If that is the only metric, the best business model is to extract as much money out of your user, while simultaneously offering less for the same amount of money, until you reach the asymptotic ideal of 100% profit.

If you were to start a company, would you be willing to do that? Would you choose profit over everything else? Some would, but not everybody.

HWR_14 · on Nov 10, 2022

Netflix is still ad-free for me (product placements in content aside). Multiple tiers is reasonable.

lesuorac · on Nov 9, 2022

The economic argument not being watertight is the problem. This is why tech companies keep pivoting to ads, they just didn't make enough money otherwise.

Afaik [1], there's about 5 employees and the revenue only covers server expenses while they're still trying to get more headcount. Not an expert on bootstrapping but I'm pretty sure you don't want to expand faster than your revenue does otherwise you stop being able to make all the decisions for the company.

[1]: https://blog.kagi.com/status-update-first-three-months

mikepurvis · on Nov 10, 2022

"This is why tech companies keep pivoting to ads"

I think another factor is the paradox that the user who has enough money to be willing to pay to not see another ad is exactly the most valuable user to advertisers— they've pre-qualified themselves on multiple axes.

So there is always going to be an enormous moral hazard here with the ad-driven-freemium model— always a temptation to run an occasional high-value ad to the paying users, or to disguise the ad as a "recommendation" or position it instead as a "sponsorship", or whatever else. Other than users reacting swiftly and decisively against this kind of thing, it's hard to know how else to keep companies away from it.

random314 · on Nov 10, 2022

This showed up at an internal Twitter meeting this week. Mr. Musk proposed making Twitter ad free for 8$ accounts before being politely informed that 8$ accounts actually produce 50$ per month in ad revenue.

An angry bird informed me about this :)

auggierose · on Nov 10, 2022

How do they know that, given that $8 accounts must be still pretty rare given they were just introduced?

mikepurvis · on Nov 10, 2022

Because it's a pattern that manifests everywhere and can be extrapolated.

Though in this case, it's likely that Twitter has repeatedly run internal studies about what a paid-tier would look like, and each time concluded exactly what the GP is passing on: that it would be a net loss because the most-valuable-to-advertiser accounts would be the first ones to pay.

Even just psychologically, it would be a major blow to their sales pitch, going from "pay $X to put your name in front of the top movers and shakers in the world" to "pay $Y to put your name in front of all the Twitter users too cheap to cough up $8/mo to not have to see you." Not hard to see that $Y is going to be less than $X... probably a lot less.

random314 · on Nov 10, 2022

I think the analysis was for pre existing blue check accounts.

meeka · on Nov 10, 2022

One of the problems is price point. They are charging $10/month. While increasing the price will certainly decrease the number of users, a much higher price point could select for "power users" that will churn less and pay substantially more. In the long run that could lead to greater revenue.

Of course this depends on how the supply demand curve looks like for their specific business.

nottorp · on Nov 10, 2022

Catering to the whales, in casino speak.

TimTheTinker · on Nov 10, 2022

The problem is that ad revenue so often becomes the escape hatch from an economically hard problem.

Need more money? Economic model not quite working? Companies face this all the time. Some solve it innovatively, survive, and show everyone else the way forward. Others fold or pivot.

But ad revenue short-circuits the hard process of economic innovation by tempting tech companies to take an easy way out, even when the problem is solvable without it.

(All of this wouldn't really be a problem if the ads business model weren't inherently adversarial against users.)

matheusmoreira · on Nov 10, 2022

> This is why tech companies keep pivoting to ads

The reason they advertise is because they can. It doesn't matter how much money they're making now, they can always make more by advertising. Paying money just makes your attention more valuable. The only way to stop them is to make it impossible or illegal to advertise. Blocking ads everywhere and making no exceptions.

wardedVibe · on Nov 10, 2022

When google started, they were having difficulty staying solvent until they added ads, even though they had a better product than the other search engines. Its a space that's ill-suited to market solutions.

matheusmoreira · on Nov 11, 2022

If their solution to insolvency is advertising, then let them be insolvent. We're not gonna keep enabling this behavior.

BasilPH · on Nov 9, 2022

They mention that users can invest in them through SAFEs. This only makes sense if they plan an exit, either by selling or through an IPO, or am I missing something?

simonbw · on Nov 9, 2022

I think it implies that they intend to take VC money at some point in the future, like a series A, and they’re trying to avoid VC money now while they can get better terms. I don’t know if that necessarily means they plan an exit, or if they’d be happy just being profitable, but I do think it implies they plan to grow significantly.

EDIT: Their website says that they don’t plan to take VC money, so I have no idea what a SAFE means in this case. The point of a SAFE is that the investor is guaranteed the best terms when a priced round of investment happens.

freediver · on Nov 9, 2022

Other options include going public or paying out dividends.

lmeyerov · on Nov 10, 2022

Going public is a qualified event for a SAFE to convert -- it is a financing -- but curious about the dividends

SAFEs are often capped, so taking awhile doesn't matter so much, except maybe interest calcs

freediver · on Nov 10, 2022

If a company is doing sustainably well, supposedly you could have a mutually agreeable mechanism to convert SAFEs to equity (at cap for example) and treat shaleholders with dividends.

wardedVibe · on Nov 10, 2022

Is this some sort of hack towards a co-op? I couldn't find anything about it on their website

pixodaros · on Nov 10, 2022

"Enough money" is usually defined by venture capitalists who want ALL THE MONEY and not by normal investors who want a modest predictable return. Patreon, for example, could have been a modestly profitable business which owned itself, but instead they keep flailing and spending lots of other people's money to satisfy the next batch of investors that one day they will be as rich as Google.

twobitshifter · on Nov 10, 2022

The sad truth is on average advertisers will pay more for your attention than you’ll pay not to have ads. So people continually complain about YouTube ads but ignore their option to pay to have them removed.

widdershins · on Nov 10, 2022

I pay for YouTube too. Between that and Kagi search, it's ~$20 per month. Which seems crazy in today's environment of 'everything free'. But seriously, I use these both all day for both work and play, but I only watch an hour of Netflix a day. So which is really more valuable?

I agree that on the whole, people will continue to go for the free option, but I'm just glad that some options are appearing to pay, which rebalances the incentives a bit.

t0bia_s · on Nov 10, 2022

Why wasting money on "feature" that can be done with same result for free? uBlock, NextDNS, pihole, etc. No-one is asking for ads by default, ever.

syntaxfree · on Nov 10, 2022

With YouTube premium you can listen to videos on the iPhone with the app in second plane or even with the phone locked.

Yeah, I should use a hacked rooted Android yadda yadda. But I just spent a week hacking and hawking for my life in an ICU, and I certainly didn’t want sanity-saving podcasts to shut down whilst on an hour-long stay at the breathing machine. Life is short.

cardanome · on Nov 9, 2022

While I am very willing to pay money for a user-centric search engine the requirement to do so directly conflicts with the greatest must-have feature that I need from one: Anonymity.

Now the issues is not just privacy but making sure it does NOT optimize search results based on prior searches or other facts it knows about me.

Now the big problem is that one would think delivering the exact search results that I am looking for is a good thing. No. It is super dangerous. It can causes me to live in a bubble, blinding me of possible opposing opinions. It is feeding my ignorance.

So there is a goal conflict here. I don't know how to solve it. I wish there would be something like Wikipedia for search engines but even the wiki models has its problems and biases.

wruza · on Nov 10, 2022

It can causes me to live in a bubble, blinding me of possible opposing opinions. It is feeding my ignorance.

This made internet completely unusable for discovery. A big shiny window into another tiny room. Absolute BS.

Recently I discussed it here on HN and was advised to like, dislike, subscribe and so on on youtube (things I rarely have done on my old account) to break out of this bubble. Well, it doesn’t work. It just suggests the content I immediately dislike or ban entire channels and it all settles to the same echo chamber as before.

kqr · on Nov 10, 2022

I think many platforms optimise for interactions, with no regard for whether they are positive or negative.

wruza · on Nov 10, 2022

As a result, I often find myself closing youtube tab because there is not only nothing to watch, but nothing to scroll through even. On a platform with billions of videos, at least thousands of which I might like to watch if given a chance. I find this incredibly stupid.

Same for google search, really. It turned from “I’ll find you that if it exists” into “Look, if you change your query to ‘cool products if at all barely relevant to …’, I have literally billions of results for that”.

memen · on Nov 9, 2022

I like the idea. Maybe a 'Search for me' button instead of the current 'I'm feeling lucky'. Or two columns or tabs for personalised and general results. Sounds easy, but I am not aware of any search engines that offer this, unfortunately.

On the other hand, a lot of the search results bias stems from the search query itself. "Does X cause Y" for example will show results that echo the search, and not show opposite results. Simply because you did not ask the opposite or a more open/unbiased query like "X Y".

g_p · on Nov 10, 2022

I think if you can assume some level of trust in your search provider (i.e. through payment - let's get to that in a minute), without which you should probably be looking for a new search engine, then you can trust the search engine to not optimise results for you. If they tell you they don't, then do, then you likely have deeper-rooted issues with the search engine you use.

On the point about payments, there are some potential approaches to handling this, but they tend to be a bit complex and technical. I"m not sur ethis is really a technical problem though (or at least one where a technical solution is what's needed).

A challenge of taking traditional payments is that the identity of the payer is inherently "linked" (to some extent) to the transaction. You could maybe use a privacy.com type intermediary or similar.

What you could do however, is buy a "kagi voucher" or similar, which would be an RSA-blind-signed token that you generated client-side in JS. You'd send it to the server during the payment transaction, and be given back a signed version of the token, which you unblind client-side, then give a backup copy to the user to save, and set it as a cookie to authenticate payment has been made.

That should, in theory, at least un-link the payment from the user. Ultimately though, a bad actor can still trivially correlate your searches (and payment, if you used same IP or browser fingerprint), and all your searches are linked by a common cookie.

So you decide to create some kind of token fountain, which you can auth to with a blind-signed token, and get 50 "search passes" from. Those could be unique per-search, and just be "bearer tokens" you can use. You'd have to trust that this token fountain doesn't record which search passes were issued to which "blind token".

Ultimately, it feels like this problem then becomes "do you trust your search engine?" - I'm not sure this is a problem that technology needs to solve, as ultimately your search provider is seeing your queries and IP, and most users won't realistically change this. A provider that isn't incentized to act against your interests (i.e. something like Kagi which isn't advertising to you or selling data) seems to address this for most user scenarios.

cardanome · on Nov 10, 2022

I don't think there is much cause for worry that Kagi themselves might use the data for anything bad BUT data leaks do happen and more importantly Kagi will need to play ball with whatever government that exists in the markets they want to have users in, be it the USA but also China, Russia or whatever.

We already know that Google is in fact cooperating with these entities. Not only by giving out user data but also by actively censoring certain things. So there is a precedent.

libraryatnight · on Nov 10, 2022

It sounds like you use the internet way different than me. I can't remember the last time I searched something where the end result wasn't mostly objective. Documentation, directions, tour information, movie times, programming questions, patch notes. Maybe seek contrary views out in the world. The internet is a terrible place to form a world view and to really communicate in general.

t0bia_s · on Nov 10, 2022

Agree, however we are already in trap of algorithms. Look how few years of using social networks spread ideologies that would be laughable back then, ie "there is more than two genders" or "earth is flat".

Fire-Dragon-DoL · on Nov 9, 2022

I thought: couldn't the details about the user live on the client? And the user could delete, modify, decide not to use them.

This way you get anonimity and custom results

Arnavion · on Nov 10, 2022

That doesn't work unless the server sends all 1 million results (say) to the client up-front. Otherwise, if you prefer results from example.com but the server only sends example.com results on page 3, your client has no way of showing you those results on page 1.

Fire-Dragon-DoL · on Nov 10, 2022

I think I've been misunderstood, in my mind the client sends all the details about themselves to the server, the server builds up the search results and send them back.

Yes, the server has access to the information the client has, but it doesn't have any way to confirm that's true. It also could avoid storing that information all together if the client sends it again every time

ukd1 · on Nov 9, 2022

I'm with you; I'm a paid Kagi user for a while now, and unlike various attempts at switching to duck duck go (et al), Kagi is actually fully working for me. I do not switch back to Google more than once a month, and each time I've gotten "worse" results so far.

I'm sold, literally.

prox · on Nov 10, 2022

Do you stay under 30 searches a day? On their pricing plan it seems users searched a lot more in their beta program than anticipated. 240 searches cost them around 3 dollars.

jongjong · on Nov 10, 2022

I just tried Kagi and was surprised that my open source project shows up in the results for relevant keywords. All the other search engines I tried bury it completely out of view no matter how specific the keywords are. It makes me wonder how many other indie projects like mine are excluded from mainstream search engine results.

Semaphor · on Nov 10, 2022

See if the founder’s side project Teclis [0] (non-commercial search) finds it. Kagi uses the project’s index in addition to Google and Bing.

[0]: https://teclis.com/

jongjong · on Nov 10, 2022

Yes, it finds it. It ranks quite well for most keywords I tried. It even got #1 spot for one of the keywords I tried. I also noticed some other related open source projects show up (by other people) which I hadn't heard about for several years.

Google search only shows paid SaaS platforms in the first page of results.

ipaddr · on Nov 10, 2022

"At USD $10/month, the price does not even cover our cost for average use"

240 searches cost them over $10.00. They will need to find additional revenue or reduce costs to even reach a break even point.

They will have to raise prices or make you pay per search or sell your data or better yet all three.

function_seven · on Nov 10, 2022

I wonder how much the per-search cost is marginal cost vs. amortized fixed costs.

If their paying user base grows by 10x, do the searches drop from 4¢ each to 2¢ each?

Tumblewood · on Nov 10, 2022

Most of their cost comes from API accesses to other services (e.g., Bing), so unless they can replace those API calls the price cannot go down very much.

Gareth321 · on Nov 10, 2022

API access is negotiated by volume, so as they grow, they can negotiate preferential pricing.

loonster · on Nov 13, 2022

> Our proposed price is dictated by the fact that search has a non zero cost. With other search engines, advertisers cover this cost. But it costs us about $1 to process 80 searches.

.0125 per search. The teams rate is twice that.

prox · on Nov 10, 2022

Or give notice to power users that they are exceeding quotas by a lot. Have a power user subscription perhaps. Many people don’t download a lot and ISPs are good, but you have outliers, datahoarders perhaps who take a lot of bandwith.

Invictus0 · on Nov 9, 2022

Google's revenue is over 280B annualized from last quarter's numbers. Let's just assume that every person on earth is using Google at an equal rate. That's $35 worth of revenue per user per year. There is no way that you can come close to that without doing ads; the median global income is less than 4000 international dollars per year. So naturally, there is a strong pressure for companies to pivot to ads once they run out of other ideas to grow the business.

leoh · on Nov 9, 2022

I very much disagree with your premise

* one does not need $200-300B ARR to run a search engine; and merely making a ton of money does not mean that the way that one has been making that money will continue forever

* using Kagi, one realizes how much the web has been perverted, in my opinion, by the obsession with the ad model; there are many nice, novel things that Kagi could introduce which would reduce ad revenue for a company like Google, but would be useful and novel for Kagi without loss in revenue (on the contrary — could help); ie google is incentivized to keep people on the site through various irritating means; a search engine that isn’t distracting and gives me what I want — I would be more than glad to pay for (and already do pay Kagi $10/month)

Other thoughts:

* Supposing $10/month (honestly I would consider $20) with 100M paying users; that is $1B MRR which is not bad at all and is more than plenty for a meaningfully sized team with meaningful salaries.

* Just like how lowered energy prices during and after the Industrial Revolution made manufacturing at scale feasible, the incredible amount of high quality OSS and infrastructure these days is making it increasingly feasible to do things Google did 20 years ago — something unthinkable for even the best engineers of that era. Not to mention the relative ease of collecting capital with payment services (let alone VCs, etc.) today — even with a looming recession factored in.

gbear605 · on Nov 9, 2022

> $10/month (honestly I would consider $20) with 100M paying users

That would actually be $12B ARR, since that price is per month while ARR is usually per year.

leoh · on Nov 10, 2022

Oops, yup, just came back to fix this; ha!

idlehand · on Nov 10, 2022

Looking at the global median income and coloring it with the mean revenue per user paints an incorrect picture. Advertisers are not paying even $35/yr for the attention of people making the global median wage.

US attention is the most valuable because it is the richest large nation and the largest rich nation.

European attention comes a distant second. Together they have 40% of the global GDP, but likely something like 60-70% of the consumption of non-essential goods in nominal terms. Accordingly, average advertiser cost per clock differs ~10x between the cheapest and most expensive groups of countries.

bratwurst3000 · on Nov 9, 2022

what I find strange is that selling ads makes more money than selling service. Because at the end the people watching the ads are buying the stuff advertised so those who adverstise have more money for advertising…. So for it to be rentable people have to buy way more than the ad revenue google makes. If spending for google ads is 1% of bussines budget and budget has to be at least Revenue……..with 280 billion revenue for google it would be 280.000.000.000.000 that we as humans have to spend for stuff that was advertised on google so someone pays google for ads so they can run google search etc ….

35 dollar is maybe ok… …. Maybe better than spending 3500$ on … I dont know … stuff is good also ..

philwelch · on Nov 10, 2022

Advertising is kind of a backdoor price discrimination in that way. If you can afford to spend $10,000,000, people will advertise wealth management services to you. If you can afford to spend $1, McDonalds will advertise their dollar menu to you.

aflag · on Nov 10, 2022

If you show an ad for a car that costs $100k to 1000 users and a single user buys it (0.1%) you still sold a car for $100k. Let's say it costs $90k to make it. You could pay $5 per ad displayed and still made a profit. My point is, the people who actually buy the products are actually paying for the other people who can't afford it. So, the model doesn't rely on everyone buying a keyring from a car dealership, but on most people not buying anything, and a couple buying a car.

In a sense, that model is better at distributing income, because everyone gets a service and the people who can afford pay for everything. If everything was behind paywalls, people who can afford would have all the nice services and people who can't, will have nothing.

What's interesting is that "targeted ads" is a bit of a misnomer. It's more of an ads ranking, where the user is shown the ads they are most likely to buy. However, from the point of view of the car dealership, when they think targeted ads, they would like their ads to be shown only to people who will likely buy the car. No company would want their ads shown to someone with no disposable income, yet, they will be shown some ad.

ZephyrBlu · on Nov 10, 2022

They're selling distribution. Why is it strange to you that distribution is valued so highly? In a lot of ways it's more valuable than a product.

dismantlethesun · on Nov 10, 2022

Isn’t Google revenue from more than search?

They have a full business suite, App Store, hardware sales, fiber internet, wireless cell and internet, and a server rental division.

nottorp · on Nov 10, 2022

> ... once they run out of other ideas to grow the business.

Yep, but the main problem here is the perverse pressure to grow indefinitely.

And that comes from it being acceptable for public companies to pay no dividends and instead rely on share price increases to deliver a return on investment.

Something that slows down this growth spiral would fix things partially. Require profitable companies to pay a % of the profit as dividends to their shareholders?

csomar · on Nov 10, 2022

I just tried it for a few terms that Google found no relevant results for (related to Wasm/Rust); and holys*t there were many results from people (developers) who wrote in their blogs.

I'm a user now, and potentially a customer if I hit their free limit.

rhn_mk1 · on Nov 10, 2022

The reason that I'm not using Kagi (and some of my paid subscriptions) is that payments require login, and login makes it possible to track my search history. I'm happy with paying them with money, but not with my keywords.

Is there some zero-knowledge protocol that could help here? In particular, to establish that a user is currently subscribed, without revealing which subscription they have?

bawolff · on Nov 10, 2022

Anonoymous credentials covers that use case pretty well.

anoy8888 · on Nov 10, 2022

Very naive . It just means that in addition to whatever google do with your data , they also charge you money on top of that . When people have power to do things to maximize their gains (selling u ads/ data in addition to charging u ) , expect that it will be done. It is the same in politics. Power corrupts.

gnramires · on Nov 10, 2022

I believe it's a responsibility of organizations themselves to orient this way, and for customers to be vigilant about this orientation. Does your search engine have weird incentives to exploit you? Change search engine :)

----

I have an idea (more here[1]) that the next addition to our civilization should be a distributed system to evaluate, reward and compensate externalities. Open source software is a great example. It's free, it has massive benefits to users and for people reusing it for all sorts of projects, it's public. Yet we still can't find a great way to fund it. There are donations, and maybe that's all we need in some sufficiently enlightened society, but I think we should be addressing those more directly. And clearly there are technologies which generate friction and other social costs, that could be (differentially) quantified and prices. The same goes for pollution and a miriad of issues.

Laws can help here, but laws tend to lack softness and specificity in my opinion, and laws are strictly punitive. If you're generating a negative externality, and the legal system is well designed, maybe you'll have to compensante for it. But that relies on quick lawmaking and judges evaluating something they're not specialists at. And there's no dual for punishment (actually compensation and reeducation), there's no intrinsic reward system. There are prizes and grants, and I'm sure they generate immense value for what they are, but they're still very unsystematic and unreliable, I think.

I dream of a system where if you generate value for society, you will be recognized and make a comfortable living off it; if you need investment, you will also be supported. (I call this idea Elementalism; I'm not sure it's original)

I think we need to be a little more open to careful, thoughtful changes to our social-economic system that can improve things.

While Elementalism isn't a formal part of our system, each of us can do our part and give directly to what we understand needs the most. This is the core idea of Effective Altruism which I also extend to donating to Open Source (which I believe can be effective[2])

[1] https://news.ycombinator.com/item?id=29043752

[2] https://www.reddit.com/r/EffectiveAltruism/comments/v7ma0d/w...

knicholes · on Nov 9, 2022

Oh, but you CAN tell if they're sending those keypresses back to the mothership by monitoring the networking tab in your browser.

tyingq · on Nov 9, 2022

Maybe. They can hold batches and smuggle them in various ways.

knicholes · on Nov 9, 2022

If they have an auto-complete feature, the keypresses WILL be seen in not even in a smuggled way. They'll be directly in the URL to the search function!

corobo · on Nov 9, 2022

You could probably hide that from anything but a deep dive by chucking the data through a websocket. Unless you see and recognise the initial socket connection you may miss it entirely.

lobocinza · on Nov 10, 2022

I'm worried that I need to be logged-in and expose my CC to use such service. Nonetheless most of us use Google Search in a similar way.

sergimas15 · on Nov 9, 2022

awesome

aabaker99 · on Nov 9, 2022

I just signed up and tried it a little bit and I like what I see so far. I find myself increasingly frustrated with Google search results for a particular use case: searching for documentation. For example, today's work had me thinking about Python's datetime and timedelta and I wanted a reference on what functions are available. With Google I am annoyed with results from geeksforgeeks.org and freecodecamp.com because they are not reference materials and generally only cover some basic use cases. In Google, those two sites are in the top four results. In Kagi, they are not. Instead, there is a longer-form blog post from guru99.com, stack overflow, and the official Python documentation.

Now, I will admit that for this particular query Kagi and Google results are pretty close. But my general experience is that when I search in Google I find that I have to look farther down the search results to look past the blogspam to find the authoritative reference.

VWWHFSfQ · on Nov 9, 2022

The blogspam has made Google and Bing/DDG almost completely unusable for technical searches.

Go search for something like "postgres cte" and you won't find anything useful until probably halfway down the page. And maybe not at all.

ryandrake · on Nov 10, 2022

"Do you want to learn about blogspam? If so, you are on the right page. You will learn about blogspam here. One of the most major things about blogspam is that it exists on the Internet. You just learned that blogspam exists on the Internet. In this way, you have become educated about blogspam.

Now that you know about blogspam, we'll move on to the next topic: How to find blogspam. It's actually very easy to find blogspam. You are on the right page if you want to learn about finding blogspam..."

pjc50 · on Nov 10, 2022

This killed the blogosphere, and will kill any decentralized system that becomes popular unless they can do something extremely clever.

Instead of a dark forest (https://en.wikipedia.org/wiki/The_Dark_Forest) think of the outer internet as a fake forest. If you wander off the beaten path - the same dozen sites that everyone uses and complains about - you wander into an endless, trackless zone of fakes all of which are ultimately trying to sell you something.

probablypower · on Nov 10, 2022

Thanks for the fun analogy. It is right on the money.

Made me imagine the invention of the internet as the big bang, and that now we are watching the expansion of e-space as all the useful bodies rush away from one another and the light-years of space between them is filled by a vacuum of usefulness.

Maybe one day the search for intelligent life in space will be easier than the search for intelligent life on the internet.

dpkirchner · on Nov 10, 2022

This is also giving me Microsoft "independent advisor" vibes.

hi41 · on Nov 11, 2022

That’s funny. Is that how pages go up in rank? Make many references to the same word and make circular arguments?!

booleandilemma · on Nov 9, 2022

I searched for that (without the quotes) and this was the 2nd result:

https://www.postgresqltutorial.com/postgresql-tutorial/postg...

The first result was the official documentation.

What else are you expecting?

projektfu · on Nov 9, 2022

Google search always gives lousy results, except when you have complained about it, in which case people who check your work always get optimal results. /s

dylan604 · on Nov 10, 2022

I see this from time to time myself where someone points out how bad the results are, but different results for me when trying the example. However, there are certain other things that I have searched for myself that absolutely resulted in crap results from SEO/blogspam type of results. So I know 100% it happens.

What I'm wondering is how much of your recognized fingerprint influences the results? What causes results to be different from user to user using the same search query?

mda · on Nov 10, 2022

Typical. Some people always complain about Google search results nowadays but I very rarely see actual bad examples.

seaman1921 · on Nov 10, 2022

he was expecting nobody to check his BS

stevewatson301 · on Nov 10, 2022

What kind of results do you get when you Google the term? For me, the PostgreSQL documentation[1] came out on the top.

[1] https://www.postgresql.org/docs/current/queries-with.html

andirk · on Nov 9, 2022

For HTML, CSS, Javascript, and browser APIs, I simply add "mdn" to the search to guarantee I get the official-ish MDN docs. From there, I can dive in to W3C specs, etc if needed.

Those training wheel search results are annoying but they're highly ranked probably because most people like and use them.

overlisted · on Nov 10, 2022

There are also Chrome and Firefox extensions to remove results from W3Schools specifically

Semaphor · on Nov 10, 2022

For me, #1 is the official documentation, #2 is a decent looking tutorial, #3 has a slightly better page on google, #4 is a clear kagi winner.

Neither of them offers bad results, but we are talking about google, it makes no sense to give example searches without saying what results you receive as Google is so heavily personalized.

Kagi does personalization, but it’s explicit. You yourself decide the region to search in (my default is international, though there are quick bangs to search in other regions), and you can up- or downrank domains, as well as block them completely.

holoduke · on Nov 9, 2022

Always add the keyword 'forum' to your search terms. It filters out most of the crap.

paulmd · on Nov 10, 2022

"hackernews post, deeply knowledgeable, FAANG dayjob, startups with VC funding, increased conversion rates, trending on artstation"

adzm · on Nov 9, 2022

Maybe it's just me but search results for that were very helpful in my case. Official documentation, stack overflow questions, an informative blog about cte gotchas, all within the top half of results.

mda · on Nov 10, 2022

Just Googled it, the first result is official postgres documentation, second is a tutorial, what exactly you get / expect when you search for it?

nottorp · on Nov 10, 2022

I just duckduckgoed it and 2nd and 3rd results are tutorial spam. 2nd is geeksforgeeks...

freediver · on Nov 10, 2022

Here are the results for that query on Kagi if you want to compare:

https://kagi.com/search?q=postgres+cte&r=us&sh=5-n8GUySt5qmx...

Calamitous · on Nov 9, 2022

Another nice thing about Kagi is that you can eliminate domains from your search results. Obvious content farms are pretty easy to spot and remove.

AndroidKitKat · on Nov 9, 2022

My favorite feature is being able to boost a certain domain up in the the results (or even pin it if you really would like). I often search for different Pokemon and prefer the information that a site like Serebii.net gives me over something like Bulbapedia.

Larrikin · on Nov 10, 2022

Is there any reason why? I'm not heavy into the game anymore but usually when I'm searching Pokemon Bulbapedia has the information I want.

AndroidKitKat · on Nov 10, 2022

I think it's just because I prefer the organization of the information, giving you things like the type chart up front and center.

It's also the site I used more as a kid so there's probably some loyalty bias.

nmstoker · on Nov 9, 2022

Yes, eliminating and boosting favoured sites are both excellent feathers on Kagi

davemp · on Nov 10, 2022

That should be a good signal for ranking algorithms as well with the subscription price disincentivizing bots.

kbyatnal · on Nov 9, 2022

IMO this article misses the biggest issue with search engines today. It's less about any ranking algorithm (like PageRank), but rather the root cause is with indexing. No matter how much you fine tune your ranking system, if your index is filled with SEO junk and blogspam, you're fighting a losing battle from the start.

For some reason, a lot of these search engines like to brag about the number of documents in their index, which never made sense to me. Maybe it was true in the past, but on the modern web, larger index !== better results. In fact, I'd argue the opposite since you're much more likely to serve SEO spam.

That was my motivation to start hacking around on CrowdView (https://crew-rho.vercel.app), a search engine specifically for forums and discussion content (e.g. forums, discords, twitter, reddit, etc). It has a curated index (today, curated by yours truly) to remove SEO junk and help you figure out "what does a real, genuine human think about this think?"

probablypower · on Nov 10, 2022

Not to be rude, as I appreciate the motivation behind your work, but your post ironically followed SEO junk format:

1. Enter the discussion on a human level

2. Hook onto existing context

3. Argue issues with existing context

4. Present your own software as a solution to these issues

It is like a date that only wants to offer MLM business opportunities, or a long-form joke that ends with a disappointing "better nate than lever" punchline.

mjburgess · on Nov 10, 2022

In general scams work because the form works, and should work, but the content is an illusion.

You could equally say everyone's following Aristotle's Rhetoric.

Semaphor · on Nov 10, 2022

Kagi has a feature, lenses, which includes a discussion lens. It’s pretty much that, results from forums and reddit.

smt88 · on Nov 10, 2022

For a long time, I've been considering a different solution to the problem, which is to create a human-curated whitelist-only search engine.

I do think your idea is great and something with a lot of value for many use cases.

The only issue is that it won't surface certain things like restaurant contact info and Wikipedia that people often need to be a top result.

dchuk · on Nov 10, 2022

"a human-curated whitelist-only search engine"

This is a good idea. Basically harkening back to the old internet directory days, combined with the powerful indexing tools we have now, and then use all the great language models and topic modeling tooling to make the querying great.

Said another way: moderate and protect the index, instead of trying to clean up the results of queries.

You'd need to monitor the whitelisted sites in various ways to make sure they don't bait and switch and turn to spam later, or get hacked and go to shit.

You might even be able to monetize on the publisher side by allowing sites to pay to be indexed faster/more regularly, which could potentially (but not totally) help with spam control too. Every publisher wants traffic.

You might be able to jump start the whitelist by getting folks using a plugin so you can know what domains they spend time on in the first place and index those when signal strength warrants it. Gotta be careful about that being gamed too though. Also I think this is actually the core principal of Brave's search engine come to think of it.

smt88 · on Nov 10, 2022

> You might be able to jump start the whitelist by getting folks using a plugin so you can know what domains they spend time on in the first place and index those when signal strength warrants it.

I actually think a browser plugin is a core component, especially if people could opt in to contributing their pages to the index (with a method to make sure they're not logged in to anything).

The plugin would also allow people to flag websites as spam/scams/low-value, regardless of whether they're in the index or not.

ZephyrBlu · on Nov 10, 2022

> combined with the powerful indexing tools we have now

What powerful indexing tools exist today? I have little context on the search domain but it's very interesting.

smt88 · on Nov 10, 2022

I'm not sure, but it sounds like they mean that building and quickly searching an index has been democratized. You can now use any number of FOSS projects or SaaS offerings to do that, depending on how big you want the index to be and how much you want to spend.

dchuk · on Nov 10, 2022

That’s right. Back when google was coming to be, building a search index was basically computer science/grad school research territory. Now there’s tons of good starting points that you can throw money and hardware to scale quite a bit to at least prove out a product concept.

ZephyrBlu · on Nov 11, 2022

Which starting points are those? I'm familiar with some open source search engines, but not open source solutions for crawling and updating a search index of the web.

nagonago · on Nov 10, 2022

I like this idea in theory, but I think it would be difficult to scale up enough to be useful.

Who would be the curators? If you open up curation to volunteers, it could be gamed by bad actors. However if you have too small a team, the results will be limited and biased. For example, favoring the English-speaking or tech-sphere web while ignoring large sections of the web with which the curators are unfamiliar.

Perhaps machine learning could help with scale - start with a human curated dataset, then train a model on it. However that could end up getting gamed too.

larksimian · on Nov 10, 2022

I think curation by paying users could work. At least it would make manipulation quite expensive.

Let people downvote/upvote search results as an extra strong signal on top of monitoring what they choose to click on in the result list.

Also let people block domains from searches and provide some discoverability of commonly blocked domains.

smt88 · on Nov 10, 2022

> Who would be the curators?

That's a key question. You don't necessarily need that many of them, though, if you're whitelisting at the domain level. A few dozen people could work giving a rating to many thousands of domains every year. An ideal number might be in the thousands.

I'd also say they probably shouldn't be in industry, because they'd have an incentive to game the index.

> Perhaps machine learning could help with scale - start with a human curated dataset, then train a model on it.

While this is true, I don't think search engines other than Google and Bing need to worry that much about being gamed. It's just not worth the effort.

kbyatnal · on Nov 10, 2022

yup totally agree, but I think it's nearly impossible to compete with Google and make something much better for certain types of queries (e.g. factual info, location queries, etc). They have too much data and too much of a head start.

The best way to compete with them IMO is to choose a slice of queries that they perform very poorly at (like non-SEO spam results) and build something 10x better. But that brings up the challenge of getting users to remember to come back to your search engine.

It's tricky for sure, with no clear answer. But for the first time in a long time, I sense that we're approaching a tipping point for Google's dominance over the web.

smt88 · on Nov 10, 2022

> it's nearly impossible to compete with Google and make something much better for certain types of queries (e.g. factual info, location queries, etc)

Some of these things can be outsourced to other services. An easy Google fallback is also a common solution for new search engines.

> But that brings up the challenge of getting users to remember to come back to your search engine.

This wouldn't really be something I'd build as a business. I'd create a nonprofit, fund it with subscriptions or donations, and cover the losses myself. I really just want it to exist for myself, and if other people used it, that would be icing on the cake.

dylan604 · on Nov 10, 2022

>which is to create a human-curated whitelist-only search engine.

You might be interested in watching the series "Halt and Catch Fire".

freediver · on Nov 10, 2022

> "Halt and Catch Fire"

Another fan!

smt88 · on Nov 10, 2022

I'm not interested. Why would I be?

I also don't need fiction to know what a human-curated search engine would be like, because I was using the web in the 90s when we had things like Dmoz.

dylan604 · on Nov 10, 2022

Well, you can be a curmudgeon and not enjoy someone's recommendation for a decent bit of entertainment, or you can say just politely carry on and be a grouch in the living room of your trashcan, or you could maybe try something new and actually consider watching something you haven't seen yet.

edit: softened the tone oh so slightly

smt88 · on Nov 10, 2022

Sorry, I misinterpreted your suggestion as, "This is not a novel idea, and you should watch how badly it goes when someone on this TV show tries the same thing."

I didn't insult you or your suggestion, though. I said I wasn't interested in the show and that it doesn't inform my understanding of the concept.

Why did that hurt your feelings enough to make insulting accusations about my personality and character?

dylan604 · on Nov 10, 2022

"I'm not interested. Why would I be?"

That just reads as a rude way to respond to me. "Why would I be?" Because you just described a plot line of a show. It might be fun to see how someone used that plot to make a show that someone reading HN might actually enjoy.

Both sentences combined also reads as "I am perfectly content in my ways and will not listen to anything anyone might kindly suggest because I know all". Maybe because it's election season, but this unwilling to listen and stonewall is quite tiresome. So maybe just rubbed me the wrong way because of everything else and might not been taken the same way at a different time?

croes · on Nov 10, 2022

Fiction maybe helps to see another point of view

albatrosstrophy · on Nov 10, 2022

I like your idea of the search engine. It's what I miss from Google. At one time (2006?) we had the option to google search only forums and discussions, and that helped me get quality answers to my queries.

_aavaa_ · on Nov 10, 2022

On Kagi you as an individual can up rank and down rank websites (even hiding them entirely).

They can conceivably use this information to do the same thing, curare the results and change their ordering

yarg · on Nov 9, 2022

It has been for a long time.

PageRank was never designed for adversarial scenarios.

It reminds me of KPIs like lines of code - it's only useful if it cannot be manipulated.

"When a measure becomes a target, it ceases to be a good measure."

https://en.wikipedia.org/wiki/Goodhart%27s_law

drc500free · on Nov 9, 2022

I'd say rather that it wasn't designed for modern adversarial scenarios. Google won the search engine wars because PageRank did better in the late 90s adversarial environment, where every page was packed with white-on-white keywords. It was significantly harder to create a counterfeit influence graph than to keyword spam with hidden text elements.

That all seems pretty quaint these days, but even worms and viruses at the time were versions of "my goodness, who would ever write a script that emails itself to your address book, or copies itself across the network to all the other unsecured PCs with passwordless full hard drive access?"

PaulHoule · on Nov 9, 2022

PageRank is gameable but harder to game than some of the competing algorithms (such as a straight link count.)

The real consequence PageRank had was that it got web pages to stop linking to each other…. Google gaslighted people into removing the competition they could have had navigating from one page to another through links. (e.g. web directories)

DaiPlusPlus · on Nov 9, 2022

Google didn’t gaslight the web: Web directories died out because they go stale, fast, and require far more human curation than is economically viable. I remember Yahoo effectively deprecating their directory pages long before I heard of Google, back when I was using Dogpile and CompuServe.

PaulHoule · on Nov 10, 2022

I never saw a web directory that took a serious approach to using automation to curate the directory. It's probably more feasible than people think because it's a matter of classifying links up or down (relevant or not) which is a much more ontologically and mathematically tractable problem than learning a ranking function or trying to classify things into one of N>2 categories.

I think the problem may more have been the lack of a sustainable revenue model. Getting volunteers to curate the directory is particularly destructive because the people who most want to volunteer either (1) want to promote something or (2) want to get paid so somebody else can promote something.

danans · on Nov 10, 2022

> I never saw a web directory that took a serious approach to using automation to curate the directory.

That's what search indexes are. Automatically curated directories.

It's just that the UI to the index is natural language queries instead of clicks through a graph, because the latter isn't scalable to large corpuses.

PaulHoule · on Nov 10, 2022

No, I think a modern directory would be a set of topics in an ontology with links. If I had to seed one I would suck all the external links out of Wikipedia, impose some organizing structures (probably overlapping trees or dags) and then build a set of classifiers for nested topic relevance, spaminess, etc.

There are certain sites which have a landing page for every topic in some set of topics, you could add a lot of links quickly if you built rules for importing links from particular sites. Adding 10 sites a day with 1000 links each would be very possible, in 100 days you could build out a million links.

candiodari · on Nov 10, 2022

Exactly how do you think Google (and Bing, and ...) work? They do start from known indexes, in particular wikipedia. Hell, Google even has internal papers where they claim they improved search quality by "wikipedia's" as a unit.

PaulHoule · on Nov 10, 2022

Actually Microsoft bought a company called Powerset which did information from Wikipedia to build a "semantic as in semantic web" index, that technology became the heart of Bing.

Google was caught flat footed and wound up buying and killing Freebase in order to catch up. They lied and said they rejected "semantic web" approaches despite hiring one of the leaders of the Cyc project as their head of research.

Still there is a difference between exposing that kind of database through a full text index vs exposing it through a browsing interface.

davemp · on Nov 10, 2022

I consider Reddit to be a form of web directory in a way.

PaulHoule · on Nov 10, 2022

I can't use it though because I've got this disability that memes cause me extreme distress.

naasking · on Nov 10, 2022

That sounds like a meme.

Shorel · on Nov 10, 2022

Every idea is a meme in the classical sense.

314 · on Nov 10, 2022

Panik!! Kalm. Panik!!

ctippett · on Nov 10, 2022

It's not quite the same as the web directories of old, but I find the links and content contained in the "Awesome [topic]" lists on GitHub to be a useful resource. The curation aspect is managed via pull requests just as any other open source project, although this doesn't make it immune from going stale.

PaulHoule · on Nov 10, 2022

That's a good example and they aren't overrun with spam.

wolpoli · on Nov 9, 2022

Indeed, after Google, links slowly disappeared and no one could really "surf" the web anymore.

jsemrau · on Nov 9, 2022

Was researching Ad spend as the ad-tech industry is falling apart. https://imgur.com/gallery/1Aw3t4i This is not a search result anymore. This is an outrage.

rgbrgb · on Nov 9, 2022

There's maybe a good argument there for blackbox (non-interpretable) ranking algos.

ketzo · on Nov 9, 2022

How would you really blackbox a search ranking, though? A website owner can always just search for themselves, and see how their ranking changes.

When there’s so much money at stake, people will go to great lengths to reverse-engineer the things that put them higher in the page, no matter how you try to hide the levers that make the rankings work.

rgbrgb · on Nov 9, 2022

I mean blackbox in the sense that I can't explain how it works or look under the hood and understand it. Many machine learning models you want to be explainable for UX or debugging (e.g. Netflix's "because you watched X"). This is a rare case where it's better if you can't figure out how it produced a result.

Instead, you want to throw queries and user behavior into a blackbox algo and have it tell you a result then give it feedback on whether the result was good (did the user come back and ask the same question? did they click the top result and leave or did they have to come back and click through many more?). I think this is kind of how google works now, though results are frequently meh. Millions of backlinks will get you noticed but your ranking will just keep dropping if users don't appear to find your content useful (e.g. they hit the back button a lot and keep searching).

dorgo · on Nov 11, 2022

>(did the user come back and ask the same question? did they click the top result and leave or did they have to come back and click through many more?).

What stops me from writing a bot to simulate that behaviour for my website (or websites of my competitors) ?

PaulHoule · on Nov 10, 2022

Google has a big bag of tricks to gaslight webmasters, they even have patents that describe some of them.

One form of personalization that they applied early on is move results you click on a lot up in your results you can’t trust the serps you see at all.

They also inject a lot of arbitrariness and randomness to make it impossible to make small changes to your site to incrementally improve it the way you would improve an advertising campaign. One reason the web seems so frozen is that if you have a successful site and make major changes to the layout, titles, link structure, etc. you will possibly trigger the ‘chaos monkey’ and wreck your ranking and it may never recover.

leobg · on Nov 9, 2022

Personalized results. If you Google yourself, you just see what you see. But you can never know what others will see.

nordsieck · on Nov 9, 2022

> Personalized results. If you Google yourself, you just see what you see. But you can never know what others will see.

That just turns it into a statistical problem.

I don't think you can escape the problem of a search engine being used as an oracle.

pornel · on Nov 9, 2022

Ranking has many many signals, which are mixed in non-linear ways, and are dampened and vary over time. You only get low-resolution low-frequency sampling of the result.

QuadmasterXLII · on Nov 9, 2022

One option is to create a 'slow search engine' that only re-ranks pages once a year, doing the computation during the year and then updating the ranks all at once. You could evaluate the ranking before releasing it and patch obvious exploits, or even patch out exploits after the yearly release using data from before the release. This should slow the info leak to a crawl

TimTheTinker · on Nov 9, 2022

Then you get SEO witchcraft consultants who charge an arm and a leg for what they promise is a secret but oh-so-good method of improving page rank.

rgbrgb · on Nov 9, 2022

The current state of the world!

yarg · on Nov 10, 2022

That looks a bit too much like 'security via obscurity' to suit my tastes.

snowwrestler · on Nov 9, 2022

In a word, no. I’ve spent a lot of time on SEO over the past couple years, and inbound links still matter a lot to search rankings and traffic. This is clear evidence that PageRank still matters.

From a more macro perspective, I’ll believe Google is failing when a competitor starts eating their lunch. What I see right now are a bunch of would-be competitors who want to eat their lunch, including this company. The blog post is probably best understood as aspirational rather than descriptive.

As a user of search, Google results are frustrating at times, but is that because “pagerank is over?” Or because it’s an incredibly hard problem they’re working on? Google does not have to be objectively perfect to keep succeeding, they just need to be better than other search engines.

twelve40 · on Nov 10, 2022

maybe I'm slow today, but I didn't get from their long rant what did they replace page rank with? I get it, ads suck, SEO spam sucks, PR is gameable, serving the user is noble. But, how did they solve the actual problem of ranking the search results?

thfuran · on Nov 10, 2022

I think they're more like trying to grab several of the crumbs that Google left behind. I don't think it's possible for a paid search engine to eat Google's lunch.

Semaphor · on Nov 10, 2022

Yeah, the devs stated again and again, that they don’t plan or expect to be a mainstream search engine.