Hacker News new | past | comments | ask | show | jobs | submit login
The Age of PageRank Is Over (kagi.com)
558 points by darthShadow on Nov 9, 2022 | hide | past | favorite | 373 comments



I've been using Kagi the last few months. I've never had a reason to complain about its search results - they seem to work plenty well enough day to day that I don't really 'notice' them. Like most, I now search reflexively, as an extension of the mind, so I only 'notice' search when it's bad.

What really excites me is that is that I'm paying them. That sounds odd, but seriously. It's incredibly refreshing to know that the company providing my search results has an incentive to make things better for _me_ and not a legion of advertisers. With Google I can't help thinking about every keypress being logged to optimize sales pitches at me. I just don't feel that with Kagi, because I'm paying them.

Sure, they might be logging every keypress (I don't actually think they are, but you never can tell) but even if they were, I could be reasonably certain they were doing it to retain my subscription, which probably means making my search better, not selling me other stuff.

It's a priveleged position to be in, and the economic argument isn't watertight, but in the "search as a brain extension" space it still _feels_ premium, because it creates trust. And that frees up brain space for other things - like where the hell was that article I was looking for?


>It's incredibly refreshing to know that the company providing my search results has an incentive to make things better for _me_ and not a legion of advertisers

Cable television and netflix have made it quite clear that payment does not mean “no ads”, it just means “no ads, yet”.

It still might be a better alternative in the short term, but the moment growth starts to peak, companies follow where the incentives lead them.


I'm a little tired of the rhetoric about Netflix adding ads. It's only on an entry-level account that is subsidized from the normal pricing (IIUC, correct me if I'm wrong). You could argue that they've artificially inflated the price of the normal plan or that it's a slippery slope, but it's not like they've forced ads on the existing plans.

I think what's more interesting/concerning/insidious is hidden ad content like product placement. It's getting to the point where personalized product placement could be embedded in shows dynamically.


It's not really about Netflix. The point is merely that "just because you pay them, doesn't mean you're not also a product to them". Companies love nothing more than to charge both ends of a deal.

Take Spotify as another example. I pay them, along with 188 million other people. Does that mean they won't turn around and ask artists, record companies and rights-owning conglomerates for money (or rebates) for putting their stuff in front of me? Of course not. Paying them means they have some interest in pleasing me, but it's far from the only way.

On the off chance that Spotify was being scrupulous about not taking payola in any form, it would be impossible for me to verify. Which is in itself a reason for them to cheat; they don't get much economic benefit from goodwill if no one can actually see them being honest.

This is not a reason to not use Kagi, it's just a reminder of what forces we're up against. Kagi will need an unusual amount of transparency in everything they do, in order to stand a chance in the long term.

And it IS a reason to not get warm fuzzy feelings merely from the fact that you're paying them.


What you say is true.

However, for sure that will get out eventually.

If that happens, I would then look at other search engines as well.

Most people wouldn't once they get used to doing it one way, so that's all to the good for Kagi. But people like me wouldn't then use it, which is what life is about - your own personal choices.

I prefer to pay for services because at least there is a slight chance that they will follow their business plan.

But nothing stays the same for forever. As a consumer, we must also change to the changing environment. I'm completely weened off Google, for quite a while now, for example. I'm paying for everything that I use. But it is so little as to be laughable. For example, I use tutanota for my email, first account is free and subsequent ones are $1/month. Big whoop. $12 per year for private emails. I have a number of emails through them because I segregate all my emails - one for just friends, one for business, one for education, etc.


I would consider the Netflix UI to be ad laden for some years now with hundreds of originals shoved in your face in a non-customizable fashion and autoplaying etc. Also personalized thumbnails based on past watching behavior. You could argue that it's a good UX for most people, but for me it's always felt a bit more like user hostile marketing.


Netflix doesn't have enough content, so they resort to these tactics.

All major content providers left them years ago and started their own Netflixes. So they're left with a few movies from ten years ago, a couple of recent-ish releases, and their own content.


really, content creators should be banned from creating markets like Netflix (and eventually the other way around), but that would require the regulation to not be asleep at the wheel.


Are you really saying that creators shouldn’t be able to publish their own content on their own infrastructure?


Should've specified I meant movies and other large conglomerates that tend towards oligopoly. I mean things like Disney+ shouldn't exist, its extending the concentration of market power vertically too.


I real wish that HN wannabe lawyers would stop throwing *poly words around with no legal justification.

In the streaming space in the US there is: Netflix, Disney+, AppleTV, Amazon Prime Video, HBO/Discovery, Paramount Plus, Peacock, STARZ, and a few other players. There is no “opoly” in streaming video.


That’s still a small number of players dominating the market. That’s the definition of an Oligopoly, and the Disney/HBO offerings very much fall into that.

They certainly have the ability to uniformly raise prices (tacit collusion) with no viable competition to enter the market and fill the gap (as even the vast sums others have thrown into it have shown how hard it is to produce good original content).

This is probably due to the characteristic that producing goods (decent original content) in this market is a big barrier to entry - which naturally leads to a small number of players. Natural monopolies and oligopolies are common - but do require closer regulatory attention to ensure desirable consumer outcomes than just letting the free market decide.

It may not entirely fit all definitions, but the general economic theory and applications/implications are relevant to consider this market.

The original post that you questioned was related to vertical integration - you could effectively find and replace it to “so you’re saying producers of operating systems shouldn’t be able to make their own web browser?”


> That’s still a small number of players dominating the market. That’s the definition of an Oligopoly, and the Disney/HBO offerings very much fall into that.

The market is “content”. Netflix competes with YouTube content producers, TikTok and it even said that one of its biggest competitors is Fortnite.

> This is probably due to the characteristic that producing goods (decent original content)

This goes back to YouTube. You and I may not think that YouTubers and TikTokkers are producing “decent original content”. But there is a generation that spends hours on both.

Besides that, there were over 550 original series being produced last year (https://collider.com/too-many-tv-shows-550-series-2021/). Competition is much fiercer for your attention than it was when you only had the three major networks producing content and everyone else buying rights to show reruns.

There are bidding wars between all of the streaming services for new content from producers. Competition is more fierce than ever.

The price of streaming before was never sustainable. Netflix was borrowing billions a year for years to produce and obtain content. Disney+ was never going to be profitable selling its service at the introductory price. It’s not “collusion”. Every company has to turn a profit eventually.

Yes I realize that Netflix was “profitable” by GAAP standards. But it was getting deeper in debt every year.


> Netflix competes with YouTube content producers, TikTok and it even said that one of its biggest competitors is Fortnite.

It competes with these for screen time, not for content.

Meanwhile Disney owns how much content (movies, series and related IP)?


Very little as a percentage of all the professional content that’s in the world.

And the value of IP without execution capability is overrated. Warner also has iconic content. But Disney has been able to make successful movies out of its 3rd tier IP while Warner has struggled with its first tier.

Sony isn’t doing too well with most of its Marvel content except Spider-Man and that’s produced by Disney.


>The original post that you questioned was related to vertical integration - you could effectively find and replace it to “so you’re saying producers of operating systems shouldn’t be able to make their own web browser?”

I think time has shown MS was correct in making their own browser, but of course incorrect in all the corrupt tactics they used to make their browser succeed over Netscape's.


This is severe rose colored glasses. Netscape was always a horrible application and crash prone on every operating system it ran on. There were geek wars back in the day bragging about how well our operating systems handle Netscape crashes - classic MacOS failed miserably.

IE was a much better browser. Especially the Mac version that when it was introduced, was the most CSS compliant browser that existed.

Netscape starting over from scratch was cited as “Things you should never do” in an article written by Joel Spolsky (StackOverflow cofounder) two decades ago.

https://www.joelonsoftware.com/2000/04/06/things-you-should-...


It's not just netflix. I hate the my Google TV pops up with ads for media. My PS5 boots straight into the store. At least my Apple Tv mostly doesn't shove ads in my face though my iPhone seems try try to shove Apple Music and/or Apple TV+ at me now and then


1. Increase the price of the normal plans.

2. Create a new plan at the price of the old plan. With ads.

I'm sorry, but pretending that they're not adding ads to their "regular" pricing plan is semantic at best. This isn't something they dreamed up overnight. They've been planning this for years and increasing their prices accordingly.


I agree with your product placement comment. I feel gross when I notice it. I also wonder what happens when Netflix decides it makes way more money on people watching the ads - ie. Google initially calling ads a detriment to search quality but not being able to resist the $. I could see a day when they remove the no-ads plan.


Chromecast screensavers _are_ an add for other shows in Netflix.

I am paying and have no way to disable that visual pollution nonsense.

Extremely exasperating when you are trying to choose something adequate for toddlers and they keep seeing flashy stuff on the screen and saying they want to watch that.


It's not about the user having to see ads. It's about Netflix trying to sell ad spots to advertisers, which might affect all (even paying) users. I.e. by getting pressured into censoring or promiting certain shows, to close the advertising deal.


> It's only on an entry-level account

So far. I imagine they will be expanding it over time.


HBO started broadcasting 50 years ago, charging a monthly fee for commercial free movies (often released much earlier than on cable) and tv shows.

With the exception of filler ads for their own content that occurs when they need to wait for the next quarter time, ex movie ended at 12:55pm. There have been no ads


It's the exception that proves the rule. HBO has always marketed as a premium service, while Netflix's goal is to reach every home.


I mean, also Showtime, Starz, Cinemax...

To say nothing of the fact that Hulu has maintained an "ad" and "ad-free" tier for their original programming.


Aren't those called "promos" since they aren't ads, specifically as they aren't paid for?


> It's only on an entry-level account

for now....


I think it’s important to consider whether this happens with all companies, or whether venture capital seeking to maximize returns to the detriment of both the business and the user is a factor here.


If they plan to go public ever, its pretty much an inevitability, at least if US market incentives stay as they are. Cable isn't venture capital funded, and they're rife with profit maximization.


Having the actual user pay is a necessary but not sufficient condition for a user centered product or service. It doesn’t guarantee the company won’t double dip but without a direct economic model nothing is even possible but ads and surveillance.


It strikes me that a company like Kagi should be able to craft a legally enforcable agreement with its customers which expressly forbids the company from selling ads and conducting surveillance.

The agreement could be carefully written by a skilled lawyer to define the things Kagi cannot do, the proof customers must present in order to proceed with a valid lawsuit, and even the maximum damages that the customer may sue for.

In that case, if Kagi was found at some point to be using customer data for these purposes, it could be sued very easily and by many parties.

People are calling for regulation for data privacy. In the meantime, Kagi can create its own regulation it will hold itself to for the benefit of its users, can it not?


Can you add a poison pill to a company where a select group of people can decide you broke one of your founding principles and hand the company over to someone else?

No VC startup would do that but if possible would allow trust.


Sure you could. Of course no serious investor would make the acquisition.


Also smartphones and smart TVs.


Even Amazon


Well ... I also made Netflix clear that my subscription means "no torrents, yet"


Counter-powers is what makes the world go round.


Yes, we can't emphasize this enough.

Companies have an incentive to make money off of their customers' data, and technology just means that they can make money at every level (your ISP, your smart TV, the apps you use on it and the websites your browse).

We're so accustomed to entities gathering data about us that it's become part of the background.

Why is it the default that my home is on Google Maps/Google Earth? I should have to opt in, not opt out.

Just because there is "public" information about me as a registered voter or home owner doesn't mean that anyone should be able get a copy of this data, put it online and connect it to other data?


Cable was never intended to be ad free. It was originally a method to rebroadcast over the air content from the broadcast stations in areas that couldn’t get a signal. You’re paying for the infrastructure to make that possible - not the content.

Complaining that cable TV isn’t ad free because you pay for it is like saying all content over the internet isn’t Ad free because you pay for your internet connection.


As long as there are good alternatives, it's fairly easy to switch, especially for something like search engine or streaming service.


>Cable television and netflix have made it quite clear that payment does not mean “no ads”, it just means “no ads, yet”.

'It's better to be a millionaire hero than a billionaire asshole' (Ben Elton, Gridlock)

Of course at $10/month even a 1% share of the search engine market would make the company worth billions.


Perhaps, but the millionaire hero will likely be outcompeted by the billionaire "asshole"


If the only metric of success is yearly revenue, yes.

If that is the only metric, the best business model is to extract as much money out of your user, while simultaneously offering less for the same amount of money, until you reach the asymptotic ideal of 100% profit.

If you were to start a company, would you be willing to do that? Would you choose profit over everything else? Some would, but not everybody.


Netflix is still ad-free for me (product placements in content aside). Multiple tiers is reasonable.


The economic argument not being watertight is the problem. This is why tech companies keep pivoting to ads, they just didn't make enough money otherwise.

Afaik [1], there's about 5 employees and the revenue only covers server expenses while they're still trying to get more headcount. Not an expert on bootstrapping but I'm pretty sure you don't want to expand faster than your revenue does otherwise you stop being able to make all the decisions for the company.

[1]: https://blog.kagi.com/status-update-first-three-months


"This is why tech companies keep pivoting to ads"

I think another factor is the paradox that the user who has enough money to be willing to pay to not see another ad is exactly the most valuable user to advertisers— they've pre-qualified themselves on multiple axes.

So there is always going to be an enormous moral hazard here with the ad-driven-freemium model— always a temptation to run an occasional high-value ad to the paying users, or to disguise the ad as a "recommendation" or position it instead as a "sponsorship", or whatever else. Other than users reacting swiftly and decisively against this kind of thing, it's hard to know how else to keep companies away from it.


This showed up at an internal Twitter meeting this week. Mr. Musk proposed making Twitter ad free for 8$ accounts before being politely informed that 8$ accounts actually produce 50$ per month in ad revenue.

An angry bird informed me about this :)


How do they know that, given that $8 accounts must be still pretty rare given they were just introduced?


Because it's a pattern that manifests everywhere and can be extrapolated.

Though in this case, it's likely that Twitter has repeatedly run internal studies about what a paid-tier would look like, and each time concluded exactly what the GP is passing on: that it would be a net loss because the most-valuable-to-advertiser accounts would be the first ones to pay.

Even just psychologically, it would be a major blow to their sales pitch, going from "pay $X to put your name in front of the top movers and shakers in the world" to "pay $Y to put your name in front of all the Twitter users too cheap to cough up $8/mo to not have to see you." Not hard to see that $Y is going to be less than $X... probably a lot less.


I think the analysis was for pre existing blue check accounts.


One of the problems is price point. They are charging $10/month. While increasing the price will certainly decrease the number of users, a much higher price point could select for "power users" that will churn less and pay substantially more. In the long run that could lead to greater revenue.

Of course this depends on how the supply demand curve looks like for their specific business.


Catering to the whales, in casino speak.


The problem is that ad revenue so often becomes the escape hatch from an economically hard problem.

Need more money? Economic model not quite working? Companies face this all the time. Some solve it innovatively, survive, and show everyone else the way forward. Others fold or pivot.

But ad revenue short-circuits the hard process of economic innovation by tempting tech companies to take an easy way out, even when the problem is solvable without it.

(All of this wouldn't really be a problem if the ads business model weren't inherently adversarial against users.)


> This is why tech companies keep pivoting to ads

The reason they advertise is because they can. It doesn't matter how much money they're making now, they can always make more by advertising. Paying money just makes your attention more valuable. The only way to stop them is to make it impossible or illegal to advertise. Blocking ads everywhere and making no exceptions.


When google started, they were having difficulty staying solvent until they added ads, even though they had a better product than the other search engines. Its a space that's ill-suited to market solutions.


If their solution to insolvency is advertising, then let them be insolvent. We're not gonna keep enabling this behavior.


They mention that users can invest in them through SAFEs. This only makes sense if they plan an exit, either by selling or through an IPO, or am I missing something?


I think it implies that they intend to take VC money at some point in the future, like a series A, and they’re trying to avoid VC money now while they can get better terms. I don’t know if that necessarily means they plan an exit, or if they’d be happy just being profitable, but I do think it implies they plan to grow significantly.

EDIT: Their website says that they don’t plan to take VC money, so I have no idea what a SAFE means in this case. The point of a SAFE is that the investor is guaranteed the best terms when a priced round of investment happens.


Other options include going public or paying out dividends.


Going public is a qualified event for a SAFE to convert -- it is a financing -- but curious about the dividends

SAFEs are often capped, so taking awhile doesn't matter so much, except maybe interest calcs


If a company is doing sustainably well, supposedly you could have a mutually agreeable mechanism to convert SAFEs to equity (at cap for example) and treat shaleholders with dividends.


Is this some sort of hack towards a co-op? I couldn't find anything about it on their website


"Enough money" is usually defined by venture capitalists who want ALL THE MONEY and not by normal investors who want a modest predictable return. Patreon, for example, could have been a modestly profitable business which owned itself, but instead they keep flailing and spending lots of other people's money to satisfy the next batch of investors that one day they will be as rich as Google.


The sad truth is on average advertisers will pay more for your attention than you’ll pay not to have ads. So people continually complain about YouTube ads but ignore their option to pay to have them removed.


I pay for YouTube too. Between that and Kagi search, it's ~$20 per month. Which seems crazy in today's environment of 'everything free'. But seriously, I use these both all day for both work and play, but I only watch an hour of Netflix a day. So which is really more valuable?

I agree that on the whole, people will continue to go for the free option, but I'm just glad that some options are appearing to pay, which rebalances the incentives a bit.


Why wasting money on "feature" that can be done with same result for free? uBlock, NextDNS, pihole, etc. No-one is asking for ads by default, ever.


With YouTube premium you can listen to videos on the iPhone with the app in second plane or even with the phone locked.

Yeah, I should use a hacked rooted Android yadda yadda. But I just spent a week hacking and hawking for my life in an ICU, and I certainly didn’t want sanity-saving podcasts to shut down whilst on an hour-long stay at the breathing machine. Life is short.


While I am very willing to pay money for a user-centric search engine the requirement to do so directly conflicts with the greatest must-have feature that I need from one: Anonymity.

Now the issues is not just privacy but making sure it does NOT optimize search results based on prior searches or other facts it knows about me.

Now the big problem is that one would think delivering the exact search results that I am looking for is a good thing. No. It is super dangerous. It can causes me to live in a bubble, blinding me of possible opposing opinions. It is feeding my ignorance.

So there is a goal conflict here. I don't know how to solve it. I wish there would be something like Wikipedia for search engines but even the wiki models has its problems and biases.


It can causes me to live in a bubble, blinding me of possible opposing opinions. It is feeding my ignorance.

This made internet completely unusable for discovery. A big shiny window into another tiny room. Absolute BS.

Recently I discussed it here on HN and was advised to like, dislike, subscribe and so on on youtube (things I rarely have done on my old account) to break out of this bubble. Well, it doesn’t work. It just suggests the content I immediately dislike or ban entire channels and it all settles to the same echo chamber as before.


I think many platforms optimise for interactions, with no regard for whether they are positive or negative.


As a result, I often find myself closing youtube tab because there is not only nothing to watch, but nothing to scroll through even. On a platform with billions of videos, at least thousands of which I might like to watch if given a chance. I find this incredibly stupid.

Same for google search, really. It turned from “I’ll find you that if it exists” into “Look, if you change your query to ‘cool products if at all barely relevant to …’, I have literally billions of results for that”.


I like the idea. Maybe a 'Search for me' button instead of the current 'I'm feeling lucky'. Or two columns or tabs for personalised and general results. Sounds easy, but I am not aware of any search engines that offer this, unfortunately.

On the other hand, a lot of the search results bias stems from the search query itself. "Does X cause Y" for example will show results that echo the search, and not show opposite results. Simply because you did not ask the opposite or a more open/unbiased query like "X Y".


I think if you can assume some level of trust in your search provider (i.e. through payment - let's get to that in a minute), without which you should probably be looking for a new search engine, then you can trust the search engine to not optimise results for you. If they tell you they don't, then do, then you likely have deeper-rooted issues with the search engine you use.

On the point about payments, there are some potential approaches to handling this, but they tend to be a bit complex and technical. I"m not sur ethis is really a technical problem though (or at least one where a technical solution is what's needed).

A challenge of taking traditional payments is that the identity of the payer is inherently "linked" (to some extent) to the transaction. You could maybe use a privacy.com type intermediary or similar.

What you could do however, is buy a "kagi voucher" or similar, which would be an RSA-blind-signed token that you generated client-side in JS. You'd send it to the server during the payment transaction, and be given back a signed version of the token, which you unblind client-side, then give a backup copy to the user to save, and set it as a cookie to authenticate payment has been made.

That should, in theory, at least un-link the payment from the user. Ultimately though, a bad actor can still trivially correlate your searches (and payment, if you used same IP or browser fingerprint), and all your searches are linked by a common cookie.

So you decide to create some kind of token fountain, which you can auth to with a blind-signed token, and get 50 "search passes" from. Those could be unique per-search, and just be "bearer tokens" you can use. You'd have to trust that this token fountain doesn't record which search passes were issued to which "blind token".

Ultimately, it feels like this problem then becomes "do you trust your search engine?" - I'm not sure this is a problem that technology needs to solve, as ultimately your search provider is seeing your queries and IP, and most users won't realistically change this. A provider that isn't incentized to act against your interests (i.e. something like Kagi which isn't advertising to you or selling data) seems to address this for most user scenarios.


I don't think there is much cause for worry that Kagi themselves might use the data for anything bad BUT data leaks do happen and more importantly Kagi will need to play ball with whatever government that exists in the markets they want to have users in, be it the USA but also China, Russia or whatever.

We already know that Google is in fact cooperating with these entities. Not only by giving out user data but also by actively censoring certain things. So there is a precedent.


It sounds like you use the internet way different than me. I can't remember the last time I searched something where the end result wasn't mostly objective. Documentation, directions, tour information, movie times, programming questions, patch notes. Maybe seek contrary views out in the world. The internet is a terrible place to form a world view and to really communicate in general.


Agree, however we are already in trap of algorithms. Look how few years of using social networks spread ideologies that would be laughable back then, ie "there is more than two genders" or "earth is flat".


I thought: couldn't the details about the user live on the client? And the user could delete, modify, decide not to use them.

This way you get anonimity and custom results


That doesn't work unless the server sends all 1 million results (say) to the client up-front. Otherwise, if you prefer results from example.com but the server only sends example.com results on page 3, your client has no way of showing you those results on page 1.


I think I've been misunderstood, in my mind the client sends all the details about themselves to the server, the server builds up the search results and send them back.

Yes, the server has access to the information the client has, but it doesn't have any way to confirm that's true. It also could avoid storing that information all together if the client sends it again every time


I'm with you; I'm a paid Kagi user for a while now, and unlike various attempts at switching to duck duck go (et al), Kagi is actually fully working for me. I do not switch back to Google more than once a month, and each time I've gotten "worse" results so far.

I'm sold, literally.


Do you stay under 30 searches a day? On their pricing plan it seems users searched a lot more in their beta program than anticipated. 240 searches cost them around 3 dollars.


I just tried Kagi and was surprised that my open source project shows up in the results for relevant keywords. All the other search engines I tried bury it completely out of view no matter how specific the keywords are. It makes me wonder how many other indie projects like mine are excluded from mainstream search engine results.


See if the founder’s side project Teclis [0] (non-commercial search) finds it. Kagi uses the project’s index in addition to Google and Bing.

[0]: https://teclis.com/


Yes, it finds it. It ranks quite well for most keywords I tried. It even got #1 spot for one of the keywords I tried. I also noticed some other related open source projects show up (by other people) which I hadn't heard about for several years.

Google search only shows paid SaaS platforms in the first page of results.


"At USD $10/month, the price does not even cover our cost for average use"

240 searches cost them over $10.00. They will need to find additional revenue or reduce costs to even reach a break even point.

They will have to raise prices or make you pay per search or sell your data or better yet all three.


I wonder how much the per-search cost is marginal cost vs. amortized fixed costs.

If their paying user base grows by 10x, do the searches drop from 4¢ each to 2¢ each?


Most of their cost comes from API accesses to other services (e.g., Bing), so unless they can replace those API calls the price cannot go down very much.


API access is negotiated by volume, so as they grow, they can negotiate preferential pricing.


> Our proposed price is dictated by the fact that search has a non zero cost. With other search engines, advertisers cover this cost. But it costs us about $1 to process 80 searches.

.0125 per search. The teams rate is twice that.


Or give notice to power users that they are exceeding quotas by a lot. Have a power user subscription perhaps. Many people don’t download a lot and ISPs are good, but you have outliers, datahoarders perhaps who take a lot of bandwith.


Google's revenue is over 280B annualized from last quarter's numbers. Let's just assume that every person on earth is using Google at an equal rate. That's $35 worth of revenue per user per year. There is no way that you can come close to that without doing ads; the median global income is less than 4000 international dollars per year. So naturally, there is a strong pressure for companies to pivot to ads once they run out of other ideas to grow the business.


I very much disagree with your premise

* one does not need $200-300B ARR to run a search engine; and merely making a ton of money does not mean that the way that one has been making that money will continue forever

* using Kagi, one realizes how much the web has been perverted, in my opinion, by the obsession with the ad model; there are many nice, novel things that Kagi could introduce which would reduce ad revenue for a company like Google, but would be useful and novel for Kagi without loss in revenue (on the contrary — could help); ie google is incentivized to keep people on the site through various irritating means; a search engine that isn’t distracting and gives me what I want — I would be more than glad to pay for (and already do pay Kagi $10/month)

Other thoughts:

* Supposing $10/month (honestly I would consider $20) with 100M paying users; that is $1B MRR which is not bad at all and is more than plenty for a meaningfully sized team with meaningful salaries.

* Just like how lowered energy prices during and after the Industrial Revolution made manufacturing at scale feasible, the incredible amount of high quality OSS and infrastructure these days is making it increasingly feasible to do things Google did 20 years ago — something unthinkable for even the best engineers of that era. Not to mention the relative ease of collecting capital with payment services (let alone VCs, etc.) today — even with a looming recession factored in.


> $10/month (honestly I would consider $20) with 100M paying users

That would actually be $12B ARR, since that price is per month while ARR is usually per year.


Oops, yup, just came back to fix this; ha!


Looking at the global median income and coloring it with the mean revenue per user paints an incorrect picture. Advertisers are not paying even $35/yr for the attention of people making the global median wage.

US attention is the most valuable because it is the richest large nation and the largest rich nation.

European attention comes a distant second. Together they have 40% of the global GDP, but likely something like 60-70% of the consumption of non-essential goods in nominal terms. Accordingly, average advertiser cost per clock differs ~10x between the cheapest and most expensive groups of countries.


what I find strange is that selling ads makes more money than selling service. Because at the end the people watching the ads are buying the stuff advertised so those who adverstise have more money for advertising…. So for it to be rentable people have to buy way more than the ad revenue google makes. If spending for google ads is 1% of bussines budget and budget has to be at least Revenue……..with 280 billion revenue for google it would be 280.000.000.000.000 that we as humans have to spend for stuff that was advertised on google so someone pays google for ads so they can run google search etc ….

35 dollar is maybe ok… …. Maybe better than spending 3500$ on … I dont know … stuff is good also ..


Advertising is kind of a backdoor price discrimination in that way. If you can afford to spend $10,000,000, people will advertise wealth management services to you. If you can afford to spend $1, McDonalds will advertise their dollar menu to you.


If you show an ad for a car that costs $100k to 1000 users and a single user buys it (0.1%) you still sold a car for $100k. Let's say it costs $90k to make it. You could pay $5 per ad displayed and still made a profit. My point is, the people who actually buy the products are actually paying for the other people who can't afford it. So, the model doesn't rely on everyone buying a keyring from a car dealership, but on most people not buying anything, and a couple buying a car.

In a sense, that model is better at distributing income, because everyone gets a service and the people who can afford pay for everything. If everything was behind paywalls, people who can afford would have all the nice services and people who can't, will have nothing.

What's interesting is that "targeted ads" is a bit of a misnomer. It's more of an ads ranking, where the user is shown the ads they are most likely to buy. However, from the point of view of the car dealership, when they think targeted ads, they would like their ads to be shown only to people who will likely buy the car. No company would want their ads shown to someone with no disposable income, yet, they will be shown some ad.


They're selling distribution. Why is it strange to you that distribution is valued so highly? In a lot of ways it's more valuable than a product.


Isn’t Google revenue from more than search?

They have a full business suite, App Store, hardware sales, fiber internet, wireless cell and internet, and a server rental division.


> ... once they run out of other ideas to grow the business.

Yep, but the main problem here is the perverse pressure to grow indefinitely.

And that comes from it being acceptable for public companies to pay no dividends and instead rely on share price increases to deliver a return on investment.

Something that slows down this growth spiral would fix things partially. Require profitable companies to pay a % of the profit as dividends to their shareholders?


I just tried it for a few terms that Google found no relevant results for (related to Wasm/Rust); and holys*t there were many results from people (developers) who wrote in their blogs.

I'm a user now, and potentially a customer if I hit their free limit.


The reason that I'm not using Kagi (and some of my paid subscriptions) is that payments require login, and login makes it possible to track my search history. I'm happy with paying them with money, but not with my keywords.

Is there some zero-knowledge protocol that could help here? In particular, to establish that a user is currently subscribed, without revealing which subscription they have?


Anonoymous credentials covers that use case pretty well.


Very naive . It just means that in addition to whatever google do with your data , they also charge you money on top of that . When people have power to do things to maximize their gains (selling u ads/ data in addition to charging u ) , expect that it will be done. It is the same in politics. Power corrupts.


I believe it's a responsibility of organizations themselves to orient this way, and for customers to be vigilant about this orientation. Does your search engine have weird incentives to exploit you? Change search engine :)

----

I have an idea (more here[1]) that the next addition to our civilization should be a distributed system to evaluate, reward and compensate externalities. Open source software is a great example. It's free, it has massive benefits to users and for people reusing it for all sorts of projects, it's public. Yet we still can't find a great way to fund it. There are donations, and maybe that's all we need in some sufficiently enlightened society, but I think we should be addressing those more directly. And clearly there are technologies which generate friction and other social costs, that could be (differentially) quantified and prices. The same goes for pollution and a miriad of issues.

Laws can help here, but laws tend to lack softness and specificity in my opinion, and laws are strictly punitive. If you're generating a negative externality, and the legal system is well designed, maybe you'll have to compensante for it. But that relies on quick lawmaking and judges evaluating something they're not specialists at. And there's no dual for punishment (actually compensation and reeducation), there's no intrinsic reward system. There are prizes and grants, and I'm sure they generate immense value for what they are, but they're still very unsystematic and unreliable, I think.

I dream of a system where if you generate value for society, you will be recognized and make a comfortable living off it; if you need investment, you will also be supported. (I call this idea Elementalism; I'm not sure it's original)

I think we need to be a little more open to careful, thoughtful changes to our social-economic system that can improve things.

While Elementalism isn't a formal part of our system, each of us can do our part and give directly to what we understand needs the most. This is the core idea of Effective Altruism which I also extend to donating to Open Source (which I believe can be effective[2])

[1] https://news.ycombinator.com/item?id=29043752

[2] https://www.reddit.com/r/EffectiveAltruism/comments/v7ma0d/w...


Oh, but you CAN tell if they're sending those keypresses back to the mothership by monitoring the networking tab in your browser.


Maybe. They can hold batches and smuggle them in various ways.


If they have an auto-complete feature, the keypresses WILL be seen in not even in a smuggled way. They'll be directly in the URL to the search function!


You could probably hide that from anything but a deep dive by chucking the data through a websocket. Unless you see and recognise the initial socket connection you may miss it entirely.


I'm worried that I need to be logged-in and expose my CC to use such service. Nonetheless most of us use Google Search in a similar way.


awesome


I just signed up and tried it a little bit and I like what I see so far. I find myself increasingly frustrated with Google search results for a particular use case: searching for documentation. For example, today's work had me thinking about Python's datetime and timedelta and I wanted a reference on what functions are available. With Google I am annoyed with results from geeksforgeeks.org and freecodecamp.com because they are not reference materials and generally only cover some basic use cases. In Google, those two sites are in the top four results. In Kagi, they are not. Instead, there is a longer-form blog post from guru99.com, stack overflow, and the official Python documentation.

Now, I will admit that for this particular query Kagi and Google results are pretty close. But my general experience is that when I search in Google I find that I have to look farther down the search results to look past the blogspam to find the authoritative reference.


The blogspam has made Google and Bing/DDG almost completely unusable for technical searches.

Go search for something like "postgres cte" and you won't find anything useful until probably halfway down the page. And maybe not at all.


"Do you want to learn about blogspam? If so, you are on the right page. You will learn about blogspam here. One of the most major things about blogspam is that it exists on the Internet. You just learned that blogspam exists on the Internet. In this way, you have become educated about blogspam.

Now that you know about blogspam, we'll move on to the next topic: How to find blogspam. It's actually very easy to find blogspam. You are on the right page if you want to learn about finding blogspam..."


This killed the blogosphere, and will kill any decentralized system that becomes popular unless they can do something extremely clever.

Instead of a dark forest (https://en.wikipedia.org/wiki/The_Dark_Forest) think of the outer internet as a fake forest. If you wander off the beaten path - the same dozen sites that everyone uses and complains about - you wander into an endless, trackless zone of fakes all of which are ultimately trying to sell you something.


Thanks for the fun analogy. It is right on the money.

Made me imagine the invention of the internet as the big bang, and that now we are watching the expansion of e-space as all the useful bodies rush away from one another and the light-years of space between them is filled by a vacuum of usefulness.

Maybe one day the search for intelligent life in space will be easier than the search for intelligent life on the internet.


This is also giving me Microsoft "independent advisor" vibes.


That’s funny. Is that how pages go up in rank? Make many references to the same word and make circular arguments?!


I searched for that (without the quotes) and this was the 2nd result:

https://www.postgresqltutorial.com/postgresql-tutorial/postg...

The first result was the official documentation.

What else are you expecting?


Google search always gives lousy results, except when you have complained about it, in which case people who check your work always get optimal results. /s


I see this from time to time myself where someone points out how bad the results are, but different results for me when trying the example. However, there are certain other things that I have searched for myself that absolutely resulted in crap results from SEO/blogspam type of results. So I know 100% it happens.

What I'm wondering is how much of your recognized fingerprint influences the results? What causes results to be different from user to user using the same search query?


Typical. Some people always complain about Google search results nowadays but I very rarely see actual bad examples.


he was expecting nobody to check his BS


What kind of results do you get when you Google the term? For me, the PostgreSQL documentation[1] came out on the top.

[1] https://www.postgresql.org/docs/current/queries-with.html


For HTML, CSS, Javascript, and browser APIs, I simply add "mdn" to the search to guarantee I get the official-ish MDN docs. From there, I can dive in to W3C specs, etc if needed.

Those training wheel search results are annoying but they're highly ranked probably because most people like and use them.


There are also Chrome and Firefox extensions to remove results from W3Schools specifically


For me, #1 is the official documentation, #2 is a decent looking tutorial, #3 has a slightly better page on google, #4 is a clear kagi winner.

Neither of them offers bad results, but we are talking about google, it makes no sense to give example searches without saying what results you receive as Google is so heavily personalized.

Kagi does personalization, but it’s explicit. You yourself decide the region to search in (my default is international, though there are quick bangs to search in other regions), and you can up- or downrank domains, as well as block them completely.


Always add the keyword 'forum' to your search terms. It filters out most of the crap.


"hackernews post, deeply knowledgeable, FAANG dayjob, startups with VC funding, increased conversion rates, trending on artstation"


Maybe it's just me but search results for that were very helpful in my case. Official documentation, stack overflow questions, an informative blog about cte gotchas, all within the top half of results.


Just Googled it, the first result is official postgres documentation, second is a tutorial, what exactly you get / expect when you search for it?


I just duckduckgoed it and 2nd and 3rd results are tutorial spam. 2nd is geeksforgeeks...


Here are the results for that query on Kagi if you want to compare:

https://kagi.com/search?q=postgres+cte&r=us&sh=5-n8GUySt5qmx...


Another nice thing about Kagi is that you can eliminate domains from your search results. Obvious content farms are pretty easy to spot and remove.


My favorite feature is being able to boost a certain domain up in the the results (or even pin it if you really would like). I often search for different Pokemon and prefer the information that a site like Serebii.net gives me over something like Bulbapedia.


Is there any reason why? I'm not heavy into the game anymore but usually when I'm searching Pokemon Bulbapedia has the information I want.


I think it's just because I prefer the organization of the information, giving you things like the type chart up front and center.

It's also the site I used more as a kid so there's probably some loyalty bias.


Yes, eliminating and boosting favoured sites are both excellent feathers on Kagi


That should be a good signal for ranking algorithms as well with the subscription price disincentivizing bots.


IMO this article misses the biggest issue with search engines today. It's less about any ranking algorithm (like PageRank), but rather the root cause is with indexing. No matter how much you fine tune your ranking system, if your index is filled with SEO junk and blogspam, you're fighting a losing battle from the start.

For some reason, a lot of these search engines like to brag about the number of documents in their index, which never made sense to me. Maybe it was true in the past, but on the modern web, larger index !== better results. In fact, I'd argue the opposite since you're much more likely to serve SEO spam.

That was my motivation to start hacking around on CrowdView (https://crew-rho.vercel.app), a search engine specifically for forums and discussion content (e.g. forums, discords, twitter, reddit, etc). It has a curated index (today, curated by yours truly) to remove SEO junk and help you figure out "what does a real, genuine human think about this think?"


Not to be rude, as I appreciate the motivation behind your work, but your post ironically followed SEO junk format:

1. Enter the discussion on a human level

2. Hook onto existing context

3. Argue issues with existing context

4. Present your own software as a solution to these issues

It is like a date that only wants to offer MLM business opportunities, or a long-form joke that ends with a disappointing "better nate than lever" punchline.


In general scams work because the form works, and should work, but the content is an illusion.

You could equally say everyone's following Aristotle's Rhetoric.


Kagi has a feature, lenses, which includes a discussion lens. It’s pretty much that, results from forums and reddit.


For a long time, I've been considering a different solution to the problem, which is to create a human-curated whitelist-only search engine.

I do think your idea is great and something with a lot of value for many use cases.

The only issue is that it won't surface certain things like restaurant contact info and Wikipedia that people often need to be a top result.


"a human-curated whitelist-only search engine"

This is a good idea. Basically harkening back to the old internet directory days, combined with the powerful indexing tools we have now, and then use all the great language models and topic modeling tooling to make the querying great.

Said another way: moderate and protect the index, instead of trying to clean up the results of queries.

You'd need to monitor the whitelisted sites in various ways to make sure they don't bait and switch and turn to spam later, or get hacked and go to shit.

You might even be able to monetize on the publisher side by allowing sites to pay to be indexed faster/more regularly, which could potentially (but not totally) help with spam control too. Every publisher wants traffic.

You might be able to jump start the whitelist by getting folks using a plugin so you can know what domains they spend time on in the first place and index those when signal strength warrants it. Gotta be careful about that being gamed too though. Also I think this is actually the core principal of Brave's search engine come to think of it.


> You might be able to jump start the whitelist by getting folks using a plugin so you can know what domains they spend time on in the first place and index those when signal strength warrants it.

I actually think a browser plugin is a core component, especially if people could opt in to contributing their pages to the index (with a method to make sure they're not logged in to anything).

The plugin would also allow people to flag websites as spam/scams/low-value, regardless of whether they're in the index or not.


> combined with the powerful indexing tools we have now

What powerful indexing tools exist today? I have little context on the search domain but it's very interesting.


I'm not sure, but it sounds like they mean that building and quickly searching an index has been democratized. You can now use any number of FOSS projects or SaaS offerings to do that, depending on how big you want the index to be and how much you want to spend.


That’s right. Back when google was coming to be, building a search index was basically computer science/grad school research territory. Now there’s tons of good starting points that you can throw money and hardware to scale quite a bit to at least prove out a product concept.


Which starting points are those? I'm familiar with some open source search engines, but not open source solutions for crawling and updating a search index of the web.


I like this idea in theory, but I think it would be difficult to scale up enough to be useful.

Who would be the curators? If you open up curation to volunteers, it could be gamed by bad actors. However if you have too small a team, the results will be limited and biased. For example, favoring the English-speaking or tech-sphere web while ignoring large sections of the web with which the curators are unfamiliar.

Perhaps machine learning could help with scale - start with a human curated dataset, then train a model on it. However that could end up getting gamed too.


I think curation by paying users could work. At least it would make manipulation quite expensive.

Let people downvote/upvote search results as an extra strong signal on top of monitoring what they choose to click on in the result list.

Also let people block domains from searches and provide some discoverability of commonly blocked domains.


> Who would be the curators?

That's a key question. You don't necessarily need that many of them, though, if you're whitelisting at the domain level. A few dozen people could work giving a rating to many thousands of domains every year. An ideal number might be in the thousands.

I'd also say they probably shouldn't be in industry, because they'd have an incentive to game the index.

> Perhaps machine learning could help with scale - start with a human curated dataset, then train a model on it.

While this is true, I don't think search engines other than Google and Bing need to worry that much about being gamed. It's just not worth the effort.


yup totally agree, but I think it's nearly impossible to compete with Google and make something much better for certain types of queries (e.g. factual info, location queries, etc). They have too much data and too much of a head start.

The best way to compete with them IMO is to choose a slice of queries that they perform very poorly at (like non-SEO spam results) and build something 10x better. But that brings up the challenge of getting users to remember to come back to your search engine.

It's tricky for sure, with no clear answer. But for the first time in a long time, I sense that we're approaching a tipping point for Google's dominance over the web.


> it's nearly impossible to compete with Google and make something much better for certain types of queries (e.g. factual info, location queries, etc)

Some of these things can be outsourced to other services. An easy Google fallback is also a common solution for new search engines.

> But that brings up the challenge of getting users to remember to come back to your search engine.

This wouldn't really be something I'd build as a business. I'd create a nonprofit, fund it with subscriptions or donations, and cover the losses myself. I really just want it to exist for myself, and if other people used it, that would be icing on the cake.


>which is to create a human-curated whitelist-only search engine.

You might be interested in watching the series "Halt and Catch Fire".


> "Halt and Catch Fire"

Another fan!


I'm not interested. Why would I be?

I also don't need fiction to know what a human-curated search engine would be like, because I was using the web in the 90s when we had things like Dmoz.


Well, you can be a curmudgeon and not enjoy someone's recommendation for a decent bit of entertainment, or you can say just politely carry on and be a grouch in the living room of your trashcan, or you could maybe try something new and actually consider watching something you haven't seen yet.

edit: softened the tone oh so slightly


Sorry, I misinterpreted your suggestion as, "This is not a novel idea, and you should watch how badly it goes when someone on this TV show tries the same thing."

I didn't insult you or your suggestion, though. I said I wasn't interested in the show and that it doesn't inform my understanding of the concept.

Why did that hurt your feelings enough to make insulting accusations about my personality and character?


"I'm not interested. Why would I be?"

That just reads as a rude way to respond to me. "Why would I be?" Because you just described a plot line of a show. It might be fun to see how someone used that plot to make a show that someone reading HN might actually enjoy.

Both sentences combined also reads as "I am perfectly content in my ways and will not listen to anything anyone might kindly suggest because I know all". Maybe because it's election season, but this unwilling to listen and stonewall is quite tiresome. So maybe just rubbed me the wrong way because of everything else and might not been taken the same way at a different time?


Fiction maybe helps to see another point of view


I like your idea of the search engine. It's what I miss from Google. At one time (2006?) we had the option to google search only forums and discussions, and that helped me get quality answers to my queries.


On Kagi you as an individual can up rank and down rank websites (even hiding them entirely).

They can conceivably use this information to do the same thing, curare the results and change their ordering


It has been for a long time.

PageRank was never designed for adversarial scenarios.

It reminds me of KPIs like lines of code - it's only useful if it cannot be manipulated.

"When a measure becomes a target, it ceases to be a good measure."

https://en.wikipedia.org/wiki/Goodhart%27s_law


I'd say rather that it wasn't designed for modern adversarial scenarios. Google won the search engine wars because PageRank did better in the late 90s adversarial environment, where every page was packed with white-on-white keywords. It was significantly harder to create a counterfeit influence graph than to keyword spam with hidden text elements.

That all seems pretty quaint these days, but even worms and viruses at the time were versions of "my goodness, who would ever write a script that emails itself to your address book, or copies itself across the network to all the other unsecured PCs with passwordless full hard drive access?"


PageRank is gameable but harder to game than some of the competing algorithms (such as a straight link count.)

The real consequence PageRank had was that it got web pages to stop linking to each other…. Google gaslighted people into removing the competition they could have had navigating from one page to another through links. (e.g. web directories)


Google didn’t gaslight the web: Web directories died out because they go stale, fast, and require far more human curation than is economically viable. I remember Yahoo effectively deprecating their directory pages long before I heard of Google, back when I was using Dogpile and CompuServe.


I never saw a web directory that took a serious approach to using automation to curate the directory. It's probably more feasible than people think because it's a matter of classifying links up or down (relevant or not) which is a much more ontologically and mathematically tractable problem than learning a ranking function or trying to classify things into one of N>2 categories.

I think the problem may more have been the lack of a sustainable revenue model. Getting volunteers to curate the directory is particularly destructive because the people who most want to volunteer either (1) want to promote something or (2) want to get paid so somebody else can promote something.


> I never saw a web directory that took a serious approach to using automation to curate the directory.

That's what search indexes are. Automatically curated directories.

It's just that the UI to the index is natural language queries instead of clicks through a graph, because the latter isn't scalable to large corpuses.


No, I think a modern directory would be a set of topics in an ontology with links. If I had to seed one I would suck all the external links out of Wikipedia, impose some organizing structures (probably overlapping trees or dags) and then build a set of classifiers for nested topic relevance, spaminess, etc.

There are certain sites which have a landing page for every topic in some set of topics, you could add a lot of links quickly if you built rules for importing links from particular sites. Adding 10 sites a day with 1000 links each would be very possible, in 100 days you could build out a million links.


Exactly how do you think Google (and Bing, and ...) work? They do start from known indexes, in particular wikipedia. Hell, Google even has internal papers where they claim they improved search quality by "wikipedia's" as a unit.


Actually Microsoft bought a company called Powerset which did information from Wikipedia to build a "semantic as in semantic web" index, that technology became the heart of Bing.

Google was caught flat footed and wound up buying and killing Freebase in order to catch up. They lied and said they rejected "semantic web" approaches despite hiring one of the leaders of the Cyc project as their head of research.

Still there is a difference between exposing that kind of database through a full text index vs exposing it through a browsing interface.


I consider Reddit to be a form of web directory in a way.


I can't use it though because I've got this disability that memes cause me extreme distress.


That sounds like a meme.


Every idea is a meme in the classical sense.


Panik!! Kalm. Panik!!


It's not quite the same as the web directories of old, but I find the links and content contained in the "Awesome [topic]" lists on GitHub to be a useful resource. The curation aspect is managed via pull requests just as any other open source project, although this doesn't make it immune from going stale.


That's a good example and they aren't overrun with spam.


Indeed, after Google, links slowly disappeared and no one could really "surf" the web anymore.


Was researching Ad spend as the ad-tech industry is falling apart. https://imgur.com/gallery/1Aw3t4i This is not a search result anymore. This is an outrage.


There's maybe a good argument there for blackbox (non-interpretable) ranking algos.


How would you really blackbox a search ranking, though? A website owner can always just search for themselves, and see how their ranking changes.

When there’s so much money at stake, people will go to great lengths to reverse-engineer the things that put them higher in the page, no matter how you try to hide the levers that make the rankings work.


I mean blackbox in the sense that I can't explain how it works or look under the hood and understand it. Many machine learning models you want to be explainable for UX or debugging (e.g. Netflix's "because you watched X"). This is a rare case where it's better if you can't figure out how it produced a result.

Instead, you want to throw queries and user behavior into a blackbox algo and have it tell you a result then give it feedback on whether the result was good (did the user come back and ask the same question? did they click the top result and leave or did they have to come back and click through many more?). I think this is kind of how google works now, though results are frequently meh. Millions of backlinks will get you noticed but your ranking will just keep dropping if users don't appear to find your content useful (e.g. they hit the back button a lot and keep searching).


>(did the user come back and ask the same question? did they click the top result and leave or did they have to come back and click through many more?).

What stops me from writing a bot to simulate that behaviour for my website (or websites of my competitors) ?


Google has a big bag of tricks to gaslight webmasters, they even have patents that describe some of them.

One form of personalization that they applied early on is move results you click on a lot up in your results you can’t trust the serps you see at all.

They also inject a lot of arbitrariness and randomness to make it impossible to make small changes to your site to incrementally improve it the way you would improve an advertising campaign. One reason the web seems so frozen is that if you have a successful site and make major changes to the layout, titles, link structure, etc. you will possibly trigger the ‘chaos monkey’ and wreck your ranking and it may never recover.


Personalized results. If you Google yourself, you just see what you see. But you can never know what others will see.


> Personalized results. If you Google yourself, you just see what you see. But you can never know what others will see.

That just turns it into a statistical problem.

I don't think you can escape the problem of a search engine being used as an oracle.


Ranking has many many signals, which are mixed in non-linear ways, and are dampened and vary over time. You only get low-resolution low-frequency sampling of the result.


One option is to create a 'slow search engine' that only re-ranks pages once a year, doing the computation during the year and then updating the ranks all at once. You could evaluate the ranking before releasing it and patch obvious exploits, or even patch out exploits after the yearly release using data from before the release. This should slow the info leak to a crawl


Then you get SEO witchcraft consultants who charge an arm and a leg for what they promise is a secret but oh-so-good method of improving page rank.


The current state of the world!


That looks a bit too much like 'security via obscurity' to suit my tastes.


In a word, no. I’ve spent a lot of time on SEO over the past couple years, and inbound links still matter a lot to search rankings and traffic. This is clear evidence that PageRank still matters.

From a more macro perspective, I’ll believe Google is failing when a competitor starts eating their lunch. What I see right now are a bunch of would-be competitors who want to eat their lunch, including this company. The blog post is probably best understood as aspirational rather than descriptive.

As a user of search, Google results are frustrating at times, but is that because “pagerank is over?” Or because it’s an incredibly hard problem they’re working on? Google does not have to be objectively perfect to keep succeeding, they just need to be better than other search engines.


maybe I'm slow today, but I didn't get from their long rant what did they replace page rank with? I get it, ads suck, SEO spam sucks, PR is gameable, serving the user is noble. But, how did they solve the actual problem of ranking the search results?


I think they're more like trying to grab several of the crumbs that Google left behind. I don't think it's possible for a paid search engine to eat Google's lunch.


Yeah, the devs stated again and again, that they don’t plan or expect to be a mainstream search engine.


I've been using kagi exclusively for the last 3 months. I pay $30 because I feel it's worth it and I can afford it. While I tried to use duckduckgo, I found myself reverting to google on every other search. With kagi, I never had to, and I have built myself a set of useful filters based on what project I'm working on. This, along with switching to mastodon, means that I almost never encounter ads anymore. I honestly feel like I got my "internet" back. I didn't realize how much the google search suggestions would taint my day to day.

I whole heartedly recommend kagi. My favourite feature is the "blast from the past" that shows results that are not online anymore, but links to web.archive.org.


Congrats! I’ve recently started the journey towards more decentralization and kagi seems to be the next logical fit. So far I’ve self hosted backups with s3 replication and next cloud over tailscale has been amazing but I’m still trying to figure out social media.

How does the workflow look like for sharing an album of pics with friends and family (who are on traditional social media platforms)?


If you don’t mind piping data through cloudflare, you can use cloudflare access + tunnel and a self hosted photo app of your choice (I use photo structure).

Cloudflare access will handle auth and can use family members’ gmail or popular emails for authentication. It will also do https for you.

If you want the data encrypted from cloudflare, I’d probably install tailscale on family members’s machine and do ACL. Friends are gonna be challenging


We just use whatsapp / dropbox / adobe bridge / icloud. Pretty transient, but in a way that's fine. My father is a photographer and makes a lot books, so we have tangible artefacts.


You can't compare the good faith web from 1996, which consisted of a bunch of nerds blogging, to today's web. Everything of any importance in our society moved online, and much of it needs monetization. And since on average users won't pay a cent for anything, ads it is. We very much have a role in this outcome ourselves.

Even in a hypothetical situation where ads would no longer be the driving force of algorithms, something else will. As soon as something gets large enough, it will be gamed. If not for commercial reasons, it might be cultural/political influence.

I will end by reminding ourselves that us techies should spend some more time with ordinary folks. I agree with everybody here that Google Search has been getting worse and worse for years now, especially for our niche searches.

It's a mistake though to think that this is widely experienced as such. My mum looks up an unknown ingredient in a recipe and within a second sees a picture of it. That means it works. All kinds of personal data might be shared in the process but since you can't really see that, it didn't happen. From her point of view, Google Search works extremely well and is close to magic.

The point being, Google doesn't give a shit that you don't get the best answer for your query on JavaScript closures. Nobody searches for that, and those that do, block ads.


What makes you assume non-techies aren't feeling like the quality of results is lower? Plenty of people moving their search needs to reddit, tiktok, even pinterest seems to indicate that it _does_ affect them - they just don't articulate it as "the google algorithm yada yada"


Reddit search is garbage. What people do is they search on google “headphones_model review reddit” on Google which shows them somewhat less fake information than the SEOd blog posts that would come up.

Sure maybe it would be good if Google prioritised these results by default but that would just result in reddit becoming more manipulated since whatever the default search shows will be gamed.


If not priority, at least fix the actively-misleading results page. So many examples, choose your own adventure.

One that annoyed me today: on mobile, reddit search results will all say like "3 days ago" or "6 days ago" and then the actual link takes you to a six year old post.


Personally, whenever I want to search anything about TV shows (particularly the most recent ep), games (particularly sticky ones like rimworld or factorio), I quite literally search directly on Reddit, bc the Google results are never relevant


This says more about a shift in the way of consuming content, and the quality of the content than the search results quality/algorithm itself. People are tired of fluff and want direct quick answers; Reddit, TikTok, Pinterest, etc, all provide that.

People don't want to search for a bread recipe, click to see it and be forced to scroll through 3 folds of ads and the history of bread in eastern europe, to get to the list of ingredients.


> People are tired of fluff and want direct quick answers;

When didn't people want "direct quick answers"? Google got popular because that's what it used to give you - the answer you want used to always be the first or second...


Physical cookbooks are still quite nice to have around for this reason.


I would (and have) buy a cookbook on Amazon before sitting through recipe ads. It’s as bad as porn hub.


I, for one, look actively for services I can pay for.


Year ago I had a realization. I make solid money. My spare time is very limited. Why am I spending hours trying to save 20 dollars.


It's such a strange phenomenon. My well-paid tech friends scoff when I say I pay $9.99 (annually, not monthly) for my little podcast app [0]. Meanwhile they have YouTube Premium. I don't get it.

[0] https://pocketcasts.com/plus/


See I haven't even made it there. I'm in a household that subscribes to Netflix, Prime, Disney, Spotify, and several Twitch streamers, yet I'm enough of a cheapskate that I somehow still slum it with free-tier Youtube and Castbox.

At least I throw my preferred YouTubers a few bucks a month on Patreon, but that feels different since I'm paying them more or less directly for the work they do, rather than buying myself a marginally better experience from Google.


Yeah, just pirate the content from the streaming services & pay people directly where you can.


> Even in a hypothetical situation where ads would no longer be the driving force of algorithms, something else will. As soon as something gets large enough, it will be gamed. If not for commercial reasons, it might be cultural/political influence.

While this sounds true, I think it actually needs motivation.

As far as I can tell, this type of analysis typically bases itself on only a few examples.


I still want a search engine that highly ranks pages with no ads. Get us back to those 1990s days a bit more.


> The more you tell your assistant, the better it can help you, so when you ask it to recommend a good restaurant nearby, it’ll provide options based on what you like to eat and how far you want to drive. Ask it for a good coffee maker, and it’ll recommend choices within your budget from your favorite brands with only your best interests in mind. The search will be personal and contextual and excitingly so!

Actually Google is already doing this. Results are personalized and contextual. It can't really know what I feel like eating this evening because I don't, but it can guess.

I applaud the author for trying, but I don't see an alternative to PageRank being proposed. How exactly is Kagi proposing to rank results?

And no, I don't want an AI generated summary when I search for the best tutorial to do X. I want a list of tutorials. The question is how to rank that list, and I've yet to see anyone do a better job than Google.

Google is far more than PageRank. Its an AI ranking model that has the largest training dataset (queries and clicks).

Ads may suck, but simply charging for the same service isn't really innovative. Would I pay for a version of Google without ads? Probably not. But that's just me - I actually like to know who is advertising for particular searches. A company with an ad budget to rank at the top of ads is probably more trustworthy than an anonymous website.


>Ads may suck, but simply charging for the same service isn't really innovative. Would I pay for a version of Google without ads? Probably not.

Google already does this with YT; YouTube Premium[1] offers you YT experience without ads plus extra perks. And YouTube Premium has apparently more than 25 million subscribers in US alone[2] and 80 million subscribers globally[3]. My thinking is power users are ready to pay for ad-free experience and casual users probably not because they are not heavy users.

>But that's just me - I actually like to know who is advertising for particular searches. A company with an ad budget to rank at the top of ads is probably more trustworthy than an anonymous website.

A lot of spammers and fraudsters see advertising as their most effective gateway and tactic to scamming people so I wouldn't count on reliability and safety of all ads that you see.

[1] https://www.youtube.com/premium

[2] https://www.statista.com/statistics/1261865/youtube-premium-...

[3] https://blog.youtube/news-and-events/youtube-music-premium-8...


I find YT premium fully worth it; the ads are horrible, and totally inturrupt watching something - enough that I pay, as I couldn't stand it otherwise.

Google, less so - but I pay for Kagi as I like the results better, and prefer how they say they're tracking me.


That's no really relevant to Google or Kagi.

YT Premium is a reaction to the infeasibility of subsidizing the extreme cost of hosting video streaming with ads alone. The scale of ads needed to cover hours of streaming video vs. ads to cover search results are magnitudes in difference, and people are more willing to pay to make those streaming ads disappear.

Even so, with a paid option, the vast majority of YT users still use Youtube w/o the Premium subscription.


YouTube ads are much much worse interstitial style ads that won't let you see the video until you watch an ad. That's why Netflix does well too. I don't think it's a relevant comparison.


I'm paying for YouTube Premium and you get YouTube Music for free + ad-free too. I've started paying because ads were annoying on my smart TV YT app and I already used YT Music.



> Google is far more than PageRank. Its an AI ranking model that has the largest training dataset (queries and clicks).

That doesn't necessarily make it better for HN users. A more selective dataset could make it more useful to specific niches, like say, HN users?


I think the point is precisely that PageRank is not being used anymore.


Oh but it most likely does use PR. Where do you think they get the results from? Are they scraping the web themselves? I highly doubt it. Results probably come from Bing.


It’s about time. Google results are so full of promoted and sponsored content that you must use tricks to get past it. Even so most of the best things are no longer findable via Google even if you know what you are searching for!. I’ve been using Kagi for a few weeks now and really like the results. I signed up for a paid account to support their progress.


Cool but why do I need to do a login to even give it a test drive? Immediate turn off.


Because they only allow 50 searches per month in their free tier. You can check how the UI looks using the "Example search results" area in the home page.


I find myself still using Google for searching the web a lot, however, maybe 2% is the classical website search.

It’s mostly to find an image, a video, a location on maps, etc.

I wonder if Google could turn things around by rethinking the search experience. They’re still the fastest first stop on your journey to find something on the web. But the behavior has changed dramatically that a change in their UX alone could really make a big impact. Maybe moreso than just an algorithm change.

I once redesigned the search experience of a large global ecommerce company. The UX changes alone grew their revenue quite a substantially and reduced customer complaints.


I don't think Google even uses PageRank anymore, do they? So, the title is correct, but not super meaningful. It sounded like the battle cry he's sounding is more like "The Age of Search Advertisement is Over", so why not use that as a title?

By the way, I've been a satisfied Kagi customer since the day the beta ended, and I have nothing bad to say about the service. Okay, it would be nice if they remembered I like to view temperature in Fahrenheit, but whatever.


Most search engines suck because people find it too hard to do the work of tuning the parameters. There is a methodology to do that

https://trec.nist.gov/

but it takes a huge amount of training/test data to do it and takes a lot of computational work after you've got that data. What TREC revealed is that most of the things that would obviously improve search relevance don't, and it took 5 whole years of 20 teams working on it before a useful discovery was made.

There was an article about TikTok that revealed just how wrongheaded the viewpoint of the current web is. As much as Google fetishizes data, the data collected by sites like YouTube is useless because they offer you too many things to click on so you click around like a chicken with its head cut off. There are maybe 5 things that you like out of 20 that they show you, but which one you click on is random so the signal is mostly noise and worst of all they can't come to the conclusion that you didn't like any of the other 19 things they showed you.

TikTok shows you exactly one thing at a time so the opinion that they capture is meaningful.

It reminds me of one of the first "learning to rank" papers where I talked the management at the CU Library to let the Thorsten Joachims group run our search engine and we realized just how poor of a signal you get from search engine usage and how challenging it is to feed it back to improve your results.

Really marking up judgements for all the results that turn up and using "pooling" to add new results to the original set when your search engine is essential to make really better search. Otherwise you can be just another one of those guys who posts to Medium about the "semantic search" engine he built but can't tell you if the results are any better than any other search engine.


Any link to the tiktok article?


The people who used to make niche sites that helped people and linked to other sites moved to youtube.

Plenty of excellent youtube channels that 10 years ago would've been websites.

I don't really have a problem with this, all things change.


But then, you can't really search youtube as efficiently as you can texts. That's the only downside for me, sieving through tens of videos just to find something that might be in the middle that's releavant to what i'm searching.


That’s true but there’s also a steep positive benefit in information density and fidelity.

Compare written instructions for e.g. fixing a car vs a video of a person showing you how to do it. Latter is much more helpful


It really depends on the task, and the best approach is often to combine written instructions with a video.


That's when the video solves your problem. But how do you find the video that solves your problem? You can't skim through it quickly like you can with text.


> In the future, instead of everyone sharing the same search engine, you’ll have your completely individual, personalized Mike or Julia or Jarvis - the AI. Instead of being scared to share information with it, you will volunteer your data, knowing its incentives align with yours.

This is true IFF this software is open source and running on my own server. Hard to see how this can happen.


Oh, it's an ad.

Since Google turned to the dark side on August 9, 2006[1], ranking by links weight has been hopeless. But the alternatives are not much better.

[1] https://www.google.com/press/podium/ses2006.html


I started using Kagi and Orion a while back and it has changed how I used the web. Both the products Kagi offers has only increased my productivity and removed the internet junk that I am used to seeing on the web. A note on Orion, the ability to use extensions from both Firefox and Chrome store is why it is admirable, the cherry is Mac integration and its similarity to safari with the tree-view tabs.


I’ve been using orion for a while, but sometimes with a couple of YouTube tabs open the bottom of my M1 MacBook Air gets uncomfortably hot.


There was a recent bug fixed, specifically on YouTube, due to ad-blocking causing a constant loop fetching and reloading an icon or something to that end, I believe - it was fixed in a recent release.

I suspect that was what you were seeing, and if so, updating should have addressed it.



This is marketing. That's all it is. It may as well be called "Search engine alternative to Google declares Google's secret sauce is dying".

This reminds me of posts from 10 years ago where "the future of search of social" (this post says "user-centric" instead). Even Google feared that outcome (given the then-meteoric rise of Facebook; oh how the turns have tabled). Personally I find the musings in this post to border on nonsencial.

But here's the interesting part: certain ideas are sticky just people want them to be true. The core one here is that "search is broken" or even the lesser "Google is getting worse". It's one many on HN like to push. I don't think theres any quantitative empirical evidence of this. It's people wanting it to be true and you'll see it on every thread about DDG (as one notable example).

Silicon Valley is littered with the corpses of "Google killers" who seem to have all fallen into different versions of this trap.


But Google search results ARE not satisfactory. The existence of Q&A forums in the age when information is broadly online (and thus is accessible to Google) indicates a need that is not answered by Google.

Google's algorithm may be the best out there, but it still falls short of what is needed. Anecdotally, I often find it fails to conduct word-sense disambiguation, handle negation or answer edge case. My general impression is that Google's algorithm focuses on topical similarity (pages that talk about the query's topic) rather than answering the query.


>In the future, instead of everyone sharing the same search engine, you’ll have your completely individual, personalized Mike or Julia or Jarvis - the AI. Instead of being scared to share information with it, you will volunteer your data, knowing its incentives align with yours. The more you tell your assistant, the better it can help you, so when you ask it to recommend a good restaurant nearby, it’ll provide options based on what you like to eat and how far you want to drive. Ask it for a good coffee maker, and it’ll recommend choices within your budget from your favorite brands with only your best interests in mind. The search will be personal and contextual and excitingly so!

This is exactly what Google is trying to do....unsuccessfully. I doubt you will have the scale to outperform Google if you try to replicate Google's AI model and replace ad based search with subscription based search. My point is don't do what Google is doing, think differently.


The difference is Google is trying to do this with the data extracted/mined/tracked from the user (pulled), while the model proposed in the article is based on users volunteering their data to the search engine (push) because incentives are aligned and they trust it. A sbtle difference, but the latter can produce vastly better outcomes as the quality of the data would be presumably much higher. (auther here)


You said >In the future, instead of everyone sharing the same search engine, you’ll have your completely individual, personalized Mike or Julia or Jarvis - the AI.

This is what Google is doing under the hood and they have the upper hand because they collect absolutely everything about the user. Yea there is a lot of noise but their advantage is enormous because theoretically the more they know about you the better search results you will get.

I respect the policy of letting users decide what data they want to hand in(volunteering their data) but I don't see how this can scale in the long term because users most probably don't even know what they know and they don't know what exactly they want.


I agree that automatically mining user data is much more scalable then building a relationship with aligned incentives, in which users would volunteer it to you.

My argument is that this is exactly the main reason for the mess on the web we are in now, and that the way forward is not doing more of the same thing that got us in this place, but doing the opposite, regardless of how hard and not scalable it is. Alas, we will at least try.


> theoretically the more they know about you the better search results you will get

Better from whose point of view? ;)


user's


The author quotes Page and Brin's original paper:

> “Currently, the predominant business model for commercial search engines is advertising. The goals of the advertising business model do not always correspond to providing quality search to users. For example, in our prototype search engine one of the top results for cellular phone is “The Effect of Cellular Phone Use Upon Driver Attention”, a study which explains in great detail the distractions and risk associated with conversing on a cell phone while driving. This search result came up first because of its high importance as judged by the PageRank algorithm, an approximation of citation importance on the web [Page, 98].

A search for cellular phone now returns a page of links to plans, and one link to Wikipedia.


Why does everything in the future-web have a price tag attached? If anything I want less business middlemen on the web period. I don't understand why funding through direct sales will give a better product than funding through advertising:

"And yes, the non-zero price point will mean you have to budget it with your other costs. But faster access to higher quality information will make you much more competitive globally, so you can decide if the investment will be worth it, like any other purchase you make. This will in turn incentivize these products to be even better, a positive feedback loop driven by entirely aligned incentives."

For businesses I think this effect is only driven by competition regardless of revenue model.


> I don't understand why funding through direct sales will give a better product than funding through advertising

The more the economic incentives of a business are aligned with the customer's interest, the better. Of course, it's not perfect, and business still might have an incentive to show ads on top of charging you, but it's certainly much better than having hugely misaligned incentives as with Google and social media today.

> Why does everything in the future-web have a price tag attached?

Because it uses real resources like humans and hardware that cost real money in the real world. You don't ask why you need to pay money to buy an apple at the grocery store, do you?


I hate this argument. It is economic dogma ... and it's bullshit. Why? Bread in Jerusalem: the law of demand and supply doesn't exist for the vast majority of goods.

Jerusalem is a city like any other. Everybody has food. You can buy bread there. And while the city has big problems, not many people are starving (not quite zero, but which city can say that). Economic theory dictates that bread would follow the law of supply and demand. Is that so?

Let's say the price of bread halves. Will tomorrow see more bread sales in Jerusalem? Here's the critical piece of evidence: NO, it won't. It won't even shift from one kind to a fancier kind of bread. Let's say the price doubles. Same thing. We're in a ZERO price elasticity regime. Sure there are prices where supply and demand would work, but we're far away from those prices. But it gets worse.

Let's look at supply. Bread is incredibly cheap to make. There is at least a 80% or so margin on bread (it's quite expensive, or profitable on the other side, if you look at it from a COGS perspective). Let's say the price of bread doubles (you could argue it has, many times even). Will more bread be produced in Jerusalem? NO. Again there are prices where we'd see supply tightly tied to prices, but we're very far away from those prices.

BOTH on the supply and demand side we see ZERO price elasticity. If you look up in your economics book you will not find this situation, and even in academia people aren't exactly comfortable discussing it. Price elasticity is ZERO for bread, both on the demand and the supply side, if you belong to the economists that at least split that up.

So what makes the difference? You can make a good living selling bread, but you can't attract customers using prices ... nor can you change your profits much by playing with prices (in fact the very act of trying out prices will lose you a lot of customers). So something else needs to make a difference. What determines which bread you buy? Google does (and Facebook, and ...)

And this applies not just to bread. This applies to the vast majority of foods. One of the big complaints in European countries these days is that entire economies are running headfirst into this problem. Even labour! People can't change jobs (supply of labor is regulated by employer demands, or outright regulated with laws) and employers can't change the pool of labor by much by raising wages. Unless you're exceptional or outside of the norm (yes, programmers are, so this is the wrong forum to make that particular argument), you cannot change your income much by providing more labor! Given "no skills" labor even the difference between working and not working ... is not very large and dropping.

This, obviously, enables FANG companies to extract the profits of normal businesses. They can choose who gets the profits of many products, and demand a growing share of the profits ...


I hate to seem condescending, but the "why" is basically what the essay is about for the first 75%. The issue that advertisements have is that the incentives and desires of advertisers usually misaligns with users, sometimes greatly. It's not a concept that's particularly new; for example, take Wikipedia's reasoning for not accepting advertisements. (I'd like to ignore the unrelated issues with Wikipedia funding here, since it's not relevant to this particular point.)

https://en.wikipedia.org/wiki/Wikipedia:Funding_Wikipedia_th...

I don't believe direct sales fixes everything. There's no single party whose ultimate wishes are fully aligned with what's best for everyone. I do think, however, that direct sales and donations are preferable to ads in this regard.


I could have been more clear: I don't understand why the non-advertising business's interest is aligned with that of the user. I follow the idea that advertising creates a conflict of interests, but I don't follow that not advertising aligns interests. The business is there to sell a product for the most profit. Making money by selling the product doesn't by itself give them an incentive to produce a better product.


If they don't make the website provide value to me, I will stop paying them. If things are popping up that are irrelevant to my search, that's not helping me. If it's not helping me, I stop paying, and they feel a hit. It's probably not a big deal if a single person does it, but if enough people do it, then they either change to get those people back (or replace them with new customers), or they go out of business.


I don't entirely follow, though; the idea is that you want the people who pay you to be satisfied with what they're paying for so that they continue to do so, convince others to do so, and are willing to increase their spending. If that wasn't the case, there wouldn't be a conflict of interest for advertisers in the first place.

There's reasons that the incentive to keep the users satisfied may not be followed, but it's mostly tangential, and relates more to other forces than it does where the money comes from.


> Making money by selling the product doesn't by itself give them an incentive to produce a better product.

Selling more product / continued subscriptions is the incentive.


If you want less middlemen on the web, you are probably not in favor of advertising. Price tags only involve a middle man of the payment processor. Advertisement involves ALL of the advertisers on the platform.


Do you think that running servers, writing code, maintaining systems happens for free? How else is the service supposed to fund itself?


Doing things requires hardware, which uses electricity and needs software which costs man hours.


What a completely disconnected-from-reality article.

There are so many ways to use Google to obtain the information you're looking for, and so many people do it literally on a minute-to-minute basis that it's flat absurd to call that ineffective.

It's trivially easy to learn something by typing a question into Google; maybe when writing an article try gut checking it against obvious observed reality first.

I simply cannot fathom a noble reason why the author would decide to publish this.

Edit: Oh wait I found it; the author is Vladimir Prelovac, CEO, Kagi Inc. What does Kagi Inc. do? I'll leave that as an exercise to the reader, but I bet you guess it in one try...


> Edit: Oh wait I found it; the author is Vladimir Prelovac, CEO, Kagi Inc. What does Kagi Inc. do? I'll leave that as an exercise to the reader, but I bet you guess it in one try...

Point taken. But it's not as though this was concealed. The top line of the page is "Tales from Kagi." The piece is hosted on blog.kagi.com. The author's signature at the end of the piece reads "CEO, Kagi Inc." The final line of the post is "I hope you join us [i.e., Kagi] on this journey."


It just bothers me that this blatant SEO attempt is an article about how SEO ruined PageRank.

Maybe it doesn't bother you, just thought it was worth calling out explicitly.


> blatant SEO attempt

It is marketing and trying to influence the discourse, but I don't see how you can call it SEO.


Writing blog articles like this and posting them to social media to create crosslinks is a very common SEO strategy to "trick" PageRank into promoting your site higher in search results.

It's the whole purpose behind all of those random blog articles on corporate sites; to improve PageRank ranking.


I have seen a lot of SEO content spam, and this article does not come across as that to me.


Search peaked 10 years ago. There's lots of regression in Google Search. For example, you used to be able to search images by specific dimensions. You can do this today on Yandex. The quality of results and promotion of sponsored results, the scraping and displaying of site contents so users visit less (how is this legal btw?) Not respecting verbatim searches.


About '08. Spam took over right after that. Used to go in waves of better/worse as they played cat-n-mouse with spammers, but they seemed to kinda give up right around then.


That's not really what this article is about though, it's about how PageRank doesn't work anymore because advertising is at odds with returning valuable results, which is complete nonsense for a gigantic subset of the value that Google offers.

Also it's written by a CEO of a competing search company. It's literally the very SEO he's railing against...


Which part is disconnected from reality?

Google is a horrible search engine, you routinely have to hack it to get searches not linking to the centralised web. The first 5 results are ads, followed by Youtube links or a Quora/Reddit on the topic.

If you used the web 20 years ago you know what a good search engine was like.


For me, almost everything that's not coding related, and that's after all the SO clones etc have been removed from the results, is SEO "optimized" at best and SEO spam at worst. Most websites that show up on Google follow the same recipe of headlines, quotes, stock images and repetitive content and annoying introduction paragraphs that place them at the top of the list.


> There are so many ways to use Google to obtain the information you're looking for

Especially if you're looking for Google users' private information, and you're a business partner of Google's.


Unrelated to this from the pricing page[1] of Kagi:

> It costs us about $1 to process 80 searches.

This caught me by surprise. It would be good to know if this is pure hardware and bandwidth cost, or if they are factoring in all the costs associated with running a business. I wonder if costs for Google are also on the same level.

There team's plan seems to quite expensive considering we are so used to free (ad supported) search.

[1]: https://kagi.com/pricing


"We Are Building A Web Where The Default Is Social" -- Mark Zuckerberg, 2010

He then described how people will first do their usual _web searches_ via FB search bar. To my knowledge, that didn't happen. And he has since pivoted to "the future is private".

[0] https://techcrunch.com/2010/04/21/zuckerbergs-buildin-web-de...


Hot take. As long as ads are clearly indicated, and non-promoted results are easily accessible and usable, I don't have a problem with Google relying on ad funding for their main development efforts. The amount of behavior they have made accessible to anyone for free has produced an immense amount of qualitative societal value. IMO this far outweighs the annoyance of ads present on various search results.


The problem with ads isn't that they're present on screen (though that can get annoying), it's that their presence causes a fundamental misalignment of incentives. A paid service without ads is primarily incentivized to provide value to the customer. An entirely ad supported service is incentivized to provide value to advertisers first, potentially to the detriment of the users, and to users only to the extent that it brings in more ad money.


I think this article is missing the point that the technology of a solution is different from its intrinsic motivations.

PageRank could have been fine if the intrinsic motivations were to educate, enlighten, and help people.

Just because a search technology is wrapped up to look like a personalized AI doesn't mean it is immune to the same intrinsic motivations.

Who is to say that Google won't release a 'Personalized Search AI' that is free for the user and it continues to follow the same motivations that their PageRank solution does.

I think custom AI's tuned for each person will be the defining technical foundation for the next few decades, but I'm extremely concerned that the fundamental business drivers that compete for sales, view time, and divisiveness will not be resolved.

Nothing will stop an authoritarian government, an ad driven company, an entertainment company, or a political research company from giving out 'free' personal AI's.

If the history of the Internet has anything to teach us then it is this:

The free ones will always be more successful than the paid ones.


I've always wondered why Google doesn't have some way to provide an explicit signal from the users like favorite/bookmark or explicitly like/dislike a search result.

It's often pretty fustrating to try and find something you know you've seen before but don't remember quite the name of or exact keywords.


Google added some "plus" button when Google+ was pretending to be a thing. It made no sense though as how would someone know it's a good result before they visit it?


Yeah I think a browser frame button like "favorite" is what is needed.

Ideal would probably be face/eye tracking to check attention/response but thats way too creepy to work.


Brave browser uses "attention" as the main factor. It's currently used for their paying the content creators and charging for ads, but with their new search engine [0] it may be a factor there too.

[0] https://search.brave.com/


> Ask it for a good coffee maker, and it’ll recommend choices within your budget from your favorite brands with only your best interests in mind.

It sounds all great and I agree that we should be prepared to pay for quality services instead of expecting everything to be free (so I will have a look at kagi). But this promise makes me skeptical: Twitter is apparently going to lose money on their "$8/m for half the ads" deal with US users because the ads make more money than that for them. Sure, paying for your services is going to mitigate the incentive problem, but it might not fully eliminate it. Of course, kagi has a reputation to lose with its users, so that is another line of defence, but I guess the best way to build such trust would be to be maximally transparent, even about the incentive structures.


It's an easier sell than $20/month, for example, especially since the idea of paying for search is already a hard sell for most people. Even if Kagi eventually fails, it'll be an interesting experiment and finally gives all the people who repeat "if you're not paying for it, you're the product" an opportunity to put their ideology into action.


Honestly not sure if search engines are the future at all, at least for commercialisable content.

More and more I find myself searching through moderated social media like Reddit or Facebook Groups/Marketplace for product recommendations or local services.

Gave Kagi a go, but it seemed to be even worse than Google for what I tried.


This feels like a consequence of a centralized, gardened web, not because Reddit / FB are inherently superior experiences.


Isn't FB marketplace just another search engine? It seems unlikely that the algorithm would be able to purely guess what you currently (actually) need.

And moderated groups don't sound like a solution to the 'I have a concrete question that I want an answer for right now' problem either (unless you're thinking of massive live chats with many lurkers, which sounds more like the web of the past, tbh).


i think general purpose search engines will always have a place, but they're ultimately a tool you use for finding information, not necessarily a tool you should be expecting to make decisions for you.

so many of the complaints about google search results seem to be along the lines of "i asked google what X i should purchase, and they served me an ad", whether that's a first-party ad on google or search results that are external ads. If you're only attempting to use a search engine to find facts, it's a much more satisfactory experience.

search results for opinions are and have always been trash - they don't serve you the best opinion, or the most trustworthy, or the most well-reasoned. they serve you the opinion most relevant to your search terms. and that's probably not what you want.


I wonder how Kagi’s crawler gets around the Cloudflare protection that many sites are using today.


PageRank was never as important to Google as Google said it was, the idea of indexing keywords that appear in link text was much more important.

It took years for peer-reviewed papers to show any benefit from PageRank at all, part of it is that a real relevance function has to balance the keyword x document influence vs the document influence and you don’t come out ahead ranking an irrelevant document highly if it has a high page rank. (E.g. a popular document that is irrelevant is… irrelevant)

If you believe the original paper, PageRank is simulating the density of a random walk over web pages and Google has been able to sample that density directly w/ Google Analytics, Chrome browser telemetry and all their other web bugs.


I've been a beta user but the 10$/month price tag is too steep for me.

That being said, I'm beginning to think that having a price tag as entry barrier is a good thing for a search engine.

One of the reasons for Google's downfall is that they had to cater to dumb and easily controllable masses.

Also, the aggressive SEO that made the search results so bad made economical sense as soon as those masses flocked at the platform.

Kagi may only stay this good if it does not become too popular. I hope the developers think about the fact that their best strategy might be focusing on their current sizeable niche market of nerds and intellectually curious people.


I've used Kagi as my primary search engine since they went live and I've been extremely pleased so far. Funnily enough, I only use !g now when I want to get Google Shopping results (for price comparison) or look at product reviews.


PageRank was all Larry Page. The cited article was about the search engine.

https://patents.google.com/patent/US6285999B1/en


As other comments have noted, PageRank didn't come out of a vacuum; some history is relevant: https://en.wikipedia.org/wiki/PageRank#History


Technically the title is correct since Google has deprecated PageRank for years. But it doesn't looks like the point of this article since it uses the majority of its texts on criticizing ad supported search business model in favor of subscription models which PageRank has no business at all. You can use PageRank (or not) regardless of the business model right? I'm still not sure what it is trying to achieve by bringing PageRank to the table. If you substitute "PageRank" with "Google" then it actually makes more sense though.


I have always imagined PageRank to be modeled after how academic papers are judged. What cites that publication, and who that publication cites, are quite important information to put a publication in context. For the same reason, I don't think PageRank will go away.

Two improvements that I'd like to see in search results are: (1) the ability to exclude particular domains from search results; and (2) the ability to over-weight certain referrers (i.e. a link from Encyclopedia Britannica is worth more than a link from Wikipedia, which is worth more than a link from MySpace.)


>I have always imagined PageRank to be modeled after how academic papers are judged. What cites that publication, and who that publication cites, are quite important information to put a publication in context. For the same reason, I don't think PageRank will go away.

You are correct but in the scientific community there are no armies of spammers who persistently try to outperform other scientists by writing fake research papers and spamming citations. There are cases of fraud here and there but not at the scale of the Internet or to be exact the World Wide Web(WWW).

>the ability to over-weight certain referrers (i.e. a link from Encyclopedia Britannica is worth more than a link from Wikipedia, which is worth more than a link from MySpace)

Google probably already does this but exactly how nobody knows because of their blackbox algorithm policy. Speaking about this suggestion; it depends on what you search query is....you need to assign specific weight values to certain referrers in relation to search queries from X,Y,Z categories.


Post states that Google search is getting worse, "...the quality of search started to deteriorate", and links to this article to justify the claim:

https://dkb.io/post/google-search-is-dying

Big discussion here and on Reddit when that article was posted. It doesn't set out to say Google search is worse/getting worse, but that people don't trust its results as much as human generated content (thus appending 'Reddit' to queries).


If Kagi gets popular it would have the same problems as Google today has. They’re unavoidable.

You can mitigate it with curation, but then you’re bound to niches and bias, which is true with automation as well. No victory here.


I've been using Kagi on and off for the last few months. I often forget about it since I can't set it as my default browser in Safari.

But in general, especially when looking for specific phrases or trying to find discussions related to a topic, it's working extremely well. Whenever I used it, I was happy with it, and the search result quality was better than in Google.

It's good to note though that iirc Kagi does actually use commercial search services by Google/Microsoft behind the scenes, in addition to their own custom components.


> since I can't set it as my default browser in Safari

Wait, what? It boggles my mind that technologically inclined people continue to use this user-hostile, walled garden nonsense.


They have a Safari extension. Have at it!



> As some websites became more prominent thanks to their page rank ... > their publishers also realized they could monetize the traffic > they started receiving.

This is a non-sequitur. Publishers could monetize their traffic without PageRank, it is not a necessary condition for the publishers to realize they could monetize the traffic. This thesis statement is the crux of the rest of their argument, which makes the rest of this not make any sense.


Can you share some insight on how/if Kagi is different from Neeva[1] in its approach?

[1] https://neeva.com/


Can the technology behind GPT-3 be used to find relevant articles given a query?

E.g. instead of training it to generate sentences given a prompt, train it to generate URLs given a query (?)


I would hope that GPT-4 or 5 will obviate the need for this entirely. Just ask the LLM your questions and it will answer.


I wonder how they will fight SEO, though...


Google doesn't use page rank anymore. They have a signal called navboost which is the strongest signal in ranking and retrieval. And that idea originates from Yahoo paper in which they revealed the gems of search engine algorithms: https://www.kdd.org/kdd2016/papers/files/adf0361-yinA.pdf


To me it looks like they broke search to only return mainstream stuff for any search with any connection to an official narrative. Pagerank is the opposite of that. That could be why Sergei and Larry aren't doing much work these days with Google. Larry is in Fiji and hasn't been heard from much in the last two years.

IMHO, Yandex feels like the old Google. It returns lots of small sites for most searches and does way better on controversial topics.


"With the inevitable advancement of our civilization"

unless advancement is understood as necessarily attendant upon linear time unspooling, it is no ways inevitable


Kagi is indeed a great search engine. Why oh why can't it just be an option on the Safari search engine preference panel? Maybe the login.


You can set as the default search engine with a Safari extension


Seemed like a perfect opportunity for Apple -- they could give their customers an actually useful search engine subsidized by their hardware sales. At the same time they could starve google and bling of the user data provided by customers using search in safari or siri.

Instead they are going the other way. Sad.


Comparing Apple Maps and Google Maps, I don't have much hope of this working out the way you imagine it.


Apple maps started out poorly but these days I prefer it to google maps.


This begs the question:

Given Some social media platforms are moving towards some embrace of AI feeds, could those platforms retain their 2nd search engines role such as Youtube, Instagram, etc?

My own biased use case is that I started following a whole bunch of designers on Instagram to get new design news.


Been a paying customer since sometime last spring and am very happy with Kagi. I give it two big thumbs-up.


The OP over-reaches in that PageRank is just a small ingredient in Google's ranking, which covers many other components from TFIDF score to anchor text features and yes, PageRank. Naturally, Google used PageRank as a differentiator in its marketing, but technically. Bing has been known to use >150 features to score the relevance of a page, given a query, so one would expect Google to exploit a similar order of magnitude of evidence in its ranking.

> Yet, despite being acutely aware of the dangers of ad-supported search, selling ads was adopted as the primary business model of the new search venture just a few years later.

Google was successful as a superior search engine, pushing out AltaVista and similar, earlier alternative search engines and Yahoo!-style portals curated by humans. However, Google wasn't a successful business for quite some time. Eventually, they adopted (some people would say "stole" - there was a lawsuit) Yahoo!-owned Overture's ad model, which changed everything. Yahoo! owned Overture, and Overture had a critical patent. Yahoo! made one critical mistake: they settled the lawsuit for relatively little money. The rest is history.

Now many people complain about decaying search result quality levels. That just means there is space for a new search engine, how exciting! The good news is it has never been easier and cheaper to start a full-text index of the Web and associated search. For about 50k (a Xoogler's estimate, not mine) you should be able to get going. Sites like Gigablast show it can even be done as a one-man show, which I would not recommend (to many complexities in "small" bits even HTML to plan text conversion, load balancing, incremental inverted index updating etc. - all requiring nowadays some specialist expertise in a game where you can't afford to reinvent the wheel because you don't know the scientific literature/state of the art). The one thing that is hard to get is initial user traffic. But I think HNers will be happy to give each new engine a try!

In summary, I think there never was an "age of PageRank". But you may say Google Web search is past its prime. Perhaps Google could change that if they wanted - it may be that it isn't much of a priority at the moment, hard to say (they are (too?) big now).

Edit: Here, I've interviewed Shadi Saleh, the architect of Syria's search engine (if you think it's impossible to get up and running with a small team): https://irsg.bcs.org/informer/2019/07/syrias-first-web-searc...


Thanks for posting the interview with Shadi Saleh. I found it interesting they were able to mitigate sanctions by using FOSS software. I wish Saleh had gone into more depth on the hardware side of things which I imagine is also complicated by sanctions.


Still see SEO spam beating quality websites.

https://kagi.com/faq#Where-are-your-results-coming-from

"Our searching includes anonymized requests to traditional search indexes like Google and Bing as well"

Lol okay


I don't understand why advertising-based revenue models are bad for consumers. The most profitable ads are those that are the most relevant to the consumer. It leads to advertisements that are most likely to provide commercial information that the consumer finds valuable.


The blog post credits Google founders for PageRank but probably the credit should go to Baidu instead.

Check the history section: https://en.wikipedia.org/wiki/PageRank


The history section mentions the work of Baidu’s founder as somewhat related to PageRank in that it scores based on links. Work of others are also mentioned in the same section as “influential” for the development of PageRank.

If we just trace back bibliographic references to decide on the inventor of PageRank, it will probably be Archimedes.


Yeah if Archimedes built a competing search engine that uses link ranking algorithms sure.


I will always remember a commercial wherein some shitty marketing company promised to hack Google's algorithm and get you on the first page.

So what destroyed the relative purity of search engines and the early web? Money. How do we solve it? Money apparently.

Yeah good luck with that.


Google is not using raw PageRank anymore, they use some sort of machine learning, which is overly optimized based on flawed abusable SEO techniques and probably user count, rather than actual value and authenticity.


Purely anecdotal, but as a matter of fact yesterday I spent 15 minutes searching for a web site on google (I couldn't remember the domain name), threw up my hands, went to duckduckgo, and nailed it on the first try.


The article suggests that search engines will become a personal choice. I rather think that good search engines will become such a competitive advantage that companies will buy search engine access for their employees.


>In the future, it is likely that if the current mainstream search engines want to survive, they will have to go back to their roots, dismissing ads as their primary business model

What an optimistic perspective.


There is a simple test to see if a search engine acts in your interest: Seach for “pornhub” with any NSFW filter off. Kagi fails to list the obvious result.

I’ll stick with DuckDuckGo, thank you.


I tried, the first result is Pornhub. What are you expecting?


> it costs us about $1 to process 80 searches.[0]

I am genuinely curious how did Kagi arrive at this cost.

[0]: https://kagi.com/pricing


I saw that too. Seems pretty high. I also think their plan of hoping future users search less on average is probably bad. I would want users to use my product more and I definitely wouldn't want to have an incentive to make users use it less. If the company is basically default dead unless usage drops... That seems really bad.

If I were them I would be focusing on: How do we get to 8k searches for a dollar?


I’m guessing they’re running NLP inference on some GPU server that costs X per day and can process Y pages. I would guess that it’s the indexing that incurs the most of the costs, not the actual queries performed by the users - though I could be wrong.


I was curious and went to their Pricing page. Their link to pledging "5% of its profits" to supporting non-profit organizations for a more humane internet leads to a dead AWS page


Oops.. The correct link should be https://help.kagi.com/kagi/company/donations.html


A bit late in my response, but thank you for this. It's really cool to see Fairplay included in here. They are doing great work


What an incredibly pointless and long ad disguised as a blog post.


Is there a way to provide feedback to Kagi that a link isn't relevant in their search? Searching "us election results" for example gives me the results from 2020.


There is a 'send feedback' link at the top right on desktop


Why would you create a search engine that looks exactly like every other search engine? And further, specifically Google.

Why would you not take the chance to completely reimage search into something way better than the absolute random crap we have today?

Whatever the next web discovery engine looks like, it'll look and feel entirely different.

It won't be the equivalent of someone saying, "Hey Facebook is bad, look at MY social network!" and it being the exact same layout and user experience as Facebook, with different margins, padding, fonts and colors.

Whatever comes next, it'll be like comparing Twitter to Facebook, or TikTok to Instagram.

As the kids used to say, this ain't it, chief.


At this stage, the chief complaint of most prospective users is the quality of search results from the market leader, rather than the layout or user experience. It's a 5-person company, and I'd rather they focus their efforts on the search algorithm.


Simply because you need to get the basing things right like the correct infrastructure and it is a safer bet, specially when you are bootstrapped. The article lays out the vision for the future of the search that Kagi wants to build.


This reads like this is the death of all network centrality algorithms, and no way, guys, no way. At most this application will somehow decline even if only for a while.


Their pricing page says that it costs them around 1$ to serve 80 searches. I really wonder how they arrive at that number. That seems shockingly expensive somehow.


Something I've always wondered about:

How about personalized pagerank search that assigns any URL I've bookmarked a high value for E?

Is there a reason no one has done this yet?


Is there a search engine that just omits pages with surveillance tracking scripts? Could it be done?

Then you can check google or a mainstream engine when needed.


Yes, Kagi has that option (it is called non-commercial lens). It also shows number of ads/trackers for search results.


So is Kagi a stand alone search engine or is it like every other Google alternative that just pays Bing for API use?


Off-topic rant: Why is it so hard in 2022 to go from your blog to your main page?


The way iOS search is done by Kagi is a little creepy. You have to load an extension and allow it the ability to inspect everything. It then intercepts searches and redirects the browser to Kagi. I’m sure it’s what’s necessary to add a search in iOS, but still a sad compromise. Regardless the search results from Kagi seem much less SEO spammy which is nice!


Safari does not allow custom search engines, and this is the only way possible, also used by any other search engine with an iOS Safari extension.

Besides, Kagi Safari iOS extension is full open source: https://github.com/kagisearch/Kagi-Search-for-Safari-iOS


FYI the "subscribe to kagi" button links to kagi.com, not to the subscription page.


The age of orcs has begun


I love Kagi! Been paying for the past 3 months.


Is this a clone of bearblog (is bearblog a clone of this) or do they share an open source?

EDIT: I'm dumb. Please do not reply or I will Sylvia Plath myself.


> Over the years, the web deteriorated to the state it is in now - a highly destructive force.

You can’t be serious with a claim like this


Reject modernity, embrace tradition: we're going back to user curated indexes like it's the 90s!

Time to join a web ring.


My favorite was LinkExchange. An interesting little success story too [0]. We should also bring back the term "webmaster" so when people ask what you do, respond, "I'm a webmaster".

[0] https://en.wikipedia.org/wiki/LinkExchange


I do wonder how unique web searches really are. It'd definitely take some time to achieve a critical mass but I swear most of the stuff I want to look up I could do with an index although sites come and go so the larger your index gets the more upkeep it costs.

I think the biggest issue is live news but I mean then just pay like 12 people to watch news/sports networks and you won't be that far behind. (Or just have an section of the index for live information and let users to go nfl/espn/oan/cnn).


StumbleUpon!


it amuses me greatly that 30 years in to the great commercialized internet experiment, we still don't have durable mechanisms for publishing, spam filtering, identification, reputation, moderation and discovery.


Too bad Kagi shows Amazon shopping results, from what I hear that's a horrible company for its own employees.

Otherwise I might have given it a shot.


Kagi gives you the option of blocking results from any particular domain, if you like.


There are a lot of public articles stating that PageRank is no longer in top signals any of the mainstream search engines for more than a decade now. The patent already expired years ago and no one cared. It's surprising CEO of a search engine company just woke up to this. The future of search is AI model-as-index infrastructure.




Join us for AI Startup School this June 16-17 in San Francisco!

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: