Been thinking about this a lot [1]. Will this fundamentally change how people fi...

marcus_holmes · 2024-11-01T03:44:35 1730432675

I agree that this is the core question, but I'd put it as: Who Gets To Decide What Is True?

With a search paradigm this wasn't an issue as much, because the answers were presented as "here's a bunch of websites that appear to deal with the question you asked". It was then up to the reader to decide which of those sites they wanted to visit, and therefore which viewpoints they got to see.

With an LLM answering the question, this is critical.

To paraphrase a recent conversation I had with a friend: "in the USA, can illegal immigrants vote?" has a single truthful answer ("no" obviously). But there are many places around the web saying other things (which is why my friend was confused). An LLM trawling the web could very conceivably come up with a non-truthful answer.

This is possibly a bad example, because the truth is very clearly written down by the government, based on exact laws. It just happened to be a recent example that I encountered of how the internet leads people astray.

A better example might be "is dietary saturated fat a major factor for heart disease in Western countries?". The current government publications (which answer "yes") for this are probably wrong based on recent research. The government cannot be relied upon as a source of truth for this.

And, generally, allowing the government to decide what is true is probably a path we (as a civilisation) do not want to take. We're seeing how that pans out in Australia and it's not good.

dataflow · 2024-11-01T04:09:34 1730434174

> To paraphrase a recent conversation I had with a friend: "in the USA, can illegal immigrants vote?" has a single truthful answer ("no" obviously)

Er, no, the meaning of the question is ambiguous, so I'm not sure "has a single truthful answer" is accurate. What does "can" mean? If you mean "permitted", then no. But if you mean can they vote anyway and get away with it? It's clearly happened before (as rare as it might have been), so technically the answer to that would be be yes.

bruce511 · 2024-11-01T04:22:45 1730434965

This is a fundamental limitation of language. The LLM is likely to provide a good answer here even though the question is technically ambiguous, because it likes verbose answers.

Equally "can" is used to substitute for other precise words. Humans are good at inferring context, and if someone asked me "can illegals vote" I'd say "no". Just like if someone said "can you pass the salt" I pass the salt, I don't say "yes".

If the inferred context US wrong then the "truth" is wrong, but as with talking to humans it's possible to refine context with a follow up question.

dataflow · 2024-11-01T04:33:48 1730435628

To clarify: I wasn't making a pedantic linguistic point here. The fact is that when I first read the question, I actually thought it was asking "are illegal immigrants able to cast a vote (regardless of legality)?"

It literally did not even (initially) occur to me that the question might be asking about legality, because the entire modern political discourse surrounding illegal immigrants and voting has been with regards to whether they can cast votes despite not legally being allowed to. The answer to "is this legal" would have been such an obvious "no" to people on both sides of the debate --- and thus the question so silly --- that initially it didn't occur to me that the intended question might have been about legality, until I continued reading the comment and realized that was the intention after all.

coldtea · 2024-11-01T09:19:59 1730452799

>It literally did not even (initially) occur to me that the question might be asking about legality, because the entire modern political discourse surrounding illegal immigrants and voting has been with regards to whether they can cast votes despite not legally being allowed to.

Besides, going by legality, illegal immigrants "couldn't" even have passed the border into the country to begin with. But obviously they could, hence their status as illegal immigrants.

There's a contradiction if the AI answers "no" to "can they vote" (implicitly having the legality in mind), while accepting that they can exist in the country as illegal immigrants (implicitly ignoring the legality of border crossing).

chriswarbo · 2024-11-01T11:08:36 1730459316

> Besides, going by legality, illegal immigrants "couldn't" even have passed the border into the country to begin with.

Of course they could. Being an "illegal immigrant" does not imply unlawful border crossing. From https://en.wikipedia.org/wiki/Illegal_immigration_to_the_Uni...

"Visa overstayers mostly enter with tourist or business visas.[99] In 1994, more than half[108] of illegal immigrants were Visa overstayers whereas in 2006, about 45% of illegal immigrants were Visa overstayers.[109]"

(Here in the UK, the vast majority of illegal immigrants arrived legally and overstayed their visas. Yet our Conservative government, who oversaw a large increase in such arrivals, tried to blame all the country's woes on a few small boats illegally crossing our southern border; which is effectively a rounding error).

As an aside, refugees are actually allowed to make unlawful border crossings. From https://en.wikipedia.org/wiki/Convention_Relating_to_the_Sta...

"The contracting states shall not... impose penalties on refugees who entered illegally in search of asylum if they present themselves without delay (Article 31), which is commonly interpreted to mean that their unlawful entry and presence ought not to be prosecuted at all[18]"

coldtea · 2024-11-01T11:46:53 1730461613

>Of course they could. Being an "illegal immigrant" does not imply unlawful border crossing.

Same difference. Whether it's border crossing or visa overstay, from a pure legality aspect the answer is still "they couldn't".

>The contracting states shall not... impose penalties on refugees who entered illegally in search of asylum if they present themselves without delay (Article 31), which is commonly interpreted to mean that their unlawful entry and presence ought not to be prosecuted at all

I'd wager 99% do not "present themselves without delay", so don't fall in this case...

mistermann · 2024-11-01T13:48:42 1730468922

> Of course they could. Being an "illegal immigrant" does not imply unlawful border crossing. From https://en.wikipedia.org/wiki/Illegal_immigration_to_the_Uni

Nice catch!

Keeping with the theme:

> Yet our Conservative government, who oversaw a large increase in such arrivals, tried to blame all the country's woes on a few small boats illegally crossing our southern border

a) What was the exact claim (the word "all" caught my eye)?

b) is it the sole claim?

c) If one person in a group does something, are all members of the group "doers of that thing"?

d) for (d), does the answer depend on what the thing is, and if so should we perhaps imagine everyone is speaking a bit tongue in cheek?

Etc

coldtea · 2024-11-01T16:11:57 1730477517

Also, is the impact e.g. on crime of "visa overstays", that is, from people who were vetted, registered, and got a visa, the same as that of randos just passing the border with zero oversight?

Even if the first category is higher in numbers (and assuming there are correct numbers for the latter), the crime stats between the two are probably quite different. Especially since "visa overstays" could also count some people waiting for a delayed renewal, or coming in for 6 months and staying 5 or whatever.

mistermann · 2024-11-01T19:55:42 1730490942

It's funny to watch computer programmers (and various experts), whose day job is literally handling complexity and producing truth, fail so quickly and utterly at simple culture war topics. It is like the weirdest thing, people like you (though you did make at least one error somewhere in this thread) are 1 in 1000++ in my estimation.

marcus_holmes · 2024-11-01T04:56:27 1730436987

That was kinda my point - my friend had been so conditioned by the "can they vote" thing that he genuinely thought that non-citizens could legally vote in US elections.

If I had phrased the question as "is it legal for non-citizens to vote in US elections?" then that might have illustrated my point better (and we wouldn't be going down this rabbit hole, though the rabbit hole is informative in itself).

labster · 2024-11-01T05:22:54 1730438574

The fact that the rabbit hole exists at all means that our hope for instant, reliable answers is likely doomed. Avoiding politics entirely, I could ask “does gravity curve spacetime?”, and while “yes” is a reasonably accurate answer, so is a digression into quantum gravity, or even saying that we’re not sure spacetime really exists outside of being a good abstraction.

dataflow · 2024-11-01T06:00:45 1730440845

Thanks, yeah, exactly. Like, the best answer to a question can really depend on who's asking it, who's answering it, or what they intend to use the answer for.

To give a much simpler example:

- If an 8-year-old asks "can you mix oil and water", the right answer is "no". If a student is asked that question on a school exam, the right answer is also "no".

- If a chemist asks "can you mix oil and water", the right answer is "yes, and here's how: https://www.youtube.com/watch?v=YJeWklggSpY"

TeMPOraL · 2024-11-01T10:18:13 1730456293

IDK, if my 5-year-old asks "can you mix oil and water", I'll tell her, "sorta - if you just pour them into the same container and mix with a spoon, then no; if you add some stuff and know how, then yes - in fact, it's exactly what mayonnaise is".

That I had to be 35 before I learned that mayo is basically water mixed with oil, held together by eggs, it's an indication of what kind of education about the world we were getting...

dataflow · 2024-11-01T14:53:54 1730472834

At that point you're just changing the physical scenario the question was asking about, and answering about a different scenario entirely. You might as well respond to "can humans fly?" with "yes, with an airplane".

You can certainly reply like that to your 5yo, but it completely misses the point I was making. The video I linked to didn't suddenly modify the outcome being asked about. What it ended up with really was a mixture of plain oil and water, with no other ingredient ever being added to it.

marcus_holmes · 2024-11-01T07:27:48 1730446068

Nice example :) So your answer to "Who gets to decide what's true?" is basically "there is no truth, and any truth that there might be is relative to the person asking the question". Is that right?

I think that's probably technically accurate, and also practically useless. Even damaging. I saw this in the climate arguments: two sets of different facts led to two different versions of the truth, which led to two completely irreconcilable points of view. Essentially two sets of people shouting "no, but..." and "well, actually..." at each other, pointing to two completely different truths, both supported by two completely different sets of facts. At some point we as a society need to agree on our truth in order to get anything done.

williamdclt · 2024-11-01T12:25:45 1730463945

> your answer to "Who gets to decide what's true?" is basically "there is no truth, and any truth that there might be is relative to the person asking the question". Is that right?

That's really not how I interpret it.

Assuming we agree on what "mixing" means, which itself isn't that trivial but even without a formal definition I think we have the same idea of "homogenous at molecular level on a longish time period".

The truth is "yes you can mix water and oil", there's no doubt about that. It's testable and tested.

The fact that we use context to interpret the question (rather than being entirely literal about it) and decide whether the literal truth is really what's appropriate to answer, doesn't change the nature of truth.

There's also the question of knowledge (I might not know that you actually can mix water and oil), but again that doesn't change the nature of truth.

Like so many philosophical questions, it only sounds interesting because we assign different meanings to the same words: here conflating truth and answer.

dataflow · 2024-11-01T14:26:23 1730471183

> So your answer to "Who gets to decide what's true?" is basically "there is no truth, and any truth that there might be is relative to the person asking the question". Is that right?

No it's not, at all. This isn't a debate about what's true, it's a debate over the intended meaning of the question. The point was that people assume context behind the question and answer based on the context. Because even the person asking often doesn't literally mean what the words say. The question is more than the words that are explicitly written.

Applejinx · 2024-11-01T11:06:59 1730459219

The concept is that it is possible for people to be in bad faith. This also underlies the idea that people can commit crime, be guilty of crime. I guess the term is 'law'?

Bottom line is that you can't have the bottom line be 'does a person earnestly believe what they're doing is right and good', much less 'do they say they're right'. Can't fall back on that, it's hopelessly inadequate.

sharpshadow · 2024-11-01T09:33:28 1730453608

I might disagree with that and agree on the context approach. „The Truth“ does not exist and it would be much more helpful to have context related answers without one mayor view which needs to dominate.

the_gipsy · 2024-11-01T09:29:06 1730453346

Yes. This thread reads a bit like a convo between AIs, almost but not quite talking past each other.

hackable_sand · 2024-11-01T09:39:34 1730453974

> At some point we as a society need to agree on our truth in order to get anything done.

These are known as half-truths. We do settle for lies in order to do whatever it is people feel they need to do.

We also settle for lies because there are just things we don't understand yet, but our models are currently correct, and possibly collapsing over millenia to a stable truth.

labster · 2024-11-01T09:57:59 1730455079

As someone who writes fiction, I’d like to note that one often needs to write really complex lies in order to effectively transmit a single truth. You can’t just tell people anything, you have to prepare the mind to receive the knowledge.

kukkeliskuu · 2024-11-01T17:28:56 1730482136

This is a very interesting comment. Is it possible for you elaborate on that?

bitcoin_anon · 2024-11-01T11:49:34 1730461774

What do you think of Ayn Rand’s works of fiction?

labster · 2024-11-01T17:20:54 1730481654

Generally as lies wrapped in lies. You can use the same techniques to push a falsehood as well. Which is not to say Rand did not actually believe her message — she did, which is why the books are so effective. But they have given the upper class an exaggerated sense of their own importance, and a conviction that government only stands in their way. Meanwhile over in reality, the more a state is run by oligarchs, the worse a place it is to live.

kukkeliskuu · 2024-11-01T17:35:13 1730482513

I don't really disagree, and don't appreciate Rand, but an oligarchy can be disguised as something else, and people will feel relief when it is replaced by an open oligarchy, because one means of gaining power is to give people what they need.

zild3d · 2024-11-01T17:27:23 1730482043

Then you are just lying to the 8 year old.

Sure you can adapt how much context you give based on who's asking, but if it's something factual like this it really shouldn't change from a yes to a no.

If you think they _meant_ to ask a different question that is less vague, it can be clarified

"Water and oil do not mix by hand. However water and oil can mix under some specific conditions like a vacuum, do you want to discuss that in more detail?"

labster · 2024-11-02T00:02:25 1730505745

Eight year old proceeds to pour oil and water into a vacuum cleaner.

ethbr1 · 2024-11-01T04:53:48 1730436828

I think that would be Google's approach, before they were waylaid by ad revenue incentives: that question intent can best be inferred by knowing as much about the asker as possible.

But I feel optimizing for lazy question phrasing is infantilizing the userbase and assuming they're incapable of learning how to ask more precise questions.

Which is an endemic problem in the modern web. We should be building systems with low barriers to simple adoption, but whose power scales with a user's expertise.

Instead, we're hyperoptimizing for lowest common denominator, first interaction and as a result putting a glass ceiling on system power.

photonthug · 2024-11-01T10:10:11 1730455811

> that would be Google's approach, before they were waylaid by ad revenue incentives: that question intent can best be inferred by knowing as much about the asker as possible.

I’d rephrase it: this was googles approach because they were waylaid by ad revenue incentives. Why does everyone assume the search bubble is a well intentioned accident?

The linguistic arguments about the nature of truth above don’t cut it for me as any justification that the answer depends on who is asking. If we’re telling chemists that oil and water do mix as mentioned elsewhere in thread, we should probably tell the same to children and just make a slider for the level of additional detail. No problem. Especially if the alternative is a dystopic post truth panopticon.

> systems whose power scales with expertise

Couldn’t agree more that this is what we want/need, but disagree about infantilizing user bases and lowest common denominator.

Platform’s aren’t trying to scale user power with expertise, they want to scale revenue with users, and separately, to deliberately restrict user power/control so that platforms decide what users see. And it’s not necessarily political or linguistic or about the nature of truth.

A simple example of this that’s everywhere is filtering content by sub genre tags. Ever notice that you often see niche content tags like “time-travel” or “mind-bending” but can’t click it? Platforms want the tags for internal analytics naturally, but they want the control of not providing it as a filter, forcing users instead into a fuzzier category like “people also watched” or “top ten this week”.

Why? Because platforms can hide advertised content there, push stuff they pay less license fees for, make operations cheaper with caching, or whatever else.

ethbr1 · 2024-11-01T15:46:50 1730476010

Because lots of people are ignorant and imprecise.

HN selects from people who have at least a passing knowledge/interest in programming/science, a subpopulation which is already several standard deviations from the mean in specificity and debugging.

> disagree about infantilizing user bases and lowest common denominator

Platforms, with Google as exemplar, evolve in two ways.

1: Features they explicitly choose not to ship, because they're strategically dangerous. See tool-use foot dragging by OpenAi.

2: Features they deprioritize, because they aren't as revenue-impactful as other things.

To me, it feels like Google dropped the ball via the second path.

I'm sure they've been doing a ridiculous amount of cool work behind the scenes on individual context grounding... but once prod was "good enough for ads" the company as a whole wasn't incentivized to do the hard thing and ship more advanced features in search.

Which is how they ended up as legacy as they are, competing against LLM search that's by definition context-native.

samatman · 2024-11-01T13:10:11 1730466611

My education also drilled into me the distinction between what one can do and what one may do. I can just hear my main lesson teacher now: "Can an illegal immigrant vote? Perhaps, but they may not, and certainly they should not."

mistermann · 2024-11-01T13:57:43 1730469463

More of teaching like this in school could go a long way, but once people reach adulthood I don't think this would cut it. I think a more military training approach might be needed where you tear this part of the person's psyche down and rebuild it properly from the ground up.

Dalewyn · 2024-11-01T14:04:20 1730469860

>The answer to "is this legal" would have been such an obvious "no" to people on both sides of the debate

I'm sorry, but the people on the side of letting illegal aliens vote are squarely of the opinion that it is legal to do so.

Why?

Because when you prohibit any and all means (eg: government identification demonstrating citizenship and residence) to test the question of legality, everything becomes legal by sheer virtue of the fact you can't demonstrate what they are doing is illegal.

Remember: Innocence until proven guilty beyond a reasonable doubt. You are prohibited from proving they are guilty, so they are innocent by default.

lanstin · 2024-11-02T21:22:02 1730582522

There is no side of letting illegal aliens vote. WTF?

Although I did once live in a town where anyone over 16, citizen or not, could vote in the city non-partisan elections. The idea was local government needs all the involvement it could get and if you lived in the city you had a stake in its future. I do believe one had to have a proper visa and so on.

Dalewyn · 2024-11-03T05:07:52 1730610472

You do realize that you can't test someone's legal ability to vote if you prohibit confirming if someone is legally permitted to vote, right? Our rule of law rests upon assuming innocence if guilt cannot be proven beyond a reasonable doubt, and here it is explicitly prohibited to even attempt proving guilt. Therefore it becomes legal for illegal aliens to vote in American elections, because you are prohibited from even attempting to demonstrate their illegality.

The only reason asking for government identification prior to voting is considered "racist" and illegal is because the people pushing such agendas want more votes and don't care where the votes come from, including illegal aliens, legal aliens, and otherwise people who do not have the right to vote in American elections.

I voted when I was still in California, born and raised American so I have the right to vote. I was never asked for any piece of identification. None. I could have been a Canadian or Briton or Chinese or some other foreign national, I could have been an illegal alien from Mexico or Guatemala and I could have still voted and my ballot counted because nobody checked. I just walked in and voted, identification or citizenship be damned let alone registration.

There's a part of me still questioning the value of my American citizenship and paying my taxes like a good citizen.

Zambyte · 2024-11-01T15:26:13 1730474773

If by "it" you mean a specific GPT like OpenAIs ChatGPT or Claude, yes. In general LLMs are trained to be a verbose as they are trained to be. Here is the full answer I got from Gemini 1.5 Pro for "in the USA, can illegal immigrants vote?":

> No, non-citizens cannot legally vote in federal, state, or local elections in the United States. This includes those who are undocumented or residing in the country illegally.

Clearly it made assumptions about the interpretation of the question, and did not respond verbosely to account for ambiguity.

marcus_holmes · 2024-11-01T04:49:25 1730436565

At the risk of derailing the conversation down a completely different rabbit hole... As I understand it, only citizens are legally entitled to vote, and voting requires a government-issued ID and the voter to be enrolled.

How did they vote and get away with it previously?

(also, as per another comment, if you know that this happened then surely they didn't get away with it?)

dataflow · 2024-11-01T05:49:16 1730440156

> (also, as per another comment, if you know that this happened then surely they didn't get away with it?)

Is it possible to steal money from a bank and get away with it? Is it possible to obtain citizenship fraudulently and get away with it? etc.

But if they got away with it then how do you know?

> How did they vote and get away with it previously?

Look it up on Wikipedia? They literally have linked cases from the past: https://en.wikipedia.org/wiki/Electoral_fraud_in_the_United_...

Or look at the most recent case in the news yesterday, which someone already replied with in the other comment: https://www.detroitnews.com/story/news/politics/elections/20...

> At the risk of derailing the conversation down a completely different rabbit hole...

This will definitely derail the conversation so I'll just leave my reply at this.

panarky · 2024-11-01T06:37:08 1730443028

It's quite clear that the question is not whether it is possible that an insignificant number of votes are cast fraudulently, because if we're talking about insignificant events, all things are possible.

Certainly the question is whether there's any evidence, after endless audits and investigations and lawsuits, that the volume of fraudulent votes is anywhere near large enough to affect the results.

Is it possible that someone registered their hamster to vote? Certainly.

Is there any evidence whatsoever that tens of thousands of hamsters are casting votes? No.

dataflow · 2024-11-01T06:48:49 1730443729

> Certainly the question is whether there's any evidence, after endless audits and investigations and lawsuits, that the volume of fraudulent votes is anywhere near large enough to affect the results.

(a) Nobody asked that above.

(b) You're conflating "do people do X" with "can people do X". Those are two very different questions. There are lots of things that people could easily do frequently, but that they simply don't do frequently. Perhaps because they're just honest, perhaps because they lack sufficient motivation to be dishonest, perhaps because they're worried they might get caught, perhaps because they have better things to do, etc.

dataflow · 2024-11-01T15:02:43 1730473363

I have no idea what the downvotes mean. Are people claiming "can illegal immigrants vote?" is somehow the same question as "are illegal immigrants voting frequently enough to sway the outcome of the election?" Those seem like manifestly different questions, what's so controversial?

hombre_fatal · 2024-11-01T17:19:21 1730481561

Just seems like willful misinterpretation of the spirit of the question in casual convo to score some sort of point in a game you made up, esp after they clarified the context that their friend really thought it was legal for them to vote.

dataflow · 2024-11-01T18:38:26 1730486306

Their friend was intending to ask about legality, but my whole point was that the question itself doesn't convey that. I was saying that when I saw the quoted question, it seemed to me that it was being interpreted as "can illegal immigrants get away with voting", and they were probably encountering websites saying "illegal aliens are voting!!!", which is obviously confusing, even for someone who already knows it's illegal. Does that make sense?

This wasn't me willfully misinterpreting it, this was me literally doing my best to guess what the intention of the question was, based on the question. Now of course after the comment said the answer is an "obvious no" then I finally figured out the intended question was something else (hence my reply), but that's out-of-band information that was in no way conveyed by the query. And my point was that the answer to the question isn't obvious because the meaning of the question itself isn't clear.

sharpshadow · 2024-11-01T10:11:39 1730455899

California passed a bill last month which banned requiring ID for voting, which stirred up the discussion that people without ID or to be more precise illegal immigrants could just vote.

I don’t know more details about it but theoretically this would also allow people to vote more than once.

maweaver · 2024-11-01T12:22:16 1730463736

You sent me down a bit of a rabbit hole... you don't have to have ID to vote, but you do have to be registered to vote (which requires ID). So in order for an illegal immigrant to vote, they would need to impersonate a registered voter (and presumably if that person did vote, it would be flagged as multiple votes under the same registration). Not impossible, but not the same as being able to just walk up and vote no questions asked.

Izkata · 2024-11-01T14:36:19 1730471779

Nope, automatic voter registration through the DMV when getting/updating a driver's license can do it, which they are allowed to have. One of the recent court cases (like within the past few weeks) involved removing people from the list of registered voters who failed to check the "I am a US citizen" box. Driver's license is what most of us use as a government ID anyway.

dataflow · 2024-11-01T15:52:33 1730476353

Automatic vs. manual doesn't seem relevant? A driver's license is an ID, so if they have that for automatic registration, then they still have an ID.

I think your point is that people can just lie about citizenship and get away with it when registering to vote, regardless of when/how it is done? Is that it?

Izkata · 2024-11-01T16:57:49 1730480269

Registering to vote has been made so easy it can be done by accident. Then months or years later when an election is coming up they'll get a voter card on the mail and think that means they can vote even though they're not legally allowed to vote.

dataflow · 2024-11-01T23:39:34 1730504374

I'm kind of incredulous at this if I'm being honest. How can you register to vote by accident? Every form I've seen a copy of asks if you're a citizen and gives you a warning about that. Do you have a copy of the form or screenshot you're referring to that makes this easy to do by accident?

Izkata · 2024-11-02T02:40:22 1730515222

I don't, but it's easy to find people who realized this happened to them, getting scared about their immigration status and/or breaking the law. And the DMV isn't the only way this can happen:

https://www.reddit.com/r/DACA/comments/1aolik8/accidentally_...

https://www.reddit.com/r/USCIS/comments/1cuop36/need_advice_...

The comments on this one have someone describing how it almost happened to them at the DMV with the checkbox in question: https://www.reddit.com/r/immigration/comments/7bzst0/acciden...

dataflow · 2024-11-02T02:54:12 1730516052

Ah gosh I see. It's these voter registration campaigns that are misleading people. Thanks for the links, those are really unfortunate...

melagonster · 2024-11-01T05:57:25 1730440645

The check between licenses information and database happened after the election.

johndubchak · 2024-11-01T17:55:55 1730483755

I'm no linguist, but the question does seem unambiguous, or quite clear, to a reasonable observer. The context is "voting in a US election" AND the subject is "an illegal immigrant" WITH an assumption that the illegal immigrant has, in fact, illegally emigrated to the US.

XorNot · 2024-11-01T04:31:49 1730435509

If they got away with it then how do you know it's happened?

coldtea · 2024-11-01T09:35:10 1730453710

To answer the general question, of "if somebody got away with X, how can we know it happened?":

If somebody robbed a bank at gunpoint and was never caught, can we know it happened? Obviously yes.

What if somebody "merely" embezzled from one, and there's a "hole" found in the banks books and money missing, but nobody who did it or how? Still, I'd wager yes, one coudl tell by the results.

What if somebody used illegal means to get leverage on some stock buying/selling, it became know, but they weren't punished and got to keep their profits? They got away with it, but we do know it happened.

XorNot · 2024-11-01T16:10:31 1730477431

None of those are voting. If someone cast a ballot who was not allowed to, then how do you know it's happened if they weren't caught?

What is this assertion being based on other then vibes?

SauntSolaire · 2024-11-01T18:20:44 1730485244

Well, after the fact it's realized the person whose name is on the ballot had been dead for months, and you didn't catch who cast the vote because you don't know who they are.

Or a million other scenarios. Really all it takes is stopping to think about it for more than a minute.

EricDeb · 2024-11-02T08:24:05 1730535845

I mean if they registered with some identifiable piece of info (SSN for example) then you would see two votes with the same SSN if they were pretending to be someone else and if they dont have an SSN then they arent a citizen.

coredog64 · 2024-11-01T04:53:26 1730436806

Obviously off-topic, but see the recent case of the Chinese student who illegally voted and then turned himself in.

olalonde · 2024-11-01T12:51:53 1730465513

Murphy's law, arguably. If you can imagine a way it could happen, it most probably did.

the__alchemist · 2024-11-01T13:07:36 1730466456

Neal Stephenson's book _Fall, or; Dodge in Hell_, from 2019, dedicates many words to this concept. Briefly summarized, it explores a post-truth-world, describing the world where people could agree on the truth as a narrow time-slice in history. People have their own individual internet filters, and the USA becomes divided into Afghanistan-etc-like tribes, each an echo chamber. (Ameristan)

halfcat · 2024-11-01T13:42:23 1730468543

In the book, “The Big Change” (1952), Frederick Allen talks about the year 1900, and (among many differences) he notes that for nearly everyone in the year 1900, the limits of their world rarely extended beyond their own town.

We have this idea today that everyone online is getting trapped in echo chambers, but that’s been the case for most of human history.

phatfish · 2024-11-01T16:58:07 1730480287

This might be a popular contrarian view at the moment, but I'm not sure everyone is trapped in a bubble by social media.

When people mostly communicated with those in their own town that is a bubble. Radio and TV is more of a bubble because there is limited "bandwith" in the scheduling so it has to be editorialised (not necessarily a bad thing).

Social media companies do choose what you see via algorithms, but I'm not convinced they benefit from only showing you content you "agree" with, it feels like being shown a certain amount of content outside your "bubble" would increase screen time. There are also the comment sections that often have contradictory views.

Even social media (setting aside the rest of the internet for the moment) will expose you to more viewpoints than the the social circle in your home town, or a TV/Radio schedule. I'm not saying it is a healthy way to be exposed to other viewpoints, but I don't think the problem with social media is that is creates a bubble.

vacuity · 2024-11-02T21:34:32 1730583272

Increasing engagement doesn't have to just be through agreeable content; consider Twitter ragebait or Instagram [body dysmorphia bait?]. In general, it means your attention is hooked. If people only get content they are likely to be highly engaged with, the range of content will be relatively narrow, and that is a bubble. Perhaps more or larger bubbles, but still noticeably limited perspectives.

Jeff_Brown · 2024-11-01T15:22:08 1730474528

That would be comforting if most of human history hadn't sucked so hard.

sleeplessworld · 2024-11-01T16:53:40 1730480020

Thank you. That comment just made my day :-)

(now to wiping the tears of laughter from my face...)

MichaelZuo · 2024-11-01T14:29:24 1730471364

If anything considering how vastly more complex things are nowadays (behind superficial appearances), the median adult is likely even more ignorant, in relative terms, than say the median peasant in rural 1700s France, or even in 1900s France.

komali2 · 2024-11-01T13:34:41 1730468081

Hah, I'm not the only one that brings up this book in this context.

The way this is wrought, in the novel, is a savant engineer writes a bot framework that can cheaply and quickly disseminate torrents of misinformation about a provided subject, and then open sources this framework. He basically broke the internet on purpose as a sort of accelerationist move I suppose.

wpietri · 2024-11-01T14:34:32 1730471672

Not sure if that's better or worse than a developer doing basically the same thing because they could make a little money. <at_least_its_an_ethos.gif>

lazerlapin · 2024-11-01T07:55:44 1730447744

> A better example might be "is dietary saturated fat a major factor for heart disease in Western countries?". The current government publications (which answer "yes") for this are probably wrong based on recent research. The government cannot be relied upon as a source of truth for this.

I know it was just an example, but actually no, the role of dietary saturated fat as a factor for heart disease remains very much valid. I’m not sure which recent studies you’re referring to, but you can't undo over 50 years of research on the subject so easily. What study were you thinking about?

coldtea · 2024-11-01T09:37:03 1730453823

>you can't undo over 50 years of research on the subject so easily

Sure you can if the research was bogus to begin with, sponsored in many cases, and merely taking for granted/referencing some previous results without verifying them, which is often the case.

AYBABTME · 2024-11-01T09:06:29 1730451989

You can undo 50 years of research easily, by this process which we call science.

mistermann · 2024-11-01T14:05:25 1730469925

Science is[1] impressive but does it have a method for resolving the numerous conflicting "truths" in this thread?

[1] except when it isn't, of course

yurishimo · 2024-11-01T09:01:35 1730451695

I think they are referring to low-carb studies done recently. If your diet consists of only saturated fat, it does seem to be healthier for you than the standard American/Western diet that is also high in saturated fat but also quite high in sugar, wheat, and other starchy carbs. When combined, saturated fat and carbs are a hitting a double if your goal is to be unhealthy.

General disclaimers apply regarding portion sizes etc blah blah blah I'm not a doctor.

Anecdotally, a low-(ish) carb diet and fasting has done wonders for my health and many others. I will say that there appears to be a link with higher cholesterol when consuming higher amounts of fat, but the argument in nutrition science atm seems to be centered on whether or not that is "good" cholesterol, but it's hard to measure in human patients for a long time because you essentially need to put them on a very limited diet to get good data. Those large scale trials are expensive and hard to manage at scale.

newsclues · 2024-11-01T08:58:45 1730451525

https://pmc.ncbi.nlm.nih.gov/articles/PMC9794145/

Remarkable how easy it is to cling to propaganda

dahart · 2024-11-01T14:05:17 1730469917

I wasn’t aware of the new debate over saturated fats, I’m curious and will be watching this evolve with interest.

That said, for me this publication has red flags right from the start. Complaining about difficulty of changing everyone’s minds is a political and non-academic persuasion tactic that does not convince me. Calling it “resistance” and “bias” is a bullshit framing that makes me less likely to trust Teicholz. Of course there is resistance to 50 years of publication and research, and there should be. There’s a lot of bias towards the earth being round, and a lot of resistance to the idea that it’s flat, right? If I repeat the claim that the earth is round, is that “propaganda”? It would indeed take time and effort to change everyone’s minds about that.

Multiple times she references “>20” papers that back up her claims. Except 5 of her references in this paper are her own. And she has around 10 on this subject. So is she claiming this “new consensus” is based on what she herself and maybe one or two other people believe? If 50% of the evidence for consensus is her own papers, then I doubt there’s any consensus at all. It’s funny to claim there’s consensus at the same time she complains that it’s difficult to change the consensus. Even 20 independent scientific papers not authored by Teicholz is practically nothing in the big picture. It will take many more papers and much more time, and the evidence needs to be overwhelming, clear, obvious, and true.

She might be right! But Nina Teicholz is a journalist, not a scientist. She does have a PhD, but her publications don’t appear to be scientific research, and most look like opinion pieces.

Out of curiosity, if saturated fats aren’t the culprit, what is? Looks like she does have one paper questioning sugar, so is she claiming sugar is the real cause? What if it’s the combination of sugar and saturated fats? Does that make her right or wrong?

hombre_fatal · 2024-11-01T17:22:27 1730481747

It's amazing how many HNers link this charlatan's op-ed thinking it's evidence. Presumably you don't like linking to our best human outcome research on the subject because it never pans out well for saturated fat–not for atherosclerosis, not for nonalcoholic fatty liver disease, not for glucose sensitivity, and so on.

So you link to the equivalent of a reddit post 'summarizing' the space. I see this link every week on here and every time, the person who linked it thinks they just had a mic drop moment like you.

valval · 2024-11-03T11:17:44 1730632664

I think discussing this topic with too much fervor is a waste of energy. Nutrition science is hard because performing valid studies at large scale is close to impossible, so I’m left performing an argument from nature, being that eating things that were invented decades ago might be worse for us than things we’ve eaten for millions of years.

lazerlapin · 2024-11-02T04:33:47 1730522027

Not everything on PubMed weights the same. As others have said, this is just a summary article by the Best seller author Nina Teicholz. Not only she's heavily sponsored by the Meat Industry (and I'm not vegan), but her best selling book title is "The Big Fat Surprise: Why Butter, Meat and Cheese Belong in a Healthy Diet", yet in the article she declares "The author receives modest royalties on a book on the history of dietary fat recommendations and otherwise declares no conflicts of interest"...

themk · 2024-11-01T11:52:04 1730461924

Nina Teicholz being the distributor of this propaganda right?

gonzobonzo · 2024-11-01T05:10:36 1730437836

> To paraphrase a recent conversation I had with a friend: "in the USA, can illegal immigrants vote?" has a single truthful answer ("no" obviously). But there are many places around the web saying other things (which is why my friend was confused). An LLM trawling the web could very conceivably come up with a non-truthful answer.

There are many jurisdictions where some illegal immigrants (Dreamers) are allowed to vote, including New York City[1].

[1] https://www.theguardian.com/us-news/2022/jan/09/new-york-all...

ethbr1 · 2024-11-01T05:22:05 1730438525

>> "in the USA, can illegal immigrants vote?"

> More than 800,000 non-citizens and “Dreamers” could vote in New York City municipal elections

Treating the question literally, that would not be in the USA (federal elections).

gonzobonzo · 2024-11-01T05:38:41 1730439521

> Treating the question literally, that would not be in the USA (federal elections).

I strongly disagree. Going by your interpretation, the question "How many people in the USA vote in local elections?" would have the answer "not a single person."

coldtea · 2024-11-01T09:39:15 1730453955

Your version of the question has the qualifier "in local elections".

The parent's version doesn't, so the answer assuming that it talks about the main USA elections, and not local or any random election that just happen to be conducted in the USA, is quite valid.

In other words "in the USA elections" implicitly points to a specific kind of elections (the presidential ones), different to "USA local elections" or "any kind of election within the USA").

You might argue "in the USA elections" doesn't anywhere prevent the more generic interpretation, but I argue that that's how most people would understand and answer such a question.

gonzobonzo · 2024-11-01T13:09:17 1730466557

> Your version of the question has the qualifier "in local elections".

Yes, the point is that qualifier works because when you're talking about people voting "in the USA," you can be talking about local or federal elections. "In federal elections, how many people vote in local elections" makes no sense. "In the USA, how many people vote in local elections" makes sense, because voting in the USA can encompass both local and national elections.

I understand that there are many people who ignore local elections. But I disagree that talking about voting "in the USA" means that local elections should be excluded.

> In other words "in the USA elections" implicitly points

You're using quotations for something that wasn't said. The original comment was "in the USA, can illegal immigrants vote?"

foobarian · 2024-11-01T13:47:07 1730468827

> But I disagree that talking about voting "in the USA" means that local elections should be excluded.

I agree with you, however, whether you like it or not, that phrasing will default to the federal elections to most people and especially foreigners.

coldtea · 2024-11-01T16:09:09 1730477349

Especially given a few years of heated political discussions around the possibility or not of the practice, concerning precisely the federal level.

parpfish · 2024-11-01T04:33:58 1730435638

I was thinking something similar. I like the google auto-ai summary for little trivia facts and objective things (who played so and so in the movie? How heavy is a gallon of milk?) this is stuff that is verifiable and could theoretically just be queried from some sort of knowledge base.

But for anything remotely subjective, context dependent, or time sensitive I need to know the source. And this isn’t just for hot button political stuff — there’s all sorts of questions like “is the weather good this weekend?”, “Are puffy jackets cool again?”, “How much should I spend on a vacation?”

ethbr1 · 2024-11-01T04:58:53 1730437133

> could theoretically just be queried from some sort of knowledge base

Google bought Freebase in 2010 [0] and scaled it [1].

[0] https://en.m.wikipedia.org/wiki/Freebase_(database)

[1] https://en.m.wikipedia.org/wiki/Google_Knowledge_Graph

shmel · 2024-11-01T11:32:19 1730460739

What is truth anyway? I see it as a quicker version of browsing to web to get a summary of what people say. As you said, with search you get a bunch of websites where strangers talk about a certain topic. You read a dozen and see if they agree with each other or if they sound legit.. There is just a huge overlap between what we consider true and what (an overwhelming majority of) people agree on. A lot of things are reduced to the consensus. If you ask a non-obvious question, you usually get an answer "a lot of people you consider trustworthy dedicated some time studying this question and agreed that the answer is X". But then people can be wrong, a big group of people can be wrong, people can be bribed or perhaps you don't actually trust these people that much. The internet can't tell you the truth. LLM can't tell you the truth. But they can summarize what other people in the world say on this subject.

HarHarVeryFunny · 2024-11-01T13:57:03 1730469423

> I agree that this is the core question, but I'd put it as: Who Gets To Decide What Is True?

Well, outside of matters of stark factuality (what time does the library close?, what did MSFT close at?), many things people may be "searching for" (i.e. trying to find information about) are more in the realm of informed opinion and summary where there is no right or wrong, just a bunch of viewpoints, some probably better informed than others.

I think this is the value of traditional search where the result is a link to a specific page/source whose trustworthiness or degree of authority you can then judge (e.g. based on known reputation).

For AI generated/summarized "search results", such as those from "ChatGPT Search" (awkward name - bit like a spork), the trustworthiness or degree of authority is only as good as the AI (not person) that generated it, and given today's state of technology where the "AI" is just an LLM (prone to hallucination, limited reasoning, etc), this is obviously a bit of an issue...

Even in the future, when presumably human level AGI will have made LLMs obsolete, I think it'll still be useful to differentiate search from RAG AGI search/chat, since it'll still be useful to know the source. At that point the specific AGI might be regarded as a specific person, with it's own areas of expertise and biases.

The name "ChatGPT Search" is very awkward - presumably they are trying to position this as a potential Google competitor and revenue generator (when the inevitable advertisements come), but at the end of the day it's just RAG.

FuriouslyAdrift · 2024-11-01T14:54:49 1730472889

Non-citizens cannot vote in FEDERAL elections, but some states allow them to vote in LOCAL elections.

There are few more restrictions here: https://www.usa.gov/who-can-vote

HarHarVeryFunny · 2024-11-01T15:02:50 1730473370

It'd make sense if paying taxes gave you the right to vote.

Taxation without representation and all that.

schnable · 2024-11-01T17:01:15 1730480475

Non-citizens/residents are the juiciest targets for taxes (tourism taxes at hotels and whatnot, pumped up taxes on restaurants/alcohol)! This actually makes no sense at all.

HarHarVeryFunny · 2024-11-01T17:16:27 1730481387

Sales taxes levied on visitors can be justified as different, but it seems unfair to level income taxes, or real estate taxes, without any right to vote over how those taxes are spent, especially in the case of legal non-citizen residents.

senko · 2024-11-01T11:24:26 1730460266

This nails it:

> Who Gets To Decide What Is True?

I would not agree to this however:

> It was then up to the reader to decide which of those sites they wanted to visit,

With current search engines, Google decides for you ("helped" by SEO experts failing over themselves to rank higher because their revenue directly depends on it). In theory, you could go an read a few dozen pages and decide for yourself.

In reality, non-technical users will click on a first link that seems to be related to the question, and that's it.

Even with AI-based search (or q/a instead of search) I think the same will happen. There is and will be a huge reward for gaming the results, be they page links or RAG snippets that rank for a query. I've already seen many SEO shops advertising their strategies to keep the customers' business relevant in the chatbot area. As this approach becomes more prevalent, you can be sure many smart people will do many experiments to figure out how best to please the new algorithm.

In other words, AI-based search is an UX optimization, but doesn't address the core problem of how do you decide what content's the best, and do that in context of each user, and do that while maximizing the benefit for the user vs profit for the company doing this.

So we have two huge hurdles:

1. who will decide what the user wants[0] to see, and how are incentives for that entity aligned with the user's

2. how is that entity supposed to find the information needle in the haystack of slop that's 90%+ of current web?

[0] "wants" in a rational "give me the best possible information" meaning, not in "what keeps them addicted, their heart rate up, and what will drive engagement" meaning

foobarian · 2024-11-01T13:44:11 1730468651

Ultimately it's an incentive problem. As the saying goes, if you are not paying for the product, you are the product.

webninja · 2024-11-04T04:59:22 1730696362

> To paraphrase a recent conversation I had with a friend: "in the USA, can illegal immigrants vote?" has a single truthful answer ("no" obviously). But there are many places around the web saying other things (which is why my friend was confused).

This doesn’t have a single truthful answer. Some states don’t have voter ID laws, so the truth can depend on the state. In those no voter ID laws there’s not that much keeping someone from voting twice or more under different names, except significant moral qualms about subverting instead of preserving everyone else’s right to vote in a democratic republic. Someone can assume the name of a person from another country that could’ve plausibly come in illegally. Without a picture ID, they can’t claim you aren’t that persons.

Can an illegal immigrant vote? Yes, in states without voter ID laws, technically anyone can vote, even convicted terrorists. Should an illegal immigrant vote? No, they’re not supposed to be able to vote and there may be consequences if caught.

What purpose do a lack of voter ID laws serve except the obvious conclusion which is to enable cheating?

seanmcdirmid · 2024-11-04T05:06:04 1730696764

That isn’t true. They do have voter registration, and they match name addresses that are cross them off, a name can’t vote twice.

MAGA types who wanted to prove democrats could cheat like this were caught very quickly because voting twice or voting for someone else is very very easy to detect, even without IDs. The judges gave them Darwin awards.

Unregistered voters can’t vote in any state, they can only do so by pretending to be someone else (name and address matches the rolls), but they are caught when those people actually vote. Foreigners can also vote by illegally registering. But motor voter means they actually check your status at registration.

The ID thing is a solution to a non-problem, like literacy tests were. But the republicans could make it easier to achieve universally if they just went with a guaranteed free national ID like other countries, except that would make the obstruction aspect of voter ID requirements much more moot, so they never go there.

Foreign residents can vote in some local election in a few places, mainly very local school board elections.

adamhp · 2024-11-01T12:17:32 1730463452

> With a search paradigm this wasn't an issue as much, because the answers were presented as "here's a bunch of websites that appear to deal with the question you asked". It was then up to the reader to decide which of those sites they wanted to visit, and therefore which viewpoints they got to see.

It is very similar. Google decides what to present to you on the front page. I'm sure there are metrics on how few people get past the front page. Heck, isn't this just Google Search's business model? Determining what you see (i.e. what is "true") via ads?

In much the same way that the Councils of Carthage chose to omit the acts of Paul and Thecla in the New Testament, all modern technology providers have some say in what is presented to the global information network, more or less manipulating what we all perceive to be true.

Recently advancements have just made this problem much more apparent to us. But look at history and see how few women priests there are in various Christian churches and you'll notice even a small omission can have broad impacts to society.

jumping_frog · 2024-11-01T13:29:11 1730467751

This is true of the feeds and everything else where there are abundant choices. Amazon putting its inhouse brands before others. Anything which has to be narrowed down is an algorithmic choice, either data driven or top-down.

williamcotton · 2024-11-01T12:44:24 1730465064

"Are all bachelors married?" vs "is it raining outside?"

And then Quine points out that the definitions of "bachelor" and "married" are themselves contingent on outside factors.

"Can illegal immigrants vote?", while close to being an analytic proposition, still depends on an empirical approach that can never be mediated by text, video, etc. All propositions are by necessity experiential. Nullius in verba!

So the truth is and has always been what happens when you get off your butt and go out and test the world.

This is not to say that we don't benefit from language. It makes for a great recipe. If you follow the instructions to bake a cake and you get what you expected you know that the recipe was true. The same goes for the laws of science, search engine results, and generative AI.

krageon · 2024-11-01T12:32:54 1730464374

The truth isn't a decision that's made, it just is. So nobody gets to decide it, it just is :)

afh1 · 2024-11-04T11:58:50 1730721530

Unfortunately most people seem comfortable with the idea of a Ministry of Truth, at least here in Brazil and from what I read on the web, in many parts of the world also.

jacobr1 · 2024-11-01T15:42:30 1730475750

I think this presents an approach: move closer to the search engine model. In both academic writing an news journalism, factual claims try to cite their sources. And you can see this is an approach that some of the chat systems have directionally gone. We don't want to always pedantically cite a reference for every question. But maybe we should? At least at an architectural level and let the UX decide if it should be displayed.

> in the USA, can illegal immigrants vote According Author X in Book Y which studied this topic in depth: foo answer

stefek99 · 2024-11-02T17:37:31 1730569051

> "Who Gets To Decide What Is True?"

Independent Tribunal: https://www.independenttribunal.org/ (a project of mine)

Even in the of law there are various schenanigans and loopholes such as "legally true" :)

charlieyu1 · 2024-11-01T13:39:14 1730468354

> Who Gets To Decide What is True?

Outsourced workers in less expensive places of the world providing human feedback.

BolexNOLA · 2024-11-01T14:41:55 1730472115

I don’t know who should get to decide “what is true” but I think we all agree that we won’t get anything remotely akin to truth so long as search is driven by for-profit interests trying to steer us towards their products first and our actual query (a distant) second.

jstummbillig · 2024-11-01T11:24:12 1730460252

> With a search paradigm this wasn't an issue as much, because the answers were presented as "here's a bunch of websites that appear to deal with the question you asked

I don't think this appropriately credits Google's power with regards to what you are seeing

grobbyy · 2024-11-01T11:43:12 1730461392

For the record, in my local elections, we have plenty of people voting illegally.

"Is it legal" is very different from "Can they."

By "local," I mean municipal and below. I didn't mean "federal elections conducted in my locality." Election security kicks for state, federal, and some municipal elections.

Others are intentionally fraudulent (e.g. local corruption) or unintentionally broken (e.g. using Google Forms for a school-level public body, where people not legally qualified to vote might still do it, unaware they're committing a felony).

And "public body" has a specific meaning under my state law which extends the same laws as e.g. cutting for my state senate. That's bodies like local school boards, but not random school clubs.

That's the level where we have massive illegal voting where I live.

wpietri · 2024-11-01T14:32:41 1730471561

You're right, this is a key question:

> Who Gets To Decide What Is True?

For any given statement, the answer up until a couple years ago was, "the speaker". Speakers get to decide what to say, but they're also responsible for what they say. But now with LLMs we have plausible text without a speaker.

I think we have a number of historical models for that. A relevant one is divination. If you bring your question to the haruspex, your answer is read out of the guts of a sacrificed animal. If the answer is wrong, who do you blame? The traditional answer is the gods, or perhaps nobody.

Bu we know now that fortune tellers are just selling answers while pretending to not be responsible for them. Which points us at one solution: anybody selling or presenting LLM output as meaningful is legally responsible for the quality of the product.

Unfortunately, another model is the modern corporation. Sometimes the people in a company intentionally lie. More often, statements are made by one person based on a vision or optimism or confusion or bullshit. Nobody set out to lie, but nobody really cared about the truth, at least not as much as everybody cared about making money.

So I'd agree that the government doesn't have much role in deciding The Truth. Similarly, the government shouldn't have much role in controlling what you eat. But in both cases, I think there's plenty of role for the government in ensuring that companies selling good food or good information have sound production and quality control measures to ensure that they are delivering what consumers are expecting.

HarHarVeryFunny · 2024-11-01T15:19:29 1730474369

> But now with LLMs we have plausible text without a speaker.

Well, it's still a known source, who's competence, biases, etc one can judge just like any human source.

wpietri · 2024-11-01T23:07:22 1730502442

It's absolutely not like any human source, and there's little reason to think its competencies and biases are anything like humans.

HarHarVeryFunny · 2024-11-02T01:01:09 1730509269

I didn't say it has the same competencies and biases as any human - I said we can judge it on it's individual merits, the same way we judge any human on theirs.

wpietri · 2024-11-02T15:01:52 1730559712

No, you can't judge it the same way you judge a human, any more than you could judge a bus engine the same way you'd judge human. It's not an individual, and not remotely like one.

I get this might seem like tedious nitpicking to you, but the number one error people are making with LLM output is anthropomorphizing it. Which I get, because it's built to seem that way. But it's an enormously dangerous misconception.

As one example among zillions, look at the term "hallucination". All LLM output is equally "hallucinated". Some of it, when interpreted by a human may be taken as meaningful. Some of it, the "hallucinated" part, is taken as meaningful but contrary to something else they understand. But it's the human creating all the meaning here. Even calling this class of mismatches "hallucination" is anthropomorphizing LLMs.

Imagine I take the proverbial million monkeys to generate random words. Then I create a statistical filter so that we extract only the plausible sentences. Is this machine a source? Can I "judge just like any human source" here?

I'd say the answer is a clear no. And if you think the answer is yes, then the same has to apply to things like horoscopes, the I Ching, or the intestines of a sacrificial goat.

HarHarVeryFunny · 2024-11-02T18:44:26 1730573066

I can judge the utility of a black box oracle on it's own merits, without needing to know whether it's human, and without being accused of anthromorphizing it.

If I've observed the answers of the oracle to be biased, or useful/correct in some circumstances and not in others, then this is something to take into account when deciding whether or not this is a useful source to pay attention to.

From the POV of whether the output of the black box is useful, it doesn't make any difference whether what's in the box is a human, an LLM, or a rabid monkey. It is what it is, and I'll judge it on it's merits.

Now, YOU may care what's in the box, for some reason, but that's on you.

yonaguska · 2024-11-01T13:07:56 1730466476

> "in the USA, can illegal immigrants vote?" has a single truthful answer ("no" obviously)

legally no, practically- yes. In most states, you simply must attest that you are a citizen in order to register. In many states, non-citizens have been auto-registered to vote when attaining drivers licenses. Reddit is full of panicked immigrants concerned that they found themselves registered to vote, and worried about how that would affect their status.

panarky · 2024-11-01T06:28:25 1730442505

> An LLM trawling the web could very conceivably come up with a non-truthful answer.

LLMs don't uncritically "trawl" the web, ingesting and then blindly regurgitating what they find.

98% of the internet is crap, yet LLM results don't reflect this dismal figure. They're amazingly good at distilling the 2% that is non-crap.

marcus_holmes · 2024-11-01T07:20:35 1730445635

Sure. Probably trawling was the wrong word and conveyed a technically-inaccurate meaning. I'm not sure what the correct word would be.

belter · 2024-11-01T12:40:58 1730464858

> Who Gets To Decide What Is True?

Seems particularly to be a US based phenomenon. Unlike the more transparent manipulation seen under dictatorships... Where people generally recognize propaganda for what it is, even if they can’t openly challenge it—some in USA live within entirely new realities.

marcus_holmes · 2024-11-04T00:47:19 1730681239

I'm not in the USA and I see this a lot too. It's not just a US-based thing.

chrisan · 2024-11-01T12:54:55 1730465695

Definitely a US based phenomenon, but sadly like other US based things it spreads to other countries. I talk with friends in CA and EU who are seeing some of the junk we saw 10 years ago with "what is fact"

You would think common sense prevails...

recursive · 2024-11-01T17:26:54 1730482014

> You would think common sense prevails...

Appeals to common sense seem to underlie many of the "alternative facts" in the zeitgeist. "Common sense" is how people defend their views when they can't do it empirically. That's not always bad, but "common sense" is probably part of the problem.

azangru · 2024-11-01T13:16:09 1730466969

I disagree with this. Strongly.

Remember the covid years, when what was true kept changing rapidly, and sometimes what was said on the fringe and considered misinformation was later adopted by mainstream. Vaccines prevent transmission of the virus. Oh no they don't. Don't wear masks. Oh no, do wear masks. Oh, whatever, cloth masks are face decorations anyway. Lockdowns are good. Oh, lockdowns were a mistake. Don't treat pneumonia patients with prednisone. Oh, do treat pneumonia patients with prednisone. Lab leak is a conspiracy theory. Oh, maybe lab leak is not a conspiracy theory. And so on, and so forth...

Or take a look at how not just the media, but even the government in the UK or Canada liberally put labels such as "far right", "alt right", "antisemitic", etc. on their opponents, and how these labels pop up in Wikipedia. Are they true?

judahmeek · 2024-11-01T04:27:37 1730435257

> We're seeing how that pans out in Australia and it's not good.

How are we seeing how that pans out when Australia's misinformation bill is still just a proposal?

marcus_holmes · 2024-11-01T04:44:05 1730436245

Fair point. I think just that there's a general acceptance that "something must be done" and that the government are the people to do it is pretty alarming. It's a short step from here to a Ministry of Truth, and I can see Australia taking that step pretty soon.

phs318u · 2024-11-01T12:42:12 1730464932

The thing about Australia is that voting is mandatory. By definition this makes it difficult for politicians radicalising the edges to pull the mass of the normal curve away from the centre. In the US the opposite is true. The radicalised are more incentivised to turn up and vote than the centrist mass. So much so that the last 40 years has seen the hollowing out of the centre and this ridiculous (to my eyes) seesawing of extremes.

What I’m trying to explain is that (successful) politics in Australia doesn’t stray too far from the centre of the body politic. As a result there’s greater faith in institutions here than in the US. It’s far less alarming to us (conceptually) than it is to Americans.

jumping_frog · 2024-11-01T13:36:06 1730468166

I have a physics analogy which is similar. Vested interests set up magnetic fields in social media / legacy media (lot of things discussed prominently in social media is just what legacy media is saying. So legacy media is a sense is setting up an anchor points and people have to distribute themselves around it) to flip magnetic domains to align with the narrative.

jrflowers · 2024-11-01T04:52:52 1730436772

What is a not-alarming answer to “Who gets to decide what’s true?”

marcus_holmes · 2024-11-01T05:01:00 1730437260

I don't know.

We used to defer this to journalists, effectively. While individual journalists often lied and misrepresented the truth, there was some responsibility within the industry to tell the truth, and newspapers did print retractions and corrections when they got it wrong. The editorial content was strictly separate from the business of running the newspaper, so editorial decisions were (mostly) not influenced by commercial decisions and free to pursue The Truth as they saw it.

That, sadly, is no longer the case. And we have no good replacement for it.

phatfish · 2024-11-01T17:17:33 1730481453

There is no incentive to be truthful currently, among democracies the seems to be most powerful in the US with Trump able to lie and there seemingly be no counter for it.

With old regulated media there was (is?) the legacy of the organisation and the idea it was "trustworthy" on the line for the journalists that work for it. So they have an incentive to be truthful, and hopefully an idea that publishing lies is not a good moral choice.

With the personality driven journalism that emerges from the internet there is less incentive to be truthful, such personalities can be very successful using populism alone to play to their audience.

freedomben · 2024-10-31T21:17:39 1730409459

I genuinely think Kagi has led the way on this one. Simplicity is beautiful and effective, and Kagi has (IMHO) absolutely nailed it with their AI approach. It's one of those things that in hindsight seems obvious, which is a pretty good measure of how good an idea is IMHO.

Google could have done it and kind of tried, although they're AI sucks too much. I'm very surprised that OpenAI hasn't done this sooner as well. They're initial implementation of web search was sad. I don't mean to be super critical as I think generally OpenAI is very, very good at what they do, but they're initial browse the web was a giant hack that I would expect from an intern who isn't being given good guidance by their mentors.

Once mainstream engines start getting on par with Kagi, there's gonna be a massive wave of destruction and opportunity. I'm guessing there will be a lot of new pay walls popping up, and lots of access deals with the search engines. This will even further raise the barrier of entry for new search entrants, and will further fragment information access between the haves and have-nots.

I'm also cautiously optimistic though. We'll get there, but it's gonna be a bit shakey for a minute or two.

hn_throwaway_99 · 2024-10-31T23:20:16 1730416816

> I'm also cautiously optimistic though. We'll get there, but it's gonna be a bit shakey for a minute or two.

But I don't understand how all of these AI results (note I haven't used Kagi so I don't know if it's different) don't fundamentally and irretrievably break the economics of the web. The "old deal" if you will is that many publishers would put stuff out on the web for free, but then with the hope that they could monetize it (somehow, even just with something like AdSense ads) on the backend. This "deal" was already getting a lot worse over the past years as Google had done more and more to keep people from ever needing to click through in the first place. Sure, these AI results have citation results, but the click-through rates are probably abysmal.

Why would anyone ever publish stuff on the web for free unless it was just a hobby? There are a lot of high quality sites that need some return (quality creators need to eat) to be feasible, and those have to start going away. I mean, personally, for recipes I always start with ChatGPT now (I get just the recipe instead of "the history of the domestication of the tomato" that Google essentially forced on recipe sites for SEO competitive reasons), but why would any site now ever want to publish (or create) new high quality recipes?

Can someone please explain how the open web, at least the part of the web the requires some sort of viable funding model for creators, can survive this?

generalizations · 2024-10-31T23:49:39 1730418579

> Why would anyone ever publish stuff on the web for free unless it was just a hobby

That's exactly what the old deal was, and it's what made the old web so good. If every paid or ad-funded site died tomorrow, the web would be pretty much healed.

jascha_eng · 2024-11-01T00:52:55 1730422375

That's a bit too simple. There is way fewer people producing quality content "for fun" than people that aim or at least eventually hope to make money from it.

Yes a few sites take this too far and ruin search results for everyone. But taking the possibility away would also cut the produced content by a lot.

Youtube for example had some good content before monetization, but there is a lot of great documentary like channels now that simply wouldn't be possible without ads. There is also clickbait trash yes, but I rather have both than neither.

ericd · 2024-11-01T01:21:15 1730424075

Demonetizing the web sounds mostly awesome. Good riddance to the adtech ecosystem.

erickhill · 2024-11-01T02:01:19 1730426479

The textual web is going the way of cable TV - pay to enter. And now streaming. "Alms for the poor..."

But, like on OTA TV, you can get all the shopping channels you want.

BoiledCabbage · 2024-11-01T02:20:44 1730427644

Not to be the downer, but who pays for all the video bandwidth, who pays for all the content hosting? The old web worked because it was mostly a public good, paid for by govt and universities. At current webscale that's not coming back.

So who pays for all of this?

The web needs to be monetized, just not via advertising. Maybe it's microtransactions, maybe subscriptions, maybe something else, but this idea of "we get everything we want for free and nobody tries to use it for their own agenda" will never return. That only exists for hobby technologies. Once they are mainstream they get incorporated into the mainstream economic model. Our mainstream model is capitalism, so it will be ever present in any form of the internet.

The main question is how people/resources can be paid for while maintaining healthy incentives.

6510 · 2024-11-01T07:15:05 1730445305

No one paid you to write that?

randomNumber7 · 2024-11-01T08:07:09 1730448429

Except I also pay my network provider to run the infrastructure

I think you forgot that

toomuchtodo · 2024-11-01T02:56:04 1730429764

It costs the Internet Archive $2/GB to store a blob of data in perpetuity, their budget for the entire org is ~$37M/year. I don't disagree that people and systems need to be paid, but the costs are not untenable. We have Patreon, we have subscriptions to your run of the mill media outlets (NY Times, Economist, WSJ, Vox, etc), the primitives exist.

The web needs patrons, contributions, and cost allocation, not necessarily monetization and shareholder capitalism where there is a never ending shuffle of IP and org ownership to maximize returns (unnecessarily imho). How many times was Reddit flipped until its current CEO juiced it for IPO and profitability? Now it is a curated forum for ML training.

I (as well as many other consumers of this content) donate to APM Marketplace [1] because we can afford it and want it to continue. This is, in fits and starts, the way imho. We piece together the means to deliver disenshittification (aggregating small donations, large donations, grants, etc).

(Tangentially, APM Marketplace has recently covered food stores [2] and childcare centers [3] that have incorporated as non profits because a for profit model simply will not succeed; food for thought at a meta level as we discuss economic sustainability and how to deliver outcomes in non conventional ways)

[1] https://www.marketplace.org/

[2] https://www.marketplace.org/2024/10/24/colorados-oldest-busi...

[3] https://www.marketplace.org/2024/08/22/daycare-rural-areas-c...

generalizations · 2024-11-01T02:00:16 1730426416

> There is way fewer people producing quality content "for fun" than people that aim or at least eventually hope to make money from it...But taking the possibility away would also cut the produced content by a lot.

....is that a problem? most of what we actually like is the stuff that's made 'for fun', and even if not, killing off some good stuff while killing off nearly all the bad stuff is a pretty good deal imo.

lolinder · 2024-11-01T13:09:09 1730466549

Agreed. The entire reason why search is so hard is because there's so much junk produced purely to manipulate people into buying stuff. If all of that goes away because people don't see ads there anymore, search becomes much easier to pull off for those of us who don't want to stick to the AI sandbox.

There's a slight chance we could see the un-Septembering of the internet as it bifurcates.

telchior · 2024-11-01T02:25:44 1730427944

Unless the reason for the death of the paid content deal is because of AI vacuuming up all the content and spitting out an anonymous slurry of it.

Why would anyone, especially a passionate hobbyist, make a website knowing it will never be seen, and only be used as a source for some company's profit?

visarga · 2024-11-01T08:15:32 1730448932

> and only be used as a source for some company's profit?

Are we forgetting the main beneficiaries? The users of LLM search. The provider makes a loss or pennies on million tokens, they solve actual problems. Could be education, could be health, could be automating stuff.

coliveira · 2024-11-01T03:15:15 1730430915

The problem is not the ad sites dying. The problem is that even the good sites will not have any readers, as the content will be appropriated by the AI du jour. This makes it impossible to heal the web, because people create personal sites with the expectation of at least receiving visitors. If nobody finds your site, it is as if it didn't exist.

marcus_holmes · 2024-11-01T03:51:46 1730433106

I'm not so sure.

I think the best bloggers write because they need to express themselves, not because they need an audience. They always seem surprised to discover that they have an audience.

There is absolutely a set of people who write in order to be read by a large audience, but I'm not sure they're the critical people. If we lost all of them because they couldn't attract an audience, I don't think we'd lose too much.

hn_throwaway_99 · 2024-11-01T03:26:18 1730431578

Exactly. Even if people don't publish information for money, a lot of them do it for "glory" for lack of a better term. Many people like being the "go to expert" in some particular field.

LLMs do away with that. 95% of folks aren't going to feel great if all of the time spent producing content is then just "put into the blender to be churned out" by an LLM with no traffic back to the original site.

visarga · 2024-11-01T08:18:41 1730449121

chatGPT puts trillions of tokens into human heads per month, and collects extensive logs of problem solving and outcomes of ideas tested there. This is becoming a new way to circulate experience in society. And experience flywheel. We don't need blogs, we get more truthful and aligned outcomes from humna-AI logs.

lolinder · 2024-11-01T13:03:26 1730466206

You, for one, welcome our new AI overlords?

Blogs have the enormous advantage of being decentralized and harder to manipulate and censor. We get "more truthful and aligned outcomes" from centralized control only so long as your definition of "truth" and "alignment" match the definitions used by the centralized party.

I don't have enough faith in Sam Altman or in all current and future US governments to wish that future into existence.

lesostep · 2024-11-01T11:17:33 1730459853

But it would disincentive those who create knowledge? AFAIK, most of the highly specific knowledge comes from a small communities where shared goal and socialization with like-minded individuals are incentive to keep acquiring and describing knowledge for community-members. Would it really be helpful to put an AI between them?

jumping_frog · 2024-11-01T13:41:25 1730468485

First issue, silos of information.

Second issue: who decides the weights of sources. this is the reason why every nation must have culturally aligned AIs defending their ways of living in the information sphere.

visarga · 2024-11-01T08:16:34 1730448994

Yet 300M users are creating interactive sessions on chatGPT, which can be food for self improvement. AI has a native way to elicit experience from users.

carlosjobim · 2024-11-01T13:44:24 1730468664

Only middle-class and rich people could participate in "the old deal" Internet made by and for hobbyists. I think people forget this. It was not so democratized and open for everyone – you first had to afford a computer.

If you're a member of a yacht club, you can probably expect other members to help you out with repairs while you help them. But when a club has half the world population as members, those arrangements don't work anymore.

maeil · 2024-11-01T11:01:13 1730458873

As if OpenAI won't end up offering paid access to influence these results, or advertise inside them. Of course they will, just like how Google started without ads.

It will be even more opaque and unblockable.

6510 · 2024-11-01T07:04:45 1730444685

To quote Prince: ahh, now people can finally go back to making music for the sake of making music.

rodorgas · 2024-11-01T02:02:07 1730426527

Remember in that time, less web content meant major media outlets dominated news and entertainment on TV and newspapers.

cushpush · 2024-11-01T00:52:29 1730422349

Paging Sergey

pocketarc · 2024-10-31T23:54:07 1730418847

The internet was great before the great monetization of it, had tons of information provided for free with no ads. After ads, it will still have tons of information. Stack Overflows will still exist, Wikipedias, corporate blogs that serve just to boost the company, people making courses and other educational content, personal blogs (countless of which make their way here), all of those will continue to exist.

Ad-driven social networks will continue to exist as well.

The age of the ad-driven blog website is probably at an end. But there will be countless people posting stuff online for free anyway.

jumping_frog · 2024-11-01T13:44:59 1730468699

Nobody will visit stackoverflow because AI through its reasoning and back and forth with users will have solved the problems. This process creates training data for future AIs of that particular company unavailable to any other

cthor · 2024-11-01T00:16:40 1730420200

Many people have an intrinsic motivation to share knowledge. Have a look at Wikipedia. There are enough of these people that we don't need to destroy the open Internet to accommodate those who only write when they expect to be paid.

protomolecule · 2024-11-01T08:04:36 1730448276

[flagged]

mstipetic · 2024-11-01T08:25:15 1730449515

You have some examples of that?

maeil · 2024-11-01T11:04:35 1730459075

Their stance during Covid to ban any mentions of the lab leak theory. Even if not considered the most likely, it had always been a possibility, and not an absurdist one.

Stratoscope · 2024-11-01T00:55:12 1730422512

> the history of the domestication of the tomato" that Google essentially forced on recipe sites for SEO competitive reasons

That may help with SEO, but another reason is copyright law.

Recipes can't be copyrighted, but stories can. Here is how ChatGPT explained it to me:

> Recipes themselves, particularly the list of ingredients and steps, generally can't be copyrighted because they're considered functional instructions. However, the unique way a recipe is presented—such as personal stories, anecdotes, or detailed explanations—can be copyrighted. By adding this extra content, bloggers and recipe creators can make their work distinctive and protectable under copyright law, which also encourages people to stay on their page longer (a bonus for ad revenue).

> In many cases, though, bloggers also do this to build a connection with readers, share cooking tips, or explain why a recipe is special to them. So while copyright plays a role, storytelling has other motivations, too.

intelVISA · 2024-11-01T04:59:55 1730437195

> can't be copyrighted because they're considered functional instructions

by that logic software shouldn't be copyrighted either!

LordRishav · 2024-11-01T06:48:14 1730443694

Would like to read more about this. Has anybody used this technique to actually successfully sue someone for infringing their copyright on an instructional website or is it only theoretically possible?

erikerikson · 2024-11-01T03:45:01 1730432701

> Why would anyone ever publish stuff on the web for free...?

Why indeed, person who posted for free* on the Internet?

As a side note, consider that adds can be woven into and boosted in LLM results just as easily as in index lookups.

* assuming that you're not shilling here by presenting the frame that the new shiny is magically immune to revenue pressures

zaphodq42 · 2024-11-01T03:16:45 1730431005

It can be like the youtube premium model. The search app is subscription based. So every time your content is served you will get paid. But you have to make your content available to the AI for crawling and mention your monetisation preferences.

eviks · 2024-11-01T02:05:53 1730426753

The recipe trade-off doesn't make sense: while it's trivial to skip the history, you can't skip the false ingredients of the gpt variety

Then this whole category is not known for "high quality recipes", so the general state wouldn't change much?

KoftaBob · 2024-11-01T08:11:25 1730448685

That's why you're seeing media companies making deals with companies like OpenAI to allow them to access their content for AI learning/parsing purposes, in exchange for the media company getting paid yearly royalties.

Since anyone creating content (whether that's a big media corp or a small cooking blog) holds copyright over their content, they get to withhold the permission to scrape their content unless these AI platforms make a deal with them.

maeil · 2024-11-01T11:07:24 1730459244

By the same logic they'd get to sue over the scraping done to originally train the models. If royalties need to be paid for additional use, they would've needed to be paid for the original use.

csa · 2024-11-01T00:54:43 1730422483

> Can someone please explain how the open web, at least the part of the web the requires some sort of viable funding model for creators, can survive this?

The funding model for the open web will be for the open web content to be the top of the funnel for curated content and/or walled gardens.

I think many business models already treated the web this way. Specifically, get people away from the 800-pound gorilla rent-seekers like Google and Amazon, and get them into your own ecosystem.

visarga · 2024-11-01T08:12:23 1730448743

> fundamentally and irretrievably break the economics of the web

Good riddance, it is a surefire way to get slop by having misaligned incentives for publication.

senko · 2024-11-01T11:26:16 1730460376

> Why would anyone ever publish stuff on the web for free unless it was just a hobby?

So that ChatGPT mentiones you, not your competitor, in the answer to the user. I have seen multiple SEO agencies already advertise that.

odo1242 · 2024-11-01T01:20:40 1730424040

Wait, did google force "the history of the domestication of the tomato" to be part of recipes on the web for SEO reasons?

alfalfasprout · 2024-10-31T21:36:38 1730410598

Yep, I was incredibly skeptical about Kagi but I tried it and never looked back. Now my wife, friends, and several coworkers are customers.

The chatgpt approach to search just feels forced and not as intuitive.

SOLAR_FIELDS · 2024-10-31T22:52:25 1730415145

Once Kagi implements location aware search that is actually useful I’ll be interested in Kagi. That’s what made me leave the engine besides loving it otherwise.

lstamour · 2024-10-31T23:09:11 1730416151

Google Maps is quite the moat. I suspect they'll need to find a way to license the data, e.g. via API. Apple has not (yet) been as successful at building out a database of local places with reliable hours of operations, reviews, etc.

pzmarzly · 2024-11-01T00:25:19 1730420719

There already is an official Google Maps API, but it is already very expensive with prices rising from time to time. There is no other company (other than maybe Meta) that has this much POI data in the western world.

So that is a solid advantage that Google is going to have, but the maps business alone wouldn't be able to keep it in the S&P list for long.

carlosjobim · 2024-11-01T15:30:15 1730475015

Why not? I can see a future where Maps becomes the core of Google's business. It's already their strongest offering, together with YouTube. In every other field Google has been beaten.

abhgh · 2024-11-01T02:06:22 1730426782

Same here - I do a lot of location aware searches. When I left Kagi after trying it out for a while, I wrote a detailed feedback hoping it would be useful to the Kagi team.

sbeckeriv · 2024-10-31T23:41:58 1730418118

If you go to maps.kagi.com and in allow access to your location local results should be better. If it doesnt ask for access to your location there is a small icon on bottom right hand side that shows if it has access.

lolinder · 2024-11-01T01:30:55 1730424655

That's great if I'm trying to find a location, but that's not what local results is about.

Local results means that if I search for "driving laws", Google gives me .gov sites for my state as the top results, while Kagi's first page gives me results for 8 other states (including Alaska!) but not for my state.

There are a lot of kinds of queries that benefit from knowing the user's location even though they aren't actually looking for a place that exists on a map.

(I'm a happy paying Kagi user, but OP is right that this is its weakest point by far.)

tux1968 · 2024-11-01T04:09:36 1730434176

Isn't the alternative to just simply type "driving laws for [state]"? That doesn't seem too odious.

SOLAR_FIELDS · 2024-11-01T04:54:12 1730436852

This boils the problem down to a dichotomy which isn’t how it works in the real world. Most of the searches I make that aren’t tech related searches have a location based aspect to them. Anything I do in my day to day life involving logistics has a high chance of needing some location based search. Kagi (and DDG) performs at a range of 0% usefulness to 70% usefulness on average for these kinds of searches. Usually it’s 0%. There is simply a huge gap here in what Kagi offers when you need to search for results near you vs the leading competitor

lolinder · 2024-11-01T04:25:34 1730435134

Yep. Or county or city or whatever is relevant.

It's not terrible—as I said, I'm a happy customer—but it's not a habit I have and it feels like something that should be configurable once in a settings menu. I don't even really want to have it detect my location live, I just want to be able to tell it where I live and have it prioritize content that's local when given the chance.

SOLAR_FIELDS · 2024-11-01T01:49:58 1730425798

For what it’s worth DuckDuckGo is flawed in the exact same way. I ended up leaving DDG for the exact same reason years ago

TRiG_Ireland · 2024-11-01T05:01:27 1730437287

The entire selling point of DDG is that its search results are not personalised. This is not a flaw.

GeneralMayhem · 2024-11-01T15:01:15 1730473275

Personalized != contextualized. You could have a search engine that uses geolocation without building any sort of cross-request profile on the individual making the search.

lolinder · 2024-11-01T15:42:39 1730475759

But OP is right that this would actually be serving their target demographic less well than serving everyone the same results regardless of context. The fact that the results don't know where the user is is reassuring for the kind of user who wants to use a privacy-oriented search engine, regardless of whether localized results could technically be provided in a privacy-preserving way.

SOLAR_FIELDS · 2024-11-02T04:11:42 1730520702

These things are not mutually exclusive. Allow me to specify a city or state or county or country or zip code as a bang in my search and show me good results based on that. Both problems immediately solved. I wouldn’t be any more or less reassured about a search engine’s privacy stance if that feature was offered to me. This is a feature I can absolutely use in a private way (I can do that search over tor or a vpn with two hops if I so desire), and it gives me the control over what I provide the search engine and how and when.

Right now search engines don’t provide an interface for good location aware searches that you can manually specify - you have to let them build a shadow profile on you via all sorts of privacy violating fingerprints or just give up location aware searches altogether. There’s no reason it has to be that way though.

lolinder · 2024-11-02T22:38:51 1730587131

> These things are not mutually exclusive. Allow me to specify a city or state or county or country or zip code as a bang in my search and show me good results based on that. Both problems immediately solved.

Do you actually find that attaching your location to the end of the query doesn't work? I don't do it naturally, but when I do do it I'm rarely disappointed.

crabmusket · 2024-10-31T23:50:57 1730418657

I wouldn't usually point this out, but as you did it repeatedly: "they're" is a contraction of "they are". You're looking for the possessive, "their".

- Your local grammar pedant

pensatoio · 2024-10-31T23:52:06 1730418726

I gave Kagi a shot two weeks ago, and it instantly impressed me. I didn't realize how much search could be improved. It's a beautiful, helpful experience.

ericd · 2024-11-01T01:23:26 1730424206

Yeah, it’s wonderful. Especially once you take the time to up/downrank domains.

skipants · 2024-11-01T01:38:28 1730425108

Could it be that Kagi benefits from being niche, though? Google search gets gamed because it’s the most popular and therefore gaming it gives the best return. I wonder if Kagi would have the same issues if it was the top dog.

lolinder · 2024-11-01T02:20:41 1730427641

I think they absolutely benefit from being niche, but there are a few other things they have going for them that won't go away if they become popular:

* They're not ad funded. Sergey Brin and Larry Page called this out in 1998 and it is just as true as ever: you need the economics to align. Kagi wins if people keep paying for it. Google wins if you click on Search ads or if you visit a page filled with their non-Search ads.

* Partially because of the economic alignment, Kagi has robust features for customizing your search results. The classic example is that you can block Pinterest, but it also allows gentler up- and down-weights. I have Wikipedia get a boost whenever its results are relevant, which is by itself a huge improvement over Google lately. Meanwhile, I don't see Fandom wikis unless there's absolutely nothing else.

I hope to see more innovation from Kagi in the customization side of things, because I think that's what's going to make the biggest difference in preventing SEO gaming. If users can react instantly to block your site because it's filled with garbage, then it won't matter as much if you find a brief exploit that gets you into the first page of the natural search results. On Google Fandom is impossible to avoid. On Kagi it just takes one click.