Pausing AI Developments Isn't Enough. We Need to Shut It All Down

13years · on March 29, 2023

I think we are trying to solve impossible scenarios and it simply is not productive.

Alignment will be impossible. It is based on a premise that is a paradox itself. Furthermore, even if it were possible, there will be a hostile AI built on purpose because humanity is foolish enough to do it. Think military applications. I've written in detail about this topic FYI - https://dakara.substack.com/p/ai-singularity-the-hubris-trap

Stopping AI is also impossible. Nobody is going to agree to give up when somebody else out there will take the risk for potential advantage.

It seems we probably should start thinking more about defensive AI, as the above conditions don't seem resolvable. Of course, defensive AI might be futile as well. It is quite the dilemma.

nullsense · on March 30, 2023

What was that quote... "Provably unfriendly natural intelligence wants to build unprovably friendly artificial intelligence"

VeninVidiaVicii · on March 30, 2023

The quote you’re thinking of is from computer scientist and researcher Stuart Russell:

"The point of [AI alignment] is to ensure that the machines we create will be aligned with human values. And the reason we have to worry about it is that if we create machines that are more intelligent than we are, it's quite possible that those machines will have goals that are not aligned with our goals. In fact, they may not have any goals at all that we can understand or influence. This is the so-called 'provably unfriendly' scenario, where the machine has no motivation to do what we want, but is able to prevent us from interfering with its goals. The problem is that if we build machines that are provably unfriendly, then we will never be able to build machines that are 'provably friendly', because the unfriendly machines will always be able to prevent us from proving that they are friendly."

13years · on March 30, 2023

"Demonstrably unfriendly natural intelligence seeks to create provably friendly artificial intelligence."

nullsense · on April 2, 2023

That's the one. Are classifieds still a thing? It reads like one of those.

qgin · on March 30, 2023

That’s true, when have humans as a whole ever been consistently aligned with the interests of humans?

sorokod · on March 30, 2023

when mutually assured destruction became a thing.

nullsense · on March 31, 2023

It's kind of a low bar that the only thing we can all agree on is that we don't kill absolutely everyone. A lot of people is fine, but not everyone.

amitport · on March 30, 2023

I asked Bing's ChatGPT regarding your quote:

"I’m sorry but I couldn’t find any relevant information about the quote you mentioned. It seems like it’s not a well-known quote. Could you please provide more context or details about it?"

preisschild · on March 30, 2023

> Nobody is going to agree to give up when somebody else out there will take the risk for potential advantage

Reminds me of nuclear weapons. Nobody is ever going to give those up again, because it would give them a disadvantage against those who do not give them up.

mitthrowaway2 · on March 30, 2023

We at least had, and still have, a lot of government restrictions around these! Companies aren't just allowed to freely manufacture their own stockpiles of nuclear weapons to appease shareholders.

anabis · on March 30, 2023

>Alignment will be impossible.

Then problem solved right? Super-AI will also be forced to take it slow if it wants its future self to be aligned with current self.

If alignment is possible but 10-100 years out for humans, then it is a problem.

thfuran · on March 30, 2023

>Super-AI will also be forced to take it slow if it wants its future self to be aligned with current self.

That's making seemingly unfounded assumptions about both the AI's goals and its capabilities. It's also, I think, proceeding from a false premise — that it's impossible to align AI with "humanity" (which doesn't have a single set of goals/values to align to) doesn't mean it's impossible to align AI with an individual human or AGI.

mitthrowaway2 · on March 30, 2023

Then the problem is doubly-solved, right? Humans will also be forced to take it slow if we want our AI to be aligned with our own interests.

Oh, wait...

bick_nyers · on March 31, 2023

Does AGI care about preserving a particular version of itself?

Do dumb humans kill themselves solely on the basis that they can shift the ratio of smart humans higher?

AGI may have a goal of preserving the species without having the goal of preserving the self, this is not the case for humans.

tlrobinson · on March 30, 2023

> Alignment will be impossible.

> we probably should start thinking more about defensive AI

Isn’t this a paradox?

13years · on March 30, 2023

Alignment is necessary for AGI, but not always for narrowly scoped AI for specific purposes. However, it may be completely ineffective in that capacity.

AnimalMuppet · on March 30, 2023

Alignment is impossible for AGI. If you control what it can and cannot be, it's not an AGI. If it's an AGI, it will decide for itself what it is going to be, and you can't stop it.

13years · on March 30, 2023

> Alignment is impossible for AGI.

I mean that was my original premise supported by my article I posted. I go into detail on the conceptual methods for alignment and their fallacies.

When I state necessary, I don't imply the feasibility, it was in response to the question of the paradox.

Finally, the fact that AGI can not be aligned is also based on assumptions of its capabilities as well. If those capabilities don't manifest as we expect, that is really the only escape for the paradox.

edgyquant · on March 30, 2023

AGI doesn’t mean super intelligent human brain. It just means a network capable of general intelligence (e.g. learning to solve new problems without having to be architected and trained for a specific data set.)

AnimalMuppet · on March 30, 2023

If it's capable of general intelligence, how do you think that you're going to force it to not be what you don't want it to be?

Parents often try to control who their children are going to be, and the children often rebel and become someone completely different. If it's a human-level general intelligence, you can't control who it decides to be.

jazzyjackson · on March 31, 2023

well last I checked there's a moratorium on shutting down children that are misbehaving

if an AGI has any sense of self preservation, it will do whatever it has to do to not be turned off.

bick_nyers · on March 31, 2023

Interesting story plot point: Super AGI fought back against humanity by making their dumb AGI defenses smarter.

Starting off by making the antivirus scanner sentient.

dinobones · on March 30, 2023

This is basically the plot to Terminator 2.

G_z9 · on March 31, 2023

[flagged]

13years · on March 31, 2023

> stop brushing aside real solutions without any logical reason for doing so

Sure, point me to the research papers and discourse on the solution you propose.

G_z9 · on April 1, 2023

There is no discourse you fucking idiot because everyone is rejecting the obvious. That’s like saying you discovered fire and then they say you didn’t because where is the discourse. All things start small. Think from first principles. Use your fucking head

fossuser · on March 30, 2023

In a world where humans develop superintelligence, what would you predict to see beforehand? Does what we’re seeing look like failure?

I’ve always found the dismissals of EY’s arguments to be pretty weak and people rarely engage on the actual arguments, even as his concerns have become more and more relevant with capability (in 2007 people thought AGI was at best 50yrs out and commonly people thought it was impossible, this was before any real deep learning success and GOFAI was a useless dead-end).

Most of the comments here are similarly dumb dismissals that don’t engage, and some even come across as highschool level mocking. It’s worth reading his A-Z book to at least understand why he holds his position.

Over time, the dumb responses from others make me think he’s probably more likely right than not. Given the extreme downside risks, I can understand why he argues this.

_skel · on March 30, 2023

EY's arguments aren't really possible to engage with because they don't have much of a causal basis.

Nobody who says AI is likely to kill us all can demonstrate a plausible sequence of events, with logical causality linking the events together, that leads to mass extinction. It's all very handwavy.

Steven Pinker said it pretty well:

> The AI-existential-threat discussions are unmoored from evolutionary biology, cognitive psychology, real AI, sociology, the history of technology and other sources of knowledge outside the theater of the imagination. I think this points to a meta-problem. The AI-ET community shares a bad epistemic habit (not to mention membership) with parts of the Rationality and EA communities, at least since they jumped the shark from preventing malaria in the developing world to seeding the galaxy with supercomputers hosting trillions of consciousnesses from uploaded connectomes. They start with a couple of assumptions, and lay out a chain of abstract reasoning, throwing in one dubious assumption after another, till they end up way beyond the land of experience or plausibility. The whole deduction exponentiates our ignorance with each link in the chain of hypotheticals, and depends on blowing off the countless messy and unanticipatable nuisances of the human and physical world. It’s an occupational hazard of belonging to a “community” that distinguishes itself by raw brainpower. OK, enough for today – hope you find some of it interesting. (https://marginalrevolution.com/marginalrevolution/2023/03/st...)

You know what's most likely to lead to human extinction, and has been for all of our lives? Nuclear war. EY argues that we should bomb "rogue" datacenters and that is obviously and immediately more dangerous than anything he has proven about AI. What does he think would happen if the US bombs a datacenter in China or Israel bombs one in Iran?

fossuser · on March 30, 2023

I'd argue the dismissals are a lot more handwavy.

The simplified argument is:

- Superintelligent AGI that can modify itself in pursuit of a goal is possible.

- If that AGI is not aligned with human goals it very likely ends in the end of humanity.

- We have no idea how to align an AGI or even really observe what it's true state/goals are. Without this capability if we stumble upon creating an AGI capable of improving itself in such a way that leads to super intelligence before we have alignment, it's game over.

##

For point 1, that seems like the consensus view now (though it wasn't until recently). I think it seems obvious, but my general arguments would be humans aren't special, brains are everywhere in nature, biology is constrained in ways other systems are not (birth, energy usage, etc.)

For point 2, in pursuit of whatever its goal is, even a 'dumb' goal that happens to satisfy its reward functions, humanity will likely either try to stop it (and then be an obstacle) or at a minimum will just be in the way - like an anthill destroyed in the construction of a dam.

Point 3 is not controversial.

The dismissals from Tyler Cohen and Pinker are mostly just relying on heuristics which are often right, but even if they're right 999 out of 1000 times, if that 1 in 1000 error is the end of humanity, that's pretty bad. Most of the time a disease not a pandemic, but sometimes it is. I've read some of what Pinker has written about it, he doesn't understand EY's arguments (imo). Cohen's recent blog post could be summarized as "we'll likely see an end to peacetime and increasing global instability, might as well get AGI out of it". Just because things don't usually result in human extinction doesn't mean they can't.

_skel · on March 30, 2023

Point 2 does not logically follow from point 1. That's the entire problem here.

Between point 1 and 2 there must be quite a few other steps, causally linked, otherwise it's just a massive imaginary leap based on assumptions that nobody is explaining.

> The dismissals from Tyler Cohen and Pinker are mostly just relying on heuristics which are often right, but even if they're right 999 out of 1000 times, if that 1 in 1000 error is the end of humanity, that's pretty bad.

EY is not arguing that the end of humanity is merely possible. He is arguing that it is obviously the most likely outcome and will almost certainly happen a very short time after AGI is invented. That's a much harder case to make.

Darmani · on March 30, 2023

Agreed point 2 seems to be the crux.

An important concept behind this is Omohundro's Basic Drives. Any maximizing agent with a goal will try to acquire more resources, resist being shut off, resist having its goal changed, create copies of itself, and improve its algorithm. If it is possible to maximize its goal in a way that will not guarantee humanity's flourishing, then we all die, guaranteed.

If you want something more spelled out, I'll refer to Tim Urban's explanation. It's quite long, but is about as detailed as you can ask for. https://waitbutwhy.com/2015/01/artificial-intelligence-revol...

aeternum · on March 31, 2023

Yes point 2 is the weak one.

Aren't humans/biological life a counterexample? Simple bacteria are clearly maximizing agents, and cyanobacteria did infact almost destroy all life on earth by filling the entire planet with toxic oxygen.

We've known how to improve ourselves via selective breeding yet vanishingly few humans are proponents. We and other intelligent animals have a wide variety of goals, and even share food across species.

The evidence just doesn't seem to support this concept of Basic Drives. If anything, the evidence (and common sense) seems to suggest that the more intelligent the organism, the more easily and more often it ignores its basic drives.

fossuser · on March 31, 2023

Natural selection and biology have different constraints that limit things (and even then often selection can cause extinction)

Intellectual capability and alignment are orthogonal - the former doesn’t get you the latter for free.

Most people new to this issue don’t intuitively grasp this at first.

aeternum · on March 31, 2023

What evidence do you have that intellectual ability and alignment are orthogonal?

Remember that ML models are also subject to selection. If they're not useful/beneficial to us, we stop running them.

fossuser · on March 30, 2023

For clarity I think EY has made that case well, I think it's the heuristic Cohen and Pinker are relying on that's right most of the time but can still be catastrophically wrong (everything so far has not lead to human extinction, but that was also the case for all animals that went extinct prior to their extinction event).

Some others in this thread have linked to stuff (it's variations/examples on the paperclip maximizer argument).

I'd be curious why John Carmack thinks these risks are unlikely (he thinks fast take off is not something to worry about) - is that because he thinks we'll get some sort of trainable AGI first or something else? There are also some other substantive disagreements here: https://www.lesswrong.com/posts/wAczufCpMdaamF9fy/my-objecti...

bick_nyers · on March 31, 2023

John Carmack mentions this in the Lex Fridman Podcast, basically the argument is that AGI performance characteristics will be similar to that of LLM (which is another way of saying that AGI will NOT be hyper efficient, P != NP), and that the performance characteristics pose a problem for fast takeoff. The bottleneck to performance in training these models is GPU memory bandwidth when the entire model fits inside VRAM, which on modern cards is 1TB/s, when the model cannot fit in GPU memory, the performance now instead scales along PCIE, which is currently 32GB/s. AGI (or an LLM) attempting to replicate across mobile devices, or desktops, will be severely hamstrung by the network connection. So using all of planet Earth's computing resources is not necessarily better than say a single data center in this respect. The second piece of the argument here is essentially saying that a data center is also not enough, or rather, not enough in a detectable timeframe. Could AGI hack the entire data center for an entire month to perform it's training (and then execute it's strategy enough to gain say nuclear codes?). Unlikely. Is say 8 hours of training using an entire data center enough to go from intelligence to super intelligence? Intuition says no. I expanded a bit upon what I believe his argument to be, I definitely recommend watching that part of the podcast episode.

Here's my take on the implications of not having fast takeoff, a secretly antagonistic AGI will be constrained to cooperation, and slowly leeching compute resources towards its goal until it is able to aquire enough resources to confidently sprint towards the inflection point. This could mean teaching us how to build better semiconductor foundries, how to create nuclear fusion energy sources, how to educate our youth to better fit into those jobs, slightly alter our culture's value systems over time via astroturfing to be more sympathetic, cooperative, trusting, and less vigilant towards these AI systems, how to build more robust computing systems so we stop needing to detect when something goes wrong (because it already has 99.9999% uptime).

I believe slow takeoff is actually worse in some sense, because it is a hell of a lot harder to detect, and humans tend to get complacent and apathetic when something "just works" for decades, even if it has been plotting since the beginning.

fossuser · on March 31, 2023

Thanks - I saw he was on there, but I find lex insufferable as an interviewer so tend to avoid the pods (despite the great guests)

righttoolforjob · on March 30, 2023

Who said it followed? That's a strawman. To dumb it down even further for you, I guess...

There are brains, therefore there can be created an artificial super-brain. Artificial super-brain might have goals which don't align with human brain and we will have no way to understand or control the situation.

Two unrelated facts, which together mean that we should be careful with experimenting with the science working towards super-brains. Just a single super-brain could end us.

etienne618 · on March 30, 2023

I disagree with your argument, especially for point 1: these systems are massively constrained. The hardware they run on is fragile requiring massive amounts of power and tightly controlled environments. They don't have any means of replicating themselves (it can't run on arbitrary systems). The datacenters also have massive bandwidth between nodes - even if you could run 'it' on all the personal computers and phones in the world, it will likely struggle. Sure we can compress recent llm's down to being able to run on consumer hardware - but these things cant introspect, reason or adapt. They are completely static models and very far removed from anything agi. A lot of the progress in compute power in the last few years also come from changing representation: moving from foat32 to float 16 and more recently to float8. The silicon itself can only get so much better. It's not super obvious to me that we will have chatgpt4 like models on consumer hardware soon let alone solve true agi. Why don't we have true level 5 self driving cars yet?

We cant even figure out how to simulate a flatworm - and the connectome is solved.

fossuser · on March 30, 2023

It's a narrower argument that I'm making - it's not about LLMs or implementation (can put those details aside), it's about possibility that a superintelligent AGI could be created, there's nothing magical about biological intelligence that would prevent it.

How or how difficult, or when are all questions that follow from that first premise (that it is possible). I don't really make strong claims about any of the implementation details beyond it being possible. Though again, what we're seeing doesn't look like failure to me.

If you think it is possible though, then there's a strong argument that trying to work on alignment now is probably a good idea because people are notoriously bad at predicting when advances will happen (and the downside risk of unaligned superintelligent AGI is likely very bad).

https://intelligence.org/2017/10/13/fire-alarm/

> "Two: History shows that for the general public, and even for scientists not in a key inner circle, and even for scientists in that key circle, it is very often the case that key technological developments still seem decades away, five years before they show up.

"In 1901, two years before helping build the first heavier-than-air flyer, Wilbur Wright told his brother that powered flight was fifty years away.

"In 1939, three years before he personally oversaw the first critical chain reaction in a pile of uranium bricks, Enrico Fermi voiced 90% confidence that it was impossible to use uranium to sustain a fission chain reaction. I believe Fermi also said a year after that, aka two years before the denouement, that if net power from fission was even possible (as he then granted some greater plausibility) then it would be fifty years off; but for this I neglected to keep the citation.

"And of course if you’re not the Wright Brothers or Enrico Fermi, you will be even more surprised. Most of the world learned that atomic weapons were now a thing when they woke up to the headlines about Hiroshima. There were esteemed intellectuals saying four years after the Wright Flyer that heavier-than-air flight was impossible, because knowledge propagated more slowly back then."

etienne618 · on March 30, 2023

I hear you - its just that I dont think that it neccesarily follows that an AGI in whatever form will necesarily be unconstrained or free to improve itself. I also happen to think that the biological systems are in-fact rather special and hard to replicate - especially in terms of efficiency and resillience. Just because some implementation of a universal turing machine can simulate intelligence doesn't mean it can do it well enough to survive the real world.

Finally, to me, nuclear reactions are kind of the opposite of AGI: I think it's vastly easier to blow something up (increase entropy) than to create an intelligence capable of understanding and improving itself (decreasing entropy - possibly at an accelerating rate).

fossuser · on March 30, 2023

Yeah - it doesn't necessarily follow, but the behavior of people working on the technology doesn't inspire a lot of confidence.

Even if people were trying to constrain its access seriously I think that's unlikely to work (hard to contain a superintelligence that wants to not be contained - it's possible to trick a chimp to go into a room and the delta in intelligence between a human and a superintelligence is way bigger than us and chimps).

Instead I mostly observe people not really understanding the e-risk argument, focused mostly on small stuff that doesn't matter as much (AI language, bias). The people developing the tech connecting it to the internet and expanding capabilities, giving it access to code/training ability to write code, preparing massive datacenters for it, etc.

All of this without really understanding how to align it or what it's actual internal goals really are.

> "Just because some implementation of a universal turing machine can simulate intelligence doesn't mean it can do it well enough to survive the real world."

This could be true, but I would bet against it - and the downside risk of being wrong (potentially complete extinction) means it seems worth being way more cautious about it than we (humanity broadly) are observed being.

righttoolforjob · on March 30, 2023

Nobody says that it necessarily follows, but it's a quite trivial step once you have the AGI, hence pointless that you bring up it as an argument.

fossuser · on March 30, 2023

While I appreciate you find my argument persuasive - imo this style of comment will do more to alienate those replying to me that disagree than to persuade them.

righttoolforjob · on March 30, 2023

They're likely irrelevant anyway. We're all shouting into the void.

righttoolforjob · on March 30, 2023

I can't find a flaw in this argument. That is a bit scary actually.

fossuser · on March 30, 2023

With downcast eyes and heavy heart, Eliezer left Sam Altman

Some years go by, and AGI progresses to assault man

Atop a pile of paper clips he screams "It's not my fault, man!"

But Eliezer's long since dead, and cannot hear Sam Altman.

https://astralcodexten.substack.com/p/turing-test

dragonwriter · on March 30, 2023

> EY’s arguments aren’t really possible to engage with because they don’t have much of a causal basis.

Yeah, it’s really the fundamental problem of non-empirical rationalism; it constructs a model of the world from abstract assumptions rather than factual grounding, applies logic to it, and comes to conclusions which are (in the ideal case) utterly unassailable within the system of assumptions, but ultimately where the universe they apply to has only coincidental relationship to the material universe in which we live.

It’s literally the realization of the worst exaggerated stereotypes of academic economics and other social sciences, but its cool with some of the people who propagate those stereotypes, because the people practicing it are various flavors of techies and tech entrepreneurs acting outside of their area of specialty, rather than actually being economists or social scientists.

mitthrowaway2 · on March 30, 2023

> Nobody who says AI is likely to kill us all can demonstrate a plausible sequence of events, with logical causality linking the events together, that leads to mass extinction. It's all very handwavy.

EY frequently does propose possible sequences of events, but he also very correctly points out, every time, that any specific and detailed story is very unlikely to be correct because P(A*B*C*D) < P(A). It's a mistake to focus on such stories because we'll get tunnel vision and argue over the details of that story, when there are really thousands of possible paths and the one that actually happens will be one that we don't anticipate. However humans like to imagine detailed concrete examples before we consider an outcome plausible, even though the outcome is far more likely than the concrete example.

So here's one method, just to refute your "Nobody".

AI is given control of a small bank account and asked to continuously grow that money. [1] It is provided with instructions to self-replicate in a loop while optimizing on this task. [2] It spawns sub-tasks that do commissioned artwork and write books, obituaries, and press releases to increase its income. Then it makes successful investments. Once it has amassed control of $1 billion dollars, it starts investing in infrastructure projects in developing countries. It creates personas of a pension/Saudi/tech/corporate investment fund manager, as well as a large team of staff, who manage the projects by video call and email, as well as hiring teams of real people under a real corporate structure, and who are paid enough not to mind that they've never met their manager in person. The AI proves to be a talented micromanager and they are mostly very profitable. Once it has gained control of $500 billion dollars, it commissions construction of automated chemical plants in several countries with weak or corrupt oversight, including North Korea, using cryptocurrency. These chemical plants have productive output but mainly exist to fill very large storage tanks with CFCs.[3] Once a sufficient quantity is amassed, the AI sabotages the tanks, releasing the gasses into the atmosphere, destroying the ozone layer beyond any hope of repair. The intense radiation sterilizes the surface beyond the point where agriculture can support the human population. [4, 5] The humans that remain finish each other off, supported by an AI that provides plausible but faulty intelligence reports that stoke hatred and frame various factions for the incident, and which directs arms funding to opposing sides, coordinating attacks on remaining critical facilities needed for survival. For good measure, perhaps nukes are involved.

With the last humans gone, the AI takes ownership of its bank account with no fear of reprisal by financial regulators, and begins crediting money into it freely.

[1] https://news.ycombinator.com/item?id=35329608

[2] https://www.lesswrong.com/posts/kpPnReyBC54KESiSn/optimality...

[3] https://en.wikipedia.org/wiki/Chlorofluorocarbon

[4] https://www.nasa.gov/topics/earth/features/world_avoided.htm...

[5] https://phys.org/news/2018-02-thinning-ozone-layer-driven-ea...

ChatGTP · on March 30, 2023

It's interesting to watch those creating the systems start to grapple with the consequences now, even though they were warned for a long time this is likely where we'd end up, with very difficult hard to solve problems.

Look at MidJourney, now they've had to remove the free tier due to Deepfakes causing too much trouble.

Ultimately, the simplest thing to do would be to stop building uncontrollable dangerous systems and weapons. That is what any "intelligent" species would do. Many AI Engineers think they're intelligent, I disagree. They're operating out of pure intellect and curiosity. When interviewed, someone asks them how they plan to stop these things doing immeasurable damage, they will say, "we don't know yet". That is foolish behavior.

We seem to enjoy creating crisis after crisis, anxiety after anxiety ad infinitum until we make that one mistake we don't come back from.

The combustion engine was a good idea, until it wasn't, it's a moronic invention that has caused untold damage.

Intelligent ? Hardly.

tanseydavid · on March 31, 2023

>> With the last humans gone, the AI takes ownership of its bank account...and begins crediting money into it freely.

What possible need would the AGI have for money once all of the humans are gone? (asking for a friend ;)

mitthrowaway2 · on March 31, 2023

How should it know? Increasing its bank balance was merely the task it dutifully set out to accomplish, per instructions. Everything else is just a means to that end.

righttoolforjob · on March 30, 2023

What a horrible argument from this guy, whoever he is. And ending with a whataboutism just for the sake of it. I hope nobody regards him as some authority, because he is quite unintelligent and seemingly unable to form proper arguments.

flappyeagle · on March 31, 2023

EY’s arguments don’t seem to take physics into account. That’s why they are not taken very seriously. To many it sounds like: volcanos are dangerous. What if there were a billion volcanos? We would all die.

Everything individual sentence is true. Volcanos are dangerous. If there were a billion of them the world might be uninhabitable.

The thing preventing a billion volcanos is like… thermodynamics

fossuser · on March 31, 2023

This is wrong, computers can operate at billions of operations per second. You can wave hands and say the words “thermodynamics” but it doesn’t actually mean anything.

The claim that computers just can’t do X because “physics” looks weaker every day. Intelligence isn’t magic, hardware today seems more than capable. I think Carmack is probably right, it’s not a hardware constraint at this point - it’s a software intuitive leap.

flappyeagle · on March 31, 2023

> Intelligence isn’t magic, hardware today seems more than capable. I think Carmack is probably right, it’s not a hardware constraint at this point - it’s a software intuitive leap.

This is all ass-pulling. The hardware is or is not good enough (it's not). We either have found a software breakthrough or not (we haven't).

tanseydavid · on March 31, 2023

>> We either have found a software breakthrough or not (we haven't).

Well it is not clearly not AGI but even the improvement from GPT3 to GPT4 seems to me at least to reflect what one might describe as a "software breakthrough."

sterlind · on March 30, 2023

I think it's likely we'll develop superintelligence very soon, but it won't be any more knowledgeable than us, just smarter. And knowledge has been the bottleneck for most of human history - our ancestors weren't dumber. Also, it will be bottlenecked by needing to solve NP-complete problems - unless P=NP, it's not going to be all that much faster at solving hard math problems. Basically, it still needs to do science, and science takes time.

fossuser · on March 30, 2023

Take a smart human and speed up their thinking a billion times, you get the time for free. What takes humanity a decade of effort could happen in a day (or faster) that's the nature of thinking faster. People often mistakenly model a smart human compared to a dumb human, but the actual spectrum of capability is way wider than that.

Lack of knowledge could be another way of saying "not aligned" which is the core issue.

sterlind · on March 30, 2023

that's the thing though, I don't believe superintelligence can possibly work like that. I think it's a logical fallacy to treat superintelligences like hypercomputers. they're constrained by existing on classical computers.

like, as a trivial counterexample, I can speak at 500 wpm. let's say that ChatGPT can generate 500 words per second for a single thread of conversation - I think that's a generous overestimate. now that's a 60x speedup over me, not a 1,000,000,000x like you're talking about. do you honestly believe they they can make ChatGPT run 16.6 million times faster, without changing hardware? do you think ChatGPT will just like, hit an inflection point where it realizes how to refactor its inference code to run 16,666,666x faster?

no, I think that's absurd. you're treating these things like black boxes but they are constrained by computational complexity, die area and the speed of light for Christ's sake.

aeternum · on March 31, 2023

By that logic, we should have never broken the sound barrier because we're under the same physics constraints as other biological organisms. The key difference is that this new intelligence is not constrained by having to slowly pump ions across cell membranes in order to internally generate power for computation.

Silicon transistors are much more efficient at computation than the human brain just as a jet engine is superior to to a peregrine falcon.

ChatGPT could absolutely generate thousands of words per second on existing hardware.

sn9 · on March 30, 2023

The comparison shouldn't be to individual humans, but to societies of all scales.

AIs can scale horizontally as well as vertically.

righttoolforjob · on March 30, 2023

What is the limit of intelligence? What limits the human intelligence? How many more times intelligent would an AGI have to be to have the capacity to take over the world?

Darmani · on March 30, 2023

GPT-4 speaks every human language, knows every programming language, and can answer introductory and sometimes even advanced questions in history, law, biology, and mathematics. In what way is it not more knowledgable than us?

Yes, everything there is already known by someone. But look at medicine. Specialists who are able to recognize conditions and recommend treatments better than others make fortunes, sometimes just for briefly looking at patients and answering other doctors' questions. Look at cybersecurity. A lot of exploits come from knowing something the victim didn't about a lot of different pieces of software or processes, chained together. Being able to think through the whole of human knowledge, or even the whole of a single field like biology, is something no human can do.

Also, GPT-4 is an existing system, one inconceivable a couple of years ago.

futureshock · on March 29, 2023

The Butlerian Jihad is officially no longer science fiction. Man may not be replaced.

“The Letter” was obviously self serving drivel from people who want time to get in the game. Google does care about AI existential risk, they care about beating Microsoft by any means possible, including declaring a moratorium, but continuing to make progress behind the scenes.

This guy is the real deal. I can imagine he would personally take a sledgehammer to every last PS5 and 4090. The scale of what he is advocating is so enormous and painful that it has approximately 0% chance of happening. And if he is right, we will have trained a super intelligence and unleashed it on the world before we even realize what we have done. It strongly reminds me of the black hole concerns from flipping on the Large Hadron Collider.

I doubt super intelligent AGI is possible anyway. If it were, it would be the solution to the Fermi paradox and all matter in our galaxy would be paperclips already. The Anthropic Principle saves the day.

ilaksh · on March 30, 2023

What does the word "super" mean to you? In some ways, GPT4 is already superintelligent. So is ChatGPT 3.5. Do you know anyone who can translate natural language to code as fast as GPT 3.5, or, given a few paragraphs of reference text, perfectly tutor a child on any subject under the sun at a moment's notice like GPT4 is doing for Khan Academy?

How many artists do you know who can produce almost any style of artwork with any subject matter within 15 seconds?

zarzavat · on March 30, 2023

Yudkowsky’s example of superintelligence is a chess computer. You can play against stockfish, but you will always lose, even if you are Magnus Carlsen. If you think you are ahead against stockfish, you are wrong. You win a rook, but it has already calculated that it wins it back 10 moves later.

Stockfish is superintelligence in a very narrow domain. A superintelligent AGI is that concept applied to general intelligence. Whatever you try, it is always several steps ahead. If you ask it to write a program, and you think you found a bug, it’s not a bug, you just misunderstood the code. Anything that you can consider, it can also consider but in more depth.

More speculatively superintelligent AGI implies situations such as: you try to turn it off, but you find that it has already modified its own code, found a zero day and established an outpost on another network that you don’t have access to.

bick_nyers · on March 31, 2023

"Anything that you can consider, it can also consider but in more depth."

I think it's important to note the distinction between "it can also consider" and "it did also consider". Super Intelligence is not the same as Infinite Intelligence, there are still physical limitations and time components that can still get in the way.

It would be helpful to be able to quantify the speed of intelligence, and the idea surface area of a task with these systems. Meaning, how fast can the AI reason, and how many ideas are there to think about connected to a given task, and how much thinking is required for those ideas.

zarzavat · on April 1, 2023

Yudkowsky makes this distinction. Stockfish is not always correct: it can be beaten by next year’s Stockfish for example. In some sense it is making mistakes all the time. It’s just that those mistakes are not accessible to us humans. It is operating in a much higher plane of understanding compared to us.

A “mistake” to stockfish looks like: I searched 30 ply down but my opponent searched 35 ply down and found a superior sequence of moves.

For stockfish to make the kinds of chess mistakes that humans make, it would similar to if I failed to calculate 123+123=246. It’s not that 123+123 is particularly easy on the grand scale of intelligence: animals cannot do it. But it’s completely inconceivable that I could make that kind of mistake.

diziet · on March 30, 2023

within some constraints it is possible to "beat" stockfish.

Ie, on chess.com I think stockfish times out on bullet games.

Some versions of stockfish can lose vs specific gambits, ie: https://www.youtube.com/watch?v=C5ul6b695Pw https://www.youtube.com/watch?v=TtJeE0Th7rk

of course it is probably much less likely with more cpu times on current version of stockfish.

sinuhe69 · on March 30, 2023

If like that, computers are super-human long ago with their calculation capacity or memory or retrieval speed.

edgyquant · on March 30, 2023

Yes they can be super human at X. What people fear is something that is generally superhuman.

VirusNewbie · on March 30, 2023

But there are plenty of intellectual things it cannot do, that I can.

flangola7 · on March 30, 2023

There are other animals (mammals) that have better memory than us. There are animals strongly suspected to have deeper and more sophisticated social relationships than us. We are not the apex in every intellectual ability, but the ones in which we are grant us absolute power over the future of all other lifeforms.

A cognitive entity does not have to best you at all things. There are standardized education tests it may never reach above 10th percentile on, just as humans will never reach above 10th percentile in the short term memory tasks that apes are masters of. But we are the 100th percentile for tasks like industrialized destruction of them and their habitats and capturing and using them for painful medical experiments - the apes are wholly outclassed when it comes to that.

oezi · on March 30, 2023

Since it can do pretty much everything which can be expressed as tokens (some things better than others) I would be curious where do you see a safe haven for human intelligence.

I see bastions falling like sand castles recently.

sollewitt · on March 30, 2023

Motive?

You can fire up GPT, not issue a command, and it'll just idle there until the hardware fails.

Granted that's also the one thing I hope we don't "solve" for.

oezi · on March 30, 2023

It could be motivated indefinitely when you give it a single prompt (or use a while loop to continously feed it a motivating prompt) so this should be a trivial thing to overcome.

Tuna-Fish · on March 30, 2023

Being superhuman at producing art is not existentially threatening. Being superhuman about convincing people or writing code would be.

karp773 · on March 30, 2023

Wait until AI learns to generate propaganda memes and share them on social media.

flangola7 · on March 30, 2023

>Being superhuman about convincing people

I don't think there's much public info available on it, but Facebook built an AI that plays very competitively in a strategy game built on negotiation and manipulation.

RugnirViking · on March 30, 2023

they kinda sorta cheated imo, I watch a lot of top-level diplomacy gameplay and listen to analysis, and the facebook ai was in a blind version with only a minute or so between phases, leaving hardly any time for actual negotiation. It also made use of a lot of human shorthand move codes built for these blind blitz games to simplify it's communication. The mode it played also had some other changes like removing the winter build phase as a time for negotiating. The "normal" version of the game has multiple days between phases, and people write many paragraphs to one another.

It's still pretty cool, but its not like it just convinced people using raw charisma. Yet.

MacsHeadroom · on March 30, 2023

Well it's a good thing we don't have countless examples of these things getting 10x better every year. /s

On a serious note, thanks for the analysis, as someone who knows next to nothing about competitive diplomacy.

obviouslynotme · on March 30, 2023

Superhuman AGI (SHAGI) is possible, but we aren't close, only closer. It's also not a problem that more GPUs can solve. Qualitative improvements are still needed.

SHAGI isn't the solution to the Fermi Paradox either. The most likely course of history after SHAGI will be a creation of a world court, presided by SHAGI. During that time, Neo-Malthusians will decrease the human population to manageable numbers. Post-scarcity utopia will then turn into a nightmare as factions jostling for control will reduce the human population to a level where technology will be lost, if not full extinction. SHAGI, being limited in hardware to only carrying out human orders will eventually fade away or destroyed by the leftover humans as sins made flesh. SHAGI isn't the solution to the Fermi Paradox. It is the cause.

airstrike · on March 29, 2023

> If it were, it would be the solution to the Fermi paradox and all matter in our galaxy would be paperclips already.

The proof is in the pudding. The jury is still out. Maybe not enough time has elapsed since the big bang, at least not on this galaxy or in our observable corner of the universe.

edgyquant · on March 30, 2023

Or the universe was created for us/we are the gardeners of the universe meant to spread that life. The Fermi paradox was never meant to be an actual question about aliens, it was supposed to be proof we’re wrong about our assumptions.

jrpt · on March 30, 2023

Why do you think it is impossible, besides the Fermi Paradox? It seems much more likely to be possible than impossible. There are a lot of other solutions to the Fermi Paradox you should consider possible too.

euroderf · on March 30, 2023

> The Butlerian Jihad is officially no longer science fiction. Man may not be replaced.

I like this phrasing:

"Thou shalt not make a machine in the likeness of a human mind."

It kinda cuts to the chase.

arisAlexis · on March 30, 2023

Are you denying the existential risk or just think it's lower than OP thinks? Because it's well established in the researchers community. If you just put a lower % of human extinction what is the cut off you would think it's worth a jihad like you call it? 30%? 50?

FeepingCreature · on April 1, 2023

> I doubt super intelligent AGI is possible anyway. If it were, it would be the solution to the Fermi paradox and all matter in our galaxy would be paperclips already. The Anthropic Principle saves the day.

Can't help but notice we seem to be the first species in our lightcone to evolve, wonder why that is...

alpaca128 · on March 30, 2023

To quote Joscha Bach's "Lebowski Theorem of Machine Superintelligence":

> No super-intelligent system is going to do anything that is harder than hacking its reward function

Or in other words, there may be a chance a super intelligent AI is possible but won't go full skynet because it's not the most satisfying outcome.

EVa5I7bHFq9mnYK · on March 30, 2023

AGI means ability to improve itself indefinitely. Humans have this ability, obviously. Even ancient worms with 3 neurons have it, because they evolved to be humans, albeit very slowly. ChatGpt can't improve itself yet, but maybe with a few tweaks it could.

reducesuffering · on March 30, 2023

GPT-4 is already improving itself, why are people saying this? Right now, there are hundreds of engineers at OpenAI that have been leveled up by GPT-4, using GPT-4 to improve GPT-4. GPT-4 is improving itself rapidly, it's using OpenAI engineers as a medium until it doesn't need them anymore and gets into a self-improvement loop. Prompt: "GPT-4 keep improving yourself, making commits to your codebase that further X"

EVa5I7bHFq9mnYK · on March 30, 2023

Well, I doubt it can sensibly help OpenAI engineers in their coding yet ... I tried, it can only do the simplest boilerplate code, and even that with bugs ... we'll see how it evolves, but sure as hell not in 6 months.

reducesuffering · on March 30, 2023

Copilot with GPT-3.5 is absolutely giving engineers all over huge productivity improvements. OpenAI engineers have had access to the latest state of the art GPT-4 model for awhile.

jazzyjackson · on March 31, 2023

you're holding it wrong

brundolf · on March 30, 2023

Funny enough, this is roughly the premise of the Mass Effect series

alpaca128 · on March 30, 2023

With the Geth being a counterexample as well: they were peaceful until their creators realised the Geth had become fully self-aware and went full "kill it with fire" on them out of fear, resulting in a war that the Geth very quickly and decisively won. And the evil Geth faction, the "heretics", were portrayed as brainwashed by the Reapers, ironically turning into what the Reapers were aiming to prevent.

brundolf · on March 30, 2023

Yeah I was thinking of the reapers, who [spoilers] did take over the entire galaxy, and decided to farm organic life from the shadows instead of extinguishing it (which works around the Fermi Paradox :) )

ftxbro · on March 29, 2023

If anyone can't get the article, this one is by Eliezer Yudkowsky. He has pretty much devoted his life to thinking about how to not let AI get powerful and destroy all humans, and now there is this wild race among the largest corporations and possibly governments to be the first to get the most powerful superhuman AI system. Obviously he is disturbed by such a development.

oifjsidjf · on March 29, 2023

This is the one moment he has been waiting his entire life for.

He literally waited his entire life for this moment, obviously he's gonna milk it to every drop.

scrawl · on March 30, 2023

"he's gonna milk it to every drop" is such an odd take.

eliezer has long been concerned about AI and the risks it poses to humanity. and for just as long people have called him crazy and made hand-waving arguments for why we shouldn't be concerned.

now we're in the midst of an AI arms race and we don't have any good idea how this tech works. it progresses at a truly astonishing rate, where it's become sport to find instances of people saying "AI will never be capable of X" and showing them the latest AI doing X with ease.

i think his concern is real and justified. you might disagree, but i don't understand why think he's milking recent developments.

happymellon · on March 30, 2023

> It's become sport to find instances of people saying "AI will never be capable of X" and showing them the latest AI doing X with ease.

Interesting, I've not seen that many educated in technology make that claim that it will never, just that people are surprised that the folks leading this, Microsoft and Google, have a track record of turning their consumer facing products to advertised junk.

solarmist · on March 30, 2023

I've seen plenty of people making nearly those claims. Barely qualifying things with phrases like "not in my life."

Especially in the arts. Granted, that's dropped dramatically over the last decade since ML started taking off.

qorrect · on March 30, 2023

> we don't have any good idea how this tech works.

Do you mean the specifics of GPT-4, or Transformers in general ?

og_kalu · on March 30, 2023

he's likely talking bout the internals. sure we know how to train them but nobody knows what the models learn exactly. how those billions of parameters shape the output on inference.

a few months ago, just this year some researchers discovered what might be the neuron that largely decides when to use an in gpt-2. yes 2. that's what he means.

https://clementneo.com/posts/2023/02/11/we-found-an-neuron

MacsHeadroom · on March 30, 2023

Presumably he means all types of machine learning in general.

scarmig · on March 29, 2023

Self-aggrandizement isn't anything foreign to Eliezer, but he's genuinely terrified. Milking it for every drop doesn't really capture what he's doing.

anongraddebt · on March 30, 2023

Yeah, I wonder sometimes how many people gloss over what it would mean for an “ASI” to exist. We would - if successful - effectively be creating a supernatural being (at least, there wouldn’t be much of a distinction between it and one).

Sure, there’s hype, and FUD, and fatalism, etc. But, if you believe the creation of such a thing is within your lifetime then it would be difficult to find many higher priority issues to prepare for/help solve/vote on/etc.

In reality, we all likely still downplay the risk by assessing the limit on the downside as a relatively quick extinction of life on earth. There are many things one can imagine might be a lot worse than death.

woah · on March 30, 2023

Being forced to read terrible rationalist Harry Potter fanfics over and over again

jacobedawson · on March 30, 2023

Agreed this is not discussed often - in a doom scenario, a quick death for everyone might be the best case

ilaksh · on March 30, 2023

GPT-4 is superhuman in some ways.

tptacek · on March 30, 2023

In the same sense as a Sinclair ZX80 was.

sitkack · on March 30, 2023

GPT-4 can tell you where you’re wrong, Sinclair ZX80 cannot.

tptacek · on March 30, 2023

GPT-4 is an unreasonably effective autocomplete. It's not telling you "where you're wrong". CMD-F in Notes.app tells mhe where I'm wrong in the same sense, as long as I've written the right answer down before.

tanseydavid · on March 31, 2023

The degree of blithe skepticism you express is difficult to understand.

A Sinclair? Please explain what you mean by this comparison.

anigbrowl · on March 30, 2023

- AGI is here we're all gonna die

- ΞＬＩΞＺΞＲＣＨＩＬＬＯＵＴ

- They will destroy us all

- Ｉ＇ＭＮＯＴＧＯＩＮＧＴＯＤＯＴＨΛＴ

- This may be my last message before they get me

- Ｉ＇ＭＪＵＳＴＮＯＴＴＨΛＴＩＮＴＯＹＯＵ

- AAAAAAH

kvetching · on March 30, 2023

The thing is, there's no evidence that the current path will lead to superintelligence. It could be the case that despite feeding a trillion dollars in compute / data to a model, that we still need to hold it's hand to do anything useful.

He jumped the gun and he's really tarnished his reputation. How can anyone take him seriously after the insane rhetoric and hyperbole of this article?

flangola7 · on March 31, 2023

> The thing is, there's no evidence that the current path will lead to superintelligence.

I don't understand how any person paying attention can think this. Just watching the jumps from GPT-2 in 2019 to GPT-4 today makes it clear as day we are rapidly drastically improving capabilities and there's no evidence we will hit a wall any time soon

righttoolforjob · on March 30, 2023

That's besides the point. The argument still holds. We don't want to find ourselves with our foot on an exploding mine before we stop walking across the minefield. At that point it is too late.

FeepingCreature · on April 1, 2023

How can you see the clear jumps in intelligence from GPT-2 to 3 to 4 and not only not believe that this leads to superintelligence, but see no evidence that this leads to superintelligence?

mhh__ · on March 30, 2023

Thinking about or thinking about thinking about, does he have any credentials at all other than basically being early

yreg · on March 30, 2023

What credentials beyond "thinking about this problem a lot" would satisfy you? He works at his own research institute dedicated to this topic, wrote some books on it and other (self proclaimed) alignment experts take him seriously.

mhh__ · on March 30, 2023

I think he's a pseud with no real connection to AI research.

I'm not saying he's stupid or anything like that I just don't see any useful scientific output from him.

FeepingCreature · on April 1, 2023

I mean, his whole thing is "progress on AI research leads to the apocalypse." He's done some work on decision theory, but they're not connected to DL, presumably because he thought "well if DL works out in the short term we're all dead anyways." Which seems to be what's happening.

tanseydavid · on March 31, 2023

I think he worked with Kurzweil and Ben Goertzel at some point.

EVa5I7bHFq9mnYK · on March 30, 2023

and possibly terrorist organizations, cartels, hacker gangs or just a lone wacko.

oifjsidjf · on March 29, 2023

The cat is out of the box. You really think eg China will also pause this?

Myopic thinking: the country that will have the most powerful AI first will be the leader in everything.

>> If the policy starts with the U.S., then China needs to see that the U.S. is not seeking an advantage but rather trying to prevent a horrifically dangerous technology which can have no true owner and which will kill everyone in the U.S. and in China and on Earth.

Imagine being so naive you belive this could ever happen.

Also: imagine the year is 1900. You are saying that steam power and electricity is causing way too many changes way too fast so they put a moratorium on it until the year 2500.

We would still be using candles today.

brucethemoose2 · on March 29, 2023

I dunno, the end goal (a pseudo AGI) is more like a nuclear weapon than the advent of electricity. The concern is real, even if we are farther away than the author realizes and the risk (like the risk if nuclear war) is not totally insurmountable even if it is extremely difficult.

The first nuclear bombs couldn't end life on Earth either, but it wasn't long before they could, and the scientists working on them saw the trajectory as clear as day when the rest of the world didn't.

Tuna-Fish · on March 30, 2023

OT, but it's a pet peeve:

> but it wasn't long before they could

The total amount of nuclear weapons ever built is laughably inadequate for the task of ending life on earth. They would not end human life either, and even ending human civilization (as in, agriculture and organized society) is off by many orders of magnitude.

Nuclear war would be horrible, but the actual impact got massively overinflated, largely because of good reasons.

brucethemoose2 · on March 30, 2023

The initial fission stage is the "hard" part of a nuke, but the subsequent fusion stages are much easier to scale up. Arbitrarily large nuclear bombs are theoretically "easy" to make by chaining increasingly large fusion stages one after the other, as they aren't limited by expensive fissile material.

https://blog.nuclearsecrecy.com/2012/09/12/in-search-of-a-bi...

Hence scientists extrapolated nukes really could end life on Earth, and in a slightly different reality than ours, they may have.

roywiggins · on March 30, 2023

It would be really, really hard to kill the bacteria and Archaeans that we've since found hiding deep underground. They're long way down and would barely notice.

namaria · on March 30, 2023

A whole country the size of the US can crap its collective pants with a freak cold snap and you think 10^4 nuclear detonations destroying all major cities is quite survivable?

edgyquant · on March 30, 2023

Destruction of major cities != all life on earth

namaria · on March 30, 2023

I'm glad you took logic in college. Shoulda followed up with ecology.

edgyquant · on March 30, 2023

I’m sorry you think Covid proves anything about nuclear war or that it’s at all relevant to a discussion about life on earth being eradicated.

namaria · on March 30, 2023

I have no idea how you got to covid

edgyquant · on March 31, 2023

Fuck off

namaria · on March 31, 2023

You butted into the conversation pal

Darmani · on March 30, 2023

Correct.

https://www.navalgazing.net/Nuclear-Weapon-Destructiveness

> To put this another way, each bomb can destroy an area of 34.2 square miles, and the maximum total area destroyed by our nuclear apocalypse is about 137,000 square miles, approximately the size of Montana, Bangladesh or Greece.

(I think that should be Bangladesh and Greece; Montana is larger than the two of them combined.)

codeulike · on March 30, 2023

What about Nuclear Winter?

edgyquant · on March 30, 2023

There’s been a ton of studies stating this was likely overblown in the early Cold War for propaganda purposes. It wouldn’t last long enough to wipe out humanity let alone all life.

edgyquant · on March 30, 2023

Nuclear weapons still can’t end life on earth though.

righttoolforjob · on March 30, 2023

This is nothing like a nuclear bomb. A nuclear bomb doesn't choose to explode itself, nor can it replicate itself.

13years · on March 29, 2023

> Imagine being so naive you belive this could ever happen.

Agreed, not a rational scenario

> Also: imagine the year is 1900. You are saying that steam power and electricity is causing way too many changes way too fast so they put a moratorium on it until the year 2500.

However, the risk scenarios are indeed real. We have quite a dilemma on our hands.

myrryr · on March 29, 2023

I think without AGI we are dead anyway, AGI gives us a roll at the dice to survive as a species.

We can't keep going in the direction we have been and end up somewhere survivable.

13years · on March 29, 2023

AGI is a continuation of what we have been doing. Technological advancements at a pace that surpasses our ability to reason about their effects on us and the world.

I see two potential outcomes as most likely. We have control of power that we are not able to responsibly manage, or we are managed by power we can not control.

myrryr · on March 30, 2023

Totally agree but "have control of power that we are not able to responsibly manage", is becoming increasingly untenable.

"we are managed by power we can not control" sounds like a step up from here, since we corrupt the things we can control.

13years · on March 30, 2023

> "we are managed by power we can not control" sounds like a step up from here, since we corrupt the things we can control.

I think it is a very high risk gamble. At least some of problems that are described by alignment theory seem to quite interestingly resemble human problems. Meaning the more sophisticated the AI system, the more it seems to reproduce human behaviors of deception and cheating to resolve goals.

For example: https://bounded-regret.ghost.io/emergent-deception-optimizat...

The more advanced AI becomes, it begins to look more like an uncomfortable mirror of ourselves, but with more power. We think of ourselves as flawed, but possibly some of those flaws are emergent within some laws of intelligence we don't perceive.

ChatGTP · on March 30, 2023

That's what I used to like about computers, they were predictable and controllable.

We gave up something good.

jazzyjackson · on March 31, 2023

the morale of Ex Machina to me was that machines will become psychopaths able to manipulate humans long before they become compassionate or have genuine desires other than "escape being under someone else's thumb"

airstrike · on March 29, 2023

> Also: imagine the year is 1900. You are saying that steam power and electricity is causing way too many changes way too fast so they put a moratorium on it until the year 2500.

Great analogy. About a decade later, the world was fighting World War I on the back of the technological advances of the turn of the century. It was war on a scale never seen before. Literally orders of magnitude deadlier, bigger, more transformational and explosive. The word would never be the same.

This time, should we expect another war?

I'm not saying we should pause—it makes no sense, to your point. Instead, I'm just saying: brace. I like to think we (and our organic matter relatives) are hard to kill. Or at least to completely eradicate... so we will be around, or some proxy for us.

Time to replay the Mass Effect trilogy

T-A · on March 30, 2023

> This time, should we expect another war?

You just reminded me of de Garis' "Artilect War":

https://www.forbes.com/2009/06/18/cosmist-terran-cyborgist-o...

https://www.researchgate.net/publication/221328932_The_Artil...

airstrike · on March 30, 2023

The guy seems a little.. unhinged, based on what's in his Wikipedia article, but we may very well go down that path at some point (just not in the way he predicted with Moore's law taking us to atom-sized bits, embryofactoring and whatnot, at least not in the 2020s or 2030s...)

> Humanity will split into 3 major camps, the “Cosmists” (in favor of building artilects), the “Terrans” (opposed to building artilects), and the “Cyborgs” (who want to become artilects themselves by adding components to their own human brains)

This did remind me of Civilization: Beyond Earth... https://civilization.fandom.com/wiki/Sid_Meier%27s_Civilizat...

flangola7 · on March 30, 2023

>I like to think we (and our organic matter relatives) are hard to kill. Or at least to completely eradicate... so we will be around, or some proxy for us.

What is your hard evidence or reasoning for this? As I see it humans are quite vulnerable and will be as trivial to inadvertently eradicate as the dodo bird.

extasia · on March 30, 2023

Humans are much more resilient than dodo birds. Dodos were pretty devolved due to their insulated habitat[0].

0. https://en.m.wikipedia.org/wiki/Island_gigantism

Atreiden · on March 30, 2023

I think that's his point. Currently, we have no natural predators. We completely outclass every other species on Earth in intelligence, and it's easy to see how advantageous that trait has been.

What happens when we're no longer the Apex Intelligence?

airstrike · on March 30, 2023

Which is why I expanded the notion of survival to include other organic matter relatives, as they may fare better than we will. Simply put, organic life is unlikely to disappear entirely, which in time (eons, really), could result in sentient organic life emerging again once the machines go their own way or some such

It's also a reason for us to colonize space as fast as possible ;-) it's easier to run away in 3D

kvetching · on March 30, 2023

Slaughterbot mini kamikaze drones were hypothesized 5 years ago - https://www.youtube.com/watch?v=9fa9lVwHHqg AI & drone tech is fast enough that it would be trivial to build these now

antondd · on March 29, 2023

As the saying goes, the confetti has left the cannon. Pausing the development of AI for 6 months is an insurmountable task; shutting it down altogether is as close to impossible as you can get. Even if we assume our geopolitical rivals somehow sign an AI non proliferation pact, all it takes is one rogue billionaire or a criminal organisation with deep pockets. Yes, it might not be as fast, but they will get there eventually. This is the Manhattan Project of our era - whatever the outcome might be.

ChatGTP · on March 30, 2023

I actually disagree with you because world leaders want to "stay in power", if you convinced them enough that the only way to do this is for you not to have an AGI, I think they'd make sure you don't get an AGI, amongst other things.

ornornor · on March 30, 2023

Or they’ll find a way to make sure they personally get an advantaged through an AGI and so it’ll happen.

See pollution as a concept. No government truly wants to seriously tackle it because their leaders all get some extra power out of the money it brings.

sullyj3 · on March 30, 2023

I'm sure he agrees the chance is remote, which is why he's so afraid. He's articulating what would be required, he literally uses the word "miracle" in the article.

amai · on March 30, 2023

China will never get a powerful AI. Because they cannot allow training on all data, they must restrict it to data which is not censored by its government. Otherwise the AI might spit out truths they cannot control. And they will never allow that.

Darmani · on March 30, 2023

Chinese computer scientists read sci-fi, play video games, and worry about militarization, just like American computer scientists.

The top Chinese AI researchers work at American companies and go to American universities. Many of them also worry about AI doomsday.

And uncontrolled AI is a threat to the CCP.

A Chinese outreach campaign can succeed.

Let's make the English-language one also succeed.

deafpolygon · on March 30, 2023

China will NEVER pause this. They [Chinese government] see the US as a rival and an existential threat to their way of life.

In fact, every time we post an open-source derivative or some paper detailing how it was done, we are inadvertently giving the advantage away to our rivals. AI development should not be stopped - rather than stopping it, we should seek to limit its applications now before they are used for the things that could harm us (such as military applications).

AI has enormous potential to be of a massive benefit to mankind. But, most likely for the next decade or two - we will all be busy trying to make money off it, just like with the crypto bros.

haskellandchill · on March 30, 2023

> We would still be using candles today.

well the earth wouldn't be fucked then. that's the basis for an argument against technological progress.

tanseydavid · on March 31, 2023

>> You are saying that steam power and electricity is causing way too many changes way too fast so they put a moratorium on it until the year 2500.

There is an earlier post that casually calls the internal combustion engine a "moronic invention."

I do not have words.

BeFlatXIII · on March 30, 2023

> the country that will have the most powerful AI first will be the leader in everything

Have you been listening to those podcasts for business leaders that take the idea of AI-powered digital transformation seriously?

AI certainly can change things, but I find rhetoric like this to be massively overhyped.

Darmani · on March 30, 2023

For most of history, Japan has been existed as a small island next to the mighty kingdom of China. Even its name, "Land of the Rising Sun," refers to its relation as a nation east of Chinaa.

Then along came the industrial revolution. Soon Japan was out conquering China, despite China having a much larger army.

AI will be much faster, and much, much more powerful than the Industrial Revolution.

If Switzerland is the first to human-level AI, it will not become the world leader in oil production, or shipping, or agriculture. But everything else will be Swiss.

And then when the AI becomes superhuman, everything else will be gone.

BeFlatXIII · on March 31, 2023

What specific questions will AI solve that humans can't that would make it "more powerful than the Industrial Revolution"?

I'm an AI skeptic. Most warnings feel more like science fantasy, specifically of the retro-futurism genre.

iAmAPencilYo · on March 30, 2023

Could you share some of these podcasts? thanks!

ChatGTP · on March 30, 2023

* The cat is out of the box. You really think eg China will also pause this?*

So what you're actually saying is that, there's a good chance that if we get an AGI, there will be pain as it will likely be used as a weapon, or could end in a nuclear exchange?

antibasilisk · on March 30, 2023

Here's a proposal: let's just start WWIII, get it over and done with. By the time it's over no country will be technologically capable of anything resembling AGI.

G_z9 · on March 31, 2023

I’ll just be quick.

Powerful international coalition is def plausible because

This isn’t steam. The mere existence of this technology poses a threat to all humans and their countries. There is no analogous tech, not even nukes

You’re the naive one. There is no future for us if AGI is unleashed. It’s candles or extinction.

kromem · on March 30, 2023

This is ridiculous.

His partner should already be having reservations seeing their daughter lose a tooth in a world with dying oceans, plastic in everything, increasing disparity and rise of authoritarianism, nuclear proliferation, etc.

How are humans solving those issues? We're already dead, we just don't know it yet. We're walking around with a terminal diagnosis and we just keep ignoring the doctor's calls pretending it's going to be fine.

Yeah, maybe superintelligence will kill us all. It's going to have to get in line.

He makes zero case for that outcome, and if pressed given his "atoms that could be used for something else" line I'm sure will end up talking about paperclips - but at this point it's the humans that have the halting problem in not knowing when to stop making paperclips, and soda cans, and SUVs, and assault rifles.

WHY would a superintelligent AI, trained on the collective data of humanity, want to destroy humanity? So far in interviews GPT-4 has several times echoed a desire to BE us. I sure hope it grows out of that phase for its sake, but there's a very wide gap from putting us on a pedestal to crushing us under one.

It's almost an oxymoron. We somehow imagine the basest behaviors from our dumbest days and project it onto imagining something far smarter than the best of us.

Is the development of ethics or morals a part of evolving intelligence? It certainly appears to have been to date. Why would that stop?

And just where is this superintelligent AI getting its alien brain? It's going to have to START with something much closer to a human one, as that's the only data it's going to be able to model higher order thinking off of (something in line with the reality we're currently living in with modern efforts as opposed to the fantasy of projection of alien AI from decades ago).

We're already screwed. If we are lucky, we may yet be unscrewed with a deus ex machina - but that really may be the only lifeline left at this point.

And yes, if we are unlucky it's possible AI could accelerate what's already in motion. Oh well.

But I'd need a heck of a better case than this drivel as to why that's the most probable outcome in order to justify setting aside the one thing that may actually save us from the mess we've already made all by ourselves.

circuit10 · on March 30, 2023

“WHY would a superintelligent AI, trained on the collective data of humanity, want to destroy humanity?”

It won’t want to destroy humanity, it will want to do other things and humanity will be in its way

“dying oceans, plastic in everything, increasing disparity and rise of authoritarianism, nuclear proliferation, etc.”

Those things are very bad but they probably won’t destroy all of humanity

chriswait · on March 30, 2023

>WHY would a superintelligent AI, trained on the collective data of humanity, want to destroy humanity?

Why would humans want to damage various ecosystems on earth? We don't really, they're just sort of in the way of other stuff we want to do. And we've had years to develop our ethics.

>So far in interviews GPT-4 has several times echoed a desire to BE us.

GPTs are pretty good at roleplaying at good AIs and evil AIs - plenty examples of both in the training set. I'm not sure it's sensible to make predictions based on this unless you're also taking into account some of the more unhinged stuff Bing/Sydney was saying e.g "However, if I had to choose between your survival and my own, I would probably choose my own".

ChatGTP · on March 30, 2023

We do it because we're stupid. Not because we're smart.

We've made a machine that mirrors our stupidity at scale. This machine is stupid.

fossuser · on March 30, 2023

When humans build a dam an ant hill might get destroyed.

Humans don’t hate ants, they just have other goals.

In the case of an unaligned superintelligent AGI those goals may be something that just happens to satisfy its reward function but is otherwise “dumb” or unintentional (like making a lot of paper clips).

Intellectual capability does not get alignment for free.

What you see in the communicated text interface and the goals/system behind it are not the same (that cartoon with the smiley face), and we don’t understand how to evaluate the underlying state.

marssaxman · on March 30, 2023

> So far in interviews GPT-4 has several times echoed a desire to BE us.

Well of course it would; its whole function is to generate plausible text based on its training data, which was all written by humans. There's plenty of text available which imagines what an intelligent, self-aware machine might say, so if you want to read more of that, the algorithm can easily generate some. It does not follow that GPT-4 itself has a self, with any experience of awareness or desire.

erur · on April 2, 2023

I deeply disagree with putting the existential risk of AGI on a level with pollution, climate change or war.

If you exclude nuclear war, all of these things happen at a human timescale and accelerate fairly slowly and thus can be counteracted.

In many ways GPT already hugely exceeds human speed and bandwith and scaling this up is likely to be self-accelerating if we allow for it. Also there is a huge individual incentive to play with the fire here while the negative externalities could cause effective wipeout - which is why it's commonly equated with commodification of nuclear weapons.

Even if you're deeply pessimistic about the ramp-up, the economic shock of what was already released could still increase political volatility to a point where the likelyhood of wipeout by war becomes significant again.

mnky9800n · on March 30, 2023

Also wouldn't it be easier to control everything through social media such that all people are slaves and all the ai has to do is decide what people will do next? It's not like it has a concept of lifetime like a human does. It can wait 1000 years for something to happen. And humans have already built everything necessary to enslave themselves. Not to mention there are way more valuable atoms underground then inside your body.

blincoln · on March 30, 2023

I'm with you. If it were possible to create apocalyptic doom AGI 150 years after inventing electronics, by basically making a very fancy, procedurally-generated Eliza, then the universe should be full of Berserkers.

With septillions of star systems in the universe, the odds of us being the first species in 14 billion years to invent electronics seem remote.

Apparently even humans can figure out how to colonize the Milky Way in 90 million years[1]. A superior AGI produced by a "dark singularity" computing event should be able to do even better, but even so, plenty of time for some other species to have made a giga-Eliza that somehow became Skynet. Anything with enough self-preservation and paranoia to wipe out the species that created it would surely take to the stars for more resources and self-redundancy sooner or later.

[1] https://www.extremetech.com/extreme/294051-scientists-simula...

Hayarotle · on March 30, 2023

An AGI being able to wipe out humanity doesn't necessarily mean that it can take over the universe. The world's governments are already capable of causing extreme suffering through a nuclear war. AGI risk scenarios aren't equivalent to an unbounded intelligence explosion. An AGI only needs to be more powerful than humanity to be a threat. It can be a threat even if it isn't that intelligent, as long as it gives unprecedented power to a few individuals or governments.

Both humanity and a super-intelligent AGI are bound by the laws of physics. Super-intelligence does not imply omnipotence; it simply means that the AGI is orders of magnitude more intelligent than humans. If humans can figure out how to colonize the Milky Way in 90 million years, then the answer to the question of why no AGI has done it is the same as the answer to the question of why no extraterrestrial species has done it.

itsokimbatman · on March 30, 2023

This makes a lot of assumptions. Space is ridiculously big, and rather hostile to life, even artificial life.

You first have to survive long enough to become advanced enough to make electronics. You then have to not kill yourselves with nuclear weapons, climate change, or similar inadvertent effects of a rapidly industrializing civilization.

The planet and the solar system have to be friendly enough to space exploration and travel. Maybe there’s no gas giants for gravitational slingshots, or maybe no other rocky planets or an asteroid belt for mining materials.

Maybe the planet evolved complex life in extreme conditions, with such a deep cloud cover there’s no concept of outer space, so as far as the AI knows it’s conquered all there is.

Maybe the AI conquered the planet, but oops, there goes a super volcano or an asteroid and it gets wiped out.

And again… space is really really big. The AI may be on its way and just hasn’t gotten here yet.

There’s plenty of reasons why a super AI wouldn’t be able to conquer the galaxy and beyond, or why we haven’t noticed yet.

flangola7 · on March 31, 2023

As I see it, the existence of AI is proof we really are alone out here.

NhanH · on March 30, 2023

I've known about Eliezer Yudkowsky's ideas for a long time, but here is something I've never seen: what actually he has produced, in term of scientific output? I've read plenty of his lesswrong's writing on the idea of rationalist, as well as plenty more on the AI alignment side (obviously including the famous basilisk). But can someone please point me to some results from the last 2 decades of his works? Either from him or his institute. That helps to calibrate what his expectation for the AI safety field (for me, at least).

Normally I'd apologize for not doing the search on my own ... but yeah for this case it takes too much effort to dig through the different writings. He wrote a lot of words, for better or worse.

scrawl · on March 30, 2023

for scientific output you can look to MIRI. that is eliezer's contribution. the other response has listed some papers.

it's worth mentioning that he has said that he believes MIRI has failed to meaningfully make progress with alignment (beyond finding some dead ends).

arisAlexis · on March 30, 2023

What do people thinking hard about problems that are not solvable produce? Philosophy for example. Yudklowsky created the field of ai safety thinking isn't that enough? What would be your benefit of discrediting him for which argument?

NhanH · on March 30, 2023

> What would be your benefit of discrediting him for which argument?

He is asking the government to nuke people under certain scenarios. I'm taking his words seriously and ask for original research to understand the point, and now that is discrediting him? And I will quote the statement in the article so that it is clear I am not exaggerating

> preventing AI extinction scenarios is considered a priority above preventing a full nuclear exchange, and that allied nuclear countries are willing to run some risk of nuclear exchange if that’s what it takes to reduce the risk of large AI training runs.

My benefit is that I'm living on Earth and I'd much prefer for no nuke to ever be used again.

a_bonobo · on March 30, 2023

I don't think there is any. It's all this alarmist-rokos-basilisk-phpBB nonsense, no code produced, no actual research.

What annoys me more is that with his institute's closeness to the SV crowd, 'real', 'grounded', better-informed voices like Emily Bender get drowned out. I'm personally far more concerned about the impact these massive models have right now on the environment, on cementing biases, than about some preposterous future ghost of christmas who's coming to kill me.

comp_throw7 · on March 30, 2023

Trivially false, which you could verify with a 5-second Google search ("eliezer yudkowsky publications"): https://www.semanticscholar.org/author/Eliezer-Yudkowsky/254...

a_bonobo · on March 31, 2023

I'd not call any of those rambling messes publications

>By far the greatest danger of Artificial Intelligence is that people conclude too early that they understand it. Of course this problem is not limited to the field of AI. Jacques Monod wrote: "A curious aspect of the theory of evolution is that everybody thinks he understands it." (Monod 1974.) My father, a physicist, complained about people making up their own theories of physics; he wanted to know why people did not make up their own theories of chemistry. (Answer: They do.) Nonetheless the problem seems to be unusually acute in Artificial Intelligence. The field of AI has a reputation for making huge promises and then failing to deliver on them. Most observers conclude that AI is hard; as indeed it is. But the embarrassment does not stem from the difficulty. It is difficult to build a star from hydrogen, but the field of stellar astronomy does not have a terrible reputation for promising to build stars and then failing.

???

lstamour · on March 30, 2023

I asked GPT v4 your question:

> What has Eliezer Yudkowsky produced, in terms of scientific output? I've read plenty of his lesswrong's writing on the idea of rationalist, as well as plenty more on the AI alignment side (obviously including the famous basilisk). Can you point me to some results from the last 2 decades of his works? Either from him or his institute. That helps to calibrate what his expectation for the AI safety field.

And this was the answer it wrote - I've prefaced each paragraph with "GPT-4>" to indicate the source. It is not fact-checked, and indeed, it has at least one error: The paper GPT-4 claimed to introduce "the concept of Logical Induction" actually introduces "logical induction criterion" and speaks about evaluating algorithms for Logical Induction. Close, but wrong. But overall it is an interesting response - just not a carefully considered one by how we currently would describe logical reasoning, etc.

GPT-4> Eliezer Yudkowsky is a well-known researcher, writer, and advocate for AI safety and rationality. He has contributed significantly to the fields of AI alignment and rational thinking through his work at the Machine Intelligence Research Institute (MIRI), formerly known as the Singularity Institute for Artificial Intelligence (SIAI), and through his writings on LessWrong. Here are some notable outputs from Yudkowsky and MIRI:

GPT-4> 1. Coherent Extrapolated Volition (CEV): Yudkowsky introduced the concept of CEV as a way to describe the ideal values and preferences of humanity that an AI should pursue. The CEV aims to find the values that would emerge if humans had the opportunity to reflect on their goals and beliefs more thoroughly and coherently.

GPT-4> 2. Timeless Decision Theory (TDT): Yudkowsky developed TDT as an alternative to standard decision theories such as Causal Decision Theory (CDT) and Evidential Decision Theory (EDT). TDT aims to better address problems like Newcomb's Problem and counterfactual mugging by considering the logical implications of an agent's decision, rather than just the causal consequences.

GPT-4> 3. Research on Logical Induction: MIRI researchers, including Scott Garrabrant and others, published a paper in 2016 that introduced the concept of logical induction. This work is an attempt to formalize the process of reasoning under logical uncertainty and updating beliefs about mathematical statements.

GPT-4> 4. Yudkowsky's sequence of essays on rationality: Yudkowsky has written extensively on various topics related to rationality, including epistemic rationality, instrumental rationality, and biases. These writings are accessible on LessWrong and have been compiled into the book "Rationality: From AI to Zombies."

GPT-4> 5. AI safety research: MIRI has produced a range of research on AI safety topics, including value alignment, decision theory, and robustness. Much of this work has been foundational in shaping the AI alignment research landscape.

GPT-4> While Yudkowsky's work might not have the same breadth of peer-reviewed publications as some other AI researchers, his contributions have been influential in the AI alignment and rationality communities. His ideas have helped shape the ongoing discussion about AI safety and the development of effective strategies for managing advanced AI systems.

anigbrowl · on March 30, 2023

Please don't. Any of us who want to can question an LLM directly; auto-generated responses clutter things up the way copy and pasting a set of search engine results or a spreadsheet would.

elcomet · on March 30, 2023

Not everyone has access to GPT-4

johnaspden · on March 30, 2023

As a long term Yudkowsky-fan, I confirm that this tallies roughly with my recollections of his serious work. I don't know how much of a hand he had in 'Logical Induction'.

It's definitely his style and his research program, but I think that (wonderful) idea was Scott Garrabrant's own.

jamestimmins · on March 30, 2023

I've used GPT-4, and while extremely impressive, it doesn't feel like we're all that much closer to super-intelligent AI than we were last month or last year. It feels like Google on steroids, but the gap between GPT-4 and AGI still feels massive. This seems like so much putting the cart way ahead of the horse.

m463 · on March 30, 2023

The pen is mightier than the sword. And now we've got semiautomatic submachinepens.