More

joaogui1 · 2024-10-02T18:49:01 1727894941

I think the whole situation where they got some serious investment from SBF and then he got indicted pushed them into commercialising their tech so they could have more standard sources of funding

joaogui1 · 2024-10-02T05:22:00 1727846520

Are you implying that our brain learns through Machine Learning?

jjk166 · 2024-10-02T17:39:55 1727890795

Tautologically. Machine learning is a blanket term for a wide range of approaches that allow machines to mimic the brain's capability to learn, at least in function if not form.

The brain may employ wildly different machine learning algorithms from those currently in vogue, but whatever algorithms the brain is using must be machine learning algorithms. Unless of course you define machine in this case to be something which isn't the brain, in which case they're just learning algorithms. Regardless, an architecture exists to match the brain's capabilities with not only finite but surprisingly limited hardware.

qgin · 2024-10-02T17:09:46 1727888986

In the very vaguest sense of the word, absolutely. Not that there is a literal transformer algorithm, but there is some evolved learning algorithm in the neurons of the brain that is at least somewhat a distant cousin of what we’re doing today.

joaogui1 · 2024-09-13T23:09:05 1726268945

OpenAI keeps innovating on being more closed than the other companies

joaogui1 · 2024-09-03T12:19:34 1725365974

I mean you didn't mention autoregressive models anywhere in your comment, whereas the post is about the connection between diffusion and autoregressive modelling. Also it's a blog post, if it has figured out a speed-up or improved method it would probably have been a paper

joaogui1 · 2024-08-31T10:11:13 1725099073

What did Canada and UK do?

cormorant · 2024-08-31T11:49:21 1725104961

Criminalize acts that would clearly be free speech in the US: the trucker convoy (Canada) (even just voicing support for it); mean tweets as hate speech (UK) (recent riots, see also JK Rowling).

librasteve · 2024-08-31T11:31:47 1725103907

gun control

joaogui1 · 2024-06-17T07:05:48 1718607948

Depends on a ton of stuff really, like size of the model, how long do you want to train it for, what exactly do you mean by "like Hacker News or Wikipedia". Both Wikipedia and Hacker News are pretty small by current LLM training sets standards, so if you train only on for example a combination of these 2 you would likely end up with a model that lacks most capabilities we associate with large language models nowadays

joaogui1 · 2024-05-30T11:54:35 1717070075

XLA tends tends to be better optimized for TPUs, Pytorch is better with GPUs, but I believe you can choose a backend when using Nx.

joaogui1 · 2024-05-17T15:50:35 1715961035

The ads are definitely coming given their pitch deck for the data partnerships https://www.adweek.com/media/openai-preferred-publisher-prog...

joaogui1 · 2024-05-15T21:11:03 1715807463

Gemini 1.5 Ultra was never announced

joaogui1 · 2024-05-15T21:08:24 1715807304

I think (iii) is about models trained using Gemma output, while the "For clarity" part says that the Output itself is not a Model Derivative