More

DominikPeters · 2025-01-25T15:46:00 1737819960

This is o1 so it need not be post hoc but the result of reasoning about several possible choices and explanations.

DominikPeters · 2025-01-21T14:53:23 1737471203

Indeed, for each of the words it got it right.

bt1a · 2025-01-21T17:31:04 1737480664

How excellent for a quantized 27GB model (the Q6_K_L GGUF quantization type uses 8 bits per weight in the embedding and output layers since they're sensitize to quantization)

DominikPeters · 2024-12-28T10:32:15 1735381935

There are also several videos of Tao giving 1hr talks on this topic, for example https://youtu.be/e049IoFBnLA?t=89

DominikPeters · 2024-12-20T14:51:02 1734706262

Is there a simple demo that one can test? I see a tutorial about tic tac toe, but the result doesn't seem to be hosted anywhere.

freetonik · 2024-12-20T18:09:35 1734718175

The tutorial page[1] has a link to open an interactive sandbox with a TicTacToe example. The link is titled "Edit in CodeSandbox".

1. https://boardgame.io/documentation/#/tutorial

DominikPeters · 2024-11-10T10:49:01 1731235741

Gwern recently wrote an article on why not to write a book: https://gwern.net/book-writing

skrishnamurthi · 2024-11-11T03:25:58 1731295558

Most of these reasons appear to be tied to commercial publishers. But there is no need to involve a commercial publisher.

DominikPeters · 2024-11-01T08:55:21 1730451321

> If I know the exact title of the video, it finds it. Everytime.

This is unfortunately not true. I have a little channel and there have been times when searching for the exact title of some of my videos did not return it in the results at all (searching with quotes or not). Cannot reproduce now because the search algorithm has now started liking me.

DominikPeters · 2024-10-12T23:44:33 1728776673

I agree that the booking experience of the DB-SNCF cooperation trains sucks from the DB end, but the underlying blame arguably lies with SNCF which insists on compulsory reservations which is against the philosophy of trains in Germany. On the other hand, in my experience DB offers cheaper tickets for these cooperation trains, most of the time.

But these trains are a special case; in other cases DB is clearly far more pleasant.

weinzierl · 2024-10-13T00:18:58 1728778738

This is not primarily a problem of the cooperation trains, I have the same situation with trains within Germany. DB-Navigator only tells you if a train is bookable right at the end, right before payment. Before that, it might show "there is high demand", but this is rather useless, especially when you have a school kid and want to book a train a the beginning or end of school holidays, when every train is in high demand. Your only chance with DB-Navigator is to play the whack-a-mole game where you run all the steps repeatedly until the very last step until you find a train you actually can book.

In the SNCF app I have this information right away, that is what makes the difference for me.

DominikPeters · 2024-09-21T17:14:37 1726938877

In this post, I don’t see any comparisons of their solver to other LP solvers on benchmarks, so it’s difficult to know how useful this is.

thxg · 2024-09-21T18:03:12 1726941792

I think it's partially excusable. Most LP solvers target large-scale instances, but instances that still fit in RAM. Think single-digit millions of variables and constraints, maybe a billion nonzeros at most. PDLP is not designed for this type of instances and gets trounced by the best solvers at this game [1]: more than 15x slower (shifted geometric mean) while being 100x less accurate (1e-4 tolerances when other solvers work with 1e-6).

PDLP is targeted at instances for which factorizations won't fit in memory. I think their idea for now is to give acceptable solutions for gigantic instances when other solvers crash.

[1] https://plato.asu.edu/ftp/lpfeas.html

sitkack · 2024-09-21T19:21:08 1726946468

Hans D Mittlemann, you are my top dude when it comes to web design. A salute your aesthetic.

[1] https://plato.asu.edu/

whatever1 · 2024-09-21T20:39:19 1726951159

FYI the table does not include the commercial top dogs (ibm cplex, gurobi, Fico Xpress), since due to politics they all withdrew

thxg · 2024-09-21T21:44:38 1726955078

Indeed those are the "big four" solver businesses in the West, and also probably the most reliably-good solvers. But by the time Gurobi withdrew from the benchmarks (a few weeks ago), COpt was handily beating them in the LP benchmarks, and closing down on them in MIP benchmarks. Solver devs like to accuse each other of gaming benchmarks, but I'm not convinced anyone is outright cheating right now. Plus, all solver companies have poached quite a bit from each other since cplex lost all its devs, which probably equalizes the playing field. So overall, I think Mittelmann benchmarks still provide a good rough estimate of where the SOTA is.

dsfcxv · 2024-09-21T18:53:22 1726944802

Their numerical results with GPUs, compared to Gurobi, are quite impressive [1]. In my opinion (unless I'm missing something), the key benefits of their algorithms lie in the ability to leverage GPUs and the fact that there’s no need to store factorization in memory. However, if the goal is to solve a small problem on a CPU that fits comfortably in memory, there may be no need to use this approach.

[1] https://arxiv.org/pdf/2311.12180

thxg · 2024-09-21T19:08:11 1726945691

I agree that their results are impressive. Just to be clear, however:

1. They compare their solver with a 1e-4 error tolerance to Gurobi with 1e-6. This may seem like a detail, but in the context of how typical LPs are formulated, this is a big difference. They have to do things this way because their solver simply isn't able to reach better accuracy (meanwhile, you can ask Gurobi for 1e-9, and it will happily comply in most cases).

2. They disable presolve, which is 100% reasonable in a scientific paper (makes things more reproducible, gives a better idea of what the solver actually does). If you look at their results to evaluate which solver you should use, though, the results will be misleading, because presolve is a huge part of what makes SOTA solvers fast.

dsfcxv · 2024-09-21T19:43:39 1726947819

hmm... I am reading [1] right now. When looking at their Table 7 and Table 11 in [1], they report comparison results with Gurobi presolve enabled and 1e-8 error. Do I miss anything?

Their performance isn't quite as good as Gurobi's barrier method, but it's still within a reasonable factor, which is impressive.

thxg · 2024-09-21T21:28:11 1726954091

Regarding presolve: When they test their solver "with presolve", they use Gurobi's presolve as a preprocessing step, then run their solver on the output. To be clear, this is perfectly fair, but from the perspective of "can I switch over from the solver I'm currently using", this is a big caveat.

They indeed report being 5x slower than Gurobi at 1e-8 precision on Mittelmann instances, which is great. Then again, Mittelmann himself reports them as 15x off COpt, even when allowed to do 1e-4. This is perfectly explainable (COpt is great at benchmarks; there is the presolve issue above; the Mittelmann instance set is a moving target), but I would regard the latter number as more useful from a practitioner's perspective.

This is not to diminish PDLP's usefulness. If you have a huge instance, it may be your only option!

sfpotter · 2024-09-21T17:29:34 1726939774

They link to three of their papers that have more details.

optcoder · 2024-09-21T17:52:18 1726941138

The three linked papers seem to be old, but the broader impact section mentioned cupdlp, which is more recent and has interesting numerical comparisons with commercial solvers: https://arxiv.org/abs/2311.12180, https://arxiv.org/pdf/2312.14832. It is CPU vs GPU, though, not sure how fair it is.

DominikPeters · 2024-08-21T22:52:13 1724280733

I'm generally strongly in favor of technical system for making speeding impossible -- for example I don't think it should be legal to sell cars that are able to go 100km/h on city streets. But this implementation seems designed to elicit backlash. Beeping is bad, and setting the tolerance at zero is not friendly given general driver behavior. I'd find it much more sensible to have "smooth" systems, such as a constant "buzzing" sound whose volume increases with the square of the speed excess, or translating pedal pressure into speed in such a way that convexly increasing amounts of pressure are required to go further above the limit.

reportgunner · 2024-08-22T10:35:00 1724322900

What about ambulances, police cars and firetrucks, would you prevent them from speeding as well ?

DominikPeters · 2024-08-22T14:51:49 1724338309

reportgunner · 2024-08-26T13:05:58 1724677558

How do you stop people making speeding vehicles for firemen and medics from making speeding cars for themselves and their friends then ?

DominikPeters · 2024-08-17T10:14:54 1723889694

As far as I understand, because employment is at-will and firing is trivial, most employees in the U.S. do not even have a job contract! This was very mind-blowing to me when I took my first U.S. job, where there was no contract (!!), only a 500-word "offer letter". I guess the reasoning is that if there ever were to be any conflict between employee and employer, the conflict would be settled by ending the employment relationship. So there is no point in the employer promising anything (e.g. number of vacation days) since the employer can costlessly renege on any such promise.