More

occamrazor · 2025-04-04T15:22:18 1743780138

Apple group of companies, rather than a specific legal entity. A common distinction in financial journalism.

jp57 · 2025-04-04T19:32:51 1743795171

Pretty sure the labor disputes in question are with a specific legal entity, the US corporation, and not any of its subsidiaries.

occamrazor · 2025-04-01T09:42:16 1743500536

FYI: Lex, not Rex.

But a Rex Talionis is an interesting concept too.

lores · 2025-04-01T09:43:17 1743500597

Oops, brain fart. Fixing it, thanks!

occamrazor · on June 28, 2024

On prompts only, with answers presumably from the teacher model (Gemini).

It was not trained or RLHFd on Arena replies or user preferences.

lhl · on July 1, 2024

Yes, answers were distilled from a much stronger model. On the one hand, you can argue that this is exactly what the LMSYS, WildBench etc datasets are for (to improve performance/alignment on real-world use cases), but on the other hand, it's clear that training on the questions (most of which are repeatedly used by the (largely non-representative of general population) users of the ChatArena for comparing/testing models) makes ChatArena ELO less useful as a model comparison tool and artificially elevates Gemma 2's ChatArena score relative to its OOD performance.

At the end of the day, by optimizing for leaderboard scoring, it makes the leaderboard ranking less useful as a benchmark (Goodhart's law strikes again). The Gemma team obviously isn't the only one doing it, but it's important to be clear-eyed about the consequences.

occamrazor · on June 18, 2024

How so? Maybe in the past, but nothing announced today is open.

occamrazor · on June 12, 2024

For Tao, spending 10x time on aproof still means spending 5x less time than an average postdoc. He is incredibly fast and productive.

occamrazor · on April 12, 2024

On all flights I have taken the flight attendants count the passengers on board before taxiing and check the lavatories. This method wouldn't have worked even with available free seats.

nyjah · on April 12, 2024

Just had to go back to the gate the other day because the manifest didn’t match the count. I can see them overlooking this with flippant flight attendants, especially on smaller flights, but on my flight we were next in line for take off when they sent us back to the gate. So we almost made it to the sky. Also, I don’t know what happened, they checked one seat and then we were off. Missed connecting flight because of it.

t0mas88 · on April 12, 2024

This is also for weight and balance / performance calculations. If the calculations are done for 150 passengers and you count only 149 (or 151), you can't legally take off without new paperwork.

occamrazor · on April 4, 2024

Zopf means “braid” and it also denotes a medium-size bread type, made with some milk and glazed with yolk, shaped like a braid, traditionally eaten on Sunday.

occamrazor · on March 27, 2024

I like:

- “open weights” for no training data and no restrictions on use,

- “weights available” for no training data and restrictions on use, like in this case.

occamrazor · on March 27, 2024

It‘s more likely to be incompetence than malice: even their 73.7% is closer to 72% than to 74%.

occamrazor · on March 21, 2024

Scholar is heavily used internally, it’s unlikely to be discontinued even if it has never brought any money to Google.

biofox · on March 21, 2024

Google Reader and RSS feeds were also heavily used internally :(

tibbydudeza · on March 21, 2024

What replaced it - Google Wave ???.

kadoban · on March 21, 2024

This is about a PDF reader though, not Google Scholar itself.