Hacker Newsnew | past | comments | ask | show | jobs | submit | occamrazor's commentslogin

Apple group of companies, rather than a specific legal entity. A common distinction in financial journalism.


Pretty sure the labor disputes in question are with a specific legal entity, the US corporation, and not any of its subsidiaries.


FYI: Lex, not Rex.

But a Rex Talionis is an interesting concept too.


Oops, brain fart. Fixing it, thanks!


On prompts only, with answers presumably from the teacher model (Gemini).

It was not trained or RLHFd on Arena replies or user preferences.


Yes, answers were distilled from a much stronger model. On the one hand, you can argue that this is exactly what the LMSYS, WildBench etc datasets are for (to improve performance/alignment on real-world use cases), but on the other hand, it's clear that training on the questions (most of which are repeatedly used by the (largely non-representative of general population) users of the ChatArena for comparing/testing models) makes ChatArena ELO less useful as a model comparison tool and artificially elevates Gemma 2's ChatArena score relative to its OOD performance.

At the end of the day, by optimizing for leaderboard scoring, it makes the leaderboard ranking less useful as a benchmark (Goodhart's law strikes again). The Gemma team obviously isn't the only one doing it, but it's important to be clear-eyed about the consequences.


How so? Maybe in the past, but nothing announced today is open.


For Tao, spending 10x time on aproof still means spending 5x less time than an average postdoc. He is incredibly fast and productive.


On all flights I have taken the flight attendants count the passengers on board before taxiing and check the lavatories. This method wouldn't have worked even with available free seats.


Just had to go back to the gate the other day because the manifest didn’t match the count. I can see them overlooking this with flippant flight attendants, especially on smaller flights, but on my flight we were next in line for take off when they sent us back to the gate. So we almost made it to the sky. Also, I don’t know what happened, they checked one seat and then we were off. Missed connecting flight because of it.


This is also for weight and balance / performance calculations. If the calculations are done for 150 passengers and you count only 149 (or 151), you can't legally take off without new paperwork.


Zopf means “braid” and it also denotes a medium-size bread type, made with some milk and glazed with yolk, shaped like a braid, traditionally eaten on Sunday.


I like:

- “open weights” for no training data and no restrictions on use,

- “weights available” for no training data and restrictions on use, like in this case.


It‘s more likely to be incompetence than malice: even their 73.7% is closer to 72% than to 74%.


Scholar is heavily used internally, it’s unlikely to be discontinued even if it has never brought any money to Google.


Google Reader and RSS feeds were also heavily used internally :(


What replaced it - Google Wave ???.


This is about a PDF reader though, not Google Scholar itself.


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: