Hacker News new | past | comments | ask | show | jobs | submit | helloplanets's comments login

I wonder how much fine tuning against something like Stockfish top moves would help a model in solving novel middle game positions. Something like this format: https://database.lichess.org/#evals

I'd be pretty surprised if it did help in novel positions. Which would make this an interesting LLM benchmark honestly: Beating Stockfish from random (but equal) middle game positions. Or to mix it up, from random Chess960 positions.

Of course, the basis of the logic the LLM would play with would come from the engine used for the original evals. So beating Stockfish from a dataset based on Stockfish evals would seem completely insufficient.


I am quite confident that an LLM will never beat a top chess engine like Stockfish. An LLM is a generalist -- it contains a lot of world knowledge, and nearly all of it is completely irrelevant to chess. Stockfish is a specialist tuned specifically to chess, and hence able to spend its FLOPs much more efficiently towards finding the best move.

The most promising approach would be tune a reasoning LLM on chess via reinforcement learning, but fundamentally, the way an LLM reasons (i.e. outputting a stream of language tokens) is so much more inefficient than the way a chess engine reasons (direct search of the game tree).


This is partly completely misleading and partly simplified, when it comes to SOTA LLMs.

Subject–Verb–Object triples, POS tagging and dependency structures are not used by LLMs. One of the fundamental differences between modern LLMs and traditional NLP is that heuristics like those are not defined.

And assuming that those specific heuristics are the ones which LLMs would converge on after training is incorrect.


Following the parent comment's idea, it'd end up in a table being the best choice 100% of the time.

Because the underlying assumption is that accessibility and the ability to grasp the data that is being conveyed isn't completely dependent on the audience. If I happen to prefer a static chart, an animated chart might still convey the intended thing in a stickier way, to a wider audience.


Fully agree. He must have been itching to use the term "chart junk" :-)


Had to make this into a country ditty on Suno.

https://suno.com/song/556b62fc-bcd8-4757-951a-018c5cf0a60b


That was.. surprisingly good TBH.


Yeah it's why I'll never write a track to be published, ever again. I never made any money as a musician, and now anyone can put together stuff that people want to hear.

I imagine it's similar to amateur artists with SD, and social media managers with LLMs.

This is the song I did about the South Korean president coup/whatever (lyrics mostly in Lojban) https://m.soundcloud.com/djoutcold/i-aint-even-writing-music...

I wrote lots of music, tho.

Hold music for my pbx. https://m.soundcloud.com/djoutcold/bew-hold-music


Never had a comment translated into song. That was awesome! Thank you! :)


In the narrowest version of the definition:

> The Registered Skilled Reporter (RSR) is NCRA's new designation that will recognize those stenographic professionals who are looking to validate their beginning level of competency.

> You have to pass three five-minute Skills Tests (SKT), which evaluate your skills level in three areas: Literary at 160 wpm, Jury Charge at 180 wpm, Testimony/Q&A at 200 wpm.

https://www.ncra.org/certification/NCRA-Certifications/regis...


Stenography is a little different to regular typing isn't it?


This is the National Court Reporter's Association; participating membership is for, as you say, stenographic court reporters and [stenographic] captioners, CART providers, and the like. Their membership FAQ mentions transcriptionists as eligible for associate membership -- not independently, though, only in a role supporting stenographic professionals (so really more like scopists, I believe).

Note also that the speeds listed are described as beginning level. Contrast this with the speed contest(1) featuring literary (i.e., a speech or some kind of governmental literature or something to that effect) read at 200-220 WPM, jury charge (instructions) read at 200-260 WPM, and Q&A ([witness] testimony) read at 280 WPM.

These are the kind of speeds that have been typical of stenographers pretty much as long as it's been a thing, even when it was done with a pen rather than a steno machine -- well, back into the 19th century at least; I personally can't speak to the performance of the earlier shorthand systems off the top of my head.

Those "beginning" speeds are about at the top of what most of the best longhand typists can do at any serious length (see, for example, hi-games.net typing leaderboard for a 5-minute(2) vs 10-second(3) test).

As to the WPM cutoff to be considered a "typist"? I mean, it's not like it's a professional credential or anything. Anyone can be a typist if they're typing, I suppose, or if they choose to take it seriously enough. Even the de facto standards of job requirements are nothing much to go by: The typing speeds listed as required in the job postings for nearly all customer service, tech support, general office, and other such jobs, quite frankly, range from underwhelming to laughable. Even transcriptionists (longhand, as in not steno, and offline, as in not real-time) don't need to type more than about 80 WPM to find work in the field, if even that much. In my view, 80 WPM is still an awfully tedious sort of speed, but I understand it's more commonly considered a respectable one, and more than adequate for most tasks, so I guess I'd be fine with that number if I had to pick one.

I'll also throw in with jb-wells above and say that anyone who's touch-typing (and preferably making progress into the triple digits, or so far as their own ability will allow) might as well be considered a typist -- or anyone who managed to convince someone to pay them to type things at any speed.

1. https://www.ncra.org/home/the-profession/Awards-and-contests...

2. https://hi-games.net/typing-test,300/

3 https://hi-games.net/typing-test,10/


> In 2023, Apple shipped 234.6 million iPhones, capturing 20.1% market share and growing 3.7% year over year, according to IDC data. [0]

So, probably not 525.6 million iOS devices a year, but safe to assume it's going to be 300+ million for 2025.

35k devices an hour, give or take.

[0]: https://www.forbes.com/sites/johnkoetsier/2024/01/16/apple-1...


As medwezys pointed out, you forgot iPads. That’s another 40-70M units per year.

My numbers are a rough estimate from memory, but they’re not wildly off.

300M or 500M, the point remains: it’s an absolutely staggering scale and cannot be moved elsewhere in any short period of time. Setting up comparable production would take many years, just as it did the first time.

I imagine Apple/Foxconn have already begun this work. The unexpected shutdown or impediment of US/CN trade is a risk that must be accounted for, given the situation with Taiwan.


Apple has more devices than iPhones, so the OPs numbers are not unbelievable


> Case in point In Germany the Polizei will swat and arrest you if you post a meme on social media that angers someone's dignity. That's not a joke that actually happens.

Source?


There is no source. The commenter is merely subject to online propaganda.


> The developer of Balatro made an award winning deck builder game by not being aware of existing deck builders.

He was aware of deck builders and was directly inspired by Luck be a Landlord, but he was not aware of just how massive the genre is.

Direct quote from the developer:

> The one largest influence on Balatro was Luck Be a Landlord. I watched Northernlion play for a few videos and loved the concept of a non-fanatsy themed score attach roguelike a ton, so I modified the card game I was working on at the time into a roguelike.

> I cut myself off from the genre at that point intentionally, I wanted to make my own mistakes and explore the design space naively just because that process is so fun. I hear the comparison to Slay the Spire a lot but the truth is that I hadn't played that game or seen footage of it when I designed Balatro, not until much later.

https://www.reddit.com/r/Games/comments/1bdtmlg/comment/kup7...


Exactly, more info in this interview on Bloomberg on 7-feb-2025: https://www.bloomberg.com/news/newsletters/2025-02-07/maker-...



Pretty sure yours is the more common take, rather than the opposite. Or at least a view that is very well represented by just about any news outlet. There's also a popular comedy movie about it.

So, you're not exactly going against the grain here.


Here's the original patent for 'revolving stairs' from 1859: https://patents.google.com/patent/US25076A/en


Join us for AI Startup School this June 16-17 in San Francisco!

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: