More

wongarsu · 2025-01-10T20:27:59 1736540879

There is C0, a stripped-down version of C popular in academia [1]. Great for teaching because it's conceptually simple and easy to write a compiler for. But with a couple of additions (like sized primitive types) it might match what you are imagining

1: https://c0.cs.cmu.edu/docs/c0-reference.pdf

wongarsu · 2025-01-10T18:37:34 1736534254

Netflix obviously wanted a Korean show, otherwise they wouldn't have funded Koreans to make the show in Korea, they would have funded Americans to make the show in Hollywood.

I applaud Netflix for actually embracing diversity of content and having great shows produced outside Hollywood, and the results speak for themselves (not just Squid Game).

wongarsu · 2025-01-10T18:20:54 1736533254

Yeah, we really shouldn't ask a South Korean show to depict American problems

wongarsu · 2025-01-09T17:45:03 1736444703

The opt-out is to use the unicode windows APIs (the functions ending in "w" instead of "a"). This also magically fixes all issues with paths longer than 260 characters (if you add a "\\?\" prefix or set you manifest correctly), and has been available and recommended since Windows XP.

I'm not sure why the non-unicode APIs are still so commonly used. I can't imagine it's out of a desire to support Windows 98 or Window 2000.

comex · 2025-01-09T19:01:44 1736449304

_Or_ set your application to use UTF-8 for the "A" APIs. Apparently this is supported as of a Windows 10 update from 2019. [1]

[1] https://learn.microsoft.com/en-us/windows/apps/design/global...

asveikau · 2025-01-10T01:35:31 1736472931

It should have been supported approximately 20 years earlier than that. I was coding against Win32 looong before 2019 and wondering for years why they wouldn't let you.

An explanation I heard ~10 years prior is that doing so exposed bugs in CRT and nobody wanted to fix them.

cesarb · 2025-01-10T12:27:01 1736512021

> An explanation I heard ~10 years prior is that doing so exposed bugs in CRT and nobody wanted to fix them.

What I've heard is that the issue is not with the CRT, but with applications using fixed-size byte buffers. IIRC, converting from UTF-16 to any of the traditional Windows code pages requires at most two bytes for each UTF-16 code unit, while the UTF-8 "code page" can need three bytes. That would lead to buffer overflows in these legacy applications if the "ANSI" code page was changed to UTF-8.

pjmlp · 2025-01-10T10:26:24 1736504784

Not sure what that has to do with CRT, given that it isn't part of Win32.

garganzol · 2025-01-10T12:11:51 1736511111

CRT in a form of msvcrt.dll file had a de-facto presence in Windows since the end of 1990s. Later on, since 2018 or so, CRT availability was formalized in Windows API in form of ucrtbase.dll module.

ygra · 2025-01-10T18:45:59 1736534759

msvcrt was never for applications to use: https://devblogs.microsoft.com/oldnewthing/20140411-00/?p=12...

okanat · 2025-01-10T18:53:27 1736535207

The bundled one with Windows wasn't. However the same "feature" exists in redistributed versions of msvcrt.

pjmlp · 2025-01-10T13:13:37 1736514817

Which doesn't change the fact that Win32 doesn't depend on it.

okanat · 2025-01-10T18:50:57 1736535057

It is extremely hard to create an application that doesn't depend on CRT on Windows. CRT provides tables for handlers of SEH exceptions and provides the default event handlers. Win32 headers have hard dependencies of the handler tables CRT provides. So you need to go quite a bit out of your way to hack deep Win32 headers. Loading DLLs etc also may call CRT functions.

You can read Mingw64 source to see how many hacks they had to do to make it work.

pjmlp · 2025-01-11T08:24:44 1736583884

I have been developing for Microsoft platforms since MS-DOS 3.3, Win16 and Win32 development without any function from standard C library has been a thing for decades, for those doing C development without caring about portability, like demoscene competitions and gaming.

Using C++ is another matter.

ChrisSD · 2025-01-11T04:54:33 1736571273

That's the "vcruntime" not the "ucrt". There has been a distinction since the ucrt was made an official part of the OS.

It's very easy to make a win32 program without the ucrt filesytems APIs so long as you don't mind being platform-specific (or making your own cross-platform wrappers).

asveikau · 2025-01-10T15:10:32 1736521832

It's still an important piece of the app compatibility story.

kazinator · 2025-01-10T07:41:50 1736494910

Does that mean that in this UTF-8 mode, GetCommandLineA would, when the full-width double quote occurs in the command line, return the UTF-8 bytes for that double quote, rather than steamrolling it to an ASCII double quote with the WorstFit mapping?

iforgotpassword · 2025-01-10T19:41:02 1736538062

Yes, I wanted to suggest the same. I modified some old tools I wrote 15 years ago to do that a while ago. Not because I was aware of any vulnerability, but because a few places still used char* and I figured this would basically make it never fail with any weird filenames regardless of the code page.

So now it seems even if you think your app is fully Unicode, still do this just in case? :)

kazinator · 2025-01-10T20:57:22 1736542642

It sounds like something Cygwin ought to do across their ecosystem.

Sharlin · 2025-01-09T18:30:06 1736447406

As mentioned elsewhere in this discussion, 99% of the time the cause is likely the use of standard C functions (or C++ `std::string`…) instead of MS's nonstandard wide versions. Which of course is a ubiquitous practice in portable command-line software like curl.

smatija · 2025-01-10T08:34:29 1736498069

A lot of details is in linked curl hackerone: https://hackerone.com/reports/2550951

pishpash · 2025-01-09T21:10:45 1736457045

So the culprit is still the software writer. They should have wrapped the C++ library for OS-specific behavior on Windows. Because they are publishing buggy software and calling it cross-platform.

bayindirh · 2025-01-09T21:20:17 1736457617

curl first released in 1996, shortly after Windows 95 has born and runs on numerous Windows versions even today. So, how many different versions shall be maintained? Are you going to help one of these versions?

On top of that, how many new gotchas these “modern” Windows functions hide, and how many fix cycles are required to polish them to the required level?

thrdbndndn · 2025-01-10T01:14:01 1736471641

If we're talking about curl specifically, I absolutely think they would (NOT "should") fix/workaround it if there are actually common problems caused by it.

Yes it would have required numerous fix cycles, but curl in my mind is such a polished product and they would have bit the bullet.

bayindirh · 2025-01-10T07:18:34 1736493514

You're right, if the problems created by this are big enough, the team will fix them without any fanfare and whining.

However, in neither case this is a shortcoming of curl. They'd be responding to a complicated problem caused by the platform they're running on.

8n4vidtmkvmk · 2025-01-10T04:11:55 1736482315

Why would/should they? I've never paid for curl. Who even develops it? Sounds like a thankless job to fix obscure worstfit bugs.

bayindirh · 2025-01-10T07:24:07 1736493847

> Why would/should they?

Because they care. That's it.

> I've never paid for curl.

I'm sure people who develop it doesn't want money and fame, but they're just doing what they like. However, curl has commercial support contracts if you need.

> Who even develops it?

Daniel Stenberg et al. Daniel can be found at https://daniel.haxx.se.

> Sounds like a thankless job to fix obscure worstfit bugs.

It may look thankless, but it's not. curl is critical infrastructure at this point. While https://xkcd.com/2347/ applies squarely to cURL, it's actually nice that the lead developer is making some money out of his endeavor.

thrdbndndn · 2025-01-10T06:54:17 1736492057

Why would they develop curl at all by your logic?

They fix bugs because they simply want their product to be better, if if I were to take a guess? Like, I'm sure curl's contributors worked on OS-specific problems before, and it wouldn't be the last.

> to fix obscure worstfit bugs.

Again my premise is "if there are actually common problems caused by it". This specific bug does sound like that, at least not for now.

shakna · 2025-01-10T13:56:39 1736517399

You also have to use wmain instead of main, with a wchar_t argv, otherwise the compiled-in argparser will be calling the ANSI version. In other words... Anyone using MSVC and the cross-platform standardised and normal C system, are hit by this.

Oh, and wmain is a VisualC thing. It isn't found on other platforms. Not standardised.

mort96 · 2025-01-10T18:50:43 1736535043

Writing cross platform code which consistently uses UCS-2 wchar_t* on Windows and UTF-8 char* on UNIX-like systems sounds like absolute hell

lmz · 2025-01-11T03:40:28 1736566828

A wchar_t "native" libc implementation would be an interesting thing.

Thorrez · 2025-01-10T05:32:15 1736487135

>I'm not sure why the non-unicode APIs are still so commonly used.

Even argv is affected on Windows. That's part of the C and C++ standard, not really a Windows API. Telling all C/C++ devs they need to stop using argv is kind of a tough ask.

vessenes · 2025-01-09T18:06:20 1736445980

I think the issue is that native OS things like the windows command line, say, don’t always do this. Check the results of their ‘cd’ commands with Japanese Yen characters introduced. You can see that the path descriptor somehow has updated to a directory name with Yen (or a wide backslash) in it, while the file system underneath has munged, and put them into an actual directory. It’s precisely the problem that you can’t control the rest of the API surface to use W that is the source of the difficulties.

ack_complete · 2025-01-10T03:00:02 1736478002

Using \\?\ has a downside: since it bypasses Win32's path processing, it also prevents relative paths like d:test.txt from working. Kind of annoying on the command line with tools like 7z.exe.

cesarb · 2025-01-09T19:47:46 1736452066

> I'm not sure why the non-unicode APIs are still so commonly used. I can't imagine it's out of a desire to support Windows 98 or Window 2000.

Nowadays, it's either for historical reasons (code written back when supporting Windows 9x was important, or even code migrated from Windows 3.x), or out of a desire to support non-Windows systems. Most operating systems use a byte-based multi-byte encoding (nowadays usually UTF-8) as their native encoding, instead of UTF-16.

asveikau · 2025-01-10T01:33:59 1736472839

I share your recommendations of always using PWSTR when using windows apis.

> I'm not sure why the non-unicode APIs are still so commonly used

I think because the rest of the C world uses char* with utf-8, so that is what people are habituated to. Setting the ACP to CP_UTF8 would have solved a lot of problems, but I believe that's only been supported for a short period of time, bafflingly.

cryptonector · 2025-01-10T18:22:24 1736533344

> I'm not sure why the non-unicode APIs are still so commonly used.

Simple: portable code meant to run on Unix (where UTF-8 is king) and Windows -> want to use UTF-8 codepage on Windows and the "A" APIs.

cryptonector · 2025-01-10T16:17:08 1736525828

The other opt-out might be to opt into UTF-8 support for the "A" functions.

p_ing · 2025-01-09T18:50:15 1736448615

> paths longer than 260 characters (if you add a "\\?\" prefix or set you manifest correctly)

A long ago released build of Windows 10 did this automatically so no need for adjustments anymore, 32k is the max....

...except for Office! It can't handle long paths. But Office has always been hacky (the title bar, for example).

wongarsu · 2025-01-09T17:33:34 1736444014

Also most of the popular NAS setups. I'm pretty happy self-hosting everything on my Unraid server. I just checked, all of the tools in the article are in Unraid's App center (except Streamyfin which is a phone app). And if something isn't you can add docker containers or VMs as necessary.

wongarsu · 2025-01-09T17:26:50 1736443610

I use pinchflat mostly for archiving channels that occasionally take videos down. It technically has an inbuilt video player, but that's more for "debugging". The intended non-archival usecase is have the download folder picked up by your Plex or Jellyfin and watch it there.

Of course that deprives you from interaction with the actual youtube content. But for support, $1/month on the creator's patreon goes further, and for engagement you can join the creator's discord. The only thing you are really missing out on is feeding the youtube algorithm.

wongarsu · 2025-01-06T00:02:35 1736121755

APPX (the installer format used by the windows store) and its successor MSIX contain decent security improvements, including filesystem and registry virtualisation and a capability-based permission system.

After the limited success of the windows store you can now get the same in standalone installers. It has been adopted by approximately nobody

wongarsu · 2025-01-04T16:13:48 1736007228

I feel like calling "shows panic when presented with liquids to drink" a fear of water is a perfectly fine shorthand. Even if it might not be a literal fear of all forms of water, only water you are supposed to drink

wongarsu · 2025-01-04T13:39:41 1735997981

We got a certificate with a company that's the local equivalent of an LLC, and have seen certificates issued to private individuals. As far as I can tell it's up to the issuer who they support in their verification process. Many of them are pretty inflexible with somewhat arcane processes designed decades ago (with ancient websites portals to match), so your experience may vary

An additional detail is that there are two levels of code signing certificates, normal and EV (extended validation) certificate. EV certificates make windows completely drop the low-reputation screen and causes many antivirus solutions to trust you but are expensive and are a bit of a pain to get. Normal certificates are cheaper and comparatively easier to acquire, but only give partial benefits (less scary screen from Windows, some leeway from antivirus).

wongarsu · 2025-01-04T12:39:35 1735994375

In Europe, statistically nearly every car older than 10 years is manual. With the recent improvements in automatics they are finally catching on in Europe.

You may be able to find an old automatic or CVT car in Europe, but you would need to look for a while

ninalanyon · 2025-01-04T13:51:00 1735998660

Volvo made automatic versions of its cars from the late sixties onwards. A colleague had an automatic 244 in the early 1980s.

Here in Norway automatics are not finally catching on, they are ubiquitous and the driving instructors' association is campaigning to get the rules changed so that you can pass the driving test on an automatic and later simply do a conversion course to allow one to drive manual.

And of course it will all be moot quite soon because EV sales already outnumber ICE sales nearly nine to one here. Volkswagen and Hyundai no longer sell ICE cars in Norway. Volkswagen still sells ICE vans and pickups though.

bigfatkitten · 2025-01-08T05:57:38 1736315858

That's because Europeans prefer small cars, in which automatics really sucked until maybe 15-20 years ago. They were inefficient and heavy, and the engines in those cars didn't have a ton of excess power.

It's a different story in large sedans/wagons.

brnt · 2025-01-04T15:15:45 1736003745

Volvo is one of the exceptions though. I have friends whose parents are Volvo-people. Their cars have been automatic since the 80ies.

MoreMoore · 2025-01-04T12:58:13 1735995493

I take it you meant to write every car older than 10 years is a manual?

wongarsu · 2025-01-04T13:43:32 1735998212

Thanks, I obviously need more coffee. Fixed above