I was actually expecting the author to take photos of the old laptop's screen and OCR those. This was far more entertaining.
I do wonder though... perhaps it would have been faster to have the larger fonts and let the transfer take 24 minutes. It probably took longer than 24 minutes to write the updated OCR software.
Author here. Seeing how difficult it was to get a reliable OCR transcription with commercial software from a pristine, computer-generated representation of the text, I suspect trying to OCR photos would be even less reliable :)
I simplified some things for brevity in the write-up. I did indeed try a bunch of fonts/font sizes (trying a single page at a time and manually inspecting the results) without much improvement.
There is nothing pristine about images transmitted over Fax. It's such a grotty old technology with loads of aliasing issues. A modern cell phone picture of a word screen full of hex would almost certainly be easier to OCR.
The transcription errors I was getting were not consistent. Like, D would be O or 0 or D, with no apparent rhyme or reason to it. And the turnaround time on each fax attempt was long enough that I focused on doing the image recognition myself instead.
This was a phenomenal effort and such a joy to read. Based on how much work this was, these were probably some very important sound files that mean a lot to someone in your family, so thanks for your hard work getting them off the laptop.
My goofy idea was using the font OCR-A but you'd be very lucky if that Mac came with that.
Well, mounting the disk itself. If it was simple to get an image of the disk, the author could have used the same method to just get the files they wanted.
Couldn't you just do a bunch of different faxes, perhaps in different fonts or different font sizes, which would lead to different randomly distributed errors? Then you can do OCR for all of them, and just take the median of the result, and get exponentially less error.
Did you try seeing how well ChatGPT was at OCRing the images? Though since it is HEX characters it might not do so good. I've found it to be very reliable at OCRin e.g. photos of receipts.
A laptop of this age should have a serial port. It's possible even with something like Win95 to run a null modem and tcp/IP over that and SMB to copy files to a semi modern OS.
I do wonder though... perhaps it would have been faster to have the larger fonts and let the transfer take 24 minutes. It probably took longer than 24 minutes to write the updated OCR software.
But, where's the fun in that?