K/simple: a tiny K interpreter for educational purposes by Arthur Whitney

userbinator · on Jan 18, 2024

J, the predecessor to K, also by the same author, is written in nearly the same style, but shorter:

https://www.jsoftware.com/ioj/iojATW.htm

(Discussed previously: https://news.ycombinator.com/item?id=25902615 )

There's something very satisfying about how this style seems to "climb the abstraction ladder" very quickly, but all of those abstractions he creates are not wasted and immediately put to use. I think much of the amazement and beauty is that there isn't much code at all, and yet it does so much. It's the complete opposite of the bloated, lazy, lowest-common-denominator trend that's been spreading in many other languages's communities.

mhuffman · on Jan 18, 2024

Whitney seems to have an obsession, and a real knack, for writing or rewriting array languages.

sxp · on Jan 17, 2024

I was surprised at how verbose and commented the code was. Then I read the note saying that Whitney's original code was in `ref/` and the two files in the root were annotated by other kparc members.

Keyframe · on Jan 17, 2024

Whitney is an unrealized IOCCC champion.

gaze · on Jan 17, 2024

I know you're joking but people who write C like him would argue it's not obfuscated. It's certainly not intentionally written to be more difficult to read, and those who are practiced in the art often say that it's _easier_ to read.

kelas · on Jan 17, 2024

> Whitney is an unrealized IOCCC champion.

no, that'd be fabrice bellard, who is actually a "realized" ioccc champ.

atw doesn't do obfuscated c. you are led astray.

Keyframe · on Jan 17, 2024

You know it's in jest, of course. However, he doesn't write C either - and you know that better than us. It's a DSL with its own idioms and quirks that just happens to be in C. Often hearing about "evils" of preprocessor (which I do and don't agree with) I wonder if he ever considered anything else, aside from C, that's also low level. Heck, even asm can be macro'd away.

huhtenberg · on Jan 17, 2024

> atw doesn't do obfuscated c.

He writes as most of us code-golf. Makes you wonder how good his golfing would be if he tries.

dchest · on Jan 17, 2024

Pity that he writes in C; if he wrote in JavaScript, he wouldn't need a minifier.

up2isomorphism · on Jan 17, 2024

Sorry, you didn't get him. He does not do it for the sake of making it harder to read, nor he would entertain language like Javascript.

dchest · on Jan 17, 2024

it was a joke. i'm fascinated by his approach.

shakti.com has a suspiciously familiar looking snippet of javascript

was it him?

eatonphil · on Jan 17, 2024

Ah, thank you for clarifying this. I was confused why it said "by Arthur Whitney" when the contributors do not seem to be Arthur Whitney.

kelas · on Jan 17, 2024

"contributors" is essentially me. what needs to be clarified?

it is by arthur whitney.

anything else?

biosed · on Jan 17, 2024

I recall a chap called Geocar demoing kOS, in a very impressive fashion. (I additionally recall an individual had to awkwardly hold the mic for the whole presentation)

Did anything ever come of kOS?

chrispsn · on Jan 17, 2024

Everything I've found that's public is documented here:

https://gist.github.com/chrispsn/da00835bb122c42f429a084df83...

The kparc.com links are down though.

skruger · on Jan 18, 2024

For people discovering k for the first time, I wrote https://xpqz.github.io/kbook as a gentle intro.

graemep · on Jan 18, 2024

I am interested, but one thing I have still not quite understood is where to use this family of languages.

As far as I can tell they are best for data analysis but not for heavy numerical computation because most or all of them lack GPU support. is that right or have I got it wrong?

Are there any other use cases?

scrawl · on Jan 18, 2024

q/kdb+ is used in finance (banking + funds) for heavy numerical computation every day. high-volume realtime data straight from markets, and petabyte/trillion-row historical DBs. it runs on CPU but computation easily parallelizes over cores/clusters.

regarding use cases, see https://kx.com/resources/use-cases/

graemep · on Jan 18, 2024

Thanks, that gives me better feel for it. Mostly analytics, good with large datasets, but probably not great for things where you get a big gain from GPU?

scrawl · on Jan 18, 2024

what tasks are you thinking? i'm not a gpu expert

q is good with bulk operations on compact arrays; these are cache-friendly and the interpreter can utilize cache-level parallelism. and with q it's convenient to go from idea -> MVP in short time. it's a high-level language with functional features so expressing algos and complex logic is natural.

but it's interpreted and optimized for array ops. so really latency-critical (e.g. high-freq trading) or highly scalar logic will be done with C++. the trade-off is convenience of development.

cachvico · on Jan 18, 2024

Hedge funds use ML

vessenes · on Jan 17, 2024

I have a longstanding fascination with K and other "modern" APL derivatives.

There are a few intersecting truisms about coding that I believe: one is that people's working memory varies: some have an immense amount, some less. Humans definitely process spatially better than in time series (e.g. comparing side by side rather than turning over a page.)

This implies you should prefer succinct code and languages because they are less memory load for engineers working on them.

At the same time, a corollary is that a smaller standard library / language is generally better, in that less needs to be learned by an engineer for full coverage of the language.

Another truism is that some people's processing speed is higher than others, and in general I think of the combination of working memory + speed as roughly equivalent to "g", general intelligence.

K occupies this weirdo place though, because it's absolutely succinct, a very small language as counted by number of atoms supported by the interpreter, and also incredibly hard to scan.

One of the K intros I read mentioned that the language is designed to be something that takes down your thinking; essentially the idea is that the workflow is "drink coffee with fellow PhDs, annotate on the chalkboard, and then when ready, capture it directly." This seems about right to me with my own K/J/Q experiences -- the bulk of the time is spent thinking about structuring a problem solution.

I compare this to go, a language I love for its long-term readability and maintainability, where I spend a lot of time writing boilerplate and dealing with errors in-situ.

At any rate, somehow there's a sort of event horizon of terse solution making where you come out the other side and need a 170 IQ to feel comfortable, and Mr. Whitney lives where he lives, and I live where I live. :)

People complaining about how ugly the C code is here are definitely missing the point: he has bent C's preprocessor to his will in order to encapsulate how he thinks about coding: essentially functional, vectorized. It's using C to write a DSL for solving programming problems interesting to Arthur Whitney.

I think it's fascinating on those terms. In a world where you have to read 10,000 lines of code from 100 developers, the C is terrible, and hard to parse. In a world where you will mostly write code to a style you've honed over 40+ years, it's super expressive, minimal, pared down to what matters, and probably fits his brain perfectly.

userbinator · on Jan 18, 2024

One analogy I like to tell people who are overcome with shock and horror at APL-family languages is to compare it to someone used to Latin-family human languages looking at something like Chinese for the first time --- it's likewise totally "unreadable" at first glance, but then you realise that over a billion people can read and write that language fluently every day, many of which may also struggle with a Latin-family language as they've never seen one before.

I'm not convinced that APL is "incredibly hard to scan" for someone who is familiar with it; it's just a matter of experience. While I'm by no means experienced in APL either, a visually similar thing I did frequently in my younger days was reading x86 instructions not in a disassembler nor hexdump, but displayed as CP437. It was not hard, and I can still remember ┤, PQRS, ═!, and ├ as "MOV AH", "PUSH AX; PUSH CX; PUSH DX; PUSH BX", "INT 21", and "RET" respectively. Here's an example:

    ò║•═!├Hello world!$

Edit: looks like HN swallowed a byte, but you can sort of see what I mean.

icsa · on Jan 18, 2024

Another analogy

Would you rather write/read:

DIVIDE X BY 5 GIVING Y

or

y=x/5; / this would be considered "noisy" by C programmers, relative to COBOL

The first is COBOL (designed to make code easier for "normal" people to read. The second is C/Java/Python/Javascript (which looks more like the math that we learn in grade school).

k/APL/J simple moves further in the direction of the algebraic notation you already know. The difference is more operations/algorithms.

When you read the one-character symbols in K as algorithms versus characters, it makes much more sense. In addition, you can read "faster" in k than in other languages, relative to the functionality being expressed.

When I review C/Java/C++, I print out the source code and write the equivalent k code in the margin. The compression ration is typically 10-20X. Doing so speeds up my work significantly when I go back over the reviewed code.

RodgerTheGreat · on Jan 18, 2024

As a moderately experienced K programmer, I find K easy enough to "scan". Common idioms immediately stand out as recognizable "words" that are suggestive of what a routine does before you fully parse it:

@& (filtering)

@< or @> (sorting)

@\: (folding)

,/ (flattening)

+\ (a running total)

etc.

It's also easy to notice, e.g. in K3, a reserved name like "_f" and immediately know you're looking at a recursive procedure.

omaranto · on Jan 18, 2024

In what sense is @\: folding? Doesn't it take a list of functions on the left and a thing on the right, and returns the list of values obtained by applying each function to the thing?

graemep · on Jan 18, 2024

Another is regular expressions. They look entirely incomprehensible if you see one with no prior knowledge, but once you get used to them they are fairly easy to understand.

ParetoOptimal · on Jan 17, 2024

For me Haskell, point free functions, and lenses are a happy medium :)

blibble · on Jan 17, 2024

my fascination with Q/kdb/K/... disappeared once I had to debug it in production

you want a stack trace? tough luck, you'll get back:

`type

and that's it

icsa · on Jan 17, 2024

Kx has much improved Q/k's debugging capabilities - including stack traces.

see: https://code.kx.com/q/basics/debug/

scrawl · on Jan 17, 2024

backtrace has been supported for some time. the debug facilities are nice

https://code.kx.com/q/basics/debug/#stack-frames

blibble · on Jan 18, 2024

2020? their main customers are banks!

at some point we might upgrade to java 8!

vessenes · on Jan 17, 2024

Oh absolutely. Pg talks at some point about how LISP macros can create a sense of godlike power / mania, and I think there's some of that in APL-land too. The true titans don't need to debug in the traditional sense - a whole program fits on a single screen, they can load it in their head, reason about it, and see what happened.

Non-titanic engineers accidentally generate bugs in these languages with high "floor" IQ requirements, and debugging someone else's K code is terrible. Debugging your own K code is terrible, not less because when you get help you will feel very, very stupid indeed :)

alfiedotwtf · on Jan 18, 2024

Awesome comment.

In your opinion, what would be a language that comes closer to K’s functionality but at the same time be understandable to mere mortals?

omaranto · on Jan 18, 2024

In my opinion, the language that comes closest to K's functionality and is also understandable by mere mortals is K itself. It is obviously extremely close to K's functionality and is a very simple language, the only reason it doesn't seem simple is that most people are used to verbose languages. A couple of days of practice is enough to make K readable, in my experience.

Also, I am constantly amazed at how concise K is, easily rivaling not only conventional languages but also much larger array languages like APL or J. Arthur Whitney's taste in selecting primitives is out of this world.

RodgerTheGreat · on Jan 18, 2024

I'm extremely biased in recommending it, but Lil is semantically very similar to Q, entirely free, and intended to be beginner-friendly: https://beyondloom.com/tools/trylil.html

It's not as powerful or concise as K, but it gives you some of the flavor of an array language tucked inside what resembles an ordinary imperative/functional scripting language.

alfiedotwtf · on Jan 18, 2024

Awesome, thanks

gsinclair · on Jan 18, 2024

I recommend checking out uiua.org for fun. The docs are well written and the concepts, while foreign to most, are ultimately accessible and interesting.

k and uiua are in different branches of the APL family.

lkuty · on Jan 18, 2024

I recommend checking BQN at https://mlochbaum.github.io/BQN/ and the YouTube channel code_report by Conor Hoekstra (and also "Composition Intuition by Conor Hoekstra | Lambda Days 2023"). It is well documented.

alfiedotwtf · on Jan 18, 2024

Will do, thanks!

jimberlage · on Jan 18, 2024

…fits his brain perfectly.

This is super accurate, and interesting in the context of the repo. It’s clearly not the most accessible language or style - I wonder if someone who has hyper optimized for themselves in the way Arthur has can write an effective teaching language. It requires a theory of mind for minds that are probably quite different in the general case.

zozbot234 · on Jan 17, 2024

I don't know about Golang but there are languages where this sort of highly domain-optimized, super terse syntax could be embedded as a domain-specific sublanguage, in a significantly less hackish way than what C allows.

vessenes · on Jan 17, 2024

I'm not that knowledgeable, and def not a C apologist, but what language are you thinking of? To my eyes, that is some heavy abuse of the preprocessor in a way that I don't think almost any modern "safe" language could possibly countenance.

I read that code like he got the expressivity benefits of a lisp macro system with close-to-the-metal C speeds in like 100 lines of code. I'm curious what else could do this.

bluejekyll · on Jan 17, 2024

Rust’s macro system is safe and hygienic, people have implemented lisps in it. I just did a google search to find an example, so I have no idea how well supported this is, https://github.com/JunSuzukiJapan/macro-lisp

aquestion · on Jan 17, 2024

The K language seems to be limited to CPU only. Yes, it can call out to GPU (https://code.kx.com/q/interfaces/gpus/) but K only runs on the CPU. This strikes me as odd for an array language.

eismcc · on Jan 17, 2024

shakti has GPU support now, i believe.

geph2021 · on Jan 17, 2024

Anyone know what is happening, or happened, with shakti db[1]?

It's been years since Arthur started this other k-variant.

1- https://shakti.com/

vessenes · on Jan 17, 2024

It's around, and they recently stopped providing free download links for recent versions, from which I take it that they have a reasonable enterprise sales program rolling now.

The feature split on the free / enterprise edition holds a lot back; my vibe on the free version was it was just enough to validate that shakti is performant, and then they want you to pay.

1vuio0pswjnm7 · on Jan 18, 2024

Here's the code

https://raw.githubusercontent.com/kparc/ksimple/main/a.c

https://raw.githubusercontent.com/kparc/ksimple/main/a.h

max_ · on Jan 17, 2024

What do they mean by "atw-style" ?

tosh · on Jan 17, 2024

Arthur Whitney

https://en.wikipedia.org/wiki/Arthur_Whitney_(computer_scien...

https://queue.acm.org/detail.cfm?id=1531242

https://hn.algolia.com/?q=arthur+whitney

tromp · on Jan 17, 2024

I'm guessing it's his initials; Arthur T. Whitney or perhaps ArThur Whitney, and possibly his account name.

nuclearnice3 · on Jan 17, 2024

Sounds right.

Additional examples at

https://github.com/louyx/aplus/blob/master/src/a/k.h

https://code.jsoftware.com/wiki/Essays/Incunabulum

How would you characterize that?

Heavy use of the C preprocessor and C defaults to embed a functional programming language. Language with a small number of core functions and ability to apply functions to lists of atoms. Aesthetically favoring short identifiers and minimal whitespace to create high semantic density. Eschew comments.

gipp · on Jan 17, 2024

You know how when you first start learning to code, the kids who really "get it" right away start off thinking shorter code = smarter code = better code?

k always seemed like a bunch of those kids managed to become highly accomplished and brilliant engineers without ever breaking that terrible habit. Is there actually a reason to write these array languages (and interpreters for them, apparently) this way, or is it just a cultural difference?

lmm · on Jan 18, 2024

> k always seemed like a bunch of those kids managed to become highly accomplished and brilliant engineers without ever breaking that terrible habit. Is there actually a reason to write these array languages (and interpreters for them, apparently) this way, or is it just a cultural difference?

They're more readable and less buggy that way. But unfortunately most programmers would rather spend 10 days reading 100,000 lines than 4 days reading 1,000 lines.

tom_ · on Jan 17, 2024

An occasional HN poster did a presentation a few years ago about his compiler, written in a similar sort of style: https://news.ycombinator.com/item?id=13638086

steveBK123 · on Jan 17, 2024

less code less bug

5jt · on Jan 19, 2024

Whitney and I both worked in the 1970s for I.P. Sharp Associates, which used an email system written beautifully in APL by Leslie Goldsmith. Most user names were simply our initials. Ian Sharp was IPS, Leslie LHG, and Arthur ATW. (Middle name Taylor.) I’ve been SJT ever since (including to two wives) but was not smart enough to grab the domain: see 5jt.com.

yard2010 · on Jan 17, 2024

This looks like a cult I would happily dive into. Does anyone care to ELI5? I tried to read the code but did not understand a single line

eismcc · on Jan 17, 2024

Arthur Whitney is showing how to build an array language using C in array language form.

AW is known for kdb+ which is often used in finance due to its extreme performance properties and ability for quants to quickly explore ideas.

Personally, to get a better handle on how array languages worked, I implemented KlongPy which is a python implementation of Klong, which descends from K (which AW wrote).

You have to play with this stuff to understand it intuitively.

bwanab · on Jan 17, 2024

Ha! For those of us who were in the Morgan Stanley Fixed Income group for many years in the 1990s and 2000s AW is known for APLus which was used extensively for modeling and application work.

CraigRo · on Jan 18, 2024

There were a lot of different and kind of weird attempts at this in the industry. The basic problem is that recalculating DAGs and monte-carlo simulations using a ton of floating point math was extremely expensive. Excel was way too slow, C++ required expert programmers and much better organization of programming teams to figure how to coordinate cached intermediate values. MS went with APL, which reduced a lot of the boilerplate and allowed for relatively easy memoization. GS went with SecDB which treated computation as memoized DAGs. Not sure what the others did

sudosysgen · on Jan 18, 2024

FWIW, kdb+ is not that extremely performant - there's a lot of things that could be faster, and a lot of limitations that mean that you often would be better of not using a DB at all (or to use another DB and just pull everything you might need into memory). There is/was a tradeoff in that many things that would make it faster would require more code, and a cool thing about q/kdb+ is that it takes so little code you don't have I$ issues, but I think that's a tradeoff that doesn't make as much sense anymore in 2023.

What it's really great for is that it's really neatly integrated into the q language, which is great for exploratory programming, and it's fast enough not to get in the way.

mst · on Jan 18, 2024

> a cool thing about q/kdb+ is that it takes so little code you don't have I$ issues

I'm not sure what an I$ issue is - any chance of a bit more explanation?

(also I'd love to hear about the tradeoff part in more detail but that's a bigger ask)

durumu · on Jan 18, 2024

I$ is short for instruction cache ($ -> "cash" -> cache). Since kdb has so little code, more of it fits into the instruction cache at once.

omnicognate · on Jan 18, 2024

I've encountered this idea that k's terseness somehow improves instruction cache use before. Can you explain further? It seems nonsensical, since instruction caching is about machine code, not source code. Why should it use the instruction cache better than any other JIT? Or is it interpreted, in which case "the terseness of the language improves cache use" might seem more of an admission than a boast... :-)

mlochbaum · on Jan 18, 2024

I say it's nonsensical (and yes, Ks are bytecode interpreted). https://mlochbaum.github.io/BQN/implementation/kclaims.html#...

scrawl · on Jan 18, 2024

they mean CPU instruction cache

eismcc · on Jan 18, 2024

Thanks the insights. Not to over do self promotion, but aside from learning, the main reason I made KlongPy was to allow for optionality with the ecosystem. Use Klong for array operations and other libraries for standard stuff.

jhbadger · on Jan 17, 2024

Basically it is an APL-like language that is used (or was, I think it has moved on to a successor language called q now) in the proprietary kdb+ system used by some financial companies. Even if you aren't into finance it is a fun language to play around with.

https://en.wikipedia.org/wiki/Kdb%2B

gitonthescene · on Jan 17, 2024

One of us.. one of us..

But seriously, if you haven’t given it a try you should. You won’t be disappointed.

horsellama · on Jan 18, 2024

array languages are very fascinating and I'd love spending more time learning about them

I read that they are (K in particular) used in finance...would you reckon it would be easier to find job in that field having that in the CV? I know I cannot compete on the C++/Python side but maybe as a skilled K developer I could sneak in

nilamo · on Jan 18, 2024

I started with the header file. It was a gentle warning of the madness in "a.c". At least it's well commented.

p0w3n3d · on Jan 17, 2024

I believe that all one letter names for a language are already taken... Even Ć

p0w3n3d · on Jan 18, 2024

I've just learned that Ć is no longer a viable name (thankfully). Now it's Fusion Programming Language (or fut?) https://github.com/fusionlanguage/fut

up2isomorphism · on Jan 18, 2024

My god, AW is even willing to #include <stdio.h> now!

gavinray · on Jan 17, 2024

[flagged]

dang · on Jan 18, 2024

"Please don't post shallow dismissals, especially of other people's work. A good critical comment teaches us something."

https://news.ycombinator.com/newsguidelines.html

up2isomorphism · on Jan 17, 2024

Might not, since for this kind of thing most likely either you give up in couple minutes (which it is not a heinous since it does not waste your time anyway) or you just read it.

gitonthescene · on Jan 17, 2024

The distinction between “seeing” code and “reading” it is right on point.

itishappy · on Jan 17, 2024

Ha! The guy who wrote this is the same guy who invented ~~APL~~ a number of APL inspired languages (Edit: He did not invent APL. Thanks for the corrections!), so I suspect he may just be built different.

https://www.jsoftware.com/ioj/iojATW.htm

Have you seen anything written in K itself? Here's a program to calculate primes:

    2_&{&/x!/:2_!x}'!R

jhbadger · on Jan 17, 2024

I'm sure ATW knows a lot of APL, but APL itself was created by Ken Iverson when ATW was just a small child. https://en.wikipedia.org/wiki/Kenneth_E._Iverson

microtherion · on Jan 17, 2024

APL was invented by Kenneth Iverson, the other person mentioned on the page, not Arthur Whitney.

I was not convinced of the readability of APL, but compared to its successors which tried to stick to ASCII, I've learned to appreciate the merits of an extended character set.

anthk · on Jan 18, 2024

A program written in Unix dc(1) would look as weird as that.

buescher · on Jan 18, 2024

How would you implement the same functionality?

kelas · on Jan 17, 2024

you must have seen a lot of code.

anonzzzies · on Jan 17, 2024

Your code better? What metric?