More

francislavoie · 2025-03-17T06:28:55 1742192935

But isn't using LLM for that really expensive? Seems wasteful.

adrianmonk · 2025-03-17T16:02:57 1742227377

I wouldn't use LLMs, but if I did, I would try to get the LLM to write parser code instead.

If it can convert from one format to another, then it can generate test cases for the parser. Then hopefully it can use those to iterate on parser code until it passes the tests.

In a sense, asking it to automate the work isn't as straightforward as asking it to do the work. But if the approach does pan out, it might be easier overall since it's probably easier to deploy generated code to production (than deploying LLMs).

genewitch · 2025-03-17T06:43:19 1742193799

My desktop GPU can run small models at 185 tokens a second. Larger models with speculative decoding: 50t/s. With a small, finetuned model as the draft model, no, this won't take much power at all to run inference.

Training, sure, but that's buy once cry once.

Whether this means it's a good idea, I don't think so, but the energy usage for parsing isn't why.

smokel · 2025-03-17T07:02:57 1742194977

A simple text parser would probably be 10,000,000 times as fast. So the statement that this won't take much power at all, is a bit of an overstatement.

boxed · 2025-03-17T07:18:25 1742195905

50 tokens per second. Compared to a quick and dirty parser written in python or even a regex? That's going to be many many orders of magnitude slower+costlier.

anthk · 2025-03-17T10:29:15 1742207355

awk would run millions times faster, not to mention mawk and awka.

berkes · 2025-03-17T08:24:12 1742199852

In order to make the point that

> energy usage for parsing isn't why

You'll need to provide actual figures and benchmark these against an actual parser.

I've written parsers for larger-scale server stuff. And while I too don't have these benchmarks available, I'll dare to wager quite a lot that a dedicated parser for almost anything will outperform an LLM magnitudes. I won't be suprised if a parser written in rust uses upwards of 10k times less energy than the most efficient LLM setup today. Hell, even a sed/awk/bash monstrosity probably outperforms such an LLM hundreds of times, energy wise.

chgs · 2025-03-17T09:10:00 1742202600

How many times would you need to parse to get an energy saving on using an lm to parse vs using an llm to write a parser, then using the parser to parse.

RUnconcerned · 2025-03-17T10:01:53 1742205713

It sounds like you need to learn how to program without using a LLM, but even if you used one to write a parser, and it took you 100 requests to do so, you would very quickly get the desired energy savings.

This is the kind of thinking that leads to modern software being slower than software from 30 years ago, even though it is running on hardware that's hundreds of times faster.

anthk · 2025-03-17T10:31:04 1742207464

People not using The AWK Programming Language as a reference to parse stuff and maybe The C Programming Language with AWKA (AWK to C translator) and a simple CSP library for threading yeilds a disaster on computing.

LLM's are not the solutions, they are the source of big troubles.

GTP · 2025-03-17T09:48:19 1742204899

> using an llm to write a parser

You're assuming OP needs an LLM to write a parser, since they mentions writing many during their career they probably don't need it ;)

chgs · 2025-03-17T16:36:30 1742229390

I was thinking more of when a sufficiently advanced device would be able to “decide” the task would be worth using its own capabilities to write some code to tackle the problem rather than brute force.

For small problems it’s not worthwhile, for large problems it is.

It’s similar to choosing to manually do something vs automate it.

berkes · 2025-03-17T12:43:53 1742215433

I didn't use an LLM back then. But would totally do that today (copilot).

Especially since the parser(s) I wrote were rather straightforward finite state machines with stream handling in front, parallel/async tooling around it, and at the core business logic (domain).

Streaming, job/thread/mutex management, FSM are all solved and clear. And I'm convinced an LLM like copilot is very good at writing code for things that have been solved.

The LLM, however, would get very much in the way in the domain/business layer. Because it hasn't got the statistical body of examples to handle our case.

(Parsers I wrote were a.o.: IBAN, gps-trails, user-defined-calculations (simple math formulas), and a DSL to describe hierarchies. I wrote them in Ruby, PHP, rust and perl.)

Hercuros · 2025-03-17T07:06:21 1742195181

It’s not just about the energy usage, but also purchase cost of the GPUs and opportunity cost of not using those GPUs for something more valuable (after you have bought them). Especially if you’re doing this at large scale and not just on a single desktop machine.

Of course you were already saying it’s not a good idea, but I think the above definitely plays a role at scale as well.

SlightlyLeftPad · 2025-03-17T07:35:57 1742196957

You’re right, I could be trying to get Crysis to run at 120 fps.

tmtvl · 2025-03-17T10:40:36 1742208036

If you have spare GPU time you could donate it to projects like Folding@Home.

anthk · 2025-03-17T10:32:34 1742207554

My Atom n270 netbook with mawk and a few lines parsing the files with a simple regex will crush down your GPU+LLM's on both time and power usage.

genmon · 2025-03-17T06:47:15 1742194035

My assumption is that models are getting cheaper, fast. So you can build now with OpenAI/Anthropic/etc and swap it out for a local or hosted model in a year.

This doesn't work for all use cases but data extraction is pretty safe. Treat it like a database query -- a slow but high availability and relatively cheap call.

Cthulhu_ · 2025-03-17T09:14:10 1742202850

While it will become cheaper, it will never be as fast / efficient as 'just' parsing the data the old-fashioned way.

It feels like using AI to do computing things instead of writing code is just like when we moved to relatively inefficient web technology for front-ends, where we needed beefier systems to get the same performance as we used to have, or when cloud computing became a thing and efficiency / speed became a factor of credit card limit instead of code efficiency.

Call me a luddite but I think as software developers we should do better, reduce waste, embrace mechanical sympathy, etc. Using AI to generate some code is fine - it's just the next step in code generators that I've been using throughout all my career IMO. But using AI to do tasks that can also be done 1000x more efficiently, like parsing / processing data, is going in the wrong direction.

relistan · 2025-03-17T10:16:15 1742206575

I know this particular problem space well. AI is a reasonable solution. WHOIS records are intentionally made to be human readable and not be machine parseable without huge effort because so many people were scraping them. So the same registrar may return records in a huge range of text formats. You can write code to handle them all if you really want to, but if you are not doing it en masse, AI is going to probably be a cheaper solution.

Example: https://github.com/weppos/whois is a very solid library for whois parsing but cannot handle all servers, as they say themselves. That has fifteen + years of work on it.

LadyCailin · 2025-03-17T10:46:25 1742208385

But.. that’s exactly what this thread is about. RDAP is the future, not WHOIS.

relistan · 2025-03-17T11:00:57 1742209257

Yes, exactly. Read what I was responding to.

notpushkin · 2025-03-17T11:11:53 1742209913

I think you’re both right, and also both are missing the point.

Using LLMs to parse whois data is okay in the meantime (preferably as a last resort!), but structuring the data properly in the first place (i.e. RDAP) is the better solution in the long run.

relistan · 2025-03-18T08:32:44 1742286764

I’m not missing that point at all. I’m 100% on board.

ohmygoodniche · 2025-03-17T11:03:02 1742209382

Requesting that people think before transferring mission critical code into the hands of LLMs is not being a Luddite lol.

Can you imagine how many ridiculous errors we would have if LLMs structured data into protobufs. Or if they compiled software.

It's more than 1000x more wasteful resources wise too. The llm swiss army knife is the Balenciaga all leather garbage bag option for a vast majority of use cases

GTP · 2025-03-17T09:40:23 1742204423

Still, I wouldn't use an LLM for what's essentially a database query: by their very nature, LLMs will give you the right answer most of the times, but will sometimes return you wrong information. Better stay on a deterministic DB query in this case.

lucianbr · 2025-03-17T11:11:23 1742209883

As usual, arguments for LLMs are based on rosy assumptions about future trajectory. How about we talk about data extraction at that point in the future when models are already cheap enough. And in the meantime just assume the future is uncertain, as it obviously is.

permo-w · 2025-03-17T16:39:18 1742229558

deepseek API costs are quite literally pennies per million tokens

francislavoie · 2025-03-07T15:25:10 1741361110

I use xplorer2, works pretty well. I don't do anything fancy with it, but I like the two-panel layout and the "scrap" panel for quick links

francislavoie · 2025-02-17T21:39:09 1739828349

Production ready, 10 year old project. It's not new, it's older than Kubernetes even.

francislavoie · 2025-02-17T20:01:23 1739822483

Logs are just written out to stderr by default, and when you use systemd it ingests it automatically. If you run it with alpine, it's up to you to wire that up.

francislavoie · 2025-02-17T19:44:06 1739821446

You can shorten it:

    @oldPath path_regexp ^/old/([\w-]+)
    rewrite @oldPath /new/{re.oldPath.1}.php

Having matching and handling be separate steps are a huge benefit to composability of config, you can have pluggable matchers and handlers.

Re you issue with netdata, ask on the forums for help.

francislavoie · 2025-02-17T19:42:34 1739821354

Yeah, tech debt sucks a lot.

francislavoie · 2025-02-17T19:34:19 1739820859

Caddy v0/1 was just built up naturally bit by bit, never had a proper design/structure around its config. Caddy v2 was a complete rewrite from the ground up, including thinking a lot more about config design, so it had to be a breaking change for the project's future to thrive.

francislavoie · 2025-02-17T19:33:05 1739820785

Yes, it has events on cert issuance/renewal you can hook into with Go plugins (which can trigger your python script). But generally you want all Caddy instances to share storage via storage modules (e.g. Redis or Consul or a DB, or synced filesystem) and Caddy manages locks through the storage so they don't step on eachother's toes.

francislavoie · 2025-02-17T19:29:00 1739820540

Yes, stable and production ready

francislavoie · 2024-12-02T20:11:08 1733170268

Hahaha, this is basically some clever clickbait. Tl;dr Paul is retiring and Jack is taking sole ownership of Fathom. Nothing really changes.