ozzmotik's favorites | Hacker News

		the_pwner224 45 days ago \| parent \| context \| on: Gemma 3n preview: Mobile-first AI I did the same thing on my Pixel Fold. Tried two different images with two different prompts: "What can you see?" and "Describe this image" First image ('Describe', photo of my desk) - 15.6 seconds to first token - 2.6 tokens/second - Total 180 seconds Second image ('What can you see?', photo of a bowl of pasta) - 10.3 seconds to first token - 3.1 tokens/second - Total 26 seconds The Edge Gallery app defaults to CPU as the accelerator. Switched to GPU. Pasta / what can you see: - It actually takes a full 1-2 minutes to start printing tokens. But the stats say 4.2 seconds to first token... - 5.8 tokens/second - 12 seconds total Desk / describe: - The output is: while True: print("[toxicity=0]") - Bugged? I stopped it after 80 seconds of output. 1st token after 4.1 seconds, then 5.7 tokens/second.
		TekMol 73 days ago \| parent \| context \| on: Advanced Python Features TFA's use-case for for/else does not convince me: `for server in servers: if server.check_availability(): primary_server = server break else: primary_server = backup_server deploy_application(primary_server)` As it is shorter to do this: `primary_server = backup_server for server in servers: if server.check_availability(): primary_server = server break deploy_application(primary_server)`
		tavavex 3 months ago \| parent \| context \| on: Show HN: Factorio Learning Environment – Agents Bu... I've played other factory-building games, but not Factorio, so I'm not familiar with the bus-building paradigm. I feel like you're saying that buses would incentivize bad practices, but at the same time I don't see what would make them inherently bad. Whenever I saw screenshots of Factorio, I thought that buses were more of a logistics tool, a way to cable-manage the delivery of stuff from one place to another. Is this wrong? I feel like, if you have more consumers than producers (and end up having to rely on buffering), then you've got a big problem regardless of whether you have a bus or not - a sufficiently long belt from an ore deposit etc could replicate the big-buffer problem in the same way. I don't think I'd use buses, I like a bit of chaos, but still, I'm not sure if they're that bad.
		cycomanic 3 months ago \| parent \| context \| on: The Future Is Niri You can also try out niri (really paperwm) like tiling in sway (papersway) or hype (hyprscroller). I'm using the later, and it works essentially the same as regular tiling (you can have named workspaces). That said, I notice that I have a lot of muscle memory due to previously working within the constraints of traditional tiling (i.e. You need a new to switch to a new workspace if you open more than 3 terminals, at least on my monitor). I therefore often switch to a new workspace when I really don't need to and get somewhat confused by where things are. I sometimes think a clearer break from my previous way of working might be easier. That said I really like the approach to tiling from niri and others. It eliminates pretty much all downsides of tiling WMs IMO
		Terr_ 4 months ago \| parent \| context \| on: A brief meditation on formal systems and lying gob... Similarly, can the always-liar state that "I have a penny, it is in my left pocket", when the reality is that it's a penny in the right pocket, or a quarter in the left pocket? Who decides which clauses or aspects are separable? For that matter, if either of them are perfect at their jobs, they are oracles, and could retire by attempting to say something about out the first, second, third, etc. digit of tomorrow's winning lottery number. If they're imperfect at their jobs, then they aren't actually "always" anything.
		Terr_ 4 months ago \| parent \| context \| on: A brief meditation on formal systems and lying gob... * Do all goblins know all their roles, or is each one not sure about the other two? (Unlike the two-goblin version, they can't figure it out by the process of elimination.) * Is the coin-flip outcome hidden, or can the player learn a correlation between heads/tails and different reactions? Is the coin flip itself hidden, or are the other two flipping simultaneous decoy coins? * Are the truth/falsehood goblins aware of the outcome of the coin flip? If they are unaware, what rule governs their behavior towards that ambiguity? Can the liar tell the truth by accident? * Are the three questions posed to the group simultaneously, or do you have to target your question to a specific goblin for a single boolean result?
		OptionOfT on May 31, 2024 \| parent \| context \| on: Why does searching Google for random hex lead to c... Funny, in Europe that's absolutely not the case. I watched some government sale and they posted a PDF vehicles for sale that were forfeited. The VINs where there but parts of it where blacked out. It was a PDF. I copy-pasted the text behind the black box and got the full VIN.
		aborsy on April 25, 2024 \| parent \| context \| on: Jeff Geerling: Corporate Open Source Is Dead That’s guaranteed to happen. Even beyond that, how are developers going to earn a living with FOSS, at least with open licensing? We have seen maintainers burnt out. It takes work to write code.
		nemoniac on April 23, 2024 \| parent \| context \| on: Understanding and avoiding visually ambiguous char... Other prior art is the use of a modified base 58 encoding in Bitcoin addresses. https://en.bitcoin.it/wiki/Base58Check_encoding
		Chabsff on April 18, 2024 \| parent \| context \| on: Halo 2 in HD: Pushing the Original Xbox to the Lim... It's worth pointing out that what you are fondly remembering from a consumer standpoint was an absolute nightmare from the publishers', and not just for pure greed, though that definitely plays a role. Game consoles were (and still are to a some degree), by and large, toys. Toys that parents buy for their children with the expectation that they can be mostly left to their own devices with them. The ESRB/PEGI/etc. ratings system was put in place so that parents would be able to trust that they know what's in the toy without having to sit over the kids' shoulders every single minute they are playing. In a sense it's not unlike Mattel spending a lot of energy making sure their dolls and action figures don't pose any choking hazards. Allowing modding breaks that system, and by extension the accompanying trust. This is a big deal for a toy manufacturer. It's also why Hot Coffee was such a mess despite the content not being normally accessible. Parents don't want to have to care about technicalities. People like to think of this situation as a "think of the children"-type of hand-wringing, but it's actually more of a "think of the parents", who happen to be the ones with money. Again, not discounting the greed and DRM aspects of this, and it definitely sucks pretty hard for adult users of the systems, but it's far from all there is to it.
		johnymontana on April 16, 2024 \| parent \| context \| on: Loading a trillion rows of weather data into Times... > Spark doesn't even have geospatial data types and the open source packages that add support leave a lot to be desired. Could you say more about this? I'm curious if you've compared Apache Sedona [0] and what specifically you found lacking? I currently work at Wherobots [1], founded by the creators of Apache Sedona and would love to hear any feedback. [0] https://sedona.apache.org/latest/ [1] https://wherobots.com/
		porphyra on April 4, 2024 \| parent \| context \| on: LSST Camera: largest camera for astronomy 3200 megapixels custom CCD array cooled to about -100 °C sounds amazing. The 3.5 degree FoV is around 700 mm focal length equivalent on a full frame camera which makes this relatively wide field by telescope standards, allowing it to capture more of the sky per shot. But the array is 64 cm wide so the actual focal length is around 10 meters.
		sillysaurusx on March 26, 2024 \| parent \| context \| on: Google's First Tensor Processing Unit: Architectur... I was confused as hell for a long time when I first got into ML, until I figured out how to think about tensors in a visual way. You're right: fundamentally ML is about vector and matrix operations (1D and 2D). So then why are most ML programs 3D, 4D, and in a transformer sometimes up to 6D (?!) One reasonable guess is that the third dimension is time. Actually not. It turns out that time is pretty rare in ML, and it's only (relatively) recently that it's been introduced into e.g. video models. Another guess is that it's to represent "time" as in, think of how transformers work: they generate a token, then another given the previous, then a third given the first two, etc. That's a certain way of describing "time". But it turns out that transformers don't do this as a 3D or 4D dimension. It only needs to be 2D, because tokens are 1D -- if you're representing tokens over time, you get a 2D output. So even with a cutting edge model like transformers, you still only need plain old 2D matrix operations. The attention layer creates a mask, which ends up being 2D. So then why do models get to 3D and above? Usually batching. You get a certain efficiency boost when you pack a bunch of operations together. And if you pack a bunch of 2D operations together, that third dimension is the batch dimension. For images, you typically end up with 4D, with the convension N,C,H,W, which stands for "Batch, Channel, Height, Width". It can also be N,H,W,C, which is the same thing but it's packed in memory as red green blue, red green blue, etc instead of all the red pixels first, then all the green pixels, then all the blue pixels. This matters in various subtle ways. I have no idea why the batch dimension is called N, but it's probably "number of images". "Vector" wouldn't quite cover all of this, and although "tensor" is confusing, it's fine. It's the ham sandwich of naming conventions: flexible, satisfying to some, and you can make them in a bunch of different varieties. Under the hood, TPUs actually flatten 3D tensors down into 2D matrix multiplications. I was surprised by this, but it makes total sense. The native size for a TPU is 8x128 -- you can think of it a bit like the native width of a CPU, except it's 2D. So if you have a 3x4x256 tensor, it actually gets flattened out to 12x256, then the XLA black box magic figures out how to split that across a certain number of 8x128 vector registers. Note they're called "vector registers" rather than "tensor registers", which is interesting. See https://cloud.google.com/tpu/docs/performance-guide
		rezonant on March 17, 2024 \| parent \| context \| on: A JavaScript Nightmare When you are not experienced on the platform you are building on, all debugging is heroic. It's really strange that they would invest all this time as if they are keen to understand the cause of why it happened, but then be satisfied when jiggling the handle solved it. The real cause ultimately is a process one (developer discipline) or a pipeline one (bad build/test/deployment processes). They got close to realizing it and then blamed the language instead of what appears to be bad practices happening on their dev team.
		okwhateverdude on Feb 14, 2024 \| parent \| context \| on: World model on million-length video and language ... If you pull the llama.cpp repo and use their convert/quantize tools on the pytorch version of the models uploaded to huggingface, they will load just fine into ollama: https://old.reddit.com/r/LocalLLaMA/comments/18av9aw/quick_s... https://github.com/ggerganov/llama.cpp/discussions/2948 You can run ollama (and a web UI) pretty trivially via docker: docker run -d --gpus=all -v /some/dir/for/ollama/data:/root/.ollama -p 11434:11434 --name ollama ollama/ollama:latest docker run -d -p 3000:8080 --add-host=host.docker.internal:host-gateway --name ollama-webui ghcr.io/ollama-webui/ollama-webui:main That particular webui will let you upload models (with configuration). Other wise, you can use the api directly (you'll need to POST a `blob` first): https://github.com/ollama/ollama/blob/main/docs/api.md#creat...
		kibwen on March 16, 2024 \| parent \| context \| on: On clock faces, 4 is Expressed as IIII, not IV > The numerical notation of 4 is IV in Roman numerals. Using "IIII" instead of "IV" isn't even necessarily wrong. Rome was a big empire with a widely-distributed populace that lasted for a thousand years. The usage of numerals changed over time and according to context: "While subtractive notation for 4, 40 and 400 (IV, XL and CD) has been the usual form since Roman times, additive notation to represent these numbers (IIII, XXXX and CCCC)[9] continued to be used, including in compound numbers like 24 (XXIIII),[10] 74 (LXXIIII),[11] and 490 (CCCCLXXXX).[12] The additive forms for 9, 90, and 900 (VIIII,[9] LXXXX,[13] and DCCCC[14]) have also been used, although less often. The two conventions could be mixed in the same document or inscription, even in the same numeral. For example, on the numbered gates to the Colosseum, IIII is systematically used instead of IV, but subtractive notation is used for XL; consequently, gate 44 is labelled XLIIII." https://en.wikipedia.org/wiki/Roman_numerals#Origin As for clock faces, the explanation that I always heard was that it simplified the manufacturing process to use IIII rather than IV; something about making better use of materials to have one fewer V and one more I.
		y0ssar1an on March 14, 2024 \| parent \| context \| on: Nanos – A Unikernel devs: there's too much complexity! security is impossible! also devs: let's add just one more layer on top of linux -> docker -> k8s godspeed to the nanos team for trying to simplify the stack
		jdontillman on March 10, 2024 \| parent \| context \| on: Moore on Moore – The past, present and uncertain f... Also see my article "The Mechanics of Moore's Law" for an analysis in terms of economic feedback loops with usual adaptive characteristics. https://till.com/articles/MooresLaw
		mgii on Feb 29, 2024 \| parent \| context \| on: Over 100k Infected Repos Found on GitHub There seems to be a lot of confusion between malware and vulnerabilities. None of the vendors mentioned in this subthread detects malicious code, only vulnerabilities. Good as they'll be in detecting vulnerabilities, you are still unprotected from malicious code planted in your code bases.
		Exoristos on Feb 29, 2024 \| parent \| context \| on: An HTML Switch Control This is just two radio buttons in a trench coat.
		SunlitCat on Feb 26, 2024 \| parent \| context \| on: Ryzen Z1's Tiny iGPU Mhm, but it kinda hurts to see, that AMD is able to push out APUs powering the likes of a Playstation 5 and everything on a single chip, while on desktop you need to buy the cpu and a chunky gpu seperately.
		nirvdrum on Feb 12, 2024 \| parent \| context \| on: Piezoelectrics enable displays to provide both aud... Off the top of my head, here are some reasons people still want a headphone jack: * Invested in quality audio equipment that they'd like like to continue using without a crappy dongle `* Bluetooth dongles don't have the same level of integration that EarPods do, for instance (e.g., button presses don't work the same way) * Lightning or USB-C headphone adapters are easy to lose track of * Lightning or USB-C headphone adapters are awkward sticking out of your phone and risk breaking the port` * Dislike the environmental impact of trashing perfectly fine equipment * Dislike like the environmental impact of moving to a model of disposable equipment with non-replaceable batteries * Tired of the inconvenience of dropping wireless ear phones and not being able to find them (e.g., this happens frequently on flights¹) * Weary of dealing with Bluetooth issues (e.g., my AirPod Pros randomly disconnect from my iPhone with regularity and I have to click multiple things to switch from my phone to my work laptop) It's fine if these don't apply to you, but they're a little more substantive than the Luddite hand-wringing you suggest.. And for them the trade-off to get a slightly slimmer phone wasn't worth it. That's not to say there aren't advantages to wireless headphones, but supporting one doesn't mean having to preclude the other. ¹ -- https://www.reddit.com/r/mildlyinteresting/comments/16uuyse/...
		mark_l_watson on Feb 9, 2024 \| parent \| context \| on: OpenAI compatibility Thanks for that idea, I use Ollama as my main LLM driver, but I still use OpenAI, Anthropic, and Mistral commercial API plans. I access Ollama via a REST API and my own client code, but I will try their UI. re: cancelling ChatGPT subscription: I am tempted to do this also except I suspect that when they release GPT-5 there may be a waiting list, and I don’t want any delays in trying it out.
		keriati1 on Feb 9, 2024 \| parent \| context \| on: OpenAI compatibility I think it is even easier right now for companies to self host an inference server with basic rag support: - get a Mac Mini or Mac Studio - just run ollama serve, - run ollama web-ui in docker - add some coding assitant model from ollamahub with the web-ui - upload your documents in the web-ui No code needed, you have your self hosted LLM with basic RAG giving you answers with your documents in context. For us the deepseek coder 33b model is fast enough on a Mac Studio with 64gb ram and can give pretty good suggestions based on our internal coding documentation.
		xyc on Feb 9, 2024 \| parent \| context \| on: OpenAI compatibility The pace of progress here is pretty amazing. I loved how easy it is to get llamafile up and running, but I missed feature complete chat interfaces, so I built one based off it: https://recurse.chat/. I still need GPT-4 for some tasks, but in daily usage it's replaced much of ChatGPT usage, especially since I can import all of my ChatGPT chat history. Also curious to learn about what people want to do with local AI.
		chris_wot on Feb 6, 2024 \| parent \| context \| on: We might want to regularly keep track of how impor... Analogies always break down under scrutiny. Any cattle farmer would find “spinning up a herd of cattle” to be hilarious.
		refulgentis on Jan 23, 2024 \| parent \| context \| on: The NSA Furby Documents Context: Furbys were the toy for a year or two, and were actively marketed as learning from speech, had an active mic, and did adjust their speech based on what they heard, "learning" to speak English from Furbish. [^1] It's not so different from the fundamental fear of Alexa/Assistant/microphones that's fairly well diffused now. Except the Furby actively claimed to learn how to speak based on your speech, and had a built-in feedback loop to make it appear as such. In retrospect it looks like it more was "shift mix towards English based on how much you've heard" than "add words you heard to your speech patterns" [^1]: https://www.listenandlearn.org/blog/no-you-cant-teach-your-f...
		brilee on Jan 22, 2024 \| parent \| context \| on: GPT-3.5 crashes when it thinks about useRalativeIm... Most likely it has badly conditioned embedding vectors for those particular tokens, leading the network to edge into numerically unstable territory; once you get some sort of underflow or NaN, they tend to propagate and invalidate the entire output. If there are any batchnorm or other operations that mix values between different entries in a batch, you could even cause other peoples' sessions to return junk values!
		tkems on Jan 22, 2024 \| parent \| context \| on: Flipper Zero: Multi-Tool Device for Geeks If you want to go deeper with RFID and can spend a bit more (~$50), I am pretty happy with my knockoff Proxmark3 Easy [0] I got on ebay. (Do some research to find a good seller as I have heard some sellers ship bad units). It can do both 125khz and 13.25Mhz RFID/NFC and is easier to use then some of the Android apps for cracking Mifare keys. For the price, it is great for more complex attacks and almost has all the features of a full Proxmark RDV4 (minus BLE and a battery). [0] https://proxmark.com/proxmark-3-hardware/proxmark-3-easy
		vidarh on Jan 22, 2024 \| parent \| context \| on: RubyWM – an X11 window manager in pure Ruby Hah. I didn't think this was quite HN worthy at this point - the code is still a mess, and has plenty of bugs. It is however the wm I actually use since I got frustrated with bspwm and did a very minimalist rewrite of TinyWM [1] in Ruby [2] and expanded it from there. It was painful the first few days until I'd had time to add multiple desktops and the start of a tiling mode. But at this point, it's "almost" pleasant for me. The warnings are real, though, apart from the initial hyperbole - this is likely to break for you in all kinds of horrible ways still. [EDIT2: As for a real example of the ways it can/will break: Right now, for some reason, after a restart without changing a single line, it's failing to grab super+buttons to move/resize windows; it doesn't really matter as I have keybindings for it and mostly use those, but, yeah, I'm doing something wrong somewhere and last time this happened the problem just went away by itself] I use very few applications beyond (my own) terminal, (my own) polybar replacement, (my own) file manager, and a browser, and so once Chrome and my own apps mostly started working ok I've had very little incentive to make sure it behaves nicely with anything else and I know the distinction between different EWMH window types is incomplete and broken - just not in ways that usually affect my own use. EDIT: Also a couple of quick notes on "design choices" to the extent this has been "designed" rather than accreted: As you can see in e.g. desktop.rb, quite a few places I've chosen to query and filter the top-level WindowManager class for a list of windows matching a given set of criteria. I did that on an experimental basis out of the idea that it was a waste of time to track precise state across multiple objects given that the total set of windows will always remain "small". So the Window objects know the bare minimum state of the underlying X11 window that it needs to track, and the WindowManager class knows how to map an X11 window id to a Window object. Other than that I avoid tracking state as much as possible, and even where I track state I try to avoid the need for full precision. The one other place I must track state is the layout classes. Currently, only the basic tiling layout, which keeps a tree (that is currently a binary tree, but the array of nodes is there because I at one point thought I might allow more nodes) where the leaves keep track of which nodes have a defined place in the layout. But even there, on updating the layout, I then crudely diff the currently known set of windows vs. the set of windows the layout believes are present and remove/add as needed. I make a best effort to place new windows on map requests, but if anything breaks for any reason, the layout will "catch" up automatically and new windows will be placed. Given I use this daily while it is in flux and often broken because I've made a change and is testing it "live" on my desktop, this has turned out to be very helpful on occasion. I can't make up my mind if this method (of constantly querying for attributes of every window) is a crude hack or quite elegant, but it works. Many other things are not "designed". This is mostly very far from a good example of how to actually do things. E.g. "find_closest" used to pick a window to move to when you want to move directionally in a tiling layout is pretty much guaranteed to be possible to do much better with fewer lines of code once I actually get around to sitting down and thinking about it - the current version was cobbled together in a hurry because I was tired of not having the functionality and it "mostly works" as expected even though it's stupid. [1] https://github.com/mackstann/tinywm/blob/master/tinywm.c [2] https://gist.github.com/vidarh/1cdbfcdf3cfd8d25a247243963e55...
		More