Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

[flagged]


I figured it might, but I think that this is a top of mind question for people and would be nice to make clear in the comments of the post too. So often there’s some theoretical improvement on multiplication that isn’t actually practical. Regardless, they don’t seem to have posted results for CUDA, which is arguably more important than CPU multiplication which is what they tried


Probably why they tried to address it in the abstract already


Let's have more comments from domain experts comparing algorithms in the abstract to cuBLAS, and less like these. Thanks!

If they're wrong to speculate, well, there's a whole paper you can just go skim to find the bit that rebuts them.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: