| | Before AI's Kepler Moment (ashvardanian.com) |
| 3 points by BenGosub 2 days ago | past | 1 comment |
|
| | Before AI's Kepler Moment (ashvardanian.com) |
| 2 points by ashvardanian 2 days ago | past | discuss |
|
| | Beyond OpenMP in C++ and Rust: Taskflow, Rayon, Fork Union (ashvardanian.com) |
| 126 points by ashvardanian 7 days ago | past | 31 comments |
|
| | CUDA Hello World: Done Less Wrong (ashvardanian.com) |
| 3 points by ashvardanian 9 days ago | past | discuss |
|
| | A String Library Beat OpenCV at Image Processing by 4x (ashvardanian.com) |
| 5 points by ternaus 13 days ago | past | discuss |
|
| | Processing Strings 109x Faster Than Nvidia on H100 (ashvardanian.com) |
| 6 points by binarymax 14 days ago | past |
|
| | Processing Strings 109x Faster Than Nvidia on H100 (ashvardanian.com) |
| 34 points by samspenc 14 days ago | past | 3 comments |
|
| | Processing Strings 109x Faster Than Nvidia on H100 (ashvardanian.com) |
| 216 points by ashvardanian 15 days ago | past | 26 comments |
|
| | Stringzilla v4 Introduces 500 GigaCUPS Edit Distance on GPUs (ashvardanian.com) |
| 12 points by _false 17 days ago | past |
|
| | Fork Union: Beyond OpenMP in C++ and Rust? (ashvardanian.com) |
| 7 points by ashvardanian 33 days ago | past | 1 comment |
|
| | Fork Union: Beyond OpenMP in C++ and Rust? (ashvardanian.com) |
| 3 points by MeetingsBrowser 4 months ago | past |
|
| | Fork Union: Beyond OpenMP in C++ and Rust? (ashvardanian.com) |
| 7 points by polyrand 4 months ago | past |
|
| | Calling CUDA in 3000 Words (ashvardanian.com) |
| 1 point by ashvardanian 6 months ago | past |
|
| | The Longest Nvidia PTX Instruction (ashvardanian.com) |
| 2 points by thunderbong 8 months ago | past |
|
| | The Longest Nvidia PTX Instruction (ashvardanian.com) |
| 8 points by ashvardanian 8 months ago | past |
|
| | CPU Ports and Latency Hiding on x86 (ashvardanian.com) |
| 2 points by ashvardanian 8 months ago | past |
|
| | Parsing JSON in C and C++: Singleton Tax (ashvardanian.com) |
| 2 points by ashvardanian 9 months ago | past |
|
| | GCC Compiler vs. Human – 119x Faster Assembly (2023) (ashvardanian.com) |
| 2 points by benchmarkist 10 months ago | past |
|
| | The Next 31 Years of Developing Unum (ashvardanian.com) |
| 1 point by ashvardanian 10 months ago | past |
|
| | Over-Engineering 5x Faster Set Intersections in SVE2, AVX-512, & Neon (ashvardanian.com) |
| 3 points by ashvardanian on Sept 16, 2024 | past | 1 comment |
|
| | 35% Discount on Keyword Arguments in Python (ashvardanian.com) |
| 2 points by ashvardanian on Sept 9, 2024 | past |
|
| | The Painful Pitfalls of C++ STL Strings (ashvardanian.com) |
| 2 points by nalgeon on Aug 9, 2024 | past |
|
| | The Painful Pitfalls of C++ STL Strings (ashvardanian.com) |
| 1 point by ashvardanian on Aug 8, 2024 | past |
|
| | Binding a C++ Library to 10 Programming Languages (ashvardanian.com) |
| 3 points by nalgeon on June 16, 2024 | past | 1 comment |
|
| | NumPy vs. BLAS: Losing 90% of Throughput (ashvardanian.com) |
| 2 points by mpweiher on March 15, 2024 | past |
|
| | NumPy vs. BLAS: Losing 90% of Throughput (ashvardanian.com) |
| 2 points by polyrand on March 14, 2024 | past |
|
| | NumPy vs. BLAS: Losing 90% of Throughput (ashvardanian.com) |
| 4 points by ashvardanian on March 13, 2024 | past | 4 comments |
|
| | Mastering C++ with Google Benchmark (ashvardanian.com) |
| 4 points by signa11 on Feb 22, 2024 | past |
|
| | Mastering C++ with Google Benchmark (2022) (ashvardanian.com) |
| 2 points by todsacerdoti on Feb 21, 2024 | past |
|
| | What's Wrong with C++ Strings? (ashvardanian.com) |
| 7 points by ashvardanian on Feb 12, 2024 | past |
|
|
| More |