Hackernews posts about AVX512
AVX512 is a set of instructions that provides advanced vector processing capabilities for x86-64 processors, allowing for significant performance improvements in certain computing-intensive applications.
- Bit-permuting 16 u32s at once with AVX-512 (bitmath.blogspot.com)
- Bit-permuting 16 u32s at once with AVX-512 (bitmath.blogspot.com)
- Sign operation using VFIXUPIMM in AVX-512 (wunkolo.github.io)
- GCC Compiler vs. Human – 119x Faster Assembly (2023) (ashvardanian.com)
- Intel Publishes Fast AVX-512 Sorting Library, 10~17x Faster Sorts in NumPy (www.phoronix.com)
- tolower() with AVX-512 (dotat.at)
- The least interesting part about AVX-512 is the 512 bits vector width (mastodon.gamedev.place)
- Parsing integers quickly with AVX-512 (lemire.me)
- AVX512 intrinsics for JDK’s Arrays.sort methods (github.com)
- Zen5's AVX512 Teardown and More (www.numberworld.org)
- Intel Releases x86-SIMD-sort 2.0 With Faster AVX-512 Sorting, New Algorithms (www.phoronix.com)
- AMD Ryzen 7040 Series Shows Great AVX-512 Performance for Laptops / Mobile (www.phoronix.com)
- How to Use AVX512 in Golang (gorse.io)
- Intel AVX10: The Successor to AVX-512 (www.phoronix.com)
- Dynamic bit shuffle using AVX-512 (lemire.me)
- GCC Lands AVX-512 Fully-Masked Vectorization (www.phoronix.com)
- Fast modular squaring with AVX512IFMA (2018) [pdf] (eprint.iacr.org)
- Zen5's AVX512 Teardown and More (Without Redacted Content) (numberworld.org)
- AMD 4th Gen EPYC 9654 “Genoa” AVX-512 Performance Analysis (www.phoronix.com)
- Intel Downfall: New Vulnerability Affecting AVX2/AVX-512 (www.phoronix.com)
- x86 amd64 SIMD instruction list: SSE to AVX512 (www.officedaytime.com)
- Dav1d 1.1 Released with More AVX-512 Improvements (www.phoronix.com)