- Jul 27, 2020
- 23,101
- 16,265
- 146
Here's your chance to spam a thread with all manner of AVX-512 related stuff, including
technical discussion, benchmarks,
software including libraries explicitly designed to take advantage of AVX-512 acceleration,
your personal experiences enjoying AVX-512 acceleration,
your future hardware/software wish list with AVX-512 support,
if you dropped the big bucks on Threadripper or server CPUs mainly for AVX-512 etc.
I'll start off with some links:
The origins of AVX-512: https://tomforsyth1000.github.io/papers/LRBNI origins v4 full fat.pdf
Benchmarking the performance and energy consumption of the AVX512 and VNNI instruction sets: https://addi.ehu.es/bitstream/handle/10810/58088/TFG_Jon_Arriaran.pdf?sequence=2
https://albertvilella.substack.com/p/intels-avx-512-use-cases-part1 (sadly requires subscription for full read)
technical discussion, benchmarks,
software including libraries explicitly designed to take advantage of AVX-512 acceleration,
your personal experiences enjoying AVX-512 acceleration,
your future hardware/software wish list with AVX-512 support,
if you dropped the big bucks on Threadripper or server CPUs mainly for AVX-512 etc.
I'll start off with some links:
The origins of AVX-512: https://tomforsyth1000.github.io/papers/LRBNI origins v4 full fat.pdf
Benchmarking the performance and energy consumption of the AVX512 and VNNI instruction sets: https://addi.ehu.es/bitstream/handle/10810/58088/TFG_Jon_Arriaran.pdf?sequence=2
Using the most unhinged AVX-512 instruction to make the fastest phrase search algo
Disclaimers before we start For those who don’t want to read/don’t care that much, here are the results. I hope after seeing them you are compelled to read. TL;DR: I wrote a super fast phrase search algorithm using AVX-512 and achieved wins up to 1600x the performance of Meilisearch. The source...
gab-menezes.github.io
Caching Energy Efficiency Data – Mobile and AVX-512
A few days ago, we looked at the power costs associated with moving data on some desktop parts.
chipsandcheese.com
https://albertvilella.substack.com/p/intels-avx-512-use-cases-part1 (sadly requires subscription for full read)
Fabian Giesen (@rygorous@mastodon.gamedev.place)
By request, my usual "the least interesting part about AVX-512 is the 512 bits vector width" infodump in thread form. So here goes, a laundry list of things introduced with AVX-512 that I think are way more important to typical use cases than the 512-bit vectors are:
mastodon.gamedev.place
Optimize for Intel® AVX-512 with or without Intel® AVX-512 hardware
Optimizing vectorization becomes increasingly important as new technology like AVX-512 becomes available. Software must be both efficiently threaded and efficiently vectorized to fully utilize modern hardware.
www.intel.com
GitHub - twest820/AVX-512: AVX-512 documentation beyond what Intel provides
AVX-512 documentation beyond what Intel provides. Contribute to twest820/AVX-512 development by creating an account on GitHub.
github.com
An Intro to AVX-512 Assembly Programming
Brief history of AVX-512, interesting feature sets, how-to tutorials and an introduction to assembly programming using Intel AVX-512 instructions.
www.physicsforums.com
Parallel Programming on a CPU with AVX-512
In this article, I describe a program that uses Intel AVX-512 assembly instructions and includes a comparison of the results from both programs.
www.physicsforums.com
AVX-512 Programming: Extracting Column Subtotals from a Table
In this Insights article I'll present an example that shows how Intel® AVX-512 instructions can be used to read a whole row of data in a single operation, and then generate subtotals.
www.physicsforums.com
Vectorization Opportunities for Improved Performance with Intel® AVX-512
Examples of How Intel® Compilers Can Vectorize and Speed Up Loops
www.codeproject.com
AVX512/VBMI2: A Programmer’s Perspective
SingleStore is a modern relational database for cloud and on-premises that delivers immediate insights for modern applications and analytical systems. Book a demo or trial today!
www.singlestore.com
FFmpeg developers claim 94x speed boost using handwritten AVX-512 code
The FFmpeg team recently highlighted how AVX-512 instructions can deliver a significant performance boost in video processing workloads. According to a slide presented by one of the...
www.techspot.com
Last edited: