Search results

I
Ryzen: Strictly technical

Linking to a different performance issue with non-temporal memory access just seemed to be asking for misunderstandings, and apparently also led to one. At least i can't see anything suggesting that AotS is compiled with intel's compiler. I'm also interested in a source on the false dependency...
- icelight_
- Post #1,340
- Mar 30, 2017
- Forum: CPUs and Overclocking
I
Ryzen: Strictly technical

The linked stackoverflow post doesn't really seem to be relevant to the ryzen issue.
- icelight_
- Post #1,338
- Mar 30, 2017
- Forum: CPUs and Overclocking
I
Ryzen: Strictly technical

I'm not quite sure i can follow. Why wouldn't it be possible to dispatch one complete avx op (2 uops) and half of the next one (+ 1 = 3 uops)? Assuming fetch/decode is fast enough, there should be enough buffered, and if not, the wider dispatch wouldn't have helped anyway?
- icelight_
- Post #1,329
- Mar 30, 2017
- Forum: CPUs and Overclocking
I
Ryzen: Strictly technical

Why would using AVX issue more uops than normal operation? At least on intel most AVX instructions decompose into one or maybe two uops, while performing 4 times (for 4x32 vectors) the work. For equal throughput, pressure on uop queue and retire queue would be reduced a lot. The bottleneck would...
- icelight_
- Post #1,326
- Mar 30, 2017
- Forum: CPUs and Overclocking
I
Ryzen: Strictly technical

Intel also uses statically partitioned buffers according to the Intel 64 and IA-32 Architectures Optimization Reference Manual chapter 2.6.1. AMD actually shares more than Intel, namely the load queue and the ITLB.
- icelight_
- Post #1,010
- Mar 19, 2017
- Forum: CPUs and Overclocking

RESOURCES

Top Bottom

Search results

Ryzen: Strictly technical

Ryzen: Strictly technical

Ryzen: Strictly technical

Ryzen: Strictly technical

Ryzen: Strictly technical