Recent content by JustViewing

J
Question On the virtues of SMT, or lack thereof

Nice improvements with SMT https://www.phoronix.com/review/amd-epyc-zen5-smt
- JustViewing
- Post #58
- Feb 7, 2025
- Forum: CPUs and Overclocking
J
AMD Announces Full Support For Llama 3.1 AI Models Across EPYC CPUs, Instinct Accelerators, Ryzen AI NPUs & Radeon GPUs

What is the size of the model you are using? With 384GB, you could try large models.
- JustViewing
- Post #24
- Feb 1, 2025
- Forum: AMD
J
AMD Announces Full Support For Llama 3.1 AI Models Across EPYC CPUs, Instinct Accelerators, Ryzen AI NPUs & Radeon GPUs

I think for future models they should separate out "Intelligence" and "Knowledge" parts, rather than ever increasing model sizes. Knowledge part can be in terabytes and should not require training, just only need processing to a machine optimized format/ database. Something like Brain + Library/...
- JustViewing
- Post #20
- Jan 30, 2025
- Forum: AMD
J
AMD Announces Full Support For Llama 3.1 AI Models Across EPYC CPUs, Instinct Accelerators, Ryzen AI NPUs & Radeon GPUs

Ryzen 5950X does 7 tokens/sec for 8B and 4 tokens/sec for 14B models. LM Studio seems to limit CPU thread to 16. As I remember earlier I was able to set 32.
- JustViewing
- Post #15
- Jan 30, 2025
- Forum: AMD
J
AMD Announces Full Support For Llama 3.1 AI Models Across EPYC CPUs, Instinct Accelerators, Ryzen AI NPUs & Radeon GPUs

For me with 6600XT it takes around 130W for 8B, and for 14B 30W-50W. Yes larger model takes less power, probably because of swapping data in and out.
- JustViewing
- Post #14
- Jan 30, 2025
- Forum: AMD
J
Discussion RDNA4 + CDNA3 Architectures Thread

I think AMD missed a big free marketing opportunity with the delay of RDNA4. I would assume RDNA4 would have run circles around 7900XTX running local LLMs.
- JustViewing
- Post #7,379
- Jan 30, 2025
- Forum: Graphics Cards
J
AMD Announces Full Support For Llama 3.1 AI Models Across EPYC CPUs, Instinct Accelerators, Ryzen AI NPUs & Radeon GPUs

I don't disagree with your statement. However I guess what we view as acceptable speed is different :) . For 8B model, I am getting 40Tokens/sec, for 14B, 7 tokens/sec. Thing I most love about these models is not the actual answer, but the the thought process. It is very insightful.
- JustViewing
- Post #10
- Jan 30, 2025
- Forum: AMD
J
AMD Announces Full Support For Llama 3.1 AI Models Across EPYC CPUs, Instinct Accelerators, Ryzen AI NPUs & Radeon GPUs

Maybe, but still good enough. It generates text faster than normal reading speed.
- JustViewing
- Post #8
- Jan 30, 2025
- Forum: AMD
J
AMD Announces Full Support For Llama 3.1 AI Models Across EPYC CPUs, Instinct Accelerators, Ryzen AI NPUs & Radeon GPUs

14B model works perfectly fine with my 6600XT 8GB in LMStudio. AMD under promoting themself.
- JustViewing
- Post #6
- Jan 30, 2025
- Forum: AMD
J
Discussion RDNA4 + CDNA3 Architectures Thread

I think answer is more simpler, software is not ready. Nothing more.
- JustViewing
- Post #6,957
- Jan 21, 2025
- Forum: Graphics Cards
J
Discussion RDNA4 + CDNA3 Architectures Thread

I guess I need new glasses o_O
- JustViewing
- Post #6,760
- Jan 20, 2025
- Forum: Graphics Cards
J
Discussion RDNA4 + CDNA3 Architectures Thread

What ??? :eek:. This is beyond silly. Are they going to rebrand all the units already shipped??🤦‍♂️
- JustViewing
- Post #6,756
- Jan 20, 2025
- Forum: Graphics Cards
J
Discussion Intel Meteor, Arrow, Lunar & Panther Lakes Discussion Threads

I guess you used x64 Guest OS on x64Host OS. For the most part this will run at native/near native speed (it is virtualization, not emulation). However it is not same when you run x64 OS on top of ARM OS, where x64 instructions need to be translated to ARM instructions.
- JustViewing
- Post #18,451
- Jan 18, 2025
- Forum: CPUs and Overclocking
J
Discussion RDNA4 + CDNA3 Architectures Thread

AMD should go one up on NVidia by adding custom fixed function hardware to do 8x frame-gen :) . This will do similar damage to NVidia as Intel's QuickSync did to AMD's hopes for APUs for video compression.
- JustViewing
- Post #6,649
- Jan 18, 2025
- Forum: Graphics Cards
J
Discussion RDNA4 + CDNA3 Architectures Thread

it could be number of seconds in the Y axis ;)
- JustViewing
- Post #6,548
- Jan 16, 2025
- Forum: Graphics Cards

RESOURCES

Top Bottom