SSE Optimization on AMD CPU
I have written an assembler function in SSE to calculate Vector multiply Matrix .It works fine with an Intel Processor , cost only 30% time compare to the FLU assembler by VC8. But as to my AMD CPU(AthlonX2 3600+). It cost about double time than FLU. I tried 3DNOW,which worked even worse. Does AMD SIMD just work slow?
Can some one help me? Any suggestion is welcomed.
Re: SSE Optimization on AMD CPU
I am no expert, but you probably have to take into account the fact that AMD K-8's SSE unit is much slower than Intel C2D's, since it can process only 64-bit per clock cycle. Also, memory access pattern can be very influential factor. I was toying with some asm routines in linux kernel and have managed to accelerate them on K-8/K-10 just by removing a couple of pre-fetches that were supposed to lift performance on Intel...
|Tags: amd, athlonx2 3600, cpu, intel, sse|
|Thread Tools||Search this Thread|
|Similar Threads for: "SSE Optimization on AMD CPU"|
|Thread||Thread Starter||Forum||Replies||Last Post|
|Looking for some web optimization tools||Nimmee||Technology & Internet||4||28-08-2013 10:52 AM|
|Optimization in MySQL||Calan||Software Development||4||21-12-2010 01:35 AM|
|Does TechTool Pro fits for MAC optimization||Vandam||Windows Software||5||09-02-2010 11:09 AM|
|Best PC Optimization Tools||Maq.H||Reviews||2||21-01-2010 01:57 AM|
|Oracle optimization HELP!!!||spectre||Tips & Tweaks||1||21-06-2008 04:13 PM|