Results 1 to 3 of 3

Thread: SSE Optimization on AMD CPU

  1. #1
    Join Date
    Oct 2008
    Posts
    50

    SSE Optimization on AMD CPU

    Hi all

    I have written an assembler function in SSE to calculate Vector multiply Matrix .It works fine with an Intel Processor , cost only 30% time compare to the FLU assembler by VC8. But as to my AMD CPU(AthlonX2 3600+). It cost about double time than FLU. I tried 3DNOW,which worked even worse. Does AMD SIMD just work slow?

    Can some one help me? Any suggestion is welcomed.

  2. #2
    Join Date
    Mar 2008
    Posts
    198

    Re: SSE Optimization on AMD CPU

    I am no expert, but you probably have to take into account the fact that AMD K-8's SSE unit is much slower than Intel C2D's, since it can process only 64-bit per clock cycle. Also, memory access pattern can be very influential factor. I was toying with some asm routines in linux kernel and have managed to accelerate them on K-8/K-10 just by removing a couple of pre-fetches that were supposed to lift performance on Intel...

  3. #3
    Join Date
    Jan 2008
    Posts
    1,521

    Re: SSE Optimization on AMD CPU

    It would be easier to help if you post the source code

Similar Threads

  1. Looking for some web optimization tools
    By Nimmee in forum Technology & Internet
    Replies: 4
    Last Post: 28-08-2013, 10:52 AM
  2. Optimization in MySQL
    By Calan in forum Software Development
    Replies: 4
    Last Post: 21-12-2010, 01:35 AM
  3. Does TechTool Pro fits for MAC optimization
    By Vandam in forum Windows Software
    Replies: 5
    Last Post: 09-02-2010, 11:09 AM
  4. Best PC Optimization Tools
    By Maq.H in forum Reviews
    Replies: 2
    Last Post: 21-01-2010, 01:57 AM
  5. Oracle optimization HELP!!!
    By spectre in forum Tips & Tweaks
    Replies: 1
    Last Post: 21-06-2008, 04:13 PM

Tags for this Thread

Bookmarks

Posting Permissions

  • You may not post new threads
  • You may not post replies
  • You may not post attachments
  • You may not edit your posts
  •  
Page generated in 1,711,661,365.90279 seconds with 16 queries