Hello,

CUDA was discussed before, check the archives.

Best regards,
Tomasz Janeczko
amibroker.com

On 2010-08-22 21:52, TA wrote:


Thanks again. My last question. Have you tested AB to hand over some calculations to GPU (e.g., using CUDA)? Since GPU has its own FPU and memory. Sorry for all these layman question! I am really fascinated by all these stuff. TIA

*From:* [email protected] [mailto:[email protected]] *On Behalf 
Of *Tomasz Janeczko
*Sent:* Sunday, August 22, 2010 12:30 PM
*To:* [email protected]
*Subject:* Re: [amibroker] OT: mutlicore cpu

Hello,

"why you bought an i7 (6 core) cpu?"

Several reasons - my previous computer was 4 year old and it was 2 core. Buying anything less than i7 would not give me any visible gain. In, although new machine is faster, in everyday tasks it makes little difference (especially with the fact that Windows and other softwares add bloat faster than hardware evolves.) As to technical reasons - among other things - to do actual tests. 4 years ago I have written portions of AFL engine using OpenMP (parallel library) to test actual, real-world performance of parallel (multi-core) code vs single-core on AMD Athlon64x2 (2core) Last year, I bought i7 to re-run those tests on latest hardware. The conclusion is the same, fine-grain parallelism (the one that OpenMP supports) with 3:1 memory to FPU ratio makes no sense performance-wise. You need much more FPU/CPU calcs per single memory access to make it worthwhile.

With regards to buying new hardware: If you develop software, you need to have several platforms to test on to ensure smooth operation on every popular hardware. I am testing AmiBroker on everything starting from Intel Celeron 600MHz (10-year old notebook), AMD Athlon XP (single core), AMD Athlon64x2 (dual core), Intel Core 2 Duo (2 core), and ending with Intel i7 920 (6core).

Best regards,
Tomasz Janeczko
amibroker.com

On 2010-08-22 20:43, TA wrote:

    Thanks for clarification. Would you mind sharing with us why you bought an 
i7 (6 core) cpu? TIA

    *From:* [email protected] <mailto:[email protected]> 
[mailto:[email protected]] *On Behalf Of *Tomasz Janeczko
    *Sent:* Sunday, August 22, 2010 11:24 AM
    *To:* [email protected] <mailto:[email protected]>
    *Subject:* Re: [amibroker] OT: mutlicore cpu

    Hello,

    Video creation software is completely different. They do a lot of math *per 
pixel* (it means that lots of FPU operations are needed for single pixel), for
    example many algorithms use 8x8 pixel blocks 64*(4 bytes per pixel)= 
256bytes and do complex transform such as cosine transform. It means that lots 
of FPU
    instructions are done on very small blocks of memory that completely fit on 
Level 1 CPU cache and thus they are not able to saturate memory bandwidth.
    They literally do dozens of FPU ops per single RAM access. In fact they 
barely need to touch RAM at all. It is completely opposite to how AFL works, 
where
    there is usually 3 times more memory accesses (2 reads + one write) than 
FPU operations.

    And yes I do know what processors are on the market. In fact I do have i7 
(6 core) (I am writing this post using it).

    The only reason for multi threading in case of AFL would be not speed but 
asynchronous/parallel execution (ability to run AFL that takes long time in 
parallel
    (without blocking) other AFLs).

    Best regards,
    Tomasz Janeczko
    amibroker.com

    On 2010-08-22 01:19, TA wrote:

        TJ

        On many occasions you have written that the reason that you have not 
implement multicore usage in AB is that single symbol  data saturates the on-die
        cache and memory bandwidth. The tests that I have seen for video 
creation shows the apps that take advantage of multicore processes finish the 
tasks
        faster. Do you know why that is? Do you know why AMD & Intel are 
creating six core cpus (soon 8 core cpus)?TIA




Reply via email to