Re: [fpc-pascal] FPC Graphics options?

Nikolay Nikolov Fri, 19 May 2017 04:17:45 -0700


On 05/19/2017 02:11 PM, Nikolay Nikolov wrote:

On 05/19/2017 03:54 AM, Ryan Joseph wrote:
On May 18, 2017, at 10:40 PM, Jon Foster<jon-li...@jfpossibilities.com> wrote:
62.44      1.33     1.33 fpc_frac_real
26.76      1.90     0.57 MATH_$$_FLOOR$EXTENDED$$LONGINT
10.33      2.12     0.22 FPC_DIV_INT64
Thanks for profiling this.
Floor is there as I expected and 26% is pretty extreme but the othersare floating point division? How does Java handle this so much betterthan FPC and what are the work arounds? Just curious. As it stands Ican only reason that I need to avoid dividing floats in FPC like theplague.
Java is a JVM, which generates bytecode, which isn't CPU specific andcomes with a JIT compiler, which compiles the bytecode to native code,when the program is run, so it can always make use of the instructionset, supported by the CPU you're using. But, of course, launching theapplication becomes much slower. In FPC, if you want to use SSE andavoid the x87 FPU, you have to compile with a specific compileroptions and forfeit the option for your executable to run on non-SSEcapable CPUs, because FPC generates native code. If you want to keepcompatibility and support modern instruction set extensions, you needto compile different executables for different instruction sets andmake a launcher .exe, which detects the CPU type and runs theappropriate executable. The default options for the i386 compiler isto target the Pentium CPU, which does not have SSE. This gives mostcompatibility and least performance, but that's what's appropriate formost users, because for most desktop applications, CPU speed is nolonger an issue. Only very specific tasks, such as software 3Drendering need high CPU performance, and people doing that stuff,usually know very well their compiler options and how to enablesupport for modern instruction extensions for maximum performance. Ofcourse, people coming from a Java background might not be used at allto having to do this kind of stuff, but it's really not that hard.

With all that said, I'm not saying that FPC still doesn't have room foroptimization, only the difference shown shouldn't be this huge, if youuse the capabilities of modern CPUs. fpc_frac_real is slow on modernCPUs, because it uses slow x87 code, instead of SSE. FPC_DIV_INT64 isslow, because it does 64-bit division on 32-bit CPUs, using an algorithmthat does use only 32-bit instructions. The fact that this procedure isa bottleneck in your code means that your code will benefit immensely ifcompiled for x86_64, which has a native 64-bit division instruction.


Nikolay
_______________________________________________
fpc-pascal maillist  -  fpc-pascal@lists.freepascal.org
http://lists.freepascal.org/cgi-bin/mailman/listinfo/fpc-pascal

Re: [fpc-pascal] FPC Graphics options?

Reply via email to