Hi Thomas,

in case you'd like to add it to your benchmark results, below is a run
of trx-bench on a Debian stable (jessie) system on a G-T40E CPU @ 800MHz.

What I find surprising is that it appears to be comparable than the Atom
D2500 @ 1.86GHz, despite the Atom running at twice the clock rate...

More details see attachments.

Regards,
        Harald
-- 
- Harald Welte <[email protected]>           http://laforge.gnumonks.org/
============================================================================
"Privacy in residential applications is a desirable marketing option."
                                                  (ETSI EN 300 175-7 Ch. A6)
processor       : 0
vendor_id       : AuthenticAMD
cpu family      : 20
model           : 2
model name      : AMD G-T40E Processor
stepping        : 0
microcode       : 0x5000101
cpu MHz         : 800.000
cache size      : 512 KB
physical id     : 0
siblings        : 2
core id         : 0
cpu cores       : 2
apicid          : 0
initial apicid  : 0
fpu             : yes
fpu_exception   : yes
cpuid level     : 6
wp              : yes
flags           : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov 
pat pse36 clflush mmx fxsr sse sse2 ht syscall nx mmxext fxsr_opt pdpe1gb 
rdtscp lm constant_tsc rep_good nopl nonstop_tsc extd_apicid aperfmperf pni 
monitor ssse3 cx16 popcnt lahf_lm cmp_legacy svm extapic cr8_legacy abm sse4a 
misalignsse 3dnowprefetch ibs skinit wdt arat hw_pstate npt lbrv svm_lock 
nrip_save pausefilter vmmcall
bogomips        : 2000.01
TLB size        : 1024 4K pages
clflush size    : 64
cache_alignment : 64
address sizes   : 36 bits physical, 48 bits virtual
power management: ts ttp tm stc 100mhzsteps hwpstate

processor       : 1
vendor_id       : AuthenticAMD
cpu family      : 20
model           : 2
model name      : AMD G-T40E Processor
stepping        : 0
microcode       : 0x5000101
cpu MHz         : 800.000
cache size      : 512 KB
physical id     : 0
siblings        : 2
core id         : 1
cpu cores       : 2
apicid          : 1
initial apicid  : 1
fpu             : yes
fpu_exception   : yes
cpuid level     : 6
wp              : yes
flags           : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov 
pat pse36 clflush mmx fxsr sse sse2 ht syscall nx mmxext fxsr_opt pdpe1gb 
rdtscp lm constant_tsc rep_good nopl nonstop_tsc extd_apicid aperfmperf pni 
monitor ssse3 cx16 popcnt lahf_lm cmp_legacy svm extapic cr8_legacy abm sse4a 
misalignsse 3dnowprefetch ibs skinit wdt arat hw_pstate npt lbrv svm_lock 
nrip_save pausefilter vmmcall
bogomips        : 2000.01
TLB size        : 1024 4K pages
clflush size    : 64
cache_alignment : 64
address sizes   : 36 bits physical, 48 bits virtual
power management: ts ttp tm stc 100mhzsteps hwpstate

[+] Testing: GSM xCCH (non-recursive, flushed, not punctured)
[.] Input length  : ret = 224  exp = 224 -> OK
[.] Output length : ret = 456  exp = 456 -> OK
[.] Pre computed vector checks:
[..] Encoding: OK
[..] Decoding base: 
[..] Decoding SIMD: 
[..] Code N 2
[..] Code K 5
[..] Decoding: OK

[.] BER tests:
[..] Testing base:
[..] Input BER.......................... 0.023499
[..] Output FER......................... 0.004800
[..] Testing SIMD:
[..] Input BER.......................... 0.023390
[..] Output FER......................... 0.000000

[.] Performance benchmark:
[..] Testing base:
[..] Encoding / Decoding 10000 bursts on 1 thread(s):
[..] Elapsed time....................... 3.427718 secs
[..] Rate............................... 1.330331 Mbps
[..] Testing SIMD:
[..] Encoding / Decoding 10000 bursts on 1 thread(s):
[..] Elapsed time....................... 0.425603 secs
[..] Rate............................... 10.714210 Mbps
[..] Speedup............................ 8.053792

[+] Testing: GSM TCH/AFS 7.95 (recursive, flushed, punctured)
[.] Input length  : ret = 165  exp = 165 -> OK
[.] Output length : ret = 448  exp = 448 -> OK
[.] Pre computed vector checks:
[..] Encoding: OK
[..] Decoding base: 
[..] Decoding SIMD: 
[..] Code N 3
[..] Code K 7
[..] Decoding: OK

[.] BER tests:
[..] Testing base:
[..] Input BER.......................... 0.023450
[..] Output FER......................... 0.000300
[..] Testing SIMD:
[..] Input BER.......................... 0.023444
[..] Output FER......................... 0.000000

[.] Performance benchmark:
[..] Testing base:
[..] Encoding / Decoding 10000 bursts on 1 thread(s):
[..] Elapsed time....................... 11.293366 secs
[..] Rate............................... 0.396693 Mbps
[..] Testing SIMD:
[..] Encoding / Decoding 10000 bursts on 1 thread(s):
[..] Elapsed time....................... 0.686461 secs
[..] Rate............................... 6.526227 Mbps
[..] Speedup............................ 16.451577

--- Floating point to integer conversions
-- Testing 40000 iterations of 12480 values
- Measuring conversion time
- Elapsed time base...                 2.548129 secs
- Validating SSE conversion results... PASS
- Measuring conversion time
- Elapsed time SSE ...                 0.960012 secs
- Quotient...                          2.654268

-- Testing 40000 iterations of 3120 values
- Measuring conversion time
- Elapsed time base...                 0.640163 secs
- Validating SSE conversion results... PASS
- Measuring conversion time
- Elapsed time SSE ...                 0.191270 secs
- Quotient...                          3.346908

--- Skipping integer-float conversions (SSE4 only)

Convolving 156 * 4 samples for 10000 iterations
-----------------
Filter length 4
Base real-complex elapsed time... 0.372130 secs
SSE real-complex elapsed time... 0.241270 secs
Quotient 1.542380

Base complex-complex elapsed time... 0.521219 secs
SSE complex-complex elapsed time... 0.382408 secs
Quotient 1.362992

Filter length 8
Base real-complex elapsed time... 0.573190 secs
SSE real-complex elapsed time... 0.326938 secs
Quotient 1.753207

Base complex-complex elapsed time... 0.873947 secs
SSE complex-complex elapsed time... 0.602019 secs
Quotient 1.451693

Filter length 12
Base real-complex elapsed time... 0.776067 secs
SSE real-complex elapsed time... 0.396321 secs
Quotient 1.958178

Base complex-complex elapsed time... 1.226749 secs
SSE complex-complex elapsed time... 0.752869 secs
Quotient 1.629432

Filter length 16
Base real-complex elapsed time... 0.977423 secs
SSE real-complex elapsed time... 0.497255 secs
Quotient 1.965637

Base complex-complex elapsed time... 1.579161 secs
SSE complex-complex elapsed time... 0.922173 secs
Quotient 1.712435

Filter length 20
Base real-complex elapsed time... 1.179777 secs
SSE real-complex elapsed time... 0.585324 secs
Quotient 2.015596

Base complex-complex elapsed time... 1.931793 secs
SSE complex-complex elapsed time... 1.104140 secs
Quotient 1.749591

Filter length 24
Base complex-complex elapsed time... 2.284363 secs
SSE complex-complex elapsed time... 1.242570 secs
Quotient 1.838418

Filter length 32
Base complex-complex elapsed time... 3.125769 secs
SSE complex-complex elapsed time... 1.562846 secs
Quotient 2.000049

Filter length 64
Base complex-complex elapsed time... 5.945989 secs
SSE complex-complex elapsed time... 2.844491 secs
Quotient 2.090353

Filter length 128
Base complex-complex elapsed time... 11.598979 secs
SSE complex-complex elapsed time... 5.407983 secs
Quotient 2.144788

Reply via email to