Hi Doug, I've personally grown fond of the perf tool that should be available for about any reasonable linux distro out there [1]; it's usage would be something like
sudo sysctl kernel/perf_event_paranoid=-1 ### note: I'm telling you to allow any user to introspect process behaviour. perf record -a python signal_source_to_head_to_null_sink.py perf report And the result would be something that gives you the time spent in individual symbols (aka. functions): 30.65% sig_source_c3 libgnuradio-analog-3.7.6.so.0.0.0 [.] gr::analog::sig_source_c_impl::work 3.99% head2 libc-2.20.so [.] __memcpy_sse2_unaligned 1.58% swapper [kernel.kallsyms] [k] intel_idle 1.28% python libc-2.20.so [.] _IO_getc 1.05% swapper [kernel.kallsyms] [k] __hrtimer_start_range_ns 0.83% swapper [kernel.kallsyms] [k] _raw_spin_unlock_irqrestore 0.80% python libpython2.7.so.1.0 [.] 0x00000000001109e4 0.69% python libpython2.7.so.1.0 [.] 0x0000000000110a34 0.65% head2 libgnuradio-runtime-3.7.6.so.0.0.0 [.] gr::block_executor::run_one_iteration 0.58% python libpython2.7.so.1.0 [.] PyObject_Malloc 0.53% python libpython2.7.so.1.0 [.] 0x000000000007eeab 0.53% python libpython2.7.so.1.0 [.] PyEval_EvalFrameEx 0.52% python libpython2.7.so.1.0 [.] PyDict_Next 0.47% swapper [kernel.kallsyms] [k] menu_select 0.44% swapper [kernel.kallsyms] [k] __schedule 0.44% python libpython2.7.so.1.0 [.] 0x0000000000110d94 0.42% head2 libpthread-2.20.so [.] pthread_mutex_unlock 0.41% swapper [kernel.kallsyms] [k] cpu_startup_entry 0.40% python ld-2.20.so [.] do_lookup_x and so forth. Clearly, my signal source is dominating the flow graph's CPU load. Memcpy comes in second, but strangely it uses the unaligned version, though it definitely will only copy at 64bit aligned addresses (complex float items, and the buffer should be page aligned). hm... Greetings, Marcus [1] disclaimer: I don't consider any distro to be *sane*, and few to be *reasonable* On 03/18/2015 12:06 AM, Anderson, Douglas J. wrote: > Hi all, > > I just finished writing a flowgraph with a few custom C++ blocks, but > when I connect it to a USRP N210 at about 25MS/s it's not too hard to > find a combo of parameters that will cause a sea of DDDDDDDDDDs to > come flooding into the term. > > I think there are some areas I can improve in my blocks but I want to > make sure I'm focusing on the worst-performing areas first. > > So my question is, is there an easy (or hard, I don't really care) way > to figure out which block in a flowgraph is causing the dropped > samples? I already checked that it's not an internet issue. > > Thanks in advance! > -Doug > > > _______________________________________________ > Discuss-gnuradio mailing list > [email protected] > https://lists.gnu.org/mailman/listinfo/discuss-gnuradio
_______________________________________________ Discuss-gnuradio mailing list [email protected] https://lists.gnu.org/mailman/listinfo/discuss-gnuradio
