Re: [Discuss-gnuradio] Re-writing blocks using intel libraries

discuss-gnuradio

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [Discuss-gnuradio] Re-writing blocks using intel libraries

From:	Tom Rondeau
Subject:	Re: [Discuss-gnuradio] Re-writing blocks using intel libraries
Date:	Wed, 12 Dec 2007 10:19:15 +0000
User-agent:	Thunderbird 2.0.0.9 (Windows/20071031)

Martin Dvh wrote:

Eric Blossom wrote:

On Tue, Dec 11, 2007 at 03:41:46PM -0800, Eugene Grayver wrote:
Please see answers in-line.

Thanks!
General curiosity questions:
  Are you using oprofile to measure performance?
I am a bit of a maverick, and for various reasons am using a pure C++environment. I hacked my own 'connect_block' function (can;t wait forv3.2, where these will be part of native gr).
The trunk contains C++ code for connect, hier_block2, etc.  Some of
the pieces that are still missing include C++ support for the USRP
daughterboards, but Johnathan Corgan is working on that now.
I am measuring the performance using a custom block (gr_throughput)
that simply reports the average number of samples processed per
second.
What h/w platform are you running on / tuning for?
The platform is currently Intel Xeon or Core2 Duo.

  You're not trying to run your app on a cache-crippled machine like a
  Celeron, are you?  ;)

No, very high end.

  Which blocks are causing you the biggest problem?

I got a 2x improvement on all the filtering blocks.
If these are FIR filters, were you using gr_fft_filter_{fff,ccc}
or the gr_fir_filter* blocks?  The FFT one's are _much_ faster with a
break-even point around 16 taps IIRC.
About a 40% improvement for sine/cosine generation blocks.  This
includes gr_expj, gr_rotate.
No surprise there, and that's a great example of SIMD code that should
be in GNU Radio.
  Are your problems caused primarily by lack of CPU cycles, cache
  misses or mis-predicted branches?
I am not sure, since I am not at all a software expect (mostly dsp/comm).My guess is that the SSE instructions are not being used (or not used to afull extent). Even the 'multiply' block is VERY slow compared to a vectorx vector multiplication in the Intel library.
OK.
Some of the gr_blocksprocess each sample using a separate function call (e.g.for (n=0; n<noutput_samples; n++)
        scale(in[n])

Replacing this with a single vectorized function call is much faster.
OK.
We would not accept the changes.
That's what I expected. We'll try to contribute the more dsp-centricblocks such as demodulators.
That would be great!  Or if you want to code up an SSE Taylor series
expansion for sine/cosine good to 23-bits or so, we'd love that too ;)

I am working on this in the little spare time I have.
I already got a SSE taylor series for atan2, working in gnuradio.
The atan2 needs some code cleanup and wrapper code to switch implementations (if 
(processor=X86, processor supports_SSE2)=>optimized else generic)
The sin/cos is far from ready.

Greetings,
Martin


Martin,

Bob put in a fast atan function (general/gr_fast_atan2f.cc) about a yearago. Have you looked in this, and is the Taylor performance better?


We really need a faster sin/cos. Glad to hear you're working on it.

Tom

[Prev in Thread]

Current Thread

[Next in Thread]

[Discuss-gnuradio] Re-writing blocks using intel libraries, Eugene Grayver, 2007/12/11
- Re: [Discuss-gnuradio] Re-writing blocks using intel libraries, Eric Blossom, 2007/12/11
  - Re: [Discuss-gnuradio] Re-writing blocks using intel libraries, Eugene Grayver, 2007/12/11
    - Re: [Discuss-gnuradio] Re-writing blocks using intel libraries, Dan Halperin, 2007/12/11
    - Re: [Discuss-gnuradio] Re-writing blocks using intel libraries, Eric Blossom, 2007/12/11
    - Re: [Discuss-gnuradio] Re-writing blocks using intel libraries, Eric Blossom, 2007/12/11
    - Re: [Discuss-gnuradio] Re-writing blocks using intel libraries, Martin Dvh, 2007/12/11
    - Re: [Discuss-gnuradio] Re-writing blocks using intel libraries, Tom Rondeau <=
    - Re: [Discuss-gnuradio] Re-writing blocks using intel libraries, Martin Dvh, 2007/12/12
    - Re: [Discuss-gnuradio] Re-writing blocks using intel libraries, Matt Ettus, 2007/12/11

Prev by Date: [Discuss-gnuradio] scope with cygwin
Next by Date: [Discuss-gnuradio] can BasicRx daughterboard receive FM and AM broadcast?
Previous by thread: Re: [Discuss-gnuradio] Re-writing blocks using intel libraries
Next by thread: Re: [Discuss-gnuradio] Re-writing blocks using intel libraries
Index(es):
- Date
- Thread