freepooma-devel
[Top][All Lists]
Advanced

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [pooma-dev] KCC versus icc


From: Richard Guenther
Subject: Re: [pooma-dev] KCC versus icc
Date: Wed, 26 Feb 2003 20:27:12 +0100 (CET)

Hi!

I remember problems with the inliner, i.e. it refused to inline
some of the expression template machinery. You might want to search
for an option letting you tune the inlining behavior or try profile
directed optimizations. With standard -O3 icc is not always faster
than gcc3.2.2 with -O3.

Richard.

On Tue, 25 Feb 2003, Paul A. Renard wrote:

> Hope I'm asking the correct crowd...
>
> Given the following:
>
>  const int N=128;
>  Array<2,complex<float> > u(N,N);
>  Iota<2>::Iota_t ij(iota(u.domain());
>  Iota<2>::Index_t I(ij.comp(0));
>  Iota<2>::Index_t J(ij.comp(1));
>  Array<1,complex<float> > cx(N), cy(N);
>
>  // Values for u, cx, cy are filled elsewhere.
>
>  // Then the following is called:
>  void compute(){
>   u *= cx(I)*cy(J);    // runs 4X slower with icc than KCC
>  }
>
>  When I time this routine, I find that it runs about 4X slower when compiled 
> with
> Intel's icc (Version 7, -O3 -DNOPAssert -DNOCTASSERT) than with KCC (version 
> 4.0f,
> +K3 -DNOPAssert, -DNOCTAssert).  As expected, the KCC version runs as fast as
> hand-written loops.
>
> Do others observe this same sluggish behavior with icc?  Am I missing some 
> obvious
> compile flag? 
>
> thanks
> Paul
>

reply via email to

[Prev in Thread] Current Thread [Next in Thread]