performance problem with unique on sparse arrays

octave-maintainers

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

performance problem with unique on sparse arrays

From:	John W. Eaton
Subject:	performance problem with unique on sparse arrays
Date:	Sun, 28 Feb 2010 12:33:51 -0500

The unique function performs poorly on sparse arrays:

  octave:1> x = sprand (1e5, 1, 0.3);
  octave:2> tic; unique (x); toc 
  Elapsed time is 10.8679 seconds.
  octave:3> tic; unique (full (x)); toc
  Elapsed time is 0.036339 seconds.

I "fixed" this problem with the following patch so that if unique is not
operating on rows and we don't need the index vectors, it just
converts the nonzero elements to a full vector instead of working on
the sparse array directly:

  http://hg.savannah.gnu.org/hgweb/octave/rev/1f11fabfa349

The real performance problem in unique appears to be the following two
operations:

  match = (y(1:n-1) == y(2:n));
  y(idx) = [];

It would be good to find a way to improve these, then maybe my quick
fix to unique could be removed and performance would also be improved
for the case when index values are requested.

Comments?

jwe

[Prev in Thread]

Current Thread

[Next in Thread]

performance problem with unique on sparse arrays, John W. Eaton <=
- Re: performance problem with unique on sparse arrays, David Bateman, 2010/02/28
  - Re: performance problem with unique on sparse arrays, John W. Eaton, 2010/02/28

Prev by Date: Re: new snapshot?
Next by Date: svds.m change to avoid test failure
Previous by thread: conv2 performance
Next by thread: Re: performance problem with unique on sparse arrays
Index(es):
- Date
- Thread