Re: Question about data structures

guile-user

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: Question about data structures

From:	Taylan Kammer
Subject:	Re: Question about data structures
Date:	Mon, 23 Nov 2020 04:42:37 +0100
User-agent:	Mozilla/5.0 (Windows NT 10.0; Win64; x64; rv:78.0) Gecko/20100101 Thunderbird/78.5.0

On 22.11.2020 19:48, Zelphir Kaltstahl wrote:

Hello Guile Users!

I have a question about data structures.

[...]

How do you approach this problem? Is it a problem at all?

First of all, be cautious about premature optimization. In many casesit's best to just write the code the most straightforward way possiblewith the tools at hand, and not bother with optimization unless itactually proves to be an issue. Are you going to be processing fileswith millions of lines? Thousands of lines but on a very weak CPU?Does it matter if your program takes 0.1 seconds or 2 seconds to run?

Now the actual answer, in case you need to optimize, or just want tolearn more:

All data structures that offer a sequential list of elements have tomake some trade-offs between the performance of various operations, aswell as the implementation complexity. Linked lists (i.e. "lists" inScheme) are very simple, and a few operations are cheap as well, butthey have the shortcomings you've described plus some more.

Since your main concern seems to be appending, you could simply use alinked list where you keep a reference to the last cons pair (tail) ofthe list, so appending is simply a matter of a 'set-cdr!' operation onthe tail.

Python lists, JDK's ArrayList, and .NET ArrayList, among probably manyother "list" or "array" data structures in popular languages nowadaysuse a relatively straightforward data structure that is backed by anactual array which can have empty slots (e.g. your Python list with 3elements might be backed by an array of size 10), and is reallocatedwhenever there's no space left. This means that appending an element atthe end is usually dirt cheap, until there's no space left, at whichpoint the append operation is much heavier for one call, then thefollowing calls are dirt cheap again, until it's full again...

Inserting an element at the beginning or middle is also relativelyexpensive with those implementations, since all elements need to beshifted forward to make space for the new element. (Although this mightbe done with an operation like C's memcpy which is still actually veryfast.)


It's called a "dynamic array" by Wikipedia:

    https://en.wikipedia.org/wiki/Dynamic_array

If you want to go on an adventure, you could implement a Scheme datastructure called DVector that implements this strategy, using plainScheme vectors for the backing array.

The VList has also been mentioned in this thread, but from what I cantell it doesn't seem to offer a very efficient append operation.



- Taylan

[Prev in Thread]

Current Thread

[Next in Thread]

Question about data structures, Zelphir Kaltstahl, 2020/11/22
- Re: Question about data structures, divoplade, 2020/11/22
  - Re: Question about data structures, Zelphir Kaltstahl, 2020/11/22
    - Re: Question about data structures, divoplade, 2020/11/22
    - Re: Question about data structures, Zelphir Kaltstahl, 2020/11/27
  - Re: Question about data structures, kwright, 2020/11/22
- Re: Question about data structures, Tim Van den Langenbergh, 2020/11/22
- Re: Question about data structures, Taylan Kammer <=
  - Re: Question about data structures, John Cowan, 2020/11/22
  - Re: [EXT] Re: Question about data structures, Thompson, David, 2020/11/23
- Re: Question about data structures, Neil Jerram, 2020/11/23
- Re: Question about data structures, Dr. Arne Babenhauserheide, 2020/11/23

Prev by Date: Re: Guile dynamic FFI, C function expecting pointer
Next by Date: Re: Question about data structures
Previous by thread: Re: Question about data structures
Next by thread: Re: Question about data structures
Index(es):
- Date
- Thread