reproduce-devel
[Top][All Lists]
Advanced

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

[task #15699] Unified format for software source code archival


From: Mohammad Akhlaghi
Subject: [task #15699] Unified format for software source code archival
Date: Thu, 18 Jun 2020 21:14:35 -0400 (EDT)
User-agent: Mozilla/5.0 (X11; Linux x86_64; rv:77.0) Gecko/20100101 Firefox/77.0

URL:
  <https://savannah.nongnu.org/task/?15699>

                 Summary: Unified format for software source code archival
                 Project: Reproducible paper template
            Submitted by: makhlaghi
            Submitted on: Fri 19 Jun 2020 02:14:33 AM BST
         Should Start On: Fri 19 Jun 2020 12:00:00 AM BST
   Should be Finished on: Fri 19 Jun 2020 12:00:00 AM BST
                Category: Software
                Priority: 5 - Normal
                  Status: Postponed
                 Privacy: Public
        Percent Complete: 0%
             Assigned to: None
             Open/Closed: Open
         Discussion Lock: Any
                  Effort: 0.00

    _______________________________________________________

Details:

Currently the software source codes are mostly stored in the same tarball that
they were originally distributed: most are '.tar.gz', some are '.tar.xz', and
some '.tar.lz' (mostly those I have had to re-package for special reasons),
and some even in '.zip'!

Since we are moving to use our own repository for software source codes by
default (task #15686), I am proposing to adopt a special storage format also.


In particular, I have been really impressed by Lzip
<http://www.nongnu.org/lzip/lzip.html> due to its elegance (arguably the
smallest program in Maneage!), and thus future-proof-ness, and excellent
compression ratio for source code: a '.tar.lz' is usually almost half the size
of a '.tar.gz' file. Its is such an elegant format, that you can even use its
manual to write the decompression algorithm! You don't even need to have the
Lzip software! Because of all this, it is also currently the first program
that Maneage builds.

So I am proposing to convert all source code that goes into the official
Maneage software repository to '.tar.lz' (like on Zenodo
<https://doi.org/10.5281/zenodo.3883409>, or our own maneage.org
<http://git.maneage.org/tarballs-software.git/tree/>) . Ofcourse the existing
source codes will remain untouched, this proposal only affects new software to
be added.

Furthermore, Antonio (author of Lzip who has guided me a lot over the last
four years), mentioned that "Another possible future problem is that GNU tar
does not create POSIX (ustar) archives by default (I think)". 

So besides adopting a fixed compression standard, we should also decide on a
good way to run GNU Tar to be as future-proof as possible.

Please share your thoughts here to hopefully converge on a good solution soon.




    _______________________________________________________

Reply to this item at:

  <https://savannah.nongnu.org/task/?15699>

_______________________________________________
  Message sent via Savannah
  https://savannah.nongnu.org/




reply via email to

[Prev in Thread] Current Thread [Next in Thread]