lzip
Developer(s) | Antonio Diaz Diaz |
---|---|
Initial release | 2008 |
Repository | none |
Written in | C++ or C |
Operating system | Unix-like, Windows, Android |
Type | Data compression |
License | GPLv2+ (Free software) |
Website | www |
Filename extension |
.lz |
---|---|
Internet media type |
application/lzip |
Magic number | 0x4C, 0x5A, 0x49, 0x50 |
Developed by | Antonio Diaz Diaz |
Type of format | Data compression |
Open format? | Yes |
lzip is a free, command-line tool for the compression of data; it employs the Lempel–Ziv–Markov chain algorithm (LZMA) with a user interface that is familiar to users of usual Unix compression tools, such as gzip and bzip2.
Like gzip and bzip2, concatenation is supported to compress multiple files, but the convention is to bundle a file that is an archive itself, such as those created by the tar or cpio Unix programs. Lzip can split the output for the creation of multivolume archives.
The file that is produced by lzip is usually given .lz
as its filename extension, and the data is described by the media type application/lzip
.
The lzip suite of programs was written in C++ and C by Antonio Diaz Diaz and is being distributed as free software under the terms of version 2 or later of the GNU General Public License (GPL).
History
7-Zip was released in 2000; a tool employing LZMA first became available on Unix-like operating systems in 2004 when a port of the command-line version of 7-Zip (p7zip) was released. In the same year, the LZMA SDK became available, which included the program called “lzma_alone”; less than a year later, Lasse Collin released LZMA Utils, which at first only consisted of a set of wrapper scripts implementing a gzip-like interface to lzma_alone. In 2008, Antonio Diaz Diaz released lzip, which uses a container format with checksums and magic numbers instead of the raw LZMA data stream, providing a complete Unix-style solution for using LZMA. Nevertheless, LZMA Utils was extended to have similar features and then renamed to XZ Utils.[1]
Features
File integrity
lzip is capable of creating archives with independently decompressible data sections called a "multimember archive" (as well as split output for the creation of multivolume archives).[2] For example, if the underlying file is a tar archive, this can allow extracting any undamaged files, even if other parts of the archive are damaged.
As for the file format, special emphasis has been put on enabling integrity checks by means of an integrated 32-bit checksum for each compressed stream;[3] this is used in combination with the lziprecover program to detect and reconstruct damaged data. This recovery tool can merge multiple copies of an archive where each copy may have damage in a different part of the file.[2]
Parallelism
lzip has two parallel interfaces provided in the default distribution.[4]
plzip
compresses any file in a parallel way. Using it withtar
is insufficient, since the conventionaltar
program needs the entire stream before a file to locate it for decompression, resulting in non-parallel extraction.tarlz
combinestar
andlzip
into a parallel archiver much like modern archivers like RAR or 7-Zip. The solid compression blocks align withtar
file boundaries, so extracting a file only requires decompressing that particular member block.
Adoption
Availability
In popular Linux distributions, lzip can usually be installed from official package repositories.[5][6][7]
Cygwin offers lzip as a maintained optional package (Archive category of its setup installer), and its GNU tar utility program has support for .lz archives (with --lzip option for creation). MinGW-w64 distributes lzip through a maintained package in MSYS2 (pacman -S lzip).
Support
- The GNU Autotools support lzip. Adding
dist-lzip
toAM_INIT_AUTOMAKE
will build lzip-ed tarballs.[8] - Versions 1.23 and newer of GNU tar support using lzip to handle compressed files transparently.[9]
- GNOME's archiving tool, Archive Manager, supports lzip files.
Application
- The Linux distribution Dragora GNU/Linux-Libre employs lzip for its software packages.
- Lzip is used to distribute the Time Zone Database from IANA,[10] and the GNU version of the Linux kernel.[11]
- The European Parliament publishes complete dumps of its database in JSON format compressed with lzip.[12]
- Lzip is used in NASA's Planetary Data System (PDS).[13]
See also
References
- ^ Brian Lindholm (May 2009), "New Options in the World of File Compression", Linux Gazette (in German), no. 162, retrieved 2011-01-07
- ^ a b Antonio Diaz Diaz (2011-12-20). "Lzip Manual: Introduction".
Lzip can produce multimember files and safely recover, with lziprecover, the undamaged members in case of file damage. Lzip can also split the compressed output in volumes of a given size, even when reading from standard input. This allows the direct creation of multivolume compressed tar archives.
- ^ Antonio Diaz Diaz (2011-12-20). "Lzip Manual: Introduction".
As a self-check for your protection, lzip stores in the member trailer the 32-bit CRC of the original data and the size of the original data, to make sure that the decompressed version of the data is identical to the original.
- ^ "The lzip format". www.nongnu.org.
- ^ "Debian -- Package Search Results -- lzip". packages.debian.org.
- ^ "Fedora Package Database -- lzip". Archived from the original on 2011-07-17. Retrieved 2011-01-08.
- ^ "Search". software.opensuse.org.
- ^ "List of Automake options (automake)". www.gnu.org.
- ^ "GNU tar 1.32: 8.1.1 Creating and Reading Compressed Archives". www.gnu.org.
- ^ "IANA — Time Zone Database". www.iana.org.
- ^ "Index of /gnu/linux-libre/4.x/4.12-gnu". ftp.gnu.org.
- ^ "Dumps | [ParlTrack]". parltrack.org.
- ^ "NASA PDS". pds.nasa.gov.