savannah-hackers
[Top][All Lists]
Advanced

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

[Savannah-hackers] submission of MG4J - savannah.nongnu.org


From: vigna
Subject: [Savannah-hackers] submission of MG4J - savannah.nongnu.org
Date: Tue, 04 Feb 2003 14:59:04 -0500
User-agent: Mozilla/5.0 (X11; U; Linux i686; en-US; rv:1.2) Gecko/20021202

A package was submitted to savannah.nongnu.org
This mail was sent to address@hidden, address@hidden


Sebastiano Vigna <address@hidden> described the package as follows:
License: gpl
Other License: 
Package: MG4J
System name: mg4j
Type: non-GNU

Description:
MG4J (Managing Gigabytes for Java) is a collaborative effort aimed at providing 
a free Java implementation of inverted-index compression techniques; as a 
by-product, it offers several general-purpose optimised classes, including fast 
&amp; compact mutable strings, bit-level I/O, (possibly signed) minimal perfect 
hashing, etc.

Generating full-text inverted indices for very large sets of documents
(say, beyond dozens of millions) is a nontrivial task. MG4J tries to make the 
techniques described in the book Managing Gigabytes, by Ian    Witten, Alistair 
Moffat and Timothy Bell, accessible without having to deal with bit-level 
operations in a clean, object-oriented environment.

You can find APIs, etc. at http://vigna.dsi.unimi.it/MG4J/

Other Software Required:
The COLT distribution (http://tilde-hoschek.home.cern.ch/~hoschek/colt/)
fastUtil (http://vigna.dsi.unimi.it/fastUtil/)

Other Comments:






reply via email to

[Prev in Thread] Current Thread [Next in Thread]