[Egothor-tech] Questions about Egothor

Leo Galambos Leo.Galambos at egothor.org
Thu Mar 10 19:50:28 GMT 2005


Marvin Humphrey wrote:

>> The major difference is that Lucene uses a classic algorithm for 
>> index actualization while Egothor uses something more sophisticated. 
>> That is why Egothor may operate on huge document collections more 
>> effectively.
>
>
> Could you explain some of the differences?


Egothor counts which parts of an index are too slow and optimizes them 
preferably, so that queries run quickly and smoothly. Lucene does not 
know how much the parts are optimized, so that it always optimizes the 
full index. Obviously, Lucene's strategy costs more time and I/O 
operations. Moreover, Egothor manages less parts of an index and can 
query them concurrently, Lucene queries the parts sequentially.

So the summary is: If you had a dynamic collection (=not static) and you 
wanted to solve queries fast, Lucene would have to optimize the index 
more frequently - and it would cost time and I/O operations. Other 
differencies are not important for you unless you develop a distributed 
search engine or your collection is pretty huge, i.e. the whole web, one 
half of the web,... or so :).

Cheers,
Leo

-- 
Leo Galambos
Faculty of Mathematics and Physics, DSE
Malostranske namesti 25
Prague 1
CZE

http://kocour.ms.mff.cuni.cz/~galambos/




More information about the Egothor-tech mailing list