[Egothor-tech] Questions about Egothor
Leo Galambos
Leo.Galambos at egothor.org
Thu Mar 10 19:50:28 GMT 2005
Marvin Humphrey wrote:
>> The major difference is that Lucene uses a classic algorithm for
>> index actualization while Egothor uses something more sophisticated.
>> That is why Egothor may operate on huge document collections more
>> effectively.
>
>
> Could you explain some of the differences?
Egothor counts which parts of an index are too slow and optimizes them
preferably, so that queries run quickly and smoothly. Lucene does not
know how much the parts are optimized, so that it always optimizes the
full index. Obviously, Lucene's strategy costs more time and I/O
operations. Moreover, Egothor manages less parts of an index and can
query them concurrently, Lucene queries the parts sequentially.
So the summary is: If you had a dynamic collection (=not static) and you
wanted to solve queries fast, Lucene would have to optimize the index
more frequently - and it would cost time and I/O operations. Other
differencies are not important for you unless you develop a distributed
search engine or your collection is pretty huge, i.e. the whole web, one
half of the web,... or so :).
Cheers,
Leo
--
Leo Galambos
Faculty of Mathematics and Physics, DSE
Malostranske namesti 25
Prague 1
CZE
http://kocour.ms.mff.cuni.cz/~galambos/
More information about the Egothor-tech
mailing list