[Egothor-tech] posting info and stemming

Peter Halacsy peter at halacsy.com
Sat Nov 27 18:17:05 GMT 2004


One more question. At indexing I'd like to store the stemmed and 
original form of a word on the same position. Suppose my stemmer stems 
the word "goes" to "go".

the input is :

as far as it goes -> as far as it goes/go

The query would be stemmed the same way. You search with: "goes" The 
sentence above should be ranked higher than the others containing the 
word "go"

Why to store in the same position? To be able to handle phrase and 
proximity searches as well.

I know I can do it: I should set the token's reloffset to zero if the 
token is a stemmed form of an original?

peter

**


More information about the Egothor-tech mailing list