[Egothor-tech] posting info and stemming
Peter Halacsy
peter at halacsy.com
Sat Nov 27 18:17:05 GMT 2004
One more question. At indexing I'd like to store the stemmed and
original form of a word on the same position. Suppose my stemmer stems
the word "goes" to "go".
the input is :
as far as it goes -> as far as it goes/go
The query would be stemmed the same way. You search with: "goes" The
sentence above should be ranked higher than the others containing the
word "go"
Why to store in the same position? To be able to handle phrase and
proximity searches as well.
I know I can do it: I should set the token's reloffset to zero if the
token is a stemmed form of an original?
peter
**
More information about the Egothor-tech
mailing list