[Egothor-tech] Duplicate default pages
Stuart David Lewis [sdl]
sdl at aber.ac.uk
Mon Apr 4 13:55:49 BST 2005
>>>http://www.aber.ac.uk/en/student/travel/
>>>http://www.aber.ac.uk/en/student/travel/index.php
>>>
>>>Could Egothor be improved to know that these are the same?
> I think you could use AntiSpam feature which is implemented
> in Oracul (see AntiSpam topic in twiki.
> www.badboyz.example.com:80 - 2 # you can also increase the values
domain
> www.goodboys.example.com:80 + 2 # the same can be applied on specific
URLs url
> http://www.badboyz.example.com:80/stupidpage.html = 0 url
> http://www.goodboys.example.com:80/greatresource.html + 5
Sorry to be stupid, but how does this affect the output? Are entries
with higher scores ranked above those with lower scores? This might
solve the problem where there are many results, but what about if there
are only two results? (/ and /index.xyz)
How easy would it be to write a script to iterate the index file(s) and
remove entries ending in one of a pre-defined set of default pages (e.g.
index.* or default.*) if there is a matching entry with just the
trailing slash? Just a thought.
Stuart
More information about the Egothor-tech
mailing list