[Egothor-tech] Capek and HTTP redirect

Leo Galambos leo.galambos at mff.cuni.cz
Mon Aug 8 16:14:02 BST 2005


Filip Koczorowski wrote:

> How does Capek the Site Crawler handle HTTP redirect codes (301 & 302)?
>
> I'm trying to run the robot on a set of redirecting links. Each link
> is a HTTP 302 response code with a short page containing valid link.
> It seems Capek however ignores the page when it gets non-200 HTTP code.
>
> Is there something wrong with my robot's configuration or does Egothor
> (Capek?) need a patch to handle redirect codes in a proper way?


Hi,

if the server sends 3xx ERR code, it would also send some link in HTTP
header (field "Location" IMHO). Such a link is followed, but the body of
the document is discarded - as Stuart noted.

Cheers,
Leo


More information about the Egothor-tech mailing list