[Egothor-tech] query problem, no results

HM hm at hmLyons.com
Fri Nov 19 21:02:19 GMT 2004


Hi All,

I've successfully crawled a site and indexed that site. Problem is, when I run the query,
everything looks okay comming from the System.out but, it never returns any hits. The
System.out looks like,

query-local:
skipping input as property query.string has already been set.
using query:  portal
using barrel: index
Query: portal
0
<?xml version="1.0" encoding="UTF-8"?>
<query><group required="no" prohibited="no" unknown="no" excluded="no"><term required="yes"
prohibited="no" unknown="yes" excluded="no" value="&lt;WORD&gt;portal" control="no" idf="-1.0"
boost="1"/></group></query>
BUILD SUCCESSFUL (total time: 0 seconds)



I figure it's likely to be either that the index didn't work properly (although the file sizes
in the 'index' folder look promising) or that I'm querying wrong.

One strange thing, when I change the barrel property in my ant script to a non-exsistant
directory, or leave it out all together, the System.out from the query task looks the same as
when I set the barrel to the 'index' folder.

Can anyone point me in the right direction?

(ant build.xml and rules follow)

My ant build.xml is,

<?xml version="1.0" encoding="UTF-8"?>
<project name="EgoThor tasks"
         default="crawl" basedir=".">

  <!-- Set up properties containing important project directories -->
  <property name="source.root" value="src"/>
  <property name="class.root" value="classes"/>
  <property name="lib.dir" value="lib"/>
  <property name="data.dir" value="data"/>
  <!-- <property name="index.barrel" value=""/> -->
  <property name="query.string" value="portal"/>

  <!-- Set up the class path for compilation and execution -->
  <path id="project.class.path">
      <!-- Include our own classes, of course -->
      <pathelement location="${class.root}" />
      <!-- Include jars in the project library directory -->
      <fileset dir="${lib.dir}">
        <include name="*.jar"/>
      </fileset>
  </path>

  <target name="crawl" description="Crawls the site">
    <delete includeEmptyDirs="true" failOnError="false">
        <fileset dir="linkdb"/>
        <fileset dir="corpus"/>
        <fileset dir="scheduler"/>
    </delete>
    <java classname="org.egothor.robot.Capek" fork="yes">
        <classpath refid="project.class.path"/>
        <!-- <arg value="-Degothor.rules.file=rules"/> -->
        <jvmarg value="-Degothor.server.pause=1ms"/>
        <arg value="http://192.168.1.205:80/Wiki/Wiki.jsp"/>
    </java>
  </target>

  <target name="index" description="Indexes the site">
    <java classname="org.egothor.apps.Michelangelo" fork="yes">
        <classpath refid="project.class.path"/>
    </java>
  </target>

  <taskdef name="querylocal" classname="org.egothor.ant.QueryLocal"
classpathref="project.class.path" />

  <target name="query-local">
    <input message="Enter query:" addproperty="query.string"/>
    <querylocal query="${query.string}" barrel="${index.barrel}" />
    <querylocal query="${query.string}" />
  </target>

    <target name="test-config" description="Validates the config for the site">
    <java classname="org.egothor.robot.Config" fork="yes">
        <classpath refid="project.class.path"/>
        <arg value="rules"/>
    </java>
  </target>


</project>


and rules is,
loop		2
valid http://192\.168\.1\.205.*


also, in the index folder,
state
1/

and the 1/ folder,
doc.btm
doc.dta
ils.dta
prx.dta
trm.dta
properties
doc.idx
trm.idx
doc.mta
depthrand.sep





-HM


More information about the Egothor-tech mailing list