Dumper utility
If you want to dump out the content of a barrel, you can use org.egothor.test.Dumper program.
$ java org.egothor.test.Dumper /tmp/txt/index/1/
0 {L=Location of this doc is on my HDD..., T=Title of the document..., S=Some summary...}
1 {L=Location of this doc is on my HDD 2..., T=Title of the document 2..., S=Some summary 2...}
<WORD>document 2
0 1: 3
1 1: 3
<WORD>is 1
0 1: 1
<WORD>my 2
0 1: 2
1 1: 2
<WORD>this 2
0 1: 0
1 1: 0
<WORD>was 1
1 1: 1
The barrel above has 2 documents and 5 tokens. For instance, the token
this appears in two documents. In documents no. 0 and 1, it appears just one time at offset position 0.
You can also print out some of the data showed above. This can be achieved by a switch which sets the respective flags for printing out:
- documents - switch "D",
- inverted lists - switch "L",
- tokens/words - switch "W",
- list of occurencies - switch "P".
If the flag is in lower case, the respective category is not printed.
$ java org.egothor.test.Dumper -DlWp /tmp/txt/index/1/
0 {L=Location of this doc is on my HDD..., T=Title of the document..., S=Some summary...}
1 {L=Location of this doc is on my HDD 2..., T=Title of the document 2..., S=Some summary 2...}
<WORD>document 2
<WORD>is 1
<WORD>my 2
<WORD>this 2
<WORD>was 1
See also: Expand
-- LeoGalambos - 15 Mar 2004