r3 - 27 Apr 2004 - 13:21:14 - LeoGalambosYou are here: TWiki >  Egothor Web  >  ExecutableProgram > Dumper

Dumper utility

If you want to dump out the content of a barrel, you can use org.egothor.test.Dumper program.

$ java org.egothor.test.Dumper /tmp/txt/index/1/
0 {L=Location of this doc is on my HDD..., T=Title of the document..., S=Some summary...}
1 {L=Location of this doc is on my HDD 2..., T=Title of the document 2..., S=Some summary 2...}
<WORD>document 2
0 1:  3
1 1:  3
<WORD>is 1
0 1:  1
<WORD>my 2
0 1:  2
1 1:  2
<WORD>this 2
0 1:  0
1 1:  0
<WORD>was 1
1 1:  1

The barrel above has 2 documents and 5 tokens. For instance, the token this appears in two documents. In documents no. 0 and 1, it appears just one time at offset position 0.

You can also print out some of the data showed above. This can be achieved by a switch which sets the respective flags for printing out:

  • documents - switch "D",
  • inverted lists - switch "L",
  • tokens/words - switch "W",
  • list of occurencies - switch "P".
If the flag is in lower case, the respective category is not printed.

$ java org.egothor.test.Dumper -DlWp /tmp/txt/index/1/
0 {L=Location of this doc is on my HDD..., T=Title of the document..., S=Some summary...}
1 {L=Location of this doc is on my HDD 2..., T=Title of the document 2..., S=Some summary 2...}
<WORD>document 2
<WORD>is 1
<WORD>my 2
<WORD>this 2
<WORD>was 1

See also: Expand

-- LeoGalambos - 15 Mar 2004

Edit | Attach | Printable | Raw View | Backlinks: Web, All Webs | History: r3 < r2 < r1 | More topic actions
 
Powered by TWiki
This site is powered by the TWiki collaboration platformCopyright © by the contributing authors. All material on this collaboration platform is the property of the contributing authors.
Ideas, requests, problems regarding TWiki? Send feedback