Package | Description |
---|---|
org.egothor.core |
This package concentrates the core data objects and interfaces.
|
org.egothor.core.memory |
This package contains an implementation of barrels in memory.
|
org.egothor.dir |
This package defines objects of distributed IR layer.
|
org.egothor.distributor |
This package defines objects which can distribute tasks and requests
to a group of objects.
|
org.egothor.duplicity.algorithm |
This package contains top-level classes that implement the duplicity checking algorithm.
|
org.egothor.duplicity.visualization |
This package contains classes implementing the visualization of the duplicities found in a document by the duplicity checking algorithm.
|
org.egothor.dynamizer |
This package defines a common dynamization and its implementation
for our index data structures.
|
org.egothor.html |
This package contains a specific implementation of core objects
for HTML with "home" and "content" support.
|
org.egothor.merger |
This package defines a machinery that merges many BarrelReaders.
|
org.egothor.parser | |
org.egothor.parser.filter |
This package defines objects that filter tokens.
|
org.egothor.parser.plain |
This package defines JavaCC parser for a plain text.
|
org.egothor.repository | |
org.egothor.robot.index | |
org.egothor.slotter |
This package is responsible for management of arrays of DynStruct
objects.
|
org.egothor.store |
This package contains an implementation of barrels on a hard disk.
|
org.egothor.text |
This package contains support of some textual routines/processes.
|
org.egothor.util | |
org.egothor.util.sort |
Modifier and Type | Interface and Description |
---|---|
interface |
IMetaReader
This interface declares, that an object can act as a sequence with a
close operation and it also offers a method that returns
uid of the item that is read currently.
|
interface |
SequenceWithClose<T>
Sequence allows you to iterate a sequence of not-null objects.
|
interface |
SequenceWithRestart<T>
Adds an ability to restart the sequence and read it again and again.
|
Modifier and Type | Class and Description |
---|---|
class |
Filter
This object works as a tokenizer filter.
|
class |
Iter2Seq<T>
The Iter2Seq class exists to wrap an Iterator inside a Sequence.
|
Modifier and Type | Field and Description |
---|---|
Sequence<Token> |
Filter.prev
The Tokenizer used by Filter.
|
Modifier and Type | Method and Description |
---|---|
Sequence<? extends IListMetadata> |
Barrel.expand(String expr)
Return the enumeration of all IListMetadata objects which match the
expression.
|
Sequence<Token> |
Filter.getPrevTokenizer()
Return the tokenizer this filter reads from.
|
Sequence<Hit> |
QueryResponse.getResult()
Return an enumeration of
Hit s. |
Sequence<Token> |
DocumentData.words(boolean readlinx,
boolean readilinx,
boolean lowercase,
boolean phonetics,
HTMLField.Diacritics diacritics,
boolean paragraphs,
boolean paragraphsKeepPunctuation,
String encoding) |
Modifier and Type | Method and Description |
---|---|
void |
Filter.setPrevTokenizer(Sequence<Token> prev)
Set the tokenizer this filter reads from.
|
Constructor and Description |
---|
Filter(Sequence<Token> prev)
Constructor for the Filter object.
|
QueryResponse(int offset,
long wouldBe,
long positives,
Sequence<Hit> e,
Token[] queryTokens,
int enum_len,
Query adaptedQuery)
Constructor for the QueryResponse object.
|
Modifier and Type | Method and Description |
---|---|
Sequence<? extends IListMetadata> |
MemoryBarrel.expand(String expr)
Return an Enumeration which will contain inverted list metadata for any
inverted list constructed for a term containing the given regular
expression.
|
Sequence<IListMetadata> |
Document.expand(String expr)
Not implemented.
|
Sequence<Token> |
FTField.filteredWords()
Return an enumeration of the terms in the field filtered by filters.
|
Sequence<Token> |
FTField.words()
Return an enumeration of the terms in the field.
|
Modifier and Type | Method and Description |
---|---|
abstract Sequence<Barrel> |
Group.elements()
Return the elements contained in this object.
|
Sequence<Barrel> |
Tanker.elements()
Return the Barrels in this Tanker.
|
Sequence<Barrel> |
TankerImplSecure.elements()
Deprecated.
|
Sequence<Barrel> |
TankerImplSecure.elementsSecure()
Gets elements of this tanker, done in multithreaded safe way.
|
Sequence<? extends IListMetadata> |
Group.expand(String expr)
Return the enumeration of all IListMetadata objects which match the
expression.
|
Sequence<? extends IListMetadata> |
TankerImplSecure.expand(String expr) |
Modifier and Type | Method and Description |
---|---|
void |
Distributor.forEach(Sequence<T> to,
Task<T> task)
Put the task to all acting (inner) structures.
|
void |
MTE.forEach(Sequence<T> to,
Task<T> task)
Put the task to all acting (inner) structures.
|
void |
STE.forEach(Sequence<T> to,
Task<T> task)
Iterate through the Enumeration and give the element the task.
|
void |
Distributor.iterate(Sequence<T> to,
Request<T> task)
Iterate through the Enumeration (in Egothor's case the active
(inner) barrels) in a predefined and constant order and try
to give them the request until one of them fulfills it.
|
void |
STE.iterate(Sequence<T> to,
Request<T> task)
Iterate through the Enumeration and give the element the task, if
not already completed..
|
Modifier and Type | Method and Description |
---|---|
void |
PermutatedMinsFiller.computeDocumentMins(DocumentPermutatedMins result,
Sequence<Token> terms,
long documentUID,
int documentDBRevision)
Computes the permutated mins values for given sequence of tokens of a document
and fills it into the result under the identificator documentID.
|
Modifier and Type | Method and Description |
---|---|
static List<List<Token>> |
DocumentDuplicities.getDocumentUnits(Sequence<Token> words)
Takes the sequence of document words and depending on the
Constants.CHECK_DUPLICITY_LEVEL splits it
to the appropriate text units - documents, paragraphs or sentences. |
Modifier and Type | Method and Description |
---|---|
Sequence<Barrel> |
Dynamizer.elements()
Set of DynStruct elements.
|
Modifier and Type | Method and Description |
---|---|
Sequence<Token> |
HTMLField.words() |
Modifier and Type | Method and Description |
---|---|
void |
HTMLField.setAppendix(Sequence<Token> appendix) |
Modifier and Type | Class and Description |
---|---|
class |
ConcatSeq<T>
The ConcatSeq object reads an input set of sequences one by one and
this way it produces this final sequence.
|
Modifier and Type | Field and Description |
---|---|
Sequence<T>[] |
ConcatSeq.enums
The Sequences that will be concatenated.
|
Modifier and Type | Method and Description |
---|---|
void |
ConcatSeq.add(Sequence<T> e)
Add the given Sequence to this object if the capacity has not been
reached..
|
void |
ConcatSeq.kickOff(Sequence<T> e)
The input Sequence
e is exhausted, and it is just
after this concatenator has removed/unlinked it from inner
structures. |
Modifier and Type | Class and Description |
---|---|
class |
Strings2Tokens
Transforms an array of strings into the sequence of tokens.
|
Modifier and Type | Class and Description |
---|---|
class |
DupWithoutDiacritics
This filter transforms all (Latin) words to non-diacritical (ASCII), but still keeps the original tokens.
|
class |
Grammer
This class is really grammer - it produces N-grams.
|
class |
LowerCase
This filter transforms all words to lower case.
|
class |
ParagraphFilter
Filter sets the sentence, paragraph and sentenceInParagraph fields
in the Token class, just like the
ParagraphPunctFilter . |
class |
ParagraphPunctFilter
Filter sets the sentence, paragraph and sentenceInParagraph fields
in the Token class.
|
class |
Phonetics |
class |
PunctFilter |
class |
RemoveDiacritics
This filter transforms all (Latin) words to non-diacritical (ASCII).
|
class |
Stemmer
The Stemmer object is a filter which transforms all words to their
respective stems.
|
class |
StopFilter
This abstract class should be extended by any class wishing to ignore
certain tokens while processing all tokens.
|
class |
WordNGrammer
This class produces N-grams of words.
|
Constructor and Description |
---|
DupWithoutDiacritics(Sequence<Token> prev)
Constructor for the Diacritics object.
|
Grammer(Sequence<Token> arg0)
Constructor for the Grammer object.
|
LowerCase(Sequence<Token> prev)
Constructor for the LowerCase object.
|
LowerCase(Sequence<Token> prev,
Locale locale)
Construct a LowerCase object using the given localization setting.
|
ParagraphFilter(Sequence<Token> prev)
Constructor for the ParagraphPunctFilter object
|
ParagraphPunctFilter(Sequence<Token> prev)
Constructor for the ParagraphPunctFilter object
|
Phonetics(Sequence<Token> arg0)
Constructor for the Phonetics object.
|
PunctFilter(Sequence<Token> arg0)
Constructor for the PunctFilter object
|
RemoveDiacritics(Sequence<Token> prev)
Constructor for the Diacritics object.
|
Stemmer(Sequence<Token> prev,
Trie stemmer)
Construct a Stem object using the given stemmer table.
|
StopFilter(Sequence<Token> arg0)
Constructor for the StopFilter object.
|
WordNGrammer(Sequence<Token> prev)
Constructor for the WordNGrammer object
|
Modifier and Type | Class and Description |
---|---|
class |
Plain
The Plain object is an extension of Tokenizer.
|
Modifier and Type | Interface and Description |
---|---|
static interface |
DataRepository.TupleSequence |
Modifier and Type | Class and Description |
---|---|
class |
DocumentsFastReader
DocumentsFastReader.java
Created on August 15, 2005, 2:23 AM
|
class |
TermsFastReader
TermsFastReader.java
Created on August 15, 2005, 2:13 AM
|
Modifier and Type | Method and Description |
---|---|
Sequence<Barrel> |
SlotMap.elements() |
Sequence<? extends IListMetadata> |
BarrelShadow.expand(String expr)
Return the enumeration of all IListMetadata objects which match the
expression.
|
Modifier and Type | Method and Description |
---|---|
Sequence<? extends IListMetadata> |
Terms.elements(String prefix)
Return an Enumeration of elements containing the given prefix.
|
Sequence<? extends IListMetadata> |
ThickBarrel.expand(String expr)
Description of the Method
|
Modifier and Type | Class and Description |
---|---|
class |
Generator
This class generates Tokenizer-s (documents) which reflect the Zipf's law.
|
Modifier and Type | Method and Description |
---|---|
String |
Snipper.filter(Sequence<Token> tokens)
Description of the Method
|
Modifier and Type | Class and Description |
---|---|
class |
MergeSequence<T> |
Modifier and Type | Class and Description |
---|---|
class |
PermutedSequence<T> |
Constructor and Description |
---|
PermutedSequence(Permutation perm,
Sequence<T> orig) |
Copyright © 2016 Egothor. All Rights Reserved.