Package | Description |
---|---|
org.egothor.core |
This package concentrates the core data objects and interfaces.
|
org.egothor.duplicity.algorithm |
This package contains top-level classes that implement the duplicity checking algorithm.
|
org.egothor.duplicity.datastructure |
This package contains datastructures needed in the duplicity checking algorithm.
|
org.egothor.duplicity.file |
This package contains implementation of the files needed the duplicity checking algorithm.
|
org.egothor.duplicity.visualization |
This package contains classes implementing the visualization of the duplicities found in a document by the duplicity checking algorithm.
|
Modifier and Type | Method and Description |
---|---|
void |
DocumentData.computeMins(PermutatedMinsFiller permutatedMinsFiller) |
Sequence<Token> |
DocumentData.words(boolean readlinx,
boolean readilinx,
boolean lowercase,
boolean phonetics,
HTMLField.Diacritics diacritics,
boolean paragraphs,
boolean paragraphsKeepPunctuation,
String encoding) |
Modifier and Type | Method and Description |
---|---|
Set<DocumentUnitID> |
DuplicityChecker.append(BarrelReader br,
boolean omitDuplicates,
boolean visualizeDuplicities,
boolean printDuplicitiesToCsv) |
void |
PermutatedMinsFiller.computeDocumentMins(DocumentPermutatedMins result,
Sequence<Token> terms,
long documentUID,
int documentDBRevision)
Computes the permutated mins values for given sequence of tokens of a document
and fills it into the result under the identificator documentID.
|
Modifier and Type | Method and Description |
---|---|
void |
DocumentPermutatedMins.commit()
Computes the number of text units in the document.
|
Modifier and Type | Method and Description |
---|---|
Map<DocumentUnitID,Double> |
JaccardCoeficientsFile.markDuplicates(List<DocumentData> docs) |
Constructor and Description |
---|
JaccardCoeficientsFile(String location) |
Modifier and Type | Method and Description |
---|---|
void |
DocumentDuplicities.createReport(String dirname,
boolean producePDF,
boolean produceHTML,
double coef)
Create duplicity checking report files for this document in given directory
in given formats.
|
Copyright © 2016 Egothor. All Rights Reserved.