public class DuplicityChecker extends Object implements Status
createDuplicityCheckingFilesForBarrel(org.egothor.core.BarrelReader, java.lang.String) allow computing duplicity checking files for single barrel. The method
mergeDuplicityCheckingFiles(java.lang.String, java.lang.String, java.util.Set<org.egothor.duplicity.datastructure.DocumentUnitID>) allows merging duplicity checking files of two barrels.
The structure of the duplicity checking algorithm files on filesystem is this.
dupliticity (directory)
|--> temp (directory) - contains temporary files for classes
| SimilarUnitPairsTempFile and |
AllSimilarUnitPairsFile |--> fdpiX files - correspond to
PermutatedMinsFile |--> fhpiX files - correspond to
SimilarUnitPairsFile - these files are no more computed
This class has a state consisting of the seed of PermutatedMinsFiller object
and sign empty.| Constructor and Description |
|---|
DuplicityChecker() |
| Modifier and Type | Method and Description |
|---|---|
Set<DocumentUnitID> |
append(BarrelReader br,
boolean omitDuplicates,
boolean visualizeDuplicities,
boolean printDuplicitiesToCsv) |
void |
close() |
void |
commit() |
void |
destroy() |
String |
getDuplicityCheckingMainDir() |
String |
getDuplicityCheckingReportDir() |
String |
getDuplicityCheckingTempDir() |
Properties |
getState(Properties p)
Return the state of this object.
|
void |
initialize(String indexMainDir,
TankerImplSecure tanker) |
boolean |
isEmpty() |
void |
removeDoc(long uid,
int docDBRevision) |
void |
setState(Properties state)
Set this object with the given properties.
|
public boolean isEmpty()
public String getDuplicityCheckingMainDir()
public String getDuplicityCheckingReportDir()
public String getDuplicityCheckingTempDir()
public void initialize(String indexMainDir, TankerImplSecure tanker) throws IOException
indexMainDir - tanker - IOExceptionpublic void commit()
public void close()
public Properties getState(Properties p)
public void setState(Properties state)
public Set<DocumentUnitID> append(BarrelReader br, boolean omitDuplicates, boolean visualizeDuplicities, boolean printDuplicitiesToCsv) throws DuplicityCheckingException
br - omitDuplicates - visualizeDuplicities - printDuplicitiesToCsv - DuplicityCheckingExceptionpublic void destroy()
public void removeDoc(long uid,
int docDBRevision)
uid - docDBRevision - Copyright © 2016 Egothor. All Rights Reserved.