| Class | Description |
|---|---|
| AbbreviationsBlocker |
Produces candidate
|
| BasicTeacher |
Train a StringDistanceLearner.
|
| Blocker |
Produces candidate pairs from a MatchData structure, and provides
access to those candidate pairs.
|
| Blocker.Pair |
Holds a pair of instances, with mutable distance between them.
|
| ClusterNGramBlocker |
TokenBlocker for clustering based on NGram co-occurence.
|
| ClusterTokenBlocker |
TokenBlocker for clustering.
|
| ExtractAbbreviations |
Extracts abbreviation pairs (<short-form, long-form>) from text using an 'abbreviation distance metric' which evaluates
the probability of a short-form string being an abbreviation/acronym of another long-form string.
|
| MatchData |
Holds data for evaluating a distance metric.
|
| MatchData.Instance |
A single item (aka record, string, etc) to match against others.
|
| MatchData.MatchIterator |
Iterates over all stored StringWrappers
|
| MatchDataTeacher |
Train a StringDistanceLearner using MatchData and a Blocker.
|
| MatchExpt |
Perform a matching experiment using a data file, distance function
and blocker.
|
| MatchExptScript |
Perform a series of match experiments, specified by a script in an input file.
|
| NGramBlocker |
Finds all pairs that share a not-too-common character n-gram.
|
| NullBlocker |
Inefficient exhausitive blocking algorithm.
|
| SourcedMatchData |
Holds data for evaluating a distance metric.
|
| SourcedMatchData.Instance |
A single item (aka record, string, etc) to match against
others.
|
| SourcedMatchData.MatchIterator |
Iterates over all stored StringWrappers
|
| SpecialMatchExpt |
Perform a matching experiment using a vocabulary stats file, data
file, distance function and blocker.
|
| TokenBlocker |
Finds all pairs that share a not-too-common token.
|
| Exception | Description |
|---|---|
| MatchData.InputFormatException |
Signals an incorrectly formatted MatchData file.
|
| SourcedMatchData.InputFormatException |
Signals an incorrectly formatted MatchData file.
|
Copyright © 2016. All rights reserved.