- a - Variable in class com.wcohen.ss.CombinedStringDistanceLearner.MyDistanceInstance
-
- AbbreviationAlignment - Class in com.wcohen.ss
-
Abbreviation distance metric which evaluates the probability of a short-form string being an abbreviation/acronym
of another long-form string.
- AbbreviationAlignment() - Constructor for class com.wcohen.ss.AbbreviationAlignment
-
- AbbreviationAlignmentContainer<T extends Enum<T>,S extends Enum<S>> - Class in com.wcohen.ss.abbvGapsHmm
-
- AbbreviationsBlocker - Class in com.wcohen.ss.expt
-
Produces candidate pairs that share not-too-common character tokens.
- AbbreviationsBlocker(Tokenizer, double) - Constructor for class com.wcohen.ss.expt.AbbreviationsBlocker
-
- AbbreviationsBlocker() - Constructor for class com.wcohen.ss.expt.AbbreviationsBlocker
-
- AbbvGapsHMM - Class in com.wcohen.ss.abbvGapsHmm
-
- AbbvGapsHMM() - Constructor for class com.wcohen.ss.abbvGapsHmm.AbbvGapsHMM
-
- AbbvGapsHMM(String) - Constructor for class com.wcohen.ss.abbvGapsHmm.AbbvGapsHMM
-
- AbbvGapsHMM(String, boolean) - Constructor for class com.wcohen.ss.abbvGapsHmm.AbbvGapsHMM
-
- AbbvGapsHMM.Emissions - Enum in com.wcohen.ss.abbvGapsHmm
-
- AbbvGapsHMM.States - Enum in com.wcohen.ss.abbvGapsHmm
-
- AbbvGapsHMM.Transitions - Enum in com.wcohen.ss.abbvGapsHmm
-
- AbbvGapsHmmBackwardsEvaluator - Class in com.wcohen.ss.abbvGapsHmm
-
- AbbvGapsHmmBackwardsEvaluator(AbbvGapsHMM) - Constructor for class com.wcohen.ss.abbvGapsHmm.AbbvGapsHmmBackwardsEvaluator
-
- AbbvGapsHmmBackwardsEvaluator.BackwardEvalParam - Class in com.wcohen.ss.abbvGapsHmm
-
- AbbvGapsHmmBackwardsViterbiEvaluator - Class in com.wcohen.ss.abbvGapsHmm
-
- AbbvGapsHmmBackwardsViterbiEvaluator(AbbvGapsHMM) - Constructor for class com.wcohen.ss.abbvGapsHmm.AbbvGapsHmmBackwardsViterbiEvaluator
-
- AbbvGapsHMMEvaluator - Class in com.wcohen.ss.abbvGapsHmm
-
- AbbvGapsHMMEvaluator(AbbvGapsHMM) - Constructor for class com.wcohen.ss.abbvGapsHmm.AbbvGapsHMMEvaluator
-
- AbbvGapsHMMEvaluator.EvalParam - Class in com.wcohen.ss.abbvGapsHmm
-
- AbbvGapsHmmExpectationEvaluator - Class in com.wcohen.ss.abbvGapsHmm
-
- AbbvGapsHmmExpectationEvaluator(AbbvGapsHMM) - Constructor for class com.wcohen.ss.abbvGapsHmm.AbbvGapsHmmExpectationEvaluator
-
- AbbvGapsHmmForwardEvaluator - Class in com.wcohen.ss.abbvGapsHmm
-
- AbbvGapsHmmForwardEvaluator(AbbvGapsHMM) - Constructor for class com.wcohen.ss.abbvGapsHmm.AbbvGapsHmmForwardEvaluator
-
- AbbvGapsHmmForwardEvaluator.ForwardEvalParam - Class in com.wcohen.ss.abbvGapsHmm
-
- AbstractSourcedStatisticalTokenDistance - Class in com.wcohen.ss
-
Abstract token distance metric that uses frequency statistics.
- AbstractSourcedStatisticalTokenDistance(SourcedTokenizer) - Constructor for class com.wcohen.ss.AbstractSourcedStatisticalTokenDistance
-
- AbstractSourcedStatisticalTokenDistance() - Constructor for class com.wcohen.ss.AbstractSourcedStatisticalTokenDistance
-
- AbstractSourcedTokenizedStringDistance - Class in com.wcohen.ss
-
Abstract distance metric for tokenized strings.
- AbstractSourcedTokenizedStringDistance(Tokenizer) - Constructor for class com.wcohen.ss.AbstractSourcedTokenizedStringDistance
-
- AbstractSourcedTokenizedStringDistance() - Constructor for class com.wcohen.ss.AbstractSourcedTokenizedStringDistance
-
- AbstractStatisticalTokenDistance - Class in com.wcohen.ss
-
Abstract token distance metric that uses frequency statistics.
- AbstractStatisticalTokenDistance(Tokenizer) - Constructor for class com.wcohen.ss.AbstractStatisticalTokenDistance
-
- AbstractStatisticalTokenDistance() - Constructor for class com.wcohen.ss.AbstractStatisticalTokenDistance
-
- AbstractStringDistance - Class in com.wcohen.ss
-
Abstract class which implements StringDistanceLearner as well as StringDistance.
- AbstractStringDistance() - Constructor for class com.wcohen.ss.AbstractStringDistance
-
- AbstractTokenizedStringDistance - Class in com.wcohen.ss
-
Abstract distance metric for tokenized strings.
- AbstractTokenizedStringDistance(Tokenizer) - Constructor for class com.wcohen.ss.AbstractTokenizedStringDistance
-
- AbstractTokenizedStringDistance() - Constructor for class com.wcohen.ss.AbstractTokenizedStringDistance
-
- Acronym - Class in com.wcohen.ss.abbvGapsHmm
-
- Acronym(String, String) - Constructor for class com.wcohen.ss.abbvGapsHmm.Acronym
-
- Acronym(String, String, Integer) - Constructor for class com.wcohen.ss.abbvGapsHmm.Acronym
-
- Acronym.AcronymFrequencyComparator - Class in com.wcohen.ss.abbvGapsHmm
-
- Acronym.AcronymShortFormComparator - Class in com.wcohen.ss.abbvGapsHmm
-
- AcronymFrequencyComparator() - Constructor for class com.wcohen.ss.abbvGapsHmm.Acronym.AcronymFrequencyComparator
-
- acronymsArrayToMap(Collection<Acronym>) - Method in class com.wcohen.ss.abbvGapsHmm.AlignmentPredictionModel
-
- AcronymShortFormComparator() - Constructor for class com.wcohen.ss.abbvGapsHmm.Acronym.AcronymShortFormComparator
-
- add(int, int, int, double) - Method in class com.wcohen.ss.abbvGapsHmm.Matrix3D
-
- addAbbreviationPairs(Map<String, Acronym>) - Method in class com.wcohen.ss.expt.ExtractAbbreviations
-
- addCandidatePair(List<Acronym>, String, String) - Method in class com.wcohen.ss.abbvGapsHmm.AlignmentPredictionModel
-
- addExample(DistanceInstance) - Method in class com.wcohen.ss.AbstractStringDistance
-
Implements StringDistanceLearner api by accepting new
DistanceInstance labels.
- addExample(DistanceInstance) - Method in interface com.wcohen.ss.api.StringDistanceLearner
-
Accept the answer to the last query.
- addExample(DistanceInstance) - Method in class com.wcohen.ss.CombinedStringDistanceLearner
-
Pass new labels to the sublearners.
- addInstance(String, String, String) - Method in class com.wcohen.ss.expt.MatchData
-
Add a single instance, with given src and id, to the datafile
- addInstance(String, String, String) - Method in class com.wcohen.ss.expt.SourcedMatchData
-
Add a single instance, with given src and id, to the datafile
- advanceEvalRange() - Method in class com.wcohen.ss.abbvGapsHmm.AbbvGapsHmmBackwardsEvaluator.BackwardEvalParam
-
- advanceEvalRange() - Method in class com.wcohen.ss.abbvGapsHmm.AbbvGapsHMMEvaluator.EvalParam
-
Advance within evaluation range
- advanceEvalRange() - Method in class com.wcohen.ss.abbvGapsHmm.AbbvGapsHmmForwardEvaluator.ForwardEvalParam
-
- AffineGap - Class in com.wcohen.ss
-
Affine-gap string distance, following Durban et al.
- AffineGap() - Constructor for class com.wcohen.ss.AffineGap
-
- AffineGap(CharMatchScore, double, double, double) - Constructor for class com.wcohen.ss.AffineGap
-
- AffineGap.MatrixTrio - Class in com.wcohen.ss
-
- AffineGap.MatrixTrio.InsertSMatrix - Class in com.wcohen.ss
-
- AffineGap.MatrixTrio.InsertTMatrix - Class in com.wcohen.ss
-
- AlignmentPredictionModel - Class in com.wcohen.ss.abbvGapsHmm
-
- AlignmentPredictionModel() - Constructor for class com.wcohen.ss.abbvGapsHmm.AlignmentPredictionModel
-
- andToSymbolMatch(char) - Method in class com.wcohen.ss.abbvGapsHmm.AbbvGapsHMMEvaluator
-
- ApproxMemoMatrix - Class in com.wcohen.ss
-
Variant of MemoMatrix that only stores values near the diagonal,
for better efficiency.
- ApproxNeedlemanWunsch - Class in com.wcohen.ss
-
Needleman-Wunsch string distance, following Durban et al.
- ApproxNeedlemanWunsch() - Constructor for class com.wcohen.ss.ApproxNeedlemanWunsch
-
- ApproxNeedlemanWunsch(CharMatchScore, double) - Constructor for class com.wcohen.ss.ApproxNeedlemanWunsch
-
- asBagOfSourcedTokens(SourcedStringWrapper) - Method in class com.wcohen.ss.AbstractSourcedTokenizedStringDistance
-
- asBagOfTokens(StringWrapper) - Method in class com.wcohen.ss.AbstractTokenizedStringDistance
-
- asMultiDistanceInstance(DistanceInstance) - Method in class com.wcohen.ss.CombinedStringDistanceLearner
-
- asMultiDistanceInstanceList(DistanceInstanceIterator) - Method in class com.wcohen.ss.CombinedStringDistanceLearner
-
- asMultiStringWrapper(StringWrapper) - Method in class com.wcohen.ss.CombinedStringDistanceLearner
-
- asMultiStringWrapper(StringWrapper) - Method in class com.wcohen.ss.MultiStringDistance
-
Lazily prepare a string.
- asMultiStringWrapperList(StringWrapperIterator) - Method in class com.wcohen.ss.CombinedStringDistanceLearner
-
- asUnitVector(SourcedStringWrapper) - Method in class com.wcohen.ss.SourcedTFIDF
-
- asUnitVector(StringWrapper) - Method in class com.wcohen.ss.TagLink
-
- asUnitVector(StringWrapper) - Method in class com.wcohen.ss.TFIDF
-
- at(int, int, int) - Method in class com.wcohen.ss.abbvGapsHmm.Matrix3D
-
- AveragedStringDistanceLearner - Class in com.wcohen.ss
-
Abstract StringDistanceLearner class which averages results of a number of
inner distance metrics, learned by a number of inner distance learners.
- AveragedStringDistanceLearner() - Constructor for class com.wcohen.ss.AveragedStringDistanceLearner
-
- AveragedStringDistanceLearner(StringDistanceLearner[], String) - Constructor for class com.wcohen.ss.AveragedStringDistanceLearner
-
- averagePrecision() - Method in class com.wcohen.ss.expt.MatchExpt
-
non-interpolated average precision
- averagePrecision() - Method in class com.wcohen.ss.expt.SpecialMatchExpt
-
non-interpolated average precision
- b - Variable in class com.wcohen.ss.CombinedStringDistanceLearner.MyDistanceInstance
-
- backgroundProb(Token) - Method in class com.wcohen.ss.JensenShannonDistance
-
Probability of token in the background language model
- BackwardEvalParam(String) - Constructor for class com.wcohen.ss.abbvGapsHmm.AbbvGapsHmmBackwardsEvaluator.BackwardEvalParam
-
- backwardEvaluate(Acronym, List<Double>, List<Double>) - Method in class com.wcohen.ss.abbvGapsHmm.AbbvGapsHmmBackwardsEvaluator
-
- backwardViterbiEvaluate(Acronym, List<Double>, List<Double>) - Method in class com.wcohen.ss.abbvGapsHmm.AbbvGapsHmmBackwardsViterbiEvaluator
-
- BasicDistanceInstanceIterator - Class in com.wcohen.ss
-
A simple DistanceInstanceIterator implementation.
- BasicDistanceInstanceIterator(Iterator) - Constructor for class com.wcohen.ss.BasicDistanceInstanceIterator
-
- BasicSourcedStringWrapperIterator - Class in com.wcohen.ss
-
A simple StringWrapperIterator implementation.
- BasicSourcedStringWrapperIterator(Iterator) - Constructor for class com.wcohen.ss.BasicSourcedStringWrapperIterator
-
- BasicSourcedToken - Class in com.wcohen.ss.tokens
-
An interned version of a string, with provinance information
- BasicStringWrapper - Class in com.wcohen.ss
-
An extendible (non-final) class that implements some of the
functionality of a string.
- BasicStringWrapper(String) - Constructor for class com.wcohen.ss.BasicStringWrapper
-
- BasicStringWrapperIterator - Class in com.wcohen.ss
-
A simple StringWrapperIterator implementation.
- BasicStringWrapperIterator(Iterator) - Constructor for class com.wcohen.ss.BasicStringWrapperIterator
-
- BasicTeacher - Class in com.wcohen.ss.expt
-
Train a StringDistanceLearner.
- BasicTeacher(Blocker, MatchData) - Constructor for class com.wcohen.ss.expt.BasicTeacher
-
Create a teacher from a blocker and a dataset.
- BasicTeacher(StringWrapperIterator, DistanceInstanceIterator, DistanceInstanceIterator) - Constructor for class com.wcohen.ss.expt.BasicTeacher
-
Create a teacher using specific values for the various iterators.
- BasicToken - Class in com.wcohen.ss.tokens
-
An interned version of a string.
- block(MatchData) - Method in class com.wcohen.ss.expt.AbbreviationsBlocker
-
- block(MatchData) - Method in class com.wcohen.ss.expt.Blocker
-
Load matchdata and prepare it for production of candidate pairs.
- block(MatchData) - Method in class com.wcohen.ss.expt.NullBlocker
-
- block(MatchData) - Method in class com.wcohen.ss.expt.TokenBlocker
-
- Blocker - Class in com.wcohen.ss.expt
-
Produces candidate pairs from a MatchData structure, and provides
access to those candidate pairs.
- Blocker() - Constructor for class com.wcohen.ss.expt.Blocker
-
- blocker(String) - Method in class com.wcohen.ss.expt.MatchExptScript
-
Load a blocker.
- blocker(String, String, String) - Method in class com.wcohen.ss.expt.MatchExptScript
-
Load a blocker, with optional boolean value
- Blocker.Pair - Class in com.wcohen.ss.expt
-
Holds a pair of instances, with mutable distance between them.
- BLOCKER_PACKAGE - Static variable in class com.wcohen.ss.expt.MatchExpt
-
- BLOCKER_PACKAGE - Static variable in class com.wcohen.ss.expt.MatchExptScript
-
- BLOCKER_PACKAGE - Static variable in class com.wcohen.ss.expt.SpecialMatchExpt
-
- blockerRecall() - Method in class com.wcohen.ss.expt.MatchExpt
-
performance of the blocker
- blockerRecall() - Method in class com.wcohen.ss.expt.SpecialMatchExpt
-
performance of the blocker
- build(String) - Static method in class com.wcohen.ss.DistanceLearnerFactory
-
Generate a StringDistance from a class name, or a sequence of classnames
separated by slashes.
- build(String[]) - Static method in class com.wcohen.ss.DistanceLearnerFactory
-
Generate a StringDistance from a sequence of classnames.
- buildArray(String) - Static method in class com.wcohen.ss.DistanceLearnerFactory
-
Generate a StringDistanceArray given a sequence of classnames
separated by slashes.
- Candidates(int, int, double) - Constructor for class com.wcohen.ss.TagLink.Candidates
-
Candidates constructor.
- cellFormat - Variable in class com.wcohen.ss.MemoMatrix
-
- CharacterTokenizer - Class in com.wcohen.ss.tokens
-
Character tokenizer implementation.
- CharacterTokenizer(boolean, boolean) - Constructor for class com.wcohen.ss.tokens.CharacterTokenizer
-
- charAt(int) - Method in interface com.wcohen.ss.api.StringWrapper
-
Return the i-th char of the wrapped string
- charAt(int) - Method in class com.wcohen.ss.BasicStringWrapper
-
- charEqualIgnoreCase(char, char) - Method in class com.wcohen.ss.abbvGapsHmm.AbbvGapsHMMEvaluator
-
- CharMatchScore - Class in com.wcohen.ss
-
Abstract distance between characters.
- CharMatchScore() - Constructor for class com.wcohen.ss.CharMatchScore
-
- checkTrainingHasHappened(StringWrapper, StringWrapper) - Method in class com.wcohen.ss.AbstractSourcedStatisticalTokenDistance
-
- checkTrainingHasHappened(StringWrapper, StringWrapper) - Method in class com.wcohen.ss.AbstractStatisticalTokenDistance
-
- chunkLongForm(String, int) - Method in class com.wcohen.ss.abbvGapsHmm.AlignmentPredictionModel
-
- clear(String) - Method in class com.wcohen.ss.expt.MatchExptScript
-
Clear datasets, blockers, or learners.
- clusterMode - Variable in class com.wcohen.ss.expt.Blocker
-
- ClusterNGramBlocker - Class in com.wcohen.ss.expt
-
TokenBlocker for clustering based on NGram co-occurence.
- ClusterNGramBlocker() - Constructor for class com.wcohen.ss.expt.ClusterNGramBlocker
-
- ClusterTokenBlocker - Class in com.wcohen.ss.expt
-
TokenBlocker for clustering.
- ClusterTokenBlocker() - Constructor for class com.wcohen.ss.expt.ClusterTokenBlocker
-
- ClusterTokenBlocker(Tokenizer, double) - Constructor for class com.wcohen.ss.expt.ClusterTokenBlocker
-
- collectionSize - Variable in class com.wcohen.ss.AbstractSourcedStatisticalTokenDistance
-
- collectionSize - Variable in class com.wcohen.ss.AbstractStatisticalTokenDistance
-
- com.wcohen.ss - package com.wcohen.ss
-
This package contains a bunch of approximate string comparators, plus code for
performing controlled experiments with this.
- com.wcohen.ss.abbvGapsHmm - package com.wcohen.ss.abbvGapsHmm
-
- com.wcohen.ss.api - package com.wcohen.ss.api
-
- com.wcohen.ss.expt - package com.wcohen.ss.expt
-
- com.wcohen.ss.lookup - package com.wcohen.ss.lookup
-
- com.wcohen.ss.tokens - package com.wcohen.ss.tokens
-
- CombinedStringDistance(StringDistance[], MultiStringWrapper) - Constructor for class com.wcohen.ss.CombinedStringDistanceLearner.CombinedStringDistance
-
- CombinedStringDistanceLearner - Class in com.wcohen.ss
-
Abstract StringDistanceLearner class which combines results of a number of
inner distance metrics, learned by a number of inner distance learners.
- CombinedStringDistanceLearner() - Constructor for class com.wcohen.ss.CombinedStringDistanceLearner
-
- CombinedStringDistanceLearner(StringDistanceLearner[], String) - Constructor for class com.wcohen.ss.CombinedStringDistanceLearner
-
- CombinedStringDistanceLearner.CombinedStringDistance - Class in com.wcohen.ss
-
Abstract class for combining innerDistances's
- CombinedStringDistanceLearner.JthDistanceInstanceIterator - Class in com.wcohen.ss
-
Iterate over the j-th field of MultiStringWrapper's in a DistanceInstance of MultiStringWrapper's
- CombinedStringDistanceLearner.JthStringWrapperValueIterator - Class in com.wcohen.ss
-
Iterate over the j-th field of MultiStringWrapper
- CombinedStringDistanceLearner.MyDistanceInstance - Class in com.wcohen.ss
-
- CombinedStringDistanceLearner.MyMultiDistanceInstance - Class in com.wcohen.ss
-
- comboAddExample(DistanceInstance) - Method in class com.wcohen.ss.AveragedStringDistanceLearner
-
- comboAddExample(DistanceInstance) - Method in class com.wcohen.ss.CombinedStringDistanceLearner
-
Pass a labeled example to the score-combination learner.
- comboHasNextQuery() - Method in class com.wcohen.ss.AveragedStringDistanceLearner
-
- comboHasNextQuery() - Method in class com.wcohen.ss.CombinedStringDistanceLearner
-
Poll the routine that learns to combine inner distance scores to see if it
wants to make more queries.
- comboNextQuery() - Method in class com.wcohen.ss.AveragedStringDistanceLearner
-
- comboNextQuery() - Method in class com.wcohen.ss.CombinedStringDistanceLearner
-
Get the next query from the score-combination learner.
- comboSetDistanceInstancePool(Iterator) - Method in class com.wcohen.ss.AveragedStringDistanceLearner
-
- comboSetDistanceInstancePool(Iterator) - Method in class com.wcohen.ss.CombinedStringDistanceLearner
-
Set up a pool of (possibly unlabeled) instance distances, for the learner
to make queries from.
- comboSetStringWrapperPool(Iterator) - Method in class com.wcohen.ss.AveragedStringDistanceLearner
-
- comboSetStringWrapperPool(Iterator) - Method in class com.wcohen.ss.CombinedStringDistanceLearner
-
Pass an iterator over unlabeled string wrappers to the score-combination learner,
just in case that's useful.
- compare(Acronym, Acronym) - Method in class com.wcohen.ss.abbvGapsHmm.Acronym.AcronymFrequencyComparator
-
- compare(Acronym, Acronym) - Method in class com.wcohen.ss.abbvGapsHmm.Acronym.AcronymShortFormComparator
-
- compareTo(Acronym) - Method in class com.wcohen.ss.abbvGapsHmm.Acronym
-
- compareTo(Object) - Method in class com.wcohen.ss.expt.Blocker.Pair
-
- compareTo(Object) - Method in class com.wcohen.ss.tokens.BasicSourcedToken
-
- compareTo(Object) - Method in class com.wcohen.ss.tokens.BasicToken
-
- compute(int, int) - Method in class com.wcohen.ss.AffineGap.MatrixTrio
-
- compute(int, int) - Method in class com.wcohen.ss.AffineGap.MatrixTrio.InsertSMatrix
-
- compute(int, int) - Method in class com.wcohen.ss.AffineGap.MatrixTrio.InsertTMatrix
-
- compute() - Method in class com.wcohen.ss.expt.MatchExptScript
-
Compute learners.
- correct - Variable in class com.wcohen.ss.CombinedStringDistanceLearner.MyDistanceInstance
-
- countCorrectPairs(MatchData) - Method in class com.wcohen.ss.expt.Blocker
-
Compute number of correct pairs betwn src1 and src2, where src2>src1
- dataset(String) - Method in class com.wcohen.ss.expt.MatchExptScript
-
Load a dataset.
- DEFAULT_SOURCED_TOKENIZER - Static variable in class com.wcohen.ss.tokens.SimpleSourcedTokenizer
-
- DEFAULT_TOKENIZER - Static variable in class com.wcohen.ss.tokens.CharacterTokenizer
-
- DEFAULT_TOKENIZER - Static variable in class com.wcohen.ss.tokens.NGramTokenizer
-
- DEFAULT_TOKENIZER - Static variable in class com.wcohen.ss.tokens.SimpleTokenizer
-
- delim - Variable in class com.wcohen.ss.CombinedStringDistanceLearner
-
- dimension1() - Method in class com.wcohen.ss.abbvGapsHmm.Matrix3D
-
- dimension2() - Method in class com.wcohen.ss.abbvGapsHmm.Matrix3D
-
- dimension3() - Method in class com.wcohen.ss.abbvGapsHmm.Matrix3D
-
- DirichletJS - Class in com.wcohen.ss
-
Jensen-Shannon distance of two unigram language models, smoothed
using Dirichlet prior.
- DirichletJS(Tokenizer, double) - Constructor for class com.wcohen.ss.DirichletJS
-
- DirichletJS() - Constructor for class com.wcohen.ss.DirichletJS
-
- displayResults(boolean, PrintStream) - Method in class com.wcohen.ss.expt.MatchExpt
-
Show results in a simple format.
- displayResults(boolean, PrintStream) - Method in class com.wcohen.ss.expt.SpecialMatchExpt
-
Show results in a simple format.
- DIST_01 - Static variable in class com.wcohen.ss.CharMatchScore
-
Scores match as 0, mismatch as -1.
- DIST_21 - Static variable in class com.wcohen.ss.CharMatchScore
-
Scores match as +2, mismatch as -1.
- distance - Variable in class com.wcohen.ss.CombinedStringDistanceLearner.MyDistanceInstance
-
- distance(String) - Method in class com.wcohen.ss.expt.MatchExptScript
-
Load a distance learner.
- DISTANCE_PACKAGE - Static variable in class com.wcohen.ss.expt.MatchExpt
-
- DISTANCE_PACKAGE - Static variable in class com.wcohen.ss.expt.MatchExptScript
-
- DISTANCE_PACKAGE - Static variable in class com.wcohen.ss.expt.SpecialMatchExpt
-
- distanceExamplePool() - Method in class com.wcohen.ss.api.StringDistanceTeacher
-
A pool of unlabeled pairs of strings over which distances will be computed,
to be used for supervised learning.
- distanceExamplePool() - Method in class com.wcohen.ss.expt.BasicTeacher
-
- distanceExamplePool() - Method in class com.wcohen.ss.expt.MatchDataTeacher
-
- DistanceInstance - Interface in com.wcohen.ss.api
-
An 'instance' for a StringDistance, analogous to an 'instance' for
a classification learner.
- DistanceInstanceIterator - Interface in com.wcohen.ss.api
-
An iterator over DistanceInstance objects.
- distanceInstancePool() - Method in class com.wcohen.ss.api.StringDistanceTeacher
-
A pool of unlabeled pairs of strings over which distances will be computed,
to be used for active or semi-supervised learning.
- distanceInstancePool() - Method in class com.wcohen.ss.expt.BasicTeacher
-
- distanceInstancePool() - Method in class com.wcohen.ss.expt.MatchDataTeacher
-
- DistanceLearnerFactory - Class in com.wcohen.ss
-
Creates distance metric learners from string descriptions.
- DistanceLearnerFactory() - Constructor for class com.wcohen.ss.DistanceLearnerFactory
-
- documentFrequency - Variable in class com.wcohen.ss.AbstractSourcedStatisticalTokenDistance
-
- documentFrequency - Variable in class com.wcohen.ss.AbstractStatisticalTokenDistance
-
- doMain(StringDistance, String[]) - Static method in class com.wcohen.ss.AbstractStringDistance
-
Default main routine for testing
- doMain(StringDistance, String[]) - Static method in class com.wcohen.ss.MultiStringDistance
-
Default main routine for testing
- doScore(MultiStringWrapper, MultiStringWrapper) - Method in class com.wcohen.ss.CombinedStringDistanceLearner.CombinedStringDistance
-
Produce a score, assuming ms and mt are the correct sizes, and fully prepared.
- dumpResults(PrintStream) - Method in class com.wcohen.ss.expt.MatchExpt
-
Show results in an easily machine-readable format.
- dumpResults(PrintStream) - Method in class com.wcohen.ss.expt.SpecialMatchExpt
-
Show results in an easily machine-readable format.
- echo(String) - Method in class com.wcohen.ss.expt.MatchExptScript
-
Turn echoing of commands on/off.
- eq(Matrix3D) - Method in class com.wcohen.ss.abbvGapsHmm.Matrix3D
-
- equals(Object) - Method in class com.wcohen.ss.abbvGapsHmm.Acronym
-
- equals(Object) - Method in class com.wcohen.ss.tokens.BasicSourcedToken
-
- equals(Object) - Method in class com.wcohen.ss.tokens.BasicToken
-
- EvalParam(String) - Constructor for class com.wcohen.ss.abbvGapsHmm.AbbvGapsHMMEvaluator.EvalParam
-
- evaluate(Acronym) - Method in class com.wcohen.ss.abbvGapsHmm.AbbvGapsHMMEvaluator
-
This function enumerates the possible transitions and emissions in a hard coded way, in order to save running-time
- expectationEvaluate(Acronym, List<Double>, List<Double>, List<Double>, List<Double>, Matrix3D, Matrix3D) - Method in class com.wcohen.ss.abbvGapsHmm.AbbvGapsHmmExpectationEvaluator
-
- expectationStep(Acronym, String) - Method in class com.wcohen.ss.abbvGapsHmm.AbbvGapsHMM
-
- explainCombination(MultiStringWrapper, MultiStringWrapper) - Method in class com.wcohen.ss.CombinedStringDistanceLearner.CombinedStringDistance
-
Explain how the primitive scores were combined.
- explainScore(StringWrapper, StringWrapper) - Method in class com.wcohen.ss.AbbreviationAlignment
-
- explainScore(StringWrapper, StringWrapper) - Method in class com.wcohen.ss.AbstractStringDistance
-
This method needs to be implemented by subclasses.
- explainScore(String, String) - Method in class com.wcohen.ss.AbstractStringDistance
-
Scores are explained by converting Strings to StringWrappers
with the prepare function.
- explainScore(StringWrapper, StringWrapper) - Method in class com.wcohen.ss.AffineGap
-
- explainScore(StringWrapper, StringWrapper) - Method in interface com.wcohen.ss.api.StringDistance
-
Explain how the distance was computed.
- explainScore(String, String) - Method in interface com.wcohen.ss.api.StringDistance
-
Explain how the distance was computed.
- explainScore(StringWrapper, StringWrapper) - Method in class com.wcohen.ss.ApproxNeedlemanWunsch
-
- explainScore(String, String) - Method in class com.wcohen.ss.CombinedStringDistanceLearner.CombinedStringDistance
-
- explainScore(StringWrapper, StringWrapper) - Method in class com.wcohen.ss.CombinedStringDistanceLearner.CombinedStringDistance
-
- explainScore(StringWrapper, StringWrapper) - Method in class com.wcohen.ss.Jaccard
-
Explain how the distance was computed.
- explainScore(StringWrapper, StringWrapper) - Method in class com.wcohen.ss.Jaro
-
- explainScore(StringWrapper, StringWrapper) - Method in class com.wcohen.ss.JensenShannonDistance
-
- explainScore(StringWrapper, StringWrapper) - Method in class com.wcohen.ss.Level2
-
Explain how the distance was computed.
- explainScore(StringWrapper, StringWrapper) - Method in class com.wcohen.ss.Mixture
-
Explain how the distance was computed.
- explainScore(StringWrapper, StringWrapper) - Method in class com.wcohen.ss.MongeElkan
-
Version where distance which is possibly scaled to [0,1].
- explainScore(StringWrapper, StringWrapper) - Method in class com.wcohen.ss.MultiStringDistance
-
- explainScore(String, String) - Method in class com.wcohen.ss.MultiStringDistance
-
Scores are explained by converting Strings to StringWrappers
with the prepare function.
- explainScore(StringWrapper, StringWrapper) - Method in class com.wcohen.ss.NeedlemanWunsch
-
- explainScore(StringWrapper, StringWrapper) - Method in class com.wcohen.ss.SmithWaterman
-
- explainScore(StringWrapper, StringWrapper) - Method in class com.wcohen.ss.SoftTFIDF
-
Explain how the distance was computed.
- explainScore(StringWrapper, StringWrapper) - Method in class com.wcohen.ss.SoftTokenFelligiSunter
-
Explain how the distance was computed.
- explainScore(StringWrapper, StringWrapper) - Method in class com.wcohen.ss.SourcedSoftTFIDF
-
Explain how the distance was computed.
- explainScore(StringWrapper, StringWrapper) - Method in class com.wcohen.ss.SourcedTFIDF
-
Explain how the distance was computed.
- explainScore(StringWrapper, StringWrapper) - Method in class com.wcohen.ss.TagLink
-
explainStringMetric gives a brief explanation of how the stringMetric was
computed.
- explainScore(StringWrapper, StringWrapper) - Method in class com.wcohen.ss.TFIDF
-
Explain how the distance was computed.
- explainScore(StringWrapper, StringWrapper) - Method in class com.wcohen.ss.TokenFelligiSunter
-
Explain how the distance was computed.
- explainScore(StringWrapper, StringWrapper) - Method in class com.wcohen.ss.tokens.TagLinkToken
-
explainScore returns an explanation of how the string distance was
computed.
- explainScore(StringWrapper, StringWrapper) - Method in class com.wcohen.ss.WinklerRescorer
-
- explainScoreCombination(double[]) - Method in class com.wcohen.ss.MultiStringAvgDistance
-
Explain how to combine the scores for each primitive distance
function on each field.
- explainScoreCombination(double[]) - Method in class com.wcohen.ss.MultiStringDistance
-
Explain how to combine the scores for each primitive distance
function on each field.
- ExtractAbbreviations - Class in com.wcohen.ss.expt
-
Extracts abbreviation pairs (<short-form, long-form>) from text using an 'abbreviation distance metric' which evaluates
the probability of a short-form string being an abbreviation/acronym of another long-form string.
- ExtractAbbreviations(String, String, String, String) - Constructor for class com.wcohen.ss.expt.ExtractAbbreviations
-
- ExtractAbbreviations.Stats - Class in com.wcohen.ss.expt
-
- extractCandidatePairs(String) - Method in class com.wcohen.ss.abbvGapsHmm.AlignmentPredictionModel
-
Candidates Extraction
- extractHeadNounPattern_2Parts(String) - Method in class com.wcohen.ss.abbvGapsHmm.AlignmentPredictionModel
-
- extractHeadNounPattern_3Parts(String) - Method in class com.wcohen.ss.abbvGapsHmm.AlignmentPredictionModel
-
- extractPatternAcronyms(String) - Method in class com.wcohen.ss.abbvGapsHmm.AlignmentPredictionModel
-
- extractSingleAcronyms(String) - Method in class com.wcohen.ss.abbvGapsHmm.AlignmentPredictionModel
-
- extractTrailingNounPattern_2Parts(String) - Method in class com.wcohen.ss.abbvGapsHmm.AlignmentPredictionModel
-
- extractTrailingNounPattern_3Parts(String) - Method in class com.wcohen.ss.abbvGapsHmm.AlignmentPredictionModel
-
- get(int) - Method in class com.wcohen.ss.CombinedStringDistanceLearner.MyMultiDistanceInstance
-
- get(int) - Method in class com.wcohen.ss.MultiStringWrapper
-
Return the i-th field.
- getA() - Method in interface com.wcohen.ss.api.DistanceInstance
-
- getA() - Method in class com.wcohen.ss.CombinedStringDistanceLearner.MyDistanceInstance
-
- getA(int) - Method in class com.wcohen.ss.CombinedStringDistanceLearner.MyMultiDistanceInstance
-
- getA() - Method in class com.wcohen.ss.expt.Blocker.Pair
-
- getAcronym() - Method in class com.wcohen.ss.abbvGapsHmm.AbbreviationAlignmentContainer
-
- getAcronymScore() - Method in class com.wcohen.ss.abbvGapsHmm.AbbreviationAlignmentContainer
-
- getAlignedChar(int, boolean) - Method in class com.wcohen.ss.ApproxNeedlemanWunsch
-
Find a character in the first string, s, that can be aligned
with the i-th character in the second string, t.
- getB() - Method in interface com.wcohen.ss.api.DistanceInstance
-
- getB() - Method in class com.wcohen.ss.CombinedStringDistanceLearner.MyDistanceInstance
-
- getB(int) - Method in class com.wcohen.ss.CombinedStringDistanceLearner.MyMultiDistanceInstance
-
- getB() - Method in class com.wcohen.ss.expt.Blocker.Pair
-
- getCollectionSize() - Method in class com.wcohen.ss.SourcedTFIDF
-
- getCollectionSize() - Method in class com.wcohen.ss.TFIDF
-
- getCurrentChars(String, int, int) - Method in class com.wcohen.ss.abbvGapsHmm.AbbvGapsHmmBackwardsEvaluator
-
- getCurrentChars(String, int, int) - Method in class com.wcohen.ss.abbvGapsHmm.AbbvGapsHMMEvaluator
-
Get characters from the string going backwards or forwards (depending on the implementing class),
and according to the specifies length.
- getCurrentChars(String, int, int) - Method in class com.wcohen.ss.abbvGapsHmm.AbbvGapsHmmForwardEvaluator
-
- getCurrentPartialWordLen() - Method in class com.wcohen.ss.abbvGapsHmm.AbbvGapsHmmBackwardsEvaluator.BackwardEvalParam
-
- getCurrentPartialWordLen() - Method in class com.wcohen.ss.abbvGapsHmm.AbbvGapsHMMEvaluator.EvalParam
-
- getCurrentPartialWordLen() - Method in class com.wcohen.ss.abbvGapsHmm.AbbvGapsHmmForwardEvaluator.ForwardEvalParam
-
- getCurrentPartialWordMatchPosition() - Method in class com.wcohen.ss.abbvGapsHmm.AbbvGapsHmmBackwardsEvaluator.BackwardEvalParam
-
- getCurrentPartialWordMatchPosition() - Method in class com.wcohen.ss.abbvGapsHmm.AbbvGapsHMMEvaluator.EvalParam
-
- getCurrentPartialWordMatchPosition() - Method in class com.wcohen.ss.abbvGapsHmm.AbbvGapsHmmForwardEvaluator.ForwardEvalParam
-
- getCurrentWord(String, int) - Method in class com.wcohen.ss.abbvGapsHmm.AbbvGapsHmmBackwardsEvaluator
-
- getCurrentWord(String, int) - Method in class com.wcohen.ss.abbvGapsHmm.AbbvGapsHMMEvaluator
-
Get the current long form word going backwards or forwards (depending on the implementing class).
- getCurrentWord(String, int) - Method in class com.wcohen.ss.abbvGapsHmm.AbbvGapsHmmForwardEvaluator
-
- getDF(String) - Method in class com.wcohen.ss.abbvGapsHmm.AbbvGapsHMM
-
- getDistance() - Method in class com.wcohen.ss.AbstractStringDistance
-
Implements the StringDistanceLearner api by return a StringDistance.
- getDistance() - Method in interface com.wcohen.ss.api.DistanceInstance
-
- getDistance() - Method in interface com.wcohen.ss.api.StringDistanceLearner
-
Return the learned distance.
- getDistance() - Method in class com.wcohen.ss.AveragedStringDistanceLearner
-
- getDistance() - Method in class com.wcohen.ss.CombinedStringDistanceLearner
-
Get the final string distance, which will be based on the distances learned by the
inner learners, as well as the combination scheme learned by comboSetAnswer, comboTrain,
and etc.
- getDistance() - Method in class com.wcohen.ss.CombinedStringDistanceLearner.MyDistanceInstance
-
- getDistance() - Method in class com.wcohen.ss.expt.Blocker.Pair
-
- getDistance(int) - Method in class com.wcohen.ss.MultiStringAvgDistance
-
- getDistance(int) - Method in class com.wcohen.ss.MultiStringDistance
-
Get the distance used for the i-th pair of fields
- getDistanceLearnerIndex(int) - Method in class com.wcohen.ss.MultiStringWrapper
-
Return the index of the learner that will process internal field i
- getDocumentFrequency(Token) - Method in class com.wcohen.ss.AbstractSourcedStatisticalTokenDistance
-
- getDocumentFrequency(Token) - Method in class com.wcohen.ss.AbstractStatisticalTokenDistance
-
- getDocumentFrequency(Token) - Method in class com.wcohen.ss.SourcedTFIDF
-
Get the document frequency of the token.
- getDocumentFrequency(Token) - Method in class com.wcohen.ss.TFIDF
-
Get the document frequency of the token.
- getEmissionCounters() - Method in class com.wcohen.ss.abbvGapsHmm.AbbvGapsHmmExpectationEvaluator
-
- getEmmisionParams() - Method in class com.wcohen.ss.abbvGapsHmm.AbbvGapsHMM
-
- getEmmisions() - Method in class com.wcohen.ss.abbvGapsHmm.AlignmentPredictionModel
-
- getEndedWord(String, int) - Static method in class com.wcohen.ss.abbvGapsHmm.AbbvGapsHmmForwardEvaluator
-
If pos is ending a word in str: returns this word.
- getEvalMatrix() - Method in class com.wcohen.ss.abbvGapsHmm.AbbvGapsHMMEvaluator
-
- getEvalMatrixSize() - Method in class com.wcohen.ss.abbvGapsHmm.AbbvGapsHMMEvaluator.EvalParam
-
- getEvalStringPos() - Method in class com.wcohen.ss.abbvGapsHmm.AbbvGapsHMMEvaluator.EvalParam
-
- getFieldIndex(int) - Method in class com.wcohen.ss.MultiStringWrapper
-
Return the field i of the original string associated with internal field i
- getFilename() - Method in class com.wcohen.ss.expt.MatchData
-
- getFilename() - Method in class com.wcohen.ss.expt.SourcedMatchData
-
- getId() - Method in interface com.wcohen.ss.api.IdentifiedStringWrapper
-
- getId() - Method in class com.wcohen.ss.expt.MatchData.Instance
-
- getId() - Method in class com.wcohen.ss.expt.SourcedMatchData.Instance
-
- getIndex() - Method in interface com.wcohen.ss.api.Token
-
- getIndex() - Method in class com.wcohen.ss.tokens.BasicToken
-
- getInnerDistances() - Method in class com.wcohen.ss.CombinedStringDistanceLearner
-
Get an array of trained inner distances.
- getInstance(String, int) - Method in class com.wcohen.ss.expt.MatchData
-
Get the j-th record for the named source.
- getInstance(String, int) - Method in class com.wcohen.ss.expt.SourcedMatchData
-
Get the j-th record for the named source.
- getIterator() - Method in class com.wcohen.ss.expt.MatchData
-
- getIterator() - Method in class com.wcohen.ss.expt.SourcedMatchData
-
- getLAlign() - Method in class com.wcohen.ss.abbvGapsHmm.AbbreviationAlignmentContainer
-
- getLambda() - Method in class com.wcohen.ss.JelinekMercerJS
-
- getLearnedDistance() - Method in class com.wcohen.ss.expt.SpecialMatchExpt
-
- getLegalStringPos(int, String) - Method in class com.wcohen.ss.abbvGapsHmm.AbbvGapsHmmBackwardsViterbiEvaluator
-
- getLookupTime() - Method in class com.wcohen.ss.lookup.SoftTFIDFDictionary
-
Get the time used in performing the lookup
- getMatched() - Method in class com.wcohen.ss.tokens.TagLinkToken
-
getMatched return the number of matched character.
- getMaxFraction() - Method in class com.wcohen.ss.expt.AbbreviationsBlocker
-
- getMaxFraction() - Method in class com.wcohen.ss.expt.TokenBlocker
-
- getMaxInvertedIndexSize() - Method in class com.wcohen.ss.lookup.SoftTFIDFDictionary
-
- getMaxNGramSize() - Method in class com.wcohen.ss.expt.NGramBlocker
-
- getMinNGramSize() - Method in class com.wcohen.ss.expt.NGramBlocker
-
- getNewStateVal(double, double) - Method in class com.wcohen.ss.abbvGapsHmm.AbbvGapsHMM
-
- getNumDeletionsInAcronym() - Method in class com.wcohen.ss.abbvGapsHmm.AbbreviationAlignmentContainer
-
- getPair(int) - Method in class com.wcohen.ss.expt.AbbreviationsBlocker
-
- getPair(int) - Method in class com.wcohen.ss.expt.Blocker
-
Get the i-th candidate pair, as produced from most recently block()-ed data
- getPair(int) - Method in class com.wcohen.ss.expt.NullBlocker
-
- getPair(int) - Method in class com.wcohen.ss.expt.TokenBlocker
-
- getPartialEndedWord(String, int) - Static method in class com.wcohen.ss.abbvGapsHmm.AbbvGapsHmmForwardEvaluator
-
If pos is ending a word in str: returns this word.
- getPartialStartedWord(String, int) - Static method in class com.wcohen.ss.abbvGapsHmm.AbbvGapsHmmBackwardsEvaluator
-
If pos is starting a new word in str, returns this word.
- getPartialWordParam(String, int) - Method in class com.wcohen.ss.abbvGapsHmm.AbbvGapsHmmBackwardsEvaluator
-
- getPartialWordParam(String, int) - Method in class com.wcohen.ss.abbvGapsHmm.AbbvGapsHMMEvaluator
-
- getPartialWordParam(String, int) - Method in class com.wcohen.ss.abbvGapsHmm.AbbvGapsHmmForwardEvaluator
-
- getProbability() - Method in class com.wcohen.ss.abbvGapsHmm.AbbreviationAlignmentContainer
-
- getPseudoCount() - Method in class com.wcohen.ss.DirichletJS
-
- getRangeEnd() - Method in class com.wcohen.ss.abbvGapsHmm.AbbvGapsHMMEvaluator.EvalParam
-
- getRangeStart() - Method in class com.wcohen.ss.abbvGapsHmm.AbbvGapsHMMEvaluator.EvalParam
-
- getResult(int) - Method in interface com.wcohen.ss.lookup.FastLookup
-
Get the i'th string found by the last lookup
- getResult(int) - Method in class com.wcohen.ss.lookup.RescoringSoftTFIDFDictionary
-
- getResult(int) - Method in class com.wcohen.ss.lookup.SoftTFIDFDictionary
-
Get the i'th string found by the last lookup
- getSAlign() - Method in class com.wcohen.ss.abbvGapsHmm.AbbreviationAlignmentContainer
-
- getScore(int) - Method in interface com.wcohen.ss.lookup.FastLookup
-
Get the score of the i'th string found by the last lookup
- getScore(int) - Method in class com.wcohen.ss.lookup.RescoringSoftTFIDFDictionary
-
- getScore(int) - Method in class com.wcohen.ss.lookup.SoftTFIDFDictionary
-
Get the score of the i'th string found by the last lookup
- getScore() - Method in class com.wcohen.ss.TagLink.Candidates
-
getScore, return the score or distance between strings T and U.
- getSource() - Method in interface com.wcohen.ss.api.SourcedStringWrapper
-
- getSource() - Method in interface com.wcohen.ss.api.SourcedToken
-
- getSource(int) - Method in class com.wcohen.ss.expt.MatchData
-
Get string identifier for i-th source
- getSource() - Method in class com.wcohen.ss.expt.MatchData.Instance
-
- getSource(int) - Method in class com.wcohen.ss.expt.SourcedMatchData
-
Get string identifier for i-th source
- getSource() - Method in class com.wcohen.ss.expt.SourcedMatchData.Instance
-
- getSource() - Method in class com.wcohen.ss.tokens.BasicSourcedToken
-
- getStartedWord(String, int) - Static method in class com.wcohen.ss.abbvGapsHmm.AbbvGapsHmmBackwardsEvaluator
-
If pos is starting a new word in str, returns this word.
- getStrByEmission(String, T) - Method in class com.wcohen.ss.abbvGapsHmm.AbbreviationAlignmentContainer
-
- getTeacher() - Method in class com.wcohen.ss.lookup.SoftDictionary
-
Return a teacher that can 'train' a distance metric
from the information in the dictionary.
- getTokenMatchThreshold() - Method in class com.wcohen.ss.SoftTFIDF
-
- getTokenMatchThreshold() - Method in class com.wcohen.ss.SourcedSoftTFIDF
-
- getTokens() - Method in class com.wcohen.ss.SourcedTFIDF
-
Access the tokens of the last prepare()-ed string.
- getTokens() - Method in class com.wcohen.ss.TFIDF
-
Access the tokens of the last prepare()-ed string.
- getTPos() - Method in class com.wcohen.ss.TagLink.Candidates
-
getTPos, return the position of string T.
- getTr() - Method in class com.wcohen.ss.tokens.TagLinkToken
-
getTr return the contant value Gamma.
- getTransitionCounters() - Method in class com.wcohen.ss.abbvGapsHmm.AbbvGapsHmmExpectationEvaluator
-
- getTransitionParams() - Method in class com.wcohen.ss.abbvGapsHmm.AbbvGapsHMM
-
- getTransitions() - Method in class com.wcohen.ss.abbvGapsHmm.AlignmentPredictionModel
-
- getUPos() - Method in class com.wcohen.ss.TagLink.Candidates
-
getUPos, return the position of string U.
- getValue() - Method in interface com.wcohen.ss.api.Token
-
- getValue(int) - Method in interface com.wcohen.ss.lookup.FastLookup
-
Get the value of the i'th string found by the last lookup
- getValue(int) - Method in class com.wcohen.ss.lookup.RescoringSoftTFIDFDictionary
-
- getValue(int) - Method in class com.wcohen.ss.lookup.SoftTFIDFDictionary
-
Get the value of the i'th string found by the last lookup
- getValue() - Method in class com.wcohen.ss.tokens.BasicToken
-
- getVocabularySize() - Method in class com.wcohen.ss.TFIDF
-
- getWeight(Token) - Method in class com.wcohen.ss.SourcedTFIDF
-
Access the weight of a token in the vector created for the last prepare()-ed string.
- getWeight(Token) - Method in class com.wcohen.ss.TFIDF
-
Access the weight of a token in the vector created for the last prepare()-ed string.
- getWindowSize(int) - Method in class com.wcohen.ss.lookup.SoftTFIDFDictionary
-
- graphPrecisionRecall(PrintStream) - Method in class com.wcohen.ss.expt.MatchExpt
-
Graph interpolated precision vs recall
- graphPrecisionRecall(PrintStream) - Method in class com.wcohen.ss.expt.SpecialMatchExpt
-
Graph interpolated precision vs recall
- labelInstance(DistanceInstance) - Method in class com.wcohen.ss.api.StringDistanceTeacher
-
Label an instance queried by the learner.
- labelInstance(DistanceInstance) - Method in class com.wcohen.ss.expt.BasicTeacher
-
- labelInstance(DistanceInstance) - Method in class com.wcohen.ss.expt.MatchDataTeacher
-
- learner(String) - Method in class com.wcohen.ss.expt.MatchExptScript
-
Load a learner.
- length() - Method in interface com.wcohen.ss.api.StringWrapper
-
Return the length of the wrapped string
- length() - Method in class com.wcohen.ss.BasicStringWrapper
-
- Level2 - Class in com.wcohen.ss
-
Generic version of Monge & Elkan's "level 2" recursive field
matching.
- Level2(Tokenizer, StringDistance) - Constructor for class com.wcohen.ss.Level2
-
- Level2Jaro - Class in com.wcohen.ss
-
"Level 2" recursive field matching algorithm, based on Jaro
distance.
- Level2Jaro() - Constructor for class com.wcohen.ss.Level2Jaro
-
- Level2JaroWinkler - Class in com.wcohen.ss
-
"Level 2" recursive field matching algorithm, based on Jaro
distance.
- Level2JaroWinkler() - Constructor for class com.wcohen.ss.Level2JaroWinkler
-
- Level2Levenstein - Class in com.wcohen.ss
-
"Level 2" recursive field matching algorithm using Levenstein
distance.
- Level2Levenstein() - Constructor for class com.wcohen.ss.Level2Levenstein
-
- Level2MongeElkan - Class in com.wcohen.ss
-
Monge & Elkan's "level 2" recursive field matching algorithm.
- Level2MongeElkan() - Constructor for class com.wcohen.ss.Level2MongeElkan
-
- Levenstein - Class in com.wcohen.ss
-
Levenstein string distance.
- Levenstein() - Constructor for class com.wcohen.ss.Levenstein
-
- load(File) - Method in class com.wcohen.ss.lookup.SoftDictionary
-
Insert all lines in a file as items mapping to themselves.
- load(File, boolean) - Method in class com.wcohen.ss.lookup.SoftDictionary
-
Insert all lines in a file as items mapping to themselves.
- loadAliases(File) - Method in class com.wcohen.ss.lookup.SoftDictionary
-
Load a file of identifiers, each of which has multiple
aliases.
- loadAliases(File) - Method in class com.wcohen.ss.lookup.SoftTFIDFDictionary
-
Load a file of identifiers, each of which has multiple
aliases.
- loadLabels(String) - Static method in class com.wcohen.ss.abbvGapsHmm.AlignmentPredictionModel
-
- loadModelParams() - Method in class com.wcohen.ss.abbvGapsHmm.AbbvGapsHMM
-
- loadPredictor() - Method in class com.wcohen.ss.AbbreviationAlignment
-
- loadPredictor() - Method in class com.wcohen.ss.expt.ExtractAbbreviations
-
- loadTrainingCorpus(String) - Static method in class com.wcohen.ss.abbvGapsHmm.AlignmentPredictionModel
-
- lookup(double, String) - Method in interface com.wcohen.ss.lookup.FastLookup
-
Lookup items similar to 'toFind', and return the number of
items found.
- lookup(double, String) - Method in class com.wcohen.ss.lookup.RescoringSoftTFIDFDictionary
-
- lookup(String, String) - Method in class com.wcohen.ss.lookup.SoftDictionary
-
Lookup a string in the dictionary.
- lookup(String, StringWrapper) - Method in class com.wcohen.ss.lookup.SoftDictionary
-
Lookup a prepared string in the dictionary.
- lookup(String) - Method in class com.wcohen.ss.lookup.SoftDictionary
-
Lookup a string in the dictionary.
- lookup(StringWrapper) - Method in class com.wcohen.ss.lookup.SoftDictionary
-
Lookup a prepared string in the dictionary.
- lookup(double, String) - Method in class com.wcohen.ss.lookup.SoftTFIDFDictionary
-
Lookup items SoftTFIDF-similar to the 'toFind' argument, and
return the number of items found.
- lookupDistance(String, String) - Method in class com.wcohen.ss.lookup.SoftDictionary
-
Return the distance to the best match.
- lookupDistance(String, StringWrapper) - Method in class com.wcohen.ss.lookup.SoftDictionary
-
Return the distance to the best match.
- lookupDistance(String) - Method in class com.wcohen.ss.lookup.SoftDictionary
-
Return the distance to the best match.
- lookupDistance(StringWrapper) - Method in class com.wcohen.ss.lookup.SoftDictionary
-
Return the distance to the best match.
- lookupTime - Variable in class com.wcohen.ss.lookup.SoftTFIDFDictionary
-
- m - Variable in class com.wcohen.ss.AffineGap.MatrixTrio
-
- main(String[]) - Static method in class com.wcohen.ss.AbbreviationAlignment
-
Evaluates the probability of the short-form string (string1) being an abbreviation/acronym
of the long-form string (string2).
Usage: AbbreviationAlignment short_form_string long_form_string [train_data_dir]
- main(String[]) - Static method in class com.wcohen.ss.AffineGap
-
- main(String[]) - Static method in class com.wcohen.ss.ApproxNeedlemanWunsch
-
- main(String[]) - Static method in class com.wcohen.ss.DirichletJS
-
- main(String[]) - Static method in class com.wcohen.ss.DistanceLearnerFactory
-
Test routine.
- main(String[]) - Static method in class com.wcohen.ss.expt.ExtractAbbreviations
-
Extracts abbreviation pairs from text.
Usage: ExtractAbbreviations input experiment_name [gold-file] [train-dir]
- main(String[]) - Static method in class com.wcohen.ss.expt.MatchData
-
- main(String[]) - Static method in class com.wcohen.ss.expt.MatchExpt
-
Command-line interface.
- main(String[]) - Static method in class com.wcohen.ss.expt.MatchExptScript
-
- main(String[]) - Static method in class com.wcohen.ss.expt.SourcedMatchData
-
- main(String[]) - Static method in class com.wcohen.ss.expt.SpecialMatchExpt
-
Command-line interface.
- main(String[]) - Static method in class com.wcohen.ss.Jaccard
-
- main(String[]) - Static method in class com.wcohen.ss.Jaro
-
- main(String[]) - Static method in class com.wcohen.ss.JaroTFIDF
-
- main(String[]) - Static method in class com.wcohen.ss.JaroWinkler
-
- main(String[]) - Static method in class com.wcohen.ss.JaroWinklerTFIDF
-
- main(String[]) - Static method in class com.wcohen.ss.JelinekMercerJS
-
- main(String[]) - Static method in class com.wcohen.ss.Level2
-
- main(String[]) - Static method in class com.wcohen.ss.Level2Jaro
-
- main(String[]) - Static method in class com.wcohen.ss.Level2JaroWinkler
-
- main(String[]) - Static method in class com.wcohen.ss.Level2Levenstein
-
- main(String[]) - Static method in class com.wcohen.ss.Level2MongeElkan
-
- main(String[]) - Static method in class com.wcohen.ss.Levenstein
-
- main(String[]) - Static method in class com.wcohen.ss.lookup.SoftDictionary
-
Simple main for testing.
- main(String[]) - Static method in class com.wcohen.ss.lookup.SoftTFIDFDictionary
-
Simple main for testing and experimentation
- main(String[]) - Static method in class com.wcohen.ss.Mixture
-
- main(String[]) - Static method in class com.wcohen.ss.MongeElkan
-
- main(String[]) - Static method in class com.wcohen.ss.MongeElkanTFIDF
-
- main(String[]) - Static method in class com.wcohen.ss.MultiStringAvgDistance
-
- main(String[]) - Static method in class com.wcohen.ss.MultiStringWrapper
-
- main(String[]) - Static method in class com.wcohen.ss.NeedlemanWunsch
-
- main(String[]) - Static method in class com.wcohen.ss.ScaledLevenstein
-
- main(String[]) - Static method in class com.wcohen.ss.SmithWaterman
-
- main(String[]) - Static method in class com.wcohen.ss.SoftTokenFelligiSunter
-
- main(String[]) - Static method in class com.wcohen.ss.SourcedTFIDF
-
- main(String[]) - Static method in class com.wcohen.ss.TFIDF
-
- main(String[]) - Static method in class com.wcohen.ss.TokenFelligiSunter
-
- main(String[]) - Static method in class com.wcohen.ss.tokens.CharacterTokenizer
-
Test routine
- main(String[]) - Static method in class com.wcohen.ss.tokens.NGramTokenizer
-
Test routine
- main(String[]) - Static method in class com.wcohen.ss.tokens.SimpleSourcedTokenizer
-
Test routine
- main(String[]) - Static method in class com.wcohen.ss.tokens.SimpleTokenizer
-
Test routine
- main(String[]) - Static method in class com.wcohen.ss.UnsmoothedJS
-
- MatchData - Class in com.wcohen.ss.expt
-
Holds data for evaluating a distance metric.
- MatchData(String) - Constructor for class com.wcohen.ss.expt.MatchData
-
Read match data from a file.
- MatchData() - Constructor for class com.wcohen.ss.expt.MatchData
-
- MatchData.InputFormatException - Exception in com.wcohen.ss.expt
-
Signals an incorrectly formatted MatchData file.
- MatchData.Instance - Class in com.wcohen.ss.expt
-
A single item (aka record, string, etc) to match against others.
- MatchData.MatchIterator - Class in com.wcohen.ss.expt
-
Iterates over all stored StringWrappers
- MatchDataTeacher - Class in com.wcohen.ss.expt
-
Train a StringDistanceLearner using MatchData and a Blocker.
- MatchDataTeacher(MatchData, Blocker) - Constructor for class com.wcohen.ss.expt.MatchDataTeacher
-
- MatchExpt - Class in com.wcohen.ss.expt
-
Perform a matching experiment using a data file, distance function
and blocker.
- MatchExpt(MatchData, StringDistanceLearner, Blocker) - Constructor for class com.wcohen.ss.expt.MatchExpt
-
- MatchExpt(MatchData, StringDistanceLearner) - Constructor for class com.wcohen.ss.expt.MatchExpt
-
- MatchExptScript - Class in com.wcohen.ss.expt
-
Perform a series of match experiments, specified by a script in an input file.
- MatchExptScript() - Constructor for class com.wcohen.ss.expt.MatchExptScript
-
- MatchIterator(MatchData) - Constructor for class com.wcohen.ss.expt.MatchData.MatchIterator
-
- MatchIterator(SourcedMatchData) - Constructor for class com.wcohen.ss.expt.SourcedMatchData.MatchIterator
-
- matchScore(char, char) - Method in class com.wcohen.ss.CharMatchScore
-
- Matrix3D - Class in com.wcohen.ss.abbvGapsHmm
-
M-by-N-by-K matrix implementation.
- Matrix3D(int, int, int) - Constructor for class com.wcohen.ss.abbvGapsHmm.Matrix3D
-
- Matrix3D(double[][][]) - Constructor for class com.wcohen.ss.abbvGapsHmm.Matrix3D
-
- MatrixTrio(StringWrapper, StringWrapper) - Constructor for class com.wcohen.ss.AffineGap.MatrixTrio
-
- max3(double, double, double) - Static method in class com.wcohen.ss.MemoMatrix
-
Return max of three numbers.
- max4(double, double, double, double) - Static method in class com.wcohen.ss.MemoMatrix
-
Return max of four numbers.
- maxF1() - Method in class com.wcohen.ss.expt.MatchExpt
-
max F1 for any threshold
- maxF1() - Method in class com.wcohen.ss.expt.SpecialMatchExpt
-
max F1 for any threshold
- maximizationStep() - Method in class com.wcohen.ss.abbvGapsHmm.AbbvGapsHMM
-
Returns the total change in model parameter values.
- maximizationStepForEmissions() - Method in class com.wcohen.ss.abbvGapsHmm.AbbvGapsHMM
-
- maximizationStepForTransitions() - Method in class com.wcohen.ss.abbvGapsHmm.AbbvGapsHMM
-
- maxTokenIndex() - Method in interface com.wcohen.ss.api.Tokenizer
-
Return the higest index of any interned token
- maxTokenIndex() - Method in class com.wcohen.ss.tokens.CharacterTokenizer
-
- maxTokenIndex() - Method in class com.wcohen.ss.tokens.NGramTokenizer
-
- maxTokenIndex() - Method in class com.wcohen.ss.tokens.SimpleTokenizer
-
- MemoMatrix - Class in com.wcohen.ss
-
A matrix of doubles, defined recursively by the compute(i,j)
method, that will not be recomputed more than necessary.
- minus(Matrix3D) - Method in class com.wcohen.ss.abbvGapsHmm.Matrix3D
-
- Mixture - Class in com.wcohen.ss
-
Mixture-based distance metric.
- Mixture(Tokenizer) - Constructor for class com.wcohen.ss.Mixture
-
- Mixture() - Constructor for class com.wcohen.ss.Mixture
-
- mkdir(String) - Method in class com.wcohen.ss.expt.ExtractAbbreviations
-
- MongeElkan - Class in com.wcohen.ss
-
The match method proposed by Monge and Elkan.
- MongeElkan() - Constructor for class com.wcohen.ss.MongeElkan
-
- MongeElkanTFIDF - Class in com.wcohen.ss
-
Soft TFIDF-based distance metric, extended to use "soft" token-matching
with the MongeElkan distance metric.
- MongeElkanTFIDF() - Constructor for class com.wcohen.ss.MongeElkanTFIDF
-
- mul(int, int, int, double) - Method in class com.wcohen.ss.abbvGapsHmm.Matrix3D
-
- MultiStringAvgDistance - Class in com.wcohen.ss
-
StringDistance defined over Strings that are broken into fields,
with distance defined as the average distance between any field.
- MultiStringAvgDistance(StringDistance, String) - Constructor for class com.wcohen.ss.MultiStringAvgDistance
-
- MultiStringDistance - Class in com.wcohen.ss
-
Abstract class StringDistance defined over Strings that are broken
into fields.
- MultiStringDistance(String) - Constructor for class com.wcohen.ss.MultiStringDistance
-
- MultiStringWrapper - Class in com.wcohen.ss
-
A StringWrapper that stores a version of the string
that has been either (a) split into a number of distinct fields,
or (b) duplicated k times, so that k different StringDistance's
can preprocess it, of (b) both of the above.
- MultiStringWrapper(String, String) - Constructor for class com.wcohen.ss.MultiStringWrapper
-
Create a MultiStringWrapper by splitting s into
fields based on the given delimiter.
- MultiStringWrapper(String, int) - Constructor for class com.wcohen.ss.MultiStringWrapper
-
Create a MultiStringWrapper by making k copies of s.
- MultiStringWrapper(String, int, String) - Constructor for class com.wcohen.ss.MultiStringWrapper
-
Create a MultiStringWrapper by making k copies of each field of s.
- MyDistanceInstance(StringWrapper, StringWrapper, boolean, double) - Constructor for class com.wcohen.ss.CombinedStringDistanceLearner.MyDistanceInstance
-
- MyMultiDistanceInstance(StringWrapper, StringWrapper, boolean, double) - Constructor for class com.wcohen.ss.CombinedStringDistanceLearner.MyMultiDistanceInstance
-
- NeedlemanWunsch - Class in com.wcohen.ss
-
Needleman-Wunsch string distance, following Durban et al.
- NeedlemanWunsch() - Constructor for class com.wcohen.ss.NeedlemanWunsch
-
- NeedlemanWunsch(CharMatchScore, double) - Constructor for class com.wcohen.ss.NeedlemanWunsch
-
- next() - Method in interface com.wcohen.ss.api.DistanceInstanceIterator
-
- next() - Method in interface com.wcohen.ss.api.StringWrapperIterator
-
- next() - Method in class com.wcohen.ss.BasicDistanceInstanceIterator
-
- next() - Method in class com.wcohen.ss.BasicSourcedStringWrapperIterator
-
- next() - Method in class com.wcohen.ss.BasicStringWrapperIterator
-
- next() - Method in class com.wcohen.ss.CombinedStringDistanceLearner.JthDistanceInstanceIterator
-
- next() - Method in class com.wcohen.ss.CombinedStringDistanceLearner.JthStringWrapperValueIterator
-
- next() - Method in class com.wcohen.ss.expt.MatchData.MatchIterator
-
Returns the next StringWrapper as an object.
- next() - Method in class com.wcohen.ss.expt.SourcedMatchData.MatchIterator
-
Returns the next StringWrapper as an object.
- nextDistanceInstance() - Method in interface com.wcohen.ss.api.DistanceInstanceIterator
-
- nextDistanceInstance() - Method in class com.wcohen.ss.BasicDistanceInstanceIterator
-
- nextDistanceInstance() - Method in class com.wcohen.ss.CombinedStringDistanceLearner.JthDistanceInstanceIterator
-
- nextQuery() - Method in class com.wcohen.ss.AbstractStringDistance
-
Implements StringDistanceLearner api by querying for
DistanceInstance labels.
- nextQuery() - Method in interface com.wcohen.ss.api.StringDistanceLearner
-
Returns a DistanceInstance for which the learner would like a
label.
- nextQuery() - Method in class com.wcohen.ss.CombinedStringDistanceLearner
-
Get a next query from one of the sublearners
- nextSourcedStringWrapper() - Method in interface com.wcohen.ss.api.SourcedStringWrapperIterator
-
- nextSourcedStringWrapper() - Method in class com.wcohen.ss.BasicSourcedStringWrapperIterator
-
- nextSourcedStringWrapper() - Method in class com.wcohen.ss.expt.MatchData.MatchIterator
-
Return the next StringWrapper.
- nextSourcedStringWrapper() - Method in class com.wcohen.ss.expt.SourcedMatchData.MatchIterator
-
Return the next StringWrapper.
- nextStringWrapper() - Method in interface com.wcohen.ss.api.StringWrapperIterator
-
- nextStringWrapper() - Method in class com.wcohen.ss.BasicSourcedStringWrapperIterator
-
- nextStringWrapper() - Method in class com.wcohen.ss.BasicStringWrapperIterator
-
- nextStringWrapper() - Method in class com.wcohen.ss.CombinedStringDistanceLearner.JthStringWrapperValueIterator
-
- nextStringWrapper() - Method in class com.wcohen.ss.expt.MatchData.MatchIterator
-
Return the next StringWrapper.
- nextStringWrapper() - Method in class com.wcohen.ss.expt.SourcedMatchData.MatchIterator
-
Return the next StringWrapper.
- NGramBlocker - Class in com.wcohen.ss.expt
-
Finds all pairs that share a not-too-common character n-gram.
- NGramBlocker() - Constructor for class com.wcohen.ss.expt.NGramBlocker
-
- NGramTokenizer - Class in com.wcohen.ss.tokens
-
Wraps another tokenizer, and adds all computes all ngrams of
characters from a single token produced by the inner tokenizer.
- NGramTokenizer(int, int, boolean, Tokenizer) - Constructor for class com.wcohen.ss.tokens.NGramTokenizer
-
- NullBlocker - Class in com.wcohen.ss.expt
-
Inefficient exhausitive blocking algorithm.
- NullBlocker() - Constructor for class com.wcohen.ss.expt.NullBlocker
-
- numCorrectPairs() - Method in class com.wcohen.ss.expt.AbbreviationsBlocker
-
- numCorrectPairs() - Method in class com.wcohen.ss.expt.Blocker
-
Return total number of correct pairs in the dataset.
- numCorrectPairs() - Method in class com.wcohen.ss.expt.NullBlocker
-
- numCorrectPairs() - Method in class com.wcohen.ss.expt.TokenBlocker
-
- numInstances(String) - Method in class com.wcohen.ss.expt.MatchData
-
Number of records for source with given string id
- numInstances(String) - Method in class com.wcohen.ss.expt.SourcedMatchData
-
Number of records for source with given string id
- numSources() - Method in class com.wcohen.ss.expt.MatchData
-
Number of sources in data set
- numSources() - Method in class com.wcohen.ss.expt.SourcedMatchData
-
Number of sources in data set
- Pair(MatchData.Instance, MatchData.Instance, boolean) - Constructor for class com.wcohen.ss.expt.Blocker.Pair
-
- pairsPerSecond() - Method in class com.wcohen.ss.expt.MatchExpt
-
Return total time to process data, divided by the number of pairs
- pairsPerSecond() - Method in class com.wcohen.ss.expt.SpecialMatchExpt
-
Return total time to process data, divided by the number of pairs
- parseCandidate(String, String) - Method in class com.wcohen.ss.abbvGapsHmm.AlignmentPredictionModel
-
- plus(Matrix3D) - Method in class com.wcohen.ss.abbvGapsHmm.Matrix3D
-
- posIsAtWord(String, int) - Static method in class com.wcohen.ss.abbvGapsHmm.AbbvGapsHMMEvaluator
-
- posIsAtWordEnd(String, int) - Static method in class com.wcohen.ss.abbvGapsHmm.AbbvGapsHmmForwardEvaluator
-
- posIsAtWordStart(String, int) - Static method in class com.wcohen.ss.abbvGapsHmm.AbbvGapsHMMEvaluator
-
- precision - Variable in class com.wcohen.ss.expt.ExtractAbbreviations.Stats
-
- precisionRecall() - Method in class com.wcohen.ss.expt.MatchExptScript
-
Show interpolated 11-pt precision curves for each blocker/distance/dataset
- predict(String, String) - Method in class com.wcohen.ss.abbvGapsHmm.AlignmentPredictionModel
-
- predict(Acronym) - Method in class com.wcohen.ss.abbvGapsHmm.AlignmentPredictionModel
-
- predict(String) - Method in class com.wcohen.ss.abbvGapsHmm.AlignmentPredictionModel
-
- predictAlignment(Acronym) - Method in class com.wcohen.ss.abbvGapsHmm.AlignmentPredictionModel
-
- predictAndTest(List<String>, List<Map<String, String>>) - Method in class com.wcohen.ss.expt.ExtractAbbreviations
-
- predictAndTest(int, List<String>, List<Map<String, String>>, BufferedWriter) - Method in class com.wcohen.ss.expt.ExtractAbbreviations
-
- prepare(StringWrapperIterator) - Method in class com.wcohen.ss.AbstractSourcedTokenizedStringDistance
-
- prepare(String) - Method in class com.wcohen.ss.AbstractStringDistance
-
Default way to preprocess a string for distance computation.
- prepare(StringWrapperIterator) - Method in class com.wcohen.ss.AbstractStringDistance
-
Implements StringDistanceLearner api by providing a way to prep a
StringWrapperIterator for training.
- prepare(DistanceInstanceIterator) - Method in class com.wcohen.ss.AbstractStringDistance
-
Implements StringDistanceLearner api by providing a way to prep a
DistanceInstanceIterator for training.
- prepare(StringWrapperIterator) - Method in class com.wcohen.ss.AbstractTokenizedStringDistance
-
- prepare(String) - Method in interface com.wcohen.ss.api.StringDistance
-
Preprocess a string for distance computation
- prepare(StringWrapperIterator) - Method in interface com.wcohen.ss.api.StringDistanceLearner
-
Preprocess a StringWrapperIterator for unsupervised training.
- prepare(DistanceInstanceIterator) - Method in interface com.wcohen.ss.api.StringDistanceLearner
-
Preprocess a DistanceInstanceIterator for supervised training.
- prepare(String) - Method in class com.wcohen.ss.CombinedStringDistanceLearner.CombinedStringDistance
-
- prepare(StringWrapperIterator) - Method in class com.wcohen.ss.CombinedStringDistanceLearner
-
Prepare data for the sublearners.
- prepare(DistanceInstanceIterator) - Method in class com.wcohen.ss.CombinedStringDistanceLearner
-
Prepare data for the learners.
- prepare(String) - Method in class com.wcohen.ss.Jaccard
-
Preprocess a string by finding tokens.
- prepare(String) - Method in class com.wcohen.ss.Jaro
-
- prepare(String) - Method in class com.wcohen.ss.JensenShannonDistance
-
Preprocess a string by finding tokens and giving them weights W
such that W is the smoothed probability of the token appearing
in the document.
- prepare(String) - Method in class com.wcohen.ss.lookup.SoftDictionary
-
Prepare a string for quicker lookup.
- prepare(String) - Method in class com.wcohen.ss.MultiStringDistance
-
Prepare a string.
- prepare(StringDistance[]) - Method in class com.wcohen.ss.MultiStringWrapper
-
Prepare each field with the appropriate distance
- prepare(String) - Method in class com.wcohen.ss.SoftTokenFelligiSunter
-
Preprocess a string by finding tokens
- prepare(String) - Method in class com.wcohen.ss.SourcedTFIDF
-
Preprocess a string by finding tokens and giving them TFIDF weights
- prepare(String) - Method in class com.wcohen.ss.TagLink
-
Preprocess a string by finding tokens and giving them TFIDF weights
- prepare(String) - Method in class com.wcohen.ss.TFIDF
-
Preprocess a string by finding tokens and giving them TFIDF weights
- prepare(String) - Method in class com.wcohen.ss.TokenFelligiSunter
-
Preprocess a string by finding tokens and giving them appropriate weights
- prepare(String) - Method in class com.wcohen.ss.WinklerRescorer
-
- PrintfFormat - Class in com.wcohen.ss
-
PrintfFormat allows the formatting of an array of
objects embedded within a string.
- PrintfFormat(String) - Constructor for class com.wcohen.ss.PrintfFormat
-
Constructs an array of control specifications
possibly preceded, separated, or followed by
ordinary strings.
- PrintfFormat(Locale, String) - Constructor for class com.wcohen.ss.PrintfFormat
-
Constructs an array of control specifications
possibly preceded, separated, or followed by
ordinary strings.
- printNegativeValues - Variable in class com.wcohen.ss.MemoMatrix
-
- prototype - Variable in class com.wcohen.ss.CombinedStringDistanceLearner.CombinedStringDistance
-
- put(String, String, Object) - Method in class com.wcohen.ss.lookup.SoftDictionary
-
Insert a string into the dictionary.
- put(String, Object) - Method in class com.wcohen.ss.lookup.SoftDictionary
-
Insert a string into the dictionary.
- put(String, StringWrapper, Object) - Method in class com.wcohen.ss.lookup.SoftDictionary
-
Insert a prepared string into the dictionary.
- put(String, Object) - Method in class com.wcohen.ss.lookup.SoftTFIDFDictionary
-
Insert a string into the dictionary, and associate it with the
given value.
- s - Variable in class com.wcohen.ss.MemoMatrix
-
- sameId(MatchData.Instance) - Method in class com.wcohen.ss.expt.MatchData.Instance
-
- sameId(SourcedMatchData.Instance) - Method in class com.wcohen.ss.expt.SourcedMatchData.Instance
-
- sAt(int) - Method in class com.wcohen.ss.MemoMatrix
-
Get i-th char of s, indexing s from 1..n
- save(String) - Method in class com.wcohen.ss.expt.MatchExptScript
-
Save current experimental data to a file
- saveAs(File) - Method in class com.wcohen.ss.lookup.SoftTFIDFDictionary
-
- saveModelParams() - Method in class com.wcohen.ss.abbvGapsHmm.AbbvGapsHMM
-
- ScaledLevenstein - Class in com.wcohen.ss
-
Levenstein string distance.
- ScaledLevenstein() - Constructor for class com.wcohen.ss.ScaledLevenstein
-
- score(StringWrapper, StringWrapper) - Method in class com.wcohen.ss.AbbreviationAlignment
-
- score(StringWrapper, StringWrapper) - Method in class com.wcohen.ss.AbstractStringDistance
-
This method needs to be implemented by subclasses.
- score(String, String) - Method in class com.wcohen.ss.AbstractStringDistance
-
Strings are scored by converting them to StringWrappers with the
prepare function.
- score(StringWrapper, StringWrapper) - Method in class com.wcohen.ss.AffineGap
-
- score(StringWrapper, StringWrapper) - Method in interface com.wcohen.ss.api.StringDistance
-
Find the distance between s and t.
- score(String, String) - Method in interface com.wcohen.ss.api.StringDistance
-
Find the distance between s and t
- score(StringWrapper, StringWrapper) - Method in class com.wcohen.ss.ApproxNeedlemanWunsch
-
- score(String, String) - Method in class com.wcohen.ss.CombinedStringDistanceLearner.CombinedStringDistance
-
- score(StringWrapper, StringWrapper) - Method in class com.wcohen.ss.CombinedStringDistanceLearner.CombinedStringDistance
-
- score(StringWrapper, StringWrapper) - Method in class com.wcohen.ss.Jaccard
-
- score(StringWrapper, StringWrapper) - Method in class com.wcohen.ss.Jaro
-
- score(StringWrapper, StringWrapper) - Method in class com.wcohen.ss.JensenShannonDistance
-
Jensen-Shannon distance between distributions.
- score(StringWrapper, StringWrapper) - Method in class com.wcohen.ss.Level2
-
- score(StringWrapper, StringWrapper) - Method in class com.wcohen.ss.Mixture
-
Distance is argmax_lambda prod_{w in s} lambda Pr(w|t) * (1-lambda) Pr(w|background).
- score(StringWrapper, StringWrapper) - Method in class com.wcohen.ss.MongeElkan
-
Version of distance which is possibly scaled to [0,1].
- score(StringWrapper, StringWrapper) - Method in class com.wcohen.ss.MultiStringDistance
-
- score(String, String) - Method in class com.wcohen.ss.MultiStringDistance
-
Strings are scored by converting them to StringWrappers with the
prepare function.
- score(StringWrapper, StringWrapper) - Method in class com.wcohen.ss.NeedlemanWunsch
-
- score(StringWrapper, StringWrapper) - Method in class com.wcohen.ss.ScaledLevenstein
-
- score(StringWrapper, StringWrapper) - Method in class com.wcohen.ss.SmithWaterman
-
- score(StringWrapper, StringWrapper) - Method in class com.wcohen.ss.SoftTFIDF
-
- score(StringWrapper, StringWrapper) - Method in class com.wcohen.ss.SoftTokenFelligiSunter
-
- score(StringWrapper, StringWrapper) - Method in class com.wcohen.ss.SourcedSoftTFIDF
-
- score(StringWrapper, StringWrapper) - Method in class com.wcohen.ss.SourcedTFIDF
-
- score(StringWrapper, StringWrapper) - Method in class com.wcohen.ss.TagLink
-
getStringMetric computes the similarity between a pair of strings T and U.
- score(StringWrapper, StringWrapper) - Method in class com.wcohen.ss.TFIDF
-
- score(StringWrapper, StringWrapper) - Method in class com.wcohen.ss.TokenFelligiSunter
-
- score(StringWrapper, StringWrapper) - Method in class com.wcohen.ss.tokens.TagLinkToken
-
score return the a strng distance value between 0 and 1 of a pair
of tokens.
- score(StringWrapper, StringWrapper) - Method in class com.wcohen.ss.WinklerRescorer
-
- scoreCombination(double[]) - Method in class com.wcohen.ss.MultiStringAvgDistance
-
Combine the scores for each primitive distance function on each field.
- scoreCombination(double[]) - Method in class com.wcohen.ss.MultiStringDistance
-
Combine the scores for each primitive distance function on each field.
- SEPARATOR - Static variable in class com.wcohen.ss.abbvGapsHmm.AlignmentPredictionModel
-
- SEPARATOR - Static variable in class com.wcohen.ss.expt.ExtractAbbreviations
-
- set(int, int, int, double) - Method in class com.wcohen.ss.abbvGapsHmm.Matrix3D
-
- set(int, StringWrapper) - Method in class com.wcohen.ss.MultiStringWrapper
-
Set the i-th field.
- setClusterMode(boolean) - Method in class com.wcohen.ss.expt.Blocker
-
In clusterMode, consider pairings between instances from the same
source.
- setClusterMode(Boolean) - Method in class com.wcohen.ss.expt.Blocker
-
- setCollectionSize(int) - Method in class com.wcohen.ss.SourcedTFIDF
-
Setting the collectionSize and alsoSet the size of the collection that this TFIDF measure was
trained on to some value.
- setCollectionSize(int) - Method in class com.wcohen.ss.TFIDF
-
Setting the collectionSize and alsoSet the size of the collection that this TFIDF measure was
trained on to some value.
- setDistance(double) - Method in interface com.wcohen.ss.api.DistanceInstance
-
- setDistance(double) - Method in class com.wcohen.ss.CombinedStringDistanceLearner.MyDistanceInstance
-
- setDistance(double) - Method in class com.wcohen.ss.expt.Blocker.Pair
-
- setDistanceInstancePool(DistanceInstanceIterator) - Method in class com.wcohen.ss.AbstractStringDistance
-
Implements StringDistanceLearner api by providing a way to
accept a pool of unlabeled DistanceInstance's.
- setDistanceInstancePool(DistanceInstanceIterator) - Method in interface com.wcohen.ss.api.StringDistanceLearner
-
Accept a set of unlabeled DistanceInstance, to use in making
distance instance queries.
- setDistanceInstancePool(DistanceInstanceIterator) - Method in class com.wcohen.ss.CombinedStringDistanceLearner
-
Pass the training data along to the inner learners.
- setDocumentFrequency(Token, int) - Method in class com.wcohen.ss.SourcedTFIDF
-
Set the document frequency of the token to some value.
- setDocumentFrequency(Token, int) - Method in class com.wcohen.ss.TFIDF
-
Set the document frequency of the token to some value.
- setIgnoreCase(boolean) - Method in class com.wcohen.ss.tokens.CharacterTokenizer
-
- setIgnoreCase(boolean) - Method in class com.wcohen.ss.tokens.SimpleTokenizer
-
- setIgnorePunctuation(boolean) - Method in class com.wcohen.ss.tokens.CharacterTokenizer
-
- setIgnorePunctuation(boolean) - Method in class com.wcohen.ss.tokens.SimpleTokenizer
-
- setLambda(double) - Method in class com.wcohen.ss.JelinekMercerJS
-
- setLambda(Double) - Method in class com.wcohen.ss.JelinekMercerJS
-
- setMaxFraction(double) - Method in class com.wcohen.ss.expt.AbbreviationsBlocker
-
- setMaxFraction(double) - Method in class com.wcohen.ss.expt.TokenBlocker
-
- setMaxInvertedIndexSize(int) - Method in class com.wcohen.ss.lookup.SoftTFIDFDictionary
-
Set the maximum size of an inverted index that will be
followed.
- setMaxNGramSize(int) - Method in class com.wcohen.ss.expt.NGramBlocker
-
- setMinNGramSize(int) - Method in class com.wcohen.ss.expt.NGramBlocker
-
- setMismatchFactor(double) - Method in class com.wcohen.ss.SoftTokenFelligiSunter
-
- setMismatchFactor(Double) - Method in class com.wcohen.ss.SoftTokenFelligiSunter
-
- setMismatchFactor(double) - Method in class com.wcohen.ss.TokenFelligiSunter
-
- setMismatchFactor(Double) - Method in class com.wcohen.ss.TokenFelligiSunter
-
- setModelParamsFile(String) - Method in class com.wcohen.ss.abbvGapsHmm.AlignmentPredictionModel
-
- setModelParamsFile() - Method in class com.wcohen.ss.abbvGapsHmm.AlignmentPredictionModel
-
- setParamFile(String) - Method in class com.wcohen.ss.abbvGapsHmm.AbbvGapsHMM
-
- setPartialWord(String, boolean) - Method in class com.wcohen.ss.abbvGapsHmm.AbbvGapsHmmBackwardsEvaluator.BackwardEvalParam
-
- setPartialWord(String, boolean) - Method in class com.wcohen.ss.abbvGapsHmm.AbbvGapsHMMEvaluator.EvalParam
-
- setPartialWord(String, boolean) - Method in class com.wcohen.ss.abbvGapsHmm.AbbvGapsHmmForwardEvaluator.ForwardEvalParam
-
- setPseudoCount(double) - Method in class com.wcohen.ss.DirichletJS
-
- setPseudoCount(Double) - Method in class com.wcohen.ss.DirichletJS
-
- setScaling(boolean) - Method in class com.wcohen.ss.MongeElkan
-
If scaling is true, then distances are scaled to 0-1
- setScaling(Double) - Method in class com.wcohen.ss.MongeElkan
-
For interfacing with reflection in MatchExptScript.
- setStartingParams(List<Double>, List<Double>) - Method in class com.wcohen.ss.abbvGapsHmm.AbbvGapsHMM
-
- setStartingParams(List<Double>, List<Double>) - Method in class com.wcohen.ss.abbvGapsHmm.AlignmentPredictionModel
-
- setStringWrapperPool(StringWrapperIterator) - Method in class com.wcohen.ss.AbstractSourcedTokenizedStringDistance
-
- setStringWrapperPool(StringWrapperIterator) - Method in class com.wcohen.ss.AbstractStringDistance
-
Implements the StringDistanceLearner api, by providing a way to
accumulate statistics for a set of related strings.
- setStringWrapperPool(StringWrapperIterator) - Method in class com.wcohen.ss.AbstractTokenizedStringDistance
-
- setStringWrapperPool(StringWrapperIterator) - Method in interface com.wcohen.ss.api.StringDistanceLearner
-
Unsupervised learning method that observes strings for which
distance will be computed.
- setStringWrapperPool(StringWrapperIterator) - Method in class com.wcohen.ss.CombinedStringDistanceLearner
-
Pass the training data along to the inner learners.
- setTfIdfData(String) - Method in class com.wcohen.ss.abbvGapsHmm.AbbvGapsHMM
-
- setTfIdfData(String) - Method in class com.wcohen.ss.abbvGapsHmm.AlignmentPredictionModel
-
- setTokenCount(int) - Method in class com.wcohen.ss.TFIDF
-
- setTokenMatchThreshold(double) - Method in class com.wcohen.ss.SoftTFIDF
-
- setTokenMatchThreshold(Double) - Method in class com.wcohen.ss.SoftTFIDF
-
- setTokenMatchThreshold(double) - Method in class com.wcohen.ss.SoftTokenFelligiSunter
-
- setTokenMatchThreshold(Double) - Method in class com.wcohen.ss.SoftTokenFelligiSunter
-
- setTokenMatchThreshold(double) - Method in class com.wcohen.ss.SourcedSoftTFIDF
-
- setTokenMatchThreshold(Double) - Method in class com.wcohen.ss.SourcedSoftTFIDF
-
- setTrainDir(String) - Method in class com.wcohen.ss.AbbreviationAlignment
-
- setTrainDir(String) - Method in class com.wcohen.ss.expt.ExtractAbbreviations
-
- setTrainingDataDir(String) - Method in class com.wcohen.ss.abbvGapsHmm.AlignmentPredictionModel
-
- setTreshold(double) - Method in class com.wcohen.ss.tokens.TagLinkToken
-
setTreshold set a new value to the constant Gamma.
- setWidth(int) - Method in class com.wcohen.ss.ApproxNeedlemanWunsch
-
- setWindowSize(int) - Method in class com.wcohen.ss.lookup.SoftTFIDFDictionary
-
Set the 'windowSize' used for finding similar tokens.
- show(String) - Method in class com.wcohen.ss.expt.MatchExptScript
-
Show datasets, blockers, or learners.
- SimpleSourcedTokenizer - Class in com.wcohen.ss.tokens
-
Simple implementation of a Tokenizer.
- SimpleSourcedTokenizer(boolean, boolean) - Constructor for class com.wcohen.ss.tokens.SimpleSourcedTokenizer
-
- SimpleTokenizer - Class in com.wcohen.ss.tokens
-
Simple implementation of a Tokenizer.
- SimpleTokenizer(boolean, boolean) - Constructor for class com.wcohen.ss.tokens.SimpleTokenizer
-
- size() - Method in class com.wcohen.ss.expt.AbbreviationsBlocker
-
- size() - Method in class com.wcohen.ss.expt.Blocker
-
Return number of candidate pairs, as produced from most recently block()-ed data
- size() - Method in class com.wcohen.ss.expt.NullBlocker
-
- size() - Method in class com.wcohen.ss.expt.TokenBlocker
-
- size() - Method in class com.wcohen.ss.lookup.SoftDictionary
-
Return the number of entries in the dictionary.
- size() - Method in class com.wcohen.ss.MultiStringWrapper
-
Return number of fields.
- slowLookup(double, String) - Method in class com.wcohen.ss.lookup.SoftTFIDFDictionary
-
Exactly like lookup, but works by exhaustively checking every stored string.
- SmithWaterman - Class in com.wcohen.ss
-
Smith-Waterman string distance, following Durban et al.
- SmithWaterman() - Constructor for class com.wcohen.ss.SmithWaterman
-
- SmithWaterman(CharMatchScore, double) - Constructor for class com.wcohen.ss.SmithWaterman
-
- smoothCounter(int, List<Double>, List<Double>) - Method in class com.wcohen.ss.abbvGapsHmm.AbbvGapsHMM
-
Dirichlet smoothing
-------------------
Without a prior:
P(data | theta) = theta(i)^beta(i) = counters(i)
With a dirichlet prior:
P(data | theta)*p(theta) = theta(i)^(beta(i) + alpha(i)) =
theta(i)^beta(i) + theta(i)^alpha(i)
counters(i) + params(i)^alpha(i)
- smoothedProbability(Token, double, double) - Method in class com.wcohen.ss.DirichletJS
-
smoothed probability of the token
- smoothedProbability(Token, double, double) - Method in class com.wcohen.ss.JelinekMercerJS
-
smoothed probability of the token
- smoothedProbability(Token, double, double) - Method in class com.wcohen.ss.JensenShannonDistance
-
Smoothed probability of the token with frequency freq in a bag with the given totalWeight
- smoothedProbability(Token, double, double) - Method in class com.wcohen.ss.UnsmoothedJS
-
Unsmoothed probability of the token
- SoftDictionary - Class in com.wcohen.ss.lookup
-
Looks up nearly-matching strings in a dictionary, using a string distance.
- SoftDictionary() - Constructor for class com.wcohen.ss.lookup.SoftDictionary
-
- SoftDictionary(StringDistanceLearner) - Constructor for class com.wcohen.ss.lookup.SoftDictionary
-
- SoftDictionary(Tokenizer) - Constructor for class com.wcohen.ss.lookup.SoftDictionary
-
- SoftDictionary(StringDistanceLearner, Tokenizer) - Constructor for class com.wcohen.ss.lookup.SoftDictionary
-
- SoftTFIDF - Class in com.wcohen.ss
-
TFIDF-based distance metric, extended to use "soft" token-matching.
- SoftTFIDF(Tokenizer, StringDistance, double) - Constructor for class com.wcohen.ss.SoftTFIDF
-
- SoftTFIDF(StringDistance, double) - Constructor for class com.wcohen.ss.SoftTFIDF
-
- SoftTFIDF(StringDistance) - Constructor for class com.wcohen.ss.SoftTFIDF
-
- SoftTFIDF() - Constructor for class com.wcohen.ss.SoftTFIDF
-
- SoftTFIDFDictionary - Class in com.wcohen.ss.lookup
-
Looks up nearly-matching strings in a dictionary, using SoftTFIDF
distance.
- SoftTFIDFDictionary() - Constructor for class com.wcohen.ss.lookup.SoftTFIDFDictionary
-
- SoftTFIDFDictionary(Tokenizer) - Constructor for class com.wcohen.ss.lookup.SoftTFIDFDictionary
-
- SoftTFIDFDictionary(Tokenizer, double) - Constructor for class com.wcohen.ss.lookup.SoftTFIDFDictionary
-
- SoftTFIDFDictionary(Tokenizer, double, int, int) - Constructor for class com.wcohen.ss.lookup.SoftTFIDFDictionary
-
Create a new SoftTFIDFDictionary.
- SoftTokenFelligiSunter - Class in com.wcohen.ss
-
Highly simplified model of Felligi-Sunter's method 1,
applied to tokens.
- SoftTokenFelligiSunter(Tokenizer, StringDistance, double, double) - Constructor for class com.wcohen.ss.SoftTokenFelligiSunter
-
- SoftTokenFelligiSunter() - Constructor for class com.wcohen.ss.SoftTokenFelligiSunter
-
- SourcedMatchData - Class in com.wcohen.ss.expt
-
Holds data for evaluating a distance metric.
- SourcedMatchData(String) - Constructor for class com.wcohen.ss.expt.SourcedMatchData
-
Read match data from a file.
- SourcedMatchData() - Constructor for class com.wcohen.ss.expt.SourcedMatchData
-
- SourcedMatchData.InputFormatException - Exception in com.wcohen.ss.expt
-
Signals an incorrectly formatted MatchData file.
- SourcedMatchData.Instance - Class in com.wcohen.ss.expt
-
A single item (aka record, string, etc) to match against
others.
- SourcedMatchData.MatchIterator - Class in com.wcohen.ss.expt
-
Iterates over all stored StringWrappers
- SourcedSoftTFIDF - Class in com.wcohen.ss
-
TFIDF-based distance metric, extended to use "soft" token-matching.
- SourcedSoftTFIDF(SourcedTokenizer, StringDistance, double) - Constructor for class com.wcohen.ss.SourcedSoftTFIDF
-
- SourcedSoftTFIDF(StringDistance, double) - Constructor for class com.wcohen.ss.SourcedSoftTFIDF
-
- SourcedSoftTFIDF(StringDistance) - Constructor for class com.wcohen.ss.SourcedSoftTFIDF
-
- SourcedSoftTFIDF() - Constructor for class com.wcohen.ss.SourcedSoftTFIDF
-
- SourcedStringWrapper - Interface in com.wcohen.ss.api
-
- SourcedStringWrapperIterator - Interface in com.wcohen.ss.api
-
An iterator over StringWrapper objects.
- SourcedTFIDF - Class in com.wcohen.ss
-
Sourced-based distance metric.
- SourcedTFIDF(SourcedTokenizer) - Constructor for class com.wcohen.ss.SourcedTFIDF
-
- SourcedTFIDF() - Constructor for class com.wcohen.ss.SourcedTFIDF
-
- SourcedTFIDF.UnitVector - Class in com.wcohen.ss
-
Marker class extending BagOfTokens
- SourcedToken - Interface in com.wcohen.ss.api
-
An interned version of a string, with provinance information
- sourcedTokenize(String, String) - Method in interface com.wcohen.ss.api.SourcedTokenizer
-
Return tokenized version of a string, as an array of
SourcedToken objects.
- sourcedTokenize(String, String) - Method in class com.wcohen.ss.tokens.SimpleSourcedTokenizer
-
Return tokenized version of a string.
- SourcedTokenizer - Interface in com.wcohen.ss.api
-
Split a string into tokens, retaining provinance.
- SpecialMatchExpt - Class in com.wcohen.ss.expt
-
Perform a matching experiment using a vocabulary stats file, data
file, distance function and blocker.
- SpecialMatchExpt(MatchData, StringDistanceLearner, Blocker, boolean, String, String, boolean) - Constructor for class com.wcohen.ss.expt.SpecialMatchExpt
-
- sprintf(Object[]) - Method in class com.wcohen.ss.PrintfFormat
-
Format an array of objects.
- sprintf() - Method in class com.wcohen.ss.PrintfFormat
-
Format nothing.
- sprintf(int) - Method in class com.wcohen.ss.PrintfFormat
-
Format an int.
- sprintf(long) - Method in class com.wcohen.ss.PrintfFormat
-
Format an long.
- sprintf(double) - Method in class com.wcohen.ss.PrintfFormat
-
Format a double.
- sprintf(String) - Method in class com.wcohen.ss.PrintfFormat
-
Format a String.
- sprintf(Object) - Method in class com.wcohen.ss.PrintfFormat
-
Format an Object.
- Stats() - Constructor for class com.wcohen.ss.expt.ExtractAbbreviations.Stats
-
- StringDistance - Interface in com.wcohen.ss.api
-
Compute the difference between pairs of strings.
- StringDistanceLearner - Interface in com.wcohen.ss.api
-
Learn a StringDistance.
- StringDistanceTeacher - Class in com.wcohen.ss.api
-
Train a StringDistanceLearner and return the learned
StringDistance, using some unspecified source of information to
train the learner.
- StringDistanceTeacher() - Constructor for class com.wcohen.ss.api.StringDistanceTeacher
-
- StringWrapper - Interface in com.wcohen.ss.api
-
Something that implements some of the functionality of Java's
string class, but which is a non-final class, and hence can also
cache additional information to facilitate later processing.
- stringWrapperIterator() - Method in class com.wcohen.ss.api.StringDistanceTeacher
-
Strings over which distances will be computed.
- StringWrapperIterator - Interface in com.wcohen.ss.api
-
An iterator over StringWrapper objects.
- stringWrapperIterator() - Method in class com.wcohen.ss.expt.BasicTeacher
-
- stringWrapperIterator() - Method in class com.wcohen.ss.expt.MatchDataTeacher
-
- t - Variable in class com.wcohen.ss.MemoMatrix
-
- table(String) - Method in class com.wcohen.ss.expt.MatchExptScript
-
Show a table of some expt-wide numeric measurement.
- TagLink - Class in com.wcohen.ss
-
- TagLink() - Constructor for class com.wcohen.ss.TagLink
-
TagLink default constructor.
- TagLink(AbstractStringDistance) - Constructor for class com.wcohen.ss.TagLink
-
TagLink constructor requires a character based string metric.
- TagLink(Tokenizer, AbstractStringDistance) - Constructor for class com.wcohen.ss.TagLink
-
TagLink constructor requires a tokenizer and a tokenDistance metric
- TagLink(String[]) - Constructor for class com.wcohen.ss.TagLink
-
TagLink constructor requires dataset string array in order to compute the IDF
weights.
- TagLink(String[], AbstractStringDistance) - Constructor for class com.wcohen.ss.TagLink
-
TagLink constructor requires dataset string array in order to compute the IDF
weights and a tokenDistance metric.
- TagLink.Candidates - Class in com.wcohen.ss
-
- TagLink.UnitVector - Class in com.wcohen.ss
-
Marker class extending BagOfTokens
- TagLinkToken - Class in com.wcohen.ss.tokens
-
- TagLinkToken() - Constructor for class com.wcohen.ss.tokens.TagLinkToken
-
TagLinkToken default constructor.
- TagLinkToken(double) - Constructor for class com.wcohen.ss.tokens.TagLinkToken
-
TagLinkToken constrctur.
- tAt(int) - Method in class com.wcohen.ss.MemoMatrix
-
Get i-th char of t, indexing s from 1..n
- TFIDF - Class in com.wcohen.ss
-
TFIDF-based distance metric.
- TFIDF(Tokenizer) - Constructor for class com.wcohen.ss.TFIDF
-
- TFIDF() - Constructor for class com.wcohen.ss.TFIDF
-
- TFIDF.UnitVector - Class in com.wcohen.ss
-
Marker class extending BagOfTokens
- time() - Method in class com.wcohen.ss.expt.MatchExpt
-
Return total time to process data.
- time() - Method in class com.wcohen.ss.expt.SpecialMatchExpt
-
Return total time to process data.
- times(double) - Method in class com.wcohen.ss.abbvGapsHmm.Matrix3D
-
- TN - Variable in class com.wcohen.ss.expt.ExtractAbbreviations.Stats
-
- Token - Interface in com.wcohen.ss.api
-
An interned version of a string.
- TokenBlocker - Class in com.wcohen.ss.expt
-
Finds all pairs that share a not-too-common token.
- TokenBlocker(Tokenizer, double) - Constructor for class com.wcohen.ss.expt.TokenBlocker
-
- TokenBlocker() - Constructor for class com.wcohen.ss.expt.TokenBlocker
-
- TokenFelligiSunter - Class in com.wcohen.ss
-
Highly simplified model of Felligi-Sunter's method 1,
applied to tokens.
- TokenFelligiSunter(Tokenizer, double) - Constructor for class com.wcohen.ss.TokenFelligiSunter
-
- TokenFelligiSunter() - Constructor for class com.wcohen.ss.TokenFelligiSunter
-
- tokenIterator() - Method in class com.wcohen.ss.AbstractStatisticalTokenDistance
-
- tokenIterator() - Method in interface com.wcohen.ss.api.Tokenizer
-
Return an iterator over interned tokens
- tokenIterator() - Method in class com.wcohen.ss.tokens.CharacterTokenizer
-
- tokenIterator() - Method in class com.wcohen.ss.tokens.NGramTokenizer
-
- tokenIterator() - Method in class com.wcohen.ss.tokens.SimpleTokenizer
-
- tokenize(String) - Method in interface com.wcohen.ss.api.Tokenizer
-
Return tokenized version of a string
- tokenize(String) - Method in class com.wcohen.ss.tokens.CharacterTokenizer
-
Return tokenized version of a string.
- tokenize(String) - Method in class com.wcohen.ss.tokens.NGramTokenizer
-
Return tokenized version of a string.
- tokenize(String) - Method in class com.wcohen.ss.tokens.SimpleTokenizer
-
Return tokenized version of a string.
- tokenizer - Variable in class com.wcohen.ss.AbstractSourcedTokenizedStringDistance
-
- tokenizer - Variable in class com.wcohen.ss.AbstractTokenizedStringDistance
-
- Tokenizer - Interface in com.wcohen.ss.api
-
Split a string into tokens.
- tokenizer - Variable in class com.wcohen.ss.expt.AbbreviationsBlocker
-
- tokenizer - Variable in class com.wcohen.ss.expt.TokenBlocker
-
- toString() - Method in class com.wcohen.ss.abbvGapsHmm.AbbreviationAlignmentContainer
-
- toString() - Method in class com.wcohen.ss.abbvGapsHmm.Acronym
-
- toString() - Method in class com.wcohen.ss.ApproxMemoMatrix
-
Print the matrix, for debugging and/or explanation.
- toString() - Method in class com.wcohen.ss.BasicStringWrapper
-
- toString() - Method in class com.wcohen.ss.CombinedStringDistanceLearner.MyMultiDistanceInstance
-
- toString() - Method in class com.wcohen.ss.DirichletJS
-
- toString() - Method in class com.wcohen.ss.expt.AbbreviationsBlocker
-
- toString() - Method in class com.wcohen.ss.expt.Blocker.Pair
-
- toString() - Method in class com.wcohen.ss.expt.ClusterNGramBlocker
-
- toString() - Method in class com.wcohen.ss.expt.ClusterTokenBlocker
-
- toString() - Method in class com.wcohen.ss.expt.MatchData.Instance
-
- toString() - Method in class com.wcohen.ss.expt.MatchData
-
- toString() - Method in class com.wcohen.ss.expt.MatchExpt
-
- toString() - Method in class com.wcohen.ss.expt.NGramBlocker
-
- toString() - Method in class com.wcohen.ss.expt.NullBlocker
-
- toString() - Method in class com.wcohen.ss.expt.SourcedMatchData.Instance
-
- toString() - Method in class com.wcohen.ss.expt.SourcedMatchData
-
- toString() - Method in class com.wcohen.ss.expt.SpecialMatchExpt
-
- toString() - Method in class com.wcohen.ss.expt.TokenBlocker
-
- toString() - Method in class com.wcohen.ss.Jaccard
-
- toString() - Method in class com.wcohen.ss.Jaro
-
- toString() - Method in class com.wcohen.ss.JaroTFIDF
-
- toString() - Method in class com.wcohen.ss.JaroWinklerTFIDF
-
- toString() - Method in class com.wcohen.ss.JelinekMercerJS
-
- toString() - Method in class com.wcohen.ss.Level2
-
- toString() - Method in class com.wcohen.ss.Level2Jaro
-
- toString() - Method in class com.wcohen.ss.Level2JaroWinkler
-
- toString() - Method in class com.wcohen.ss.Level2Levenstein
-
- toString() - Method in class com.wcohen.ss.Level2MongeElkan
-
- toString() - Method in class com.wcohen.ss.Levenstein
-
- toString() - Method in class com.wcohen.ss.MemoMatrix
-
Print the matrix, for debugging and/or explanation.
- toString() - Method in class com.wcohen.ss.Mixture
-
- toString() - Method in class com.wcohen.ss.MongeElkan
-
- toString() - Method in class com.wcohen.ss.MongeElkanTFIDF
-
- toString() - Method in class com.wcohen.ss.MultiStringWrapper
-
- toString() - Method in class com.wcohen.ss.ScaledLevenstein
-
- toString() - Method in class com.wcohen.ss.SmithWaterman
-
- toString() - Method in class com.wcohen.ss.SoftTFIDF
-
- toString() - Method in class com.wcohen.ss.SoftTokenFelligiSunter
-
- toString() - Method in class com.wcohen.ss.SourcedSoftTFIDF
-
- toString() - Method in class com.wcohen.ss.SourcedTFIDF
-
- toString() - Method in class com.wcohen.ss.TagLink
-
toString returns the name and parameters of this string metric
- toString() - Method in class com.wcohen.ss.TFIDF
-
- toString() - Method in class com.wcohen.ss.TokenFelligiSunter
-
- toString() - Method in class com.wcohen.ss.tokens.BasicSourcedToken
-
- toString() - Method in class com.wcohen.ss.tokens.BasicToken
-
- toString() - Method in class com.wcohen.ss.tokens.CharacterTokenizer
-
- toString() - Method in class com.wcohen.ss.tokens.SimpleTokenizer
-
- toString() - Method in class com.wcohen.ss.tokens.TagLinkToken
-
toString return the name of the string metric.
- toString() - Method in class com.wcohen.ss.UnsmoothedJS
-
- toString() - Method in class com.wcohen.ss.WinklerRescorer
-
- toStringNoStates() - Method in class com.wcohen.ss.abbvGapsHmm.AbbreviationAlignmentContainer
-
- toStringWithEmissions() - Method in class com.wcohen.ss.abbvGapsHmm.AbbreviationAlignmentContainer
-
- toStringWithFrequency() - Method in class com.wcohen.ss.abbvGapsHmm.Acronym
-
- toStringWithStates() - Method in class com.wcohen.ss.abbvGapsHmm.AbbreviationAlignmentContainer
-
- totalTokenCount - Variable in class com.wcohen.ss.AbstractSourcedStatisticalTokenDistance
-
- totalTokenCount - Variable in class com.wcohen.ss.AbstractStatisticalTokenDistance
-
- TP - Variable in class com.wcohen.ss.expt.ExtractAbbreviations.Stats
-
- train(List<List<Acronym>>, List<Map<String, String>>) - Method in class com.wcohen.ss.abbvGapsHmm.AbbvGapsHMM
-
- train(List<List<Acronym>>, List<Map<String, String>>, boolean) - Method in class com.wcohen.ss.abbvGapsHmm.AbbvGapsHMM
-
- train(List<String>, List<Integer>, List<Map<String, String>>) - Method in class com.wcohen.ss.abbvGapsHmm.AlignmentPredictionModel
-
- train(StringWrapperIterator) - Method in class com.wcohen.ss.AbstractSourcedStatisticalTokenDistance
-
Accumulate statistics on how often each token value occurs
- train(StringWrapperIterator) - Method in class com.wcohen.ss.AbstractSourcedTokenizedStringDistance
-
- train(StringWrapperIterator) - Method in class com.wcohen.ss.AbstractStatisticalTokenDistance
-
Accumulate statistics on how often each token value occurs
- train(StringWrapperIterator) - Method in class com.wcohen.ss.AbstractTokenizedStringDistance
-
- train(StringDistanceLearner) - Method in class com.wcohen.ss.api.StringDistanceTeacher
-
- train(StringWrapperIterator) - Method in class com.wcohen.ss.JensenShannonDistance
-
Accumulate statistics on how often each token occurs.
- train(StringWrapperIterator) - Method in class com.wcohen.ss.Level2
-
Training is a null operation.
- trainCorpus(List<List<Acronym>>, List<Map<String, String>>) - Method in class com.wcohen.ss.abbvGapsHmm.AbbvGapsHMM
-
- trainIfNeeded() - Method in class com.wcohen.ss.abbvGapsHmm.AlignmentPredictionModel
-
- trainOnAll() - Method in class com.wcohen.ss.abbvGapsHmm.AlignmentPredictionModel
-
- trainOnCandidates() - Method in class com.wcohen.ss.abbvGapsHmm.AlignmentPredictionModel
-
- transitionIsLegal(int, int, AbbvGapsHMM.States, int, int, AbbvGapsHMM.States, AbbvGapsHMM.Transitions, AbbvGapsHMM.Emissions) - Method in class com.wcohen.ss.abbvGapsHmm.AbbvGapsHmmBackwardsEvaluator
-
- transitionIsLegal(int, int, AbbvGapsHMM.States, int, int, AbbvGapsHMM.States, AbbvGapsHMM.Transitions, AbbvGapsHMM.Emissions) - Method in class com.wcohen.ss.abbvGapsHmm.AbbvGapsHMMEvaluator
-
- transitionIsLegal(int, int, AbbvGapsHMM.States, int, int, AbbvGapsHMM.States, AbbvGapsHMM.Transitions, AbbvGapsHMM.Emissions) - Method in class com.wcohen.ss.abbvGapsHmm.AbbvGapsHmmForwardEvaluator
-