A B C D E F G H I J L M N O P R S T U W

A

a - Variable in class com.wcohen.ss.CombinedStringDistanceLearner.MyDistanceInstance
 
AbstractStatisticalTokenDistance - Class in com.wcohen.ss
Abstract token distance metric that uses frequency statistics.
AbstractStatisticalTokenDistance(Tokenizer) - Constructor for class com.wcohen.ss.AbstractStatisticalTokenDistance
 
AbstractStatisticalTokenDistance() - Constructor for class com.wcohen.ss.AbstractStatisticalTokenDistance
 
AbstractStringDistance - Class in com.wcohen.ss
Abstract class which implements StringDistanceLearner as well as StringDistance.
AbstractStringDistance() - Constructor for class com.wcohen.ss.AbstractStringDistance
 
AbstractTokenizedStringDistance - Class in com.wcohen.ss
Abstract distance metric for tokenized strings.
AbstractTokenizedStringDistance(Tokenizer) - Constructor for class com.wcohen.ss.AbstractTokenizedStringDistance
 
AbstractTokenizedStringDistance() - Constructor for class com.wcohen.ss.AbstractTokenizedStringDistance
 
AdaptiveStringDistanceLearner - Class in com.wcohen.ss
Abstract StringDistanceLearner class which averages results of a number of inner distance metrics, learned by a number of inner distance learners.
AdaptiveStringDistanceLearner() - Constructor for class com.wcohen.ss.AdaptiveStringDistanceLearner
 
AdaptiveStringDistanceLearner(BinaryClassifierLearner) - Constructor for class com.wcohen.ss.AdaptiveStringDistanceLearner
 
AdaptiveStringDistanceLearner(StringDistanceLearner[], String, BinaryClassifierLearner) - Constructor for class com.wcohen.ss.AdaptiveStringDistanceLearner
 
addExample(DistanceInstance) - Method in class com.wcohen.ss.AbstractStringDistance
Implements StringDistanceLearner api by accepting new DistanceInstance labels.
addExample(DistanceInstance) - Method in interface com.wcohen.ss.api.StringDistanceLearner
Accept the answer to the last query.
addExample(DistanceInstance) - Method in class com.wcohen.ss.CombinedStringDistanceLearner
Pass new labels to the sublearners.
addInstance(String, String, String) - Method in class com.wcohen.ss.expt.MatchData
Add a single instance, with given src and id, to the datafile
AffineGap - Class in com.wcohen.ss
Affine-gap string distance, following Durban et al.
AffineGap() - Constructor for class com.wcohen.ss.AffineGap
 
AffineGap(CharMatchScore, double, double, double) - Constructor for class com.wcohen.ss.AffineGap
 
AffineGap.MatrixTrio - Class in com.wcohen.ss
 
AffineGap.MatrixTrio(StringWrapper, StringWrapper) - Constructor for class com.wcohen.ss.AffineGap.MatrixTrio
 
AffineGap.MatrixTrio.InsertSMatrix - Class in com.wcohen.ss
 
AffineGap.MatrixTrio.InsertSMatrix(StringWrapper, StringWrapper) - Constructor for class com.wcohen.ss.AffineGap.MatrixTrio.InsertSMatrix
 
AffineGap.MatrixTrio.InsertTMatrix - Class in com.wcohen.ss
 
AffineGap.MatrixTrio.InsertTMatrix(StringWrapper, StringWrapper) - Constructor for class com.wcohen.ss.AffineGap.MatrixTrio.InsertTMatrix
 
ApproxMemoMatrix - Class in com.wcohen.ss
Variant of MemoMatrix that only stores values near the diagonal, for better efficiency.
ApproxNeedlemanWunsch - Class in com.wcohen.ss
Needleman-Wunsch string distance, following Durban et al.
ApproxNeedlemanWunsch() - Constructor for class com.wcohen.ss.ApproxNeedlemanWunsch
 
ApproxNeedlemanWunsch(CharMatchScore, double) - Constructor for class com.wcohen.ss.ApproxNeedlemanWunsch
 
asBagOfTokens(StringWrapper) - Method in class com.wcohen.ss.AbstractTokenizedStringDistance
 
asMultiDistanceInstance(DistanceInstance) - Method in class com.wcohen.ss.CombinedStringDistanceLearner
 
asMultiDistanceInstanceList(DistanceInstanceIterator) - Method in class com.wcohen.ss.CombinedStringDistanceLearner
 
asMultiStringWrapper(StringWrapper) - Method in class com.wcohen.ss.CombinedStringDistanceLearner
 
asMultiStringWrapper(StringWrapper) - Method in class com.wcohen.ss.MultiStringDistance
Lazily prepare a string.
asMultiStringWrapperList(StringWrapperIterator) - Method in class com.wcohen.ss.CombinedStringDistanceLearner
 
asUnitVector(StringWrapper) - Method in class com.wcohen.ss.TagLink
 
asUnitVector(StringWrapper) - Method in class com.wcohen.ss.TFIDF
 
AveragedStringDistanceLearner - Class in com.wcohen.ss
Abstract StringDistanceLearner class which averages results of a number of inner distance metrics, learned by a number of inner distance learners.
AveragedStringDistanceLearner() - Constructor for class com.wcohen.ss.AveragedStringDistanceLearner
 
AveragedStringDistanceLearner(StringDistanceLearner[], String) - Constructor for class com.wcohen.ss.AveragedStringDistanceLearner
 
averagePrecision() - Method in class com.wcohen.ss.expt.MatchExpt
non-interpolated average precision

B

b - Variable in class com.wcohen.ss.CombinedStringDistanceLearner.MyDistanceInstance
 
backgroundProb(Token) - Method in class com.wcohen.ss.JensenShannonDistance
Probability of token in the background language model
BasicDistanceInstanceIterator - Class in com.wcohen.ss
A simple DistanceInstanceIterator implementation.
BasicDistanceInstanceIterator(Iterator<DistanceInstance>) - Constructor for class com.wcohen.ss.BasicDistanceInstanceIterator
 
BasicStringWrapper - Class in com.wcohen.ss
An extendible (non-final) class that implements some of the functionality of a string.
BasicStringWrapper(String) - Constructor for class com.wcohen.ss.BasicStringWrapper
 
BasicStringWrapperIterator - Class in com.wcohen.ss
A simple StringWrapperIterator implementation.
BasicStringWrapperIterator(Iterator) - Constructor for class com.wcohen.ss.BasicStringWrapperIterator
 
BasicTeacher - Class in com.wcohen.ss.expt
Train a StringDistanceLearner.
BasicTeacher(Blocker, MatchData) - Constructor for class com.wcohen.ss.expt.BasicTeacher
Create a teacher from a blocker and a dataset.
BasicTeacher(StringWrapperIterator, DistanceInstanceIterator, DistanceInstanceIterator) - Constructor for class com.wcohen.ss.expt.BasicTeacher
Create a teacher using specific values for the various iterators.
BasicToken - Class in com.wcohen.ss.tokens
An interned version of a string.
block(MatchData) - Method in class com.wcohen.ss.expt.Blocker
Load matchdata and prepare it for production of candidate pairs.
block(MatchData) - Method in class com.wcohen.ss.expt.NullBlocker
 
block(MatchData) - Method in class com.wcohen.ss.expt.TokenBlocker
 
Blocker - Class in com.wcohen.ss.expt
Produces candidate pairs from a MatchData structure, and provides access to those candidate pairs.
Blocker() - Constructor for class com.wcohen.ss.expt.Blocker
 
blocker(String) - Method in class com.wcohen.ss.expt.MatchExptScript
Load a blocker.
blocker(String, String, String) - Method in class com.wcohen.ss.expt.MatchExptScript
Load a blocker, with optional boolean value
Blocker.Pair - Class in com.wcohen.ss.expt
Holds a pair of instances, with mutable distance between them.
Blocker.Pair(MatchData.Instance, MatchData.Instance, boolean) - Constructor for class com.wcohen.ss.expt.Blocker.Pair
 
BLOCKER_PACKAGE - Static variable in class com.wcohen.ss.expt.MatchExpt
 
BLOCKER_PACKAGE - Static variable in class com.wcohen.ss.expt.MatchExptScript
 
blockerName - Variable in class com.wcohen.ss.expt.MatchExpt
 
blockerRecall() - Method in class com.wcohen.ss.expt.MatchExpt
performance of the blocker
blockingTime - Variable in class com.wcohen.ss.expt.MatchExpt
 
build(String) - Static method in class com.wcohen.ss.DistanceLearnerFactory
Generate a StringDistance from a class name, or a sequence of classnames separated by slashes.
build(String[]) - Static method in class com.wcohen.ss.DistanceLearnerFactory
Generate a StringDistance from a sequence of classnames.
buildArray(String) - Static method in class com.wcohen.ss.DistanceLearnerFactory
Generate a StringDistanceArray given a sequence of classnames separated by slashes.

C

cellFormat - Variable in class com.wcohen.ss.MemoMatrix
 
charAt(int) - Method in interface com.wcohen.ss.api.StringWrapper
Return the i-th char of the wrapped string
charAt(int) - Method in class com.wcohen.ss.BasicStringWrapper
 
CharMatchScore - Class in com.wcohen.ss
Abstract distance between characters.
CharMatchScore() - Constructor for class com.wcohen.ss.CharMatchScore
 
checkTrainingHasHappened(StringWrapper, StringWrapper) - Method in class com.wcohen.ss.AbstractStatisticalTokenDistance
 
clear(String) - Method in class com.wcohen.ss.expt.MatchExptScript
Clear datasets, blockers, or learners.
clusterMode - Variable in class com.wcohen.ss.expt.Blocker
 
ClusterNGramBlocker - Class in com.wcohen.ss.expt
TokenBlocker for clustering based on NGram co-occurence.
ClusterNGramBlocker() - Constructor for class com.wcohen.ss.expt.ClusterNGramBlocker
 
ClusterTokenBlocker - Class in com.wcohen.ss.expt
TokenBlocker for clustering.
ClusterTokenBlocker() - Constructor for class com.wcohen.ss.expt.ClusterTokenBlocker
 
ClusterTokenBlocker(Tokenizer, double) - Constructor for class com.wcohen.ss.expt.ClusterTokenBlocker
 
collectionSize - Variable in class com.wcohen.ss.AbstractStatisticalTokenDistance
 
com.wcohen.ss - package com.wcohen.ss
This package contains a bunch of approximate string comparators, plus code for performing controlled experiments with this.
com.wcohen.ss.api - package com.wcohen.ss.api
 
com.wcohen.ss.demo - package com.wcohen.ss.demo
 
com.wcohen.ss.expt - package com.wcohen.ss.expt
 
com.wcohen.ss.lookup - package com.wcohen.ss.lookup
 
com.wcohen.ss.tokens - package com.wcohen.ss.tokens
 
CombinedStringDistanceLearner - Class in com.wcohen.ss
Abstract StringDistanceLearner class which combines results of a number of inner distance metrics, learned by a number of inner distance learners.
CombinedStringDistanceLearner() - Constructor for class com.wcohen.ss.CombinedStringDistanceLearner
 
CombinedStringDistanceLearner(StringDistanceLearner[], String) - Constructor for class com.wcohen.ss.CombinedStringDistanceLearner
 
CombinedStringDistanceLearner.CombinedStringDistance - Class in com.wcohen.ss
Abstract class for combining innerDistances's
CombinedStringDistanceLearner.CombinedStringDistance(StringDistance[], MultiStringWrapper) - Constructor for class com.wcohen.ss.CombinedStringDistanceLearner.CombinedStringDistance
 
CombinedStringDistanceLearner.JthDistanceInstanceIterator - Class in com.wcohen.ss
Iterate over the j-th field of MultiStringWrapper's in a DistanceInstance of MultiStringWrapper's
CombinedStringDistanceLearner.JthDistanceInstanceIterator(int, Iterator<DistanceInstance>) - Constructor for class com.wcohen.ss.CombinedStringDistanceLearner.JthDistanceInstanceIterator
 
CombinedStringDistanceLearner.JthStringWrapperValueIterator - Class in com.wcohen.ss
Iterate over the j-th field of MultiStringWrapper
CombinedStringDistanceLearner.JthStringWrapperValueIterator(int, Iterator) - Constructor for class com.wcohen.ss.CombinedStringDistanceLearner.JthStringWrapperValueIterator
 
CombinedStringDistanceLearner.MyDistanceInstance - Class in com.wcohen.ss
 
CombinedStringDistanceLearner.MyDistanceInstance(StringWrapper, StringWrapper, boolean, double) - Constructor for class com.wcohen.ss.CombinedStringDistanceLearner.MyDistanceInstance
 
CombinedStringDistanceLearner.MyMultiDistanceInstance - Class in com.wcohen.ss
 
CombinedStringDistanceLearner.MyMultiDistanceInstance(StringWrapper, StringWrapper, boolean, double) - Constructor for class com.wcohen.ss.CombinedStringDistanceLearner.MyMultiDistanceInstance
 
comboAddExample(DistanceInstance) - Method in class com.wcohen.ss.AdaptiveStringDistanceLearner
 
comboAddExample(DistanceInstance) - Method in class com.wcohen.ss.AveragedStringDistanceLearner
 
comboAddExample(DistanceInstance) - Method in class com.wcohen.ss.CombinedStringDistanceLearner
Pass a labeled example to the score-combination learner.
comboHasNextQuery() - Method in class com.wcohen.ss.AdaptiveStringDistanceLearner
 
comboHasNextQuery() - Method in class com.wcohen.ss.AveragedStringDistanceLearner
 
comboHasNextQuery() - Method in class com.wcohen.ss.CombinedStringDistanceLearner
Poll the routine that learns to combine inner distance scores to see if it wants to make more queries.
comboNextQuery() - Method in class com.wcohen.ss.AdaptiveStringDistanceLearner
 
comboNextQuery() - Method in class com.wcohen.ss.AveragedStringDistanceLearner
 
comboNextQuery() - Method in class com.wcohen.ss.CombinedStringDistanceLearner
Get the next query from the score-combination learner.
comboSetDistanceInstancePool(Iterator) - Method in class com.wcohen.ss.AdaptiveStringDistanceLearner
 
comboSetDistanceInstancePool(Iterator) - Method in class com.wcohen.ss.AveragedStringDistanceLearner
 
comboSetDistanceInstancePool(Iterator) - Method in class com.wcohen.ss.CombinedStringDistanceLearner
Set up a pool of (possibly unlabeled) instance distances, for the learner to make queries from.
comboSetStringWrapperPool(Iterator) - Method in class com.wcohen.ss.AdaptiveStringDistanceLearner
 
comboSetStringWrapperPool(Iterator) - Method in class com.wcohen.ss.AveragedStringDistanceLearner
 
comboSetStringWrapperPool(Iterator) - Method in class com.wcohen.ss.CombinedStringDistanceLearner
Pass an iterator over unlabeled string wrappers to the score-combination learner, just in case that's useful.
compareTo(Blocker.Pair) - Method in class com.wcohen.ss.expt.Blocker.Pair
 
compareTo(Token) - Method in class com.wcohen.ss.tokens.BasicToken
 
compute(int, int) - Method in class com.wcohen.ss.AffineGap.MatrixTrio
 
compute(int, int) - Method in class com.wcohen.ss.AffineGap.MatrixTrio.InsertSMatrix
 
compute(int, int) - Method in class com.wcohen.ss.AffineGap.MatrixTrio.InsertTMatrix
 
compute() - Method in class com.wcohen.ss.expt.MatchExptScript
Compute learners.
correct - Variable in class com.wcohen.ss.CombinedStringDistanceLearner.MyDistanceInstance
 
countCorrectPairs(MatchData) - Method in class com.wcohen.ss.expt.Blocker
Compute number of correct pairs betwn src1 and src2, where src2>src1

D

dataset(String) - Method in class com.wcohen.ss.expt.MatchExptScript
Load a dataset.
DEFAULT_TOKENIZER - Static variable in class com.wcohen.ss.tokens.NGramTokenizer
 
DEFAULT_TOKENIZER - Static variable in class com.wcohen.ss.tokens.SimpleTokenizer
 
delim - Variable in class com.wcohen.ss.CombinedStringDistanceLearner
 
DirichletJS - Class in com.wcohen.ss
Jensen-Shannon distance of two unigram language models, smoothed using Dirichlet prior.
DirichletJS(Tokenizer, double) - Constructor for class com.wcohen.ss.DirichletJS
 
DirichletJS() - Constructor for class com.wcohen.ss.DirichletJS
 
displayResults(boolean, PrintStream) - Method in class com.wcohen.ss.expt.MatchExpt
Show results in a very simple format.
DIST_01 - Static variable in class com.wcohen.ss.CharMatchScore
Scores match as 0, mismatch as -1.
DIST_21 - Static variable in class com.wcohen.ss.CharMatchScore
Scores match as +2, mismatch as -1.
distance - Variable in class com.wcohen.ss.CombinedStringDistanceLearner.MyDistanceInstance
 
distance(String) - Method in class com.wcohen.ss.expt.MatchExptScript
Load a distance learner.
DISTANCE_PACKAGE - Static variable in class com.wcohen.ss.expt.MatchExpt
 
DISTANCE_PACKAGE - Static variable in class com.wcohen.ss.expt.MatchExptScript
 
distanceExamplePool() - Method in class com.wcohen.ss.api.StringDistanceTeacher
A pool of unlabeled pairs of strings over which distances will be computed, to be used for supervised learning.
distanceExamplePool() - Method in class com.wcohen.ss.expt.BasicTeacher
 
distanceExamplePool() - Method in class com.wcohen.ss.expt.MatchDataTeacher
 
DistanceInstance - Interface in com.wcohen.ss.api
An 'instance' for a StringDistance, analogous to an 'instance' for a classification learner.
DistanceInstanceIterator - Interface in com.wcohen.ss.api
An iterator over DistanceInstance objects.
distanceInstancePool() - Method in class com.wcohen.ss.api.StringDistanceTeacher
A pool of unlabeled pairs of strings over which distances will be computed, to be used for active or semi-supervised learning.
distanceInstancePool() - Method in class com.wcohen.ss.expt.BasicTeacher
 
distanceInstancePool() - Method in class com.wcohen.ss.expt.MatchDataTeacher
 
DistanceLearnerFactory - Class in com.wcohen.ss
Creates distance metric learners from string descriptions.
DistanceLearnerFactory() - Constructor for class com.wcohen.ss.DistanceLearnerFactory
 
documentFrequency - Variable in class com.wcohen.ss.AbstractStatisticalTokenDistance
 
doMain(StringDistance, String[]) - Static method in class com.wcohen.ss.AbstractStringDistance
Default main routine for testing
doMain(StringDistance, String[]) - Static method in class com.wcohen.ss.MultiStringDistance
Default main routine for testing
doScore(MultiStringWrapper, MultiStringWrapper) - Method in class com.wcohen.ss.CombinedStringDistanceLearner.CombinedStringDistance
Produce a score, assuming ms and mt are the correct sizes, and fully prepared.

E

echo(String) - Method in class com.wcohen.ss.expt.MatchExptScript
Turn echoing of commands on/off.
explainCombination(MultiStringWrapper, MultiStringWrapper) - Method in class com.wcohen.ss.CombinedStringDistanceLearner.CombinedStringDistance
Explain how the primitive scores were combined.
explainScore(StringWrapper, StringWrapper) - Method in class com.wcohen.ss.AbstractStringDistance
This method needs to be implemented by subclasses.
explainScore(String, String) - Method in class com.wcohen.ss.AbstractStringDistance
Scores are explained by converting Strings to StringWrappers with the prepare function.
explainScore(StringWrapper, StringWrapper) - Method in class com.wcohen.ss.AffineGap
 
explainScore(StringWrapper, StringWrapper) - Method in interface com.wcohen.ss.api.StringDistance
Explain how the distance was computed.
explainScore(String, String) - Method in interface com.wcohen.ss.api.StringDistance
Explain how the distance was computed.
explainScore(StringWrapper, StringWrapper) - Method in class com.wcohen.ss.ApproxNeedlemanWunsch
 
explainScore(String, String) - Method in class com.wcohen.ss.CombinedStringDistanceLearner.CombinedStringDistance
 
explainScore(StringWrapper, StringWrapper) - Method in class com.wcohen.ss.CombinedStringDistanceLearner.CombinedStringDistance
 
explainScore(StringWrapper, StringWrapper) - Method in class com.wcohen.ss.Jaccard
Explain how the distance was computed.
explainScore(StringWrapper, StringWrapper) - Method in class com.wcohen.ss.Jaro
 
explainScore(StringWrapper, StringWrapper) - Method in class com.wcohen.ss.JensenShannonDistance
 
explainScore(StringWrapper, StringWrapper) - Method in class com.wcohen.ss.Level2
Explain how the distance was computed.
explainScore(StringWrapper, StringWrapper) - Method in class com.wcohen.ss.Mixture
Explain how the distance was computed.
explainScore(StringWrapper, StringWrapper) - Method in class com.wcohen.ss.MongeElkan
Version where distance which is possibly scaled to [0,1].
explainScore(StringWrapper, StringWrapper) - Method in class com.wcohen.ss.MultiStringDistance
 
explainScore(String, String) - Method in class com.wcohen.ss.MultiStringDistance
Scores are explained by converting Strings to StringWrappers with the prepare function.
explainScore(StringWrapper, StringWrapper) - Method in class com.wcohen.ss.NeedlemanWunsch
 
explainScore(StringWrapper, StringWrapper) - Method in class com.wcohen.ss.SmithWaterman
 
explainScore(StringWrapper, StringWrapper) - Method in class com.wcohen.ss.SoftTFIDF
Explain how the distance was computed.
explainScore(StringWrapper, StringWrapper) - Method in class com.wcohen.ss.SoftTokenFelligiSunter
Explain how the distance was computed.
explainScore(StringWrapper, StringWrapper) - Method in class com.wcohen.ss.TagLink
explainStringMetric gives a brief explanation of how the stringMetric was computed.
explainScore(StringWrapper, StringWrapper) - Method in class com.wcohen.ss.TFIDF
Explain how the distance was computed.
explainScore(StringWrapper, StringWrapper) - Method in class com.wcohen.ss.TokenFelligiSunter
Explain how the distance was computed.
explainScore(StringWrapper, StringWrapper) - Method in class com.wcohen.ss.tokens.TagLinkToken
explainScore returns an explanation of how the string distance was computed.
explainScore(StringWrapper, StringWrapper) - Method in class com.wcohen.ss.WinklerRescorer
 
explainScoreCombination(double[]) - Method in class com.wcohen.ss.MultiStringAvgDistance
Explain how to combine the scores for each primitive distance function on each field.
explainScoreCombination(double[]) - Method in class com.wcohen.ss.MultiStringDistance
Explain how to combine the scores for each primitive distance function on each field.

F

FastLookup - Interface in com.wcohen.ss.lookup
Interface for SoftTFIDFDictionary and the rescoring variant of it.
fileName - Variable in class com.wcohen.ss.expt.MatchExpt
 
freeze() - Method in class com.wcohen.ss.lookup.SoftTFIDFDictionary
Make it impossible to add new values, but possible to perform lookups.

G

get(int) - Method in class com.wcohen.ss.CombinedStringDistanceLearner.MyMultiDistanceInstance
 
get(int) - Method in class com.wcohen.ss.MultiStringWrapper
Return the i-th field.
getA() - Method in interface com.wcohen.ss.api.DistanceInstance
 
getA() - Method in class com.wcohen.ss.CombinedStringDistanceLearner.MyDistanceInstance
 
getA(int) - Method in class com.wcohen.ss.CombinedStringDistanceLearner.MyMultiDistanceInstance
 
getA() - Method in class com.wcohen.ss.expt.Blocker.Pair
 
getAlignedChar(int, boolean) - Method in class com.wcohen.ss.ApproxNeedlemanWunsch
Find a character in the first string, s, that can be aligned with the i-th character in the second string, t.
getB() - Method in interface com.wcohen.ss.api.DistanceInstance
 
getB() - Method in class com.wcohen.ss.CombinedStringDistanceLearner.MyDistanceInstance
 
getB(int) - Method in class com.wcohen.ss.CombinedStringDistanceLearner.MyMultiDistanceInstance
 
getB() - Method in class com.wcohen.ss.expt.Blocker.Pair
 
getCollectionSize() - Method in class com.wcohen.ss.TFIDF
 
getDistance() - Method in class com.wcohen.ss.AbstractStringDistance
Implements the StringDistanceLearner api by return a StringDistance.
getDistance() - Method in class com.wcohen.ss.AdaptiveStringDistanceLearner
 
getDistance() - Method in interface com.wcohen.ss.api.DistanceInstance
 
getDistance() - Method in interface com.wcohen.ss.api.StringDistanceLearner
Return the learned distance.
getDistance() - Method in class com.wcohen.ss.AveragedStringDistanceLearner
 
getDistance() - Method in class com.wcohen.ss.CombinedStringDistanceLearner
Get the final string distance, which will be based on the distances learned by the inner learners, as well as the combination scheme learned by comboSetAnswer, comboTrain, and etc.
getDistance() - Method in class com.wcohen.ss.CombinedStringDistanceLearner.MyDistanceInstance
 
getDistance() - Method in class com.wcohen.ss.expt.Blocker.Pair
 
getDistance(int) - Method in class com.wcohen.ss.MultiStringAvgDistance
 
getDistance(int) - Method in class com.wcohen.ss.MultiStringDistance
Get the distance used for the i-th pair of fields
getDistanceLearnerIndex(int) - Method in class com.wcohen.ss.MultiStringWrapper
Return the index of the learner that will process internal field i
getDocumentFrequency(Token) - Method in class com.wcohen.ss.AbstractStatisticalTokenDistance
 
getDocumentFrequency(Token) - Method in class com.wcohen.ss.TFIDF
Get the document frequency of the token.
getFieldIndex(int) - Method in class com.wcohen.ss.MultiStringWrapper
Return the field i of the original string associated with internal field i
getFilename() - Method in class com.wcohen.ss.expt.MatchData
 
getId() - Method in class com.wcohen.ss.expt.MatchData.Instance
 
getIndex() - Method in interface com.wcohen.ss.api.Token
 
getIndex() - Method in class com.wcohen.ss.tokens.BasicToken
 
getInnerDistances() - Method in class com.wcohen.ss.CombinedStringDistanceLearner
Get an array of trained inner distances.
getInstance(String, int) - Method in class com.wcohen.ss.expt.MatchData
Get the j-th record for the named source.
getIterator() - Method in class com.wcohen.ss.expt.MatchData
 
getLambda() - Method in class com.wcohen.ss.JelinekMercerJS
 
getLookupTime() - Method in class com.wcohen.ss.lookup.SoftTFIDFDictionary
Get the time used in performing the lookup
getMatched() - Method in class com.wcohen.ss.tokens.TagLinkToken
getMatched return the number of matched character.
getMaxFraction() - Method in class com.wcohen.ss.expt.TokenBlocker
 
getMaxInvertedIndexSize() - Method in class com.wcohen.ss.lookup.SoftTFIDFDictionary
 
getMaxNGramSize() - Method in class com.wcohen.ss.expt.NGramBlocker
 
getMinNGramSize() - Method in class com.wcohen.ss.expt.NGramBlocker
 
getPair(int) - Method in class com.wcohen.ss.expt.Blocker
Get the i-th candidate pair, as produced from most recently block()-ed data
getPair(int) - Method in class com.wcohen.ss.expt.NullBlocker
 
getPair(int) - Method in class com.wcohen.ss.expt.TokenBlocker
 
getPairs() - Method in class com.wcohen.ss.expt.MatchExpt
 
getPseudoCount() - Method in class com.wcohen.ss.DirichletJS
 
getResult(int) - Method in interface com.wcohen.ss.lookup.FastLookup
Get the i'th string found by the last lookup
getResult(int) - Method in class com.wcohen.ss.lookup.RescoringSoftTFIDFDictionary
 
getResult(int) - Method in class com.wcohen.ss.lookup.SoftTFIDFDictionary
Get the i'th string found by the last lookup
getScore(int) - Method in interface com.wcohen.ss.lookup.FastLookup
Get the score of the i'th string found by the last lookup
getScore(int) - Method in class com.wcohen.ss.lookup.RescoringSoftTFIDFDictionary
 
getScore(int) - Method in class com.wcohen.ss.lookup.SoftTFIDFDictionary
Get the score of the i'th string found by the last lookup
getScore() - Method in class com.wcohen.ss.TagLink.Candidates
getScore, return the score or distance between strings T and U.
getSource(int) - Method in class com.wcohen.ss.expt.MatchData
Get string identifier for i-th source
getSource() - Method in class com.wcohen.ss.expt.MatchData.Instance
 
getSource() - Method in class com.wcohen.ss.expt.MatchData.MatchIterator
Return the source of the last StringWrapper.
getTeacher() - Method in class com.wcohen.ss.lookup.SoftDictionary
Return a teacher that can 'train' a distance metric from the information in the dictionary.
getTokenMatchThreshold() - Method in class com.wcohen.ss.SoftTFIDF
 
getTokens() - Method in class com.wcohen.ss.TFIDF
Access the tokens of the last prepare()-ed string.
getTPos() - Method in class com.wcohen.ss.TagLink.Candidates
getTPos, return the position of string T.
getTr() - Method in class com.wcohen.ss.tokens.TagLinkToken
getTr return the contant value Gamma.
getUPos() - Method in class com.wcohen.ss.TagLink.Candidates
getUPos, return the position of string U.
getValue() - Method in interface com.wcohen.ss.api.Token
 
getValue(int) - Method in interface com.wcohen.ss.lookup.FastLookup
Get the value of the i'th string found by the last lookup
getValue(int) - Method in class com.wcohen.ss.lookup.RescoringSoftTFIDFDictionary
 
getValue(int) - Method in class com.wcohen.ss.lookup.SoftTFIDFDictionary
Get the value of the i'th string found by the last lookup
getValue() - Method in class com.wcohen.ss.tokens.BasicToken
 
getWeight(Token) - Method in class com.wcohen.ss.TFIDF
Access the weight of a token in the vector created for the last prepare()-ed string.
getWindowSize(int) - Method in class com.wcohen.ss.lookup.SoftTFIDFDictionary
 
graphPrecisionRecall(PrintStream) - Method in class com.wcohen.ss.expt.MatchExpt
Graph interpolated precision vs recall

H

hasAnswers() - Method in class com.wcohen.ss.api.StringDistanceTeacher
Return true if this teacher can answer more queries.
hasAnswers() - Method in class com.wcohen.ss.expt.BasicTeacher
 
hasAnswers() - Method in class com.wcohen.ss.expt.MatchDataTeacher
 
hashCode() - Method in class com.wcohen.ss.BasicStringWrapper
 
hashCode() - Method in class com.wcohen.ss.tokens.BasicToken
 
hasNext() - Method in interface com.wcohen.ss.api.DistanceInstanceIterator
 
hasNext() - Method in interface com.wcohen.ss.api.StringWrapperIterator
 
hasNext() - Method in class com.wcohen.ss.BasicDistanceInstanceIterator
 
hasNext() - Method in class com.wcohen.ss.BasicStringWrapperIterator
 
hasNext() - Method in class com.wcohen.ss.CombinedStringDistanceLearner.JthDistanceInstanceIterator
 
hasNext() - Method in class com.wcohen.ss.CombinedStringDistanceLearner.JthStringWrapperValueIterator
 
hasNext() - Method in class com.wcohen.ss.expt.MatchData.MatchIterator
 
hasNextQuery() - Method in class com.wcohen.ss.AbstractStringDistance
Implements StringDistanceLearner api by informing a teacher if the learner has DistanceInstance queries.
hasNextQuery() - Method in interface com.wcohen.ss.api.StringDistanceLearner
Returns true if the learner has more queries to answer.
hasNextQuery() - Method in class com.wcohen.ss.CombinedStringDistanceLearner
See if someone has a query

I

INCREASING_DISTANCE - Static variable in interface com.wcohen.ss.api.DistanceInstance
 
innerDistances - Variable in class com.wcohen.ss.CombinedStringDistanceLearner.CombinedStringDistance
 
innerDistanceString() - Method in class com.wcohen.ss.CombinedStringDistanceLearner.CombinedStringDistance
Help class for 'toString()' which produces a description of the distances being combined.
innerLearners - Variable in class com.wcohen.ss.CombinedStringDistanceLearner
 
intern(String) - Method in interface com.wcohen.ss.api.Tokenizer
Convert a given string into a token.
intern(String) - Method in class com.wcohen.ss.tokens.NGramTokenizer
 
intern(String) - Method in class com.wcohen.ss.tokens.SimpleTokenizer
 
interpolated11PointPrecision() - Method in class com.wcohen.ss.expt.MatchExpt
Return an array of interpolated precision at various different recall levels.
interpolated11PointRecallLevels() - Static method in class com.wcohen.ss.expt.MatchExpt
Return recall levels associated with the precision levels returned by interpolated11PointPrecision.
is - Variable in class com.wcohen.ss.AffineGap.MatrixTrio
 
isCorrect() - Method in interface com.wcohen.ss.api.DistanceInstance
 
isCorrect() - Method in class com.wcohen.ss.CombinedStringDistanceLearner.MyDistanceInstance
 
isCorrect() - Method in class com.wcohen.ss.expt.Blocker.Pair
 
isLegalMultiStringWrapperSize(int) - Method in class com.wcohen.ss.MultiStringDistance
Check if a string has a valid number of fields.
it - Variable in class com.wcohen.ss.AffineGap.MatrixTrio
 

J

Jaccard - Class in com.wcohen.ss
Jaccard distance implementation.
Jaccard(Tokenizer) - Constructor for class com.wcohen.ss.Jaccard
 
Jaccard() - Constructor for class com.wcohen.ss.Jaccard
 
Jaro - Class in com.wcohen.ss
Jaro distance metric.
Jaro() - Constructor for class com.wcohen.ss.Jaro
 
JaroWinkler - Class in com.wcohen.ss
Jaro distance metric, as extended by Winkler.
JaroWinkler() - Constructor for class com.wcohen.ss.JaroWinkler
 
JaroWinklerTFIDF - Class in com.wcohen.ss
Soft TFIDF-based distance metric, extended to use "soft" token-matching with the JaroWinkler distance metric.
JaroWinklerTFIDF() - Constructor for class com.wcohen.ss.JaroWinklerTFIDF
 
JelinekMercerJS - Class in com.wcohen.ss
Jensen-Shannon distance of two unigram language models, smoothed using Jelinek-Mercer mixture model.
JelinekMercerJS(Tokenizer, double) - Constructor for class com.wcohen.ss.JelinekMercerJS
 
JelinekMercerJS() - Constructor for class com.wcohen.ss.JelinekMercerJS
 
JensenShannonDistance - Class in com.wcohen.ss
Distance metrics based on Jensen-Shannon distance of two smoothed unigram language models.
JensenShannonDistance(Tokenizer) - Constructor for class com.wcohen.ss.JensenShannonDistance
 
JensenShannonDistance() - Constructor for class com.wcohen.ss.JensenShannonDistance
 

L

labelInstance(DistanceInstance) - Method in class com.wcohen.ss.api.StringDistanceTeacher
Label an instance queried by the learner.
labelInstance(DistanceInstance) - Method in class com.wcohen.ss.expt.BasicTeacher
 
labelInstance(DistanceInstance) - Method in class com.wcohen.ss.expt.MatchDataTeacher
 
learner(String) - Method in class com.wcohen.ss.expt.MatchExptScript
Load a learner.
learnerName - Variable in class com.wcohen.ss.expt.MatchExpt
 
learningTime - Variable in class com.wcohen.ss.expt.MatchExpt
 
length() - Method in interface com.wcohen.ss.api.StringWrapper
Return the length of the wrapped string
length() - Method in class com.wcohen.ss.BasicStringWrapper
 
Level2 - Class in com.wcohen.ss
Generic version of Monge & Elkan's "level 2" recursive field matching.
Level2(Tokenizer, StringDistance) - Constructor for class com.wcohen.ss.Level2
 
Level2Jaro - Class in com.wcohen.ss
"Level 2" recursive field matching algorithm, based on Jaro distance.
Level2Jaro() - Constructor for class com.wcohen.ss.Level2Jaro
 
Level2JaroWinkler - Class in com.wcohen.ss
"Level 2" recursive field matching algorithm, based on Jaro distance.
Level2JaroWinkler() - Constructor for class com.wcohen.ss.Level2JaroWinkler
 
Level2Levenstein - Class in com.wcohen.ss
"Level 2" recursive field matching algorithm using Levenstein distance.
Level2Levenstein() - Constructor for class com.wcohen.ss.Level2Levenstein
 
Level2MongeElkan - Class in com.wcohen.ss
Monge & Elkan's "level 2" recursive field matching algorithm.
Level2MongeElkan() - Constructor for class com.wcohen.ss.Level2MongeElkan
 
Levenstein - Class in com.wcohen.ss
Levenstein string distance.
Levenstein() - Constructor for class com.wcohen.ss.Levenstein
 
load(File) - Method in class com.wcohen.ss.lookup.SoftDictionary
Insert all lines in a file as items mapping to themselves.
load(File, boolean) - Method in class com.wcohen.ss.lookup.SoftDictionary
Insert all lines in a file as items mapping to themselves.
loadAliases(File) - Method in class com.wcohen.ss.lookup.SoftDictionary
Load a file of identifiers, each of which has multiple aliases.
loadAliases(File) - Method in class com.wcohen.ss.lookup.SoftTFIDFDictionary
Load a file of identifiers, each of which has multiple aliases.
lookup(double, String) - Method in interface com.wcohen.ss.lookup.FastLookup
Lookup items similar to 'toFind', and return the number of items found.
lookup(double, String) - Method in class com.wcohen.ss.lookup.RescoringSoftTFIDFDictionary
 
lookup(String, String) - Method in class com.wcohen.ss.lookup.SoftDictionary
Lookup a string in the dictionary.
lookup(String, StringWrapper) - Method in class com.wcohen.ss.lookup.SoftDictionary
Lookup a prepared string in the dictionary.
lookup(String) - Method in class com.wcohen.ss.lookup.SoftDictionary
Lookup a string in the dictionary.
lookup(StringWrapper) - Method in class com.wcohen.ss.lookup.SoftDictionary
Lookup a prepared string in the dictionary.
lookup(double, String) - Method in class com.wcohen.ss.lookup.SoftTFIDFDictionary
Lookup items SoftTFIDF-similar to the 'toFind' argument, and return the number of items found.
lookupDistance(String, String) - Method in class com.wcohen.ss.lookup.SoftDictionary
Return the distance to the best match.
lookupDistance(String, StringWrapper) - Method in class com.wcohen.ss.lookup.SoftDictionary
Return the distance to the best match.
lookupDistance(String) - Method in class com.wcohen.ss.lookup.SoftDictionary
Return the distance to the best match.
lookupDistance(StringWrapper) - Method in class com.wcohen.ss.lookup.SoftDictionary
Return the distance to the best match.
lookupTime - Variable in class com.wcohen.ss.lookup.SoftTFIDFDictionary
 

M

m - Variable in class com.wcohen.ss.AffineGap.MatrixTrio
 
main(String[]) - Static method in class com.wcohen.ss.AffineGap
 
main(String[]) - Static method in class com.wcohen.ss.ApproxNeedlemanWunsch
 
main(String[]) - Static method in class com.wcohen.ss.demo.SampleSoftTFIDFUsage
 
main(String[]) - Static method in class com.wcohen.ss.DirichletJS
 
main(String[]) - Static method in class com.wcohen.ss.DistanceLearnerFactory
Test routine.
main(String[]) - Static method in class com.wcohen.ss.expt.MatchData
 
main(String[]) - Static method in class com.wcohen.ss.expt.MatchExpt
Command-line interface.
main(String[]) - Static method in class com.wcohen.ss.expt.MatchExptScript
 
main(String[]) - Static method in class com.wcohen.ss.Jaccard
 
main(String[]) - Static method in class com.wcohen.ss.Jaro
 
main(String[]) - Static method in class com.wcohen.ss.JaroWinkler
 
main(String[]) - Static method in class com.wcohen.ss.JaroWinklerTFIDF
 
main(String[]) - Static method in class com.wcohen.ss.JelinekMercerJS
 
main(String[]) - Static method in class com.wcohen.ss.Level2
 
main(String[]) - Static method in class com.wcohen.ss.Level2Jaro
 
main(String[]) - Static method in class com.wcohen.ss.Level2JaroWinkler
 
main(String[]) - Static method in class com.wcohen.ss.Level2Levenstein
 
main(String[]) - Static method in class com.wcohen.ss.Level2MongeElkan
 
main(String[]) - Static method in class com.wcohen.ss.Levenstein
 
main(String[]) - Static method in class com.wcohen.ss.lookup.SoftDictionary
Simple main for testing.
main(String[]) - Static method in class com.wcohen.ss.lookup.SoftTFIDFDictionary
Simple main for testing and experimentation
main(String[]) - Static method in class com.wcohen.ss.Mixture
 
main(String[]) - Static method in class com.wcohen.ss.MongeElkan
 
main(String[]) - Static method in class com.wcohen.ss.MultiStringAvgDistance
 
main(String[]) - Static method in class com.wcohen.ss.MultiStringWrapper
 
main(String[]) - Static method in class com.wcohen.ss.NeedlemanWunsch
 
main(String[]) - Static method in class com.wcohen.ss.ScaledLevenstein
 
main(String[]) - Static method in class com.wcohen.ss.SmithWaterman
 
main(String[]) - Static method in class com.wcohen.ss.SoftTokenFelligiSunter
 
main(String[]) - Static method in class com.wcohen.ss.TFIDF
 
main(String[]) - Static method in class com.wcohen.ss.TokenFelligiSunter
 
main(String[]) - Static method in class com.wcohen.ss.tokens.NGramTokenizer
Test routine
main(String[]) - Static method in class com.wcohen.ss.tokens.SimpleTokenizer
Test routine
main(String[]) - Static method in class com.wcohen.ss.UnsmoothedJS
 
main(String[]) - Static method in class com.wcohen.ss.WizardUI
The entry point to the wizard.
MatchData - Class in com.wcohen.ss.expt
Holds data for evaluating a distance metric.
MatchData(String) - Constructor for class com.wcohen.ss.expt.MatchData
Read match data from a file.
MatchData() - Constructor for class com.wcohen.ss.expt.MatchData
 
MatchData.InputFormatException - Exception in com.wcohen.ss.expt
Signals an incorrectly formatted MatchData file.
MatchData.InputFormatException(String, int, String) - Constructor for exception com.wcohen.ss.expt.MatchData.InputFormatException
 
MatchData.Instance - Class in com.wcohen.ss.expt
A single item (aka record, string, etc) to match against others.
MatchData.Instance(String, String, String) - Constructor for class com.wcohen.ss.expt.MatchData.Instance
 
MatchData.MatchIterator - Class in com.wcohen.ss.expt
Iterates over all stored StringWrappers
MatchData.MatchIterator(MatchData) - Constructor for class com.wcohen.ss.expt.MatchData.MatchIterator
 
MatchDataTeacher - Class in com.wcohen.ss.expt
Train a StringDistanceLearner using MatchData and a Blocker.
MatchDataTeacher(MatchData, Blocker) - Constructor for class com.wcohen.ss.expt.MatchDataTeacher
 
MatchExpt - Class in com.wcohen.ss.expt
Perform a matching experiment using a data file, distance function and blocker.
MatchExpt(MatchData, StringDistanceLearner, Blocker) - Constructor for class com.wcohen.ss.expt.MatchExpt
 
MatchExpt(MatchData, StringDistanceLearner) - Constructor for class com.wcohen.ss.expt.MatchExpt
 
MatchExptScript - Class in com.wcohen.ss.expt
Perform a series of match experiments, specified by a script in an input file.
MatchExptScript() - Constructor for class com.wcohen.ss.expt.MatchExptScript
 
matchingTime - Variable in class com.wcohen.ss.expt.MatchExpt
 
matchScore(char, char) - Method in class com.wcohen.ss.CharMatchScore
 
max3(double, double, double) - Static method in class com.wcohen.ss.MemoMatrix
Return max of three numbers.
max4(double, double, double, double) - Static method in class com.wcohen.ss.MemoMatrix
Return max of four numbers.
maxF1() - Method in class com.wcohen.ss.expt.MatchExpt
max F1 for any threshold
maxTokenIndex() - Method in interface com.wcohen.ss.api.Tokenizer
Return the higest index of any interned token
maxTokenIndex() - Method in class com.wcohen.ss.tokens.NGramTokenizer
 
maxTokenIndex() - Method in class com.wcohen.ss.tokens.SimpleTokenizer
 
MemoMatrix - Class in com.wcohen.ss
A matrix of doubles, defined recursively by the compute(i,j) method, that will not be recomputed more than necessary.
Mixture - Class in com.wcohen.ss
Mixture-based distance metric.
Mixture(Tokenizer) - Constructor for class com.wcohen.ss.Mixture
 
Mixture() - Constructor for class com.wcohen.ss.Mixture
 
MongeElkan - Class in com.wcohen.ss
The match method proposed by Monge and Elkan.
MongeElkan() - Constructor for class com.wcohen.ss.MongeElkan
 
MultiStringAvgDistance - Class in com.wcohen.ss
StringDistance defined over Strings that are broken into fields, with distance defined as the average distance between any field.
MultiStringAvgDistance(StringDistance, String) - Constructor for class com.wcohen.ss.MultiStringAvgDistance
 
MultiStringDistance - Class in com.wcohen.ss
Abstract class StringDistance defined over Strings that are broken into fields.
MultiStringDistance(String) - Constructor for class com.wcohen.ss.MultiStringDistance
 
MultiStringWrapper - Class in com.wcohen.ss
A StringWrapper that stores a version of the string that has been either (a) split into a number of distinct fields, or (b) duplicated k times, so that k different StringDistance's can preprocess it, of (b) both of the above.
MultiStringWrapper(String, String) - Constructor for class com.wcohen.ss.MultiStringWrapper
Create a MultiStringWrapper by splitting s into fields based on the given delimiter.
MultiStringWrapper(String, int) - Constructor for class com.wcohen.ss.MultiStringWrapper
Create a MultiStringWrapper by making k copies of s.
MultiStringWrapper(String, int, String) - Constructor for class com.wcohen.ss.MultiStringWrapper
Create a MultiStringWrapper by making k copies of each field of s.

N

NeedlemanWunsch - Class in com.wcohen.ss
Needleman-Wunsch string distance, following Durban et al.
NeedlemanWunsch() - Constructor for class com.wcohen.ss.NeedlemanWunsch
 
NeedlemanWunsch(CharMatchScore, double) - Constructor for class com.wcohen.ss.NeedlemanWunsch
 
next() - Method in interface com.wcohen.ss.api.DistanceInstanceIterator
 
next() - Method in interface com.wcohen.ss.api.StringWrapperIterator
 
next() - Method in class com.wcohen.ss.BasicDistanceInstanceIterator
 
next() - Method in class com.wcohen.ss.BasicStringWrapperIterator
 
next() - Method in class com.wcohen.ss.CombinedStringDistanceLearner.JthDistanceInstanceIterator
 
next() - Method in class com.wcohen.ss.CombinedStringDistanceLearner.JthStringWrapperValueIterator
 
next() - Method in class com.wcohen.ss.expt.MatchData.MatchIterator
Returns the next StringWrapper as an object.
nextDistanceInstance() - Method in interface com.wcohen.ss.api.DistanceInstanceIterator
 
nextDistanceInstance() - Method in class com.wcohen.ss.BasicDistanceInstanceIterator
 
nextDistanceInstance() - Method in class com.wcohen.ss.CombinedStringDistanceLearner.JthDistanceInstanceIterator
 
nextQuery() - Method in class com.wcohen.ss.AbstractStringDistance
Implements StringDistanceLearner api by querying for DistanceInstance labels.
nextQuery() - Method in interface com.wcohen.ss.api.StringDistanceLearner
Returns a DistanceInstance for which the learner would like a label.
nextQuery() - Method in class com.wcohen.ss.CombinedStringDistanceLearner
Get a next query from one of the sublearners
nextStringWrapper() - Method in interface com.wcohen.ss.api.StringWrapperIterator
 
nextStringWrapper() - Method in class com.wcohen.ss.BasicStringWrapperIterator
 
nextStringWrapper() - Method in class com.wcohen.ss.CombinedStringDistanceLearner.JthStringWrapperValueIterator
 
nextStringWrapper() - Method in class com.wcohen.ss.expt.MatchData.MatchIterator
Return the next StringWrapper.
NGramBlocker - Class in com.wcohen.ss.expt
Finds all pairs that share a not-too-common character n-gram.
NGramBlocker() - Constructor for class com.wcohen.ss.expt.NGramBlocker
 
NGramTokenizer - Class in com.wcohen.ss.tokens
Wraps another tokenizer, and adds all computes all ngrams of characters from a single token produced by the inner tokenizer.
NGramTokenizer(int, int, boolean, Tokenizer) - Constructor for class com.wcohen.ss.tokens.NGramTokenizer
 
NullBlocker - Class in com.wcohen.ss.expt
Inefficient exhausitive blocking algorithm.
NullBlocker() - Constructor for class com.wcohen.ss.expt.NullBlocker
 
numCorrectPairs() - Method in class com.wcohen.ss.expt.Blocker
Return total number of correct pairs in the dataset.
numCorrectPairs - Variable in class com.wcohen.ss.expt.MatchExpt
 
numCorrectPairs() - Method in class com.wcohen.ss.expt.NullBlocker
 
numCorrectPairs() - Method in class com.wcohen.ss.expt.TokenBlocker
 
numInstances(String) - Method in class com.wcohen.ss.expt.MatchData
Number of records for source with given string id
numSources() - Method in class com.wcohen.ss.expt.MatchData
Number of sources in data set

O

outOfRange(int, int) - Method in class com.wcohen.ss.ApproxMemoMatrix
 

P

pairs - Variable in class com.wcohen.ss.expt.MatchExpt
 
pairsPerSecond() - Method in class com.wcohen.ss.expt.MatchExpt
Return total time to process data, divided by the number of pairs
precisionRecall() - Method in class com.wcohen.ss.expt.MatchExptScript
Show interpolated 11-pt precision curves for each blocker/distance/dataset
prepare(String) - Method in class com.wcohen.ss.AbstractStringDistance
Default way to preprocess a string for distance computation.
prepare(StringWrapperIterator) - Method in class com.wcohen.ss.AbstractStringDistance
Implements StringDistanceLearner api by providing a way to prep a StringWrapperIterator for training.
prepare(DistanceInstanceIterator) - Method in class com.wcohen.ss.AbstractStringDistance
Implements StringDistanceLearner api by providing a way to prep a DistanceInstanceIterator for training.
prepare(StringWrapperIterator) - Method in class com.wcohen.ss.AbstractTokenizedStringDistance
 
prepare(String) - Method in interface com.wcohen.ss.api.StringDistance
Preprocess a string for distance computation
prepare(StringWrapperIterator) - Method in interface com.wcohen.ss.api.StringDistanceLearner
Preprocess a StringWrapperIterator for unsupervised training.
prepare(DistanceInstanceIterator) - Method in interface com.wcohen.ss.api.StringDistanceLearner
Preprocess a DistanceInstanceIterator for supervised training.
prepare(String) - Method in class com.wcohen.ss.CombinedStringDistanceLearner.CombinedStringDistance
 
prepare(StringWrapperIterator) - Method in class com.wcohen.ss.CombinedStringDistanceLearner
Prepare data for the sublearners.
prepare(DistanceInstanceIterator) - Method in class com.wcohen.ss.CombinedStringDistanceLearner
Prepare data for the learners.
prepare(String) - Method in class com.wcohen.ss.Jaccard
Preprocess a string by finding tokens.
prepare(String) - Method in class com.wcohen.ss.Jaro
 
prepare(String) - Method in class com.wcohen.ss.JensenShannonDistance
Preprocess a string by finding tokens and giving them weights W such that W is the smoothed probability of the token appearing in the document.
prepare(String) - Method in class com.wcohen.ss.lookup.SoftDictionary
Prepare a string for quicker lookup.
prepare(String) - Method in class com.wcohen.ss.MultiStringDistance
Prepare a string.
prepare(StringDistance[]) - Method in class com.wcohen.ss.MultiStringWrapper
Prepare each field with the appropriate distance
prepare(String) - Method in class com.wcohen.ss.SoftTokenFelligiSunter
Preprocess a string by finding tokens
prepare(String) - Method in class com.wcohen.ss.TagLink
Preprocess a string by finding tokens and giving them TFIDF weights
prepare(String) - Method in class com.wcohen.ss.TFIDF
Preprocess a string by finding tokens and giving them TFIDF weights
prepare(String) - Method in class com.wcohen.ss.TokenFelligiSunter
Preprocess a string by finding tokens and giving them appropriate weights
prepare(String) - Method in class com.wcohen.ss.WinklerRescorer
 
PrintfFormat - Class in com.wcohen.ss
PrintfFormat allows the formatting of an array of objects embedded within a string.
PrintfFormat(String) - Constructor for class com.wcohen.ss.PrintfFormat
Constructs an array of control specifications possibly preceded, separated, or followed by ordinary strings.
PrintfFormat(Locale, String) - Constructor for class com.wcohen.ss.PrintfFormat
Constructs an array of control specifications possibly preceded, separated, or followed by ordinary strings.
printNegativeValues - Variable in class com.wcohen.ss.MemoMatrix
 
prototype - Variable in class com.wcohen.ss.CombinedStringDistanceLearner.CombinedStringDistance
 
put(String, String, Object) - Method in class com.wcohen.ss.lookup.SoftDictionary
Insert a string into the dictionary.
put(String, Object) - Method in class com.wcohen.ss.lookup.SoftDictionary
Insert a string into the dictionary.
put(String, StringWrapper, Object) - Method in class com.wcohen.ss.lookup.SoftDictionary
Insert a prepared string into the dictionary.
put(String, Object) - Method in class com.wcohen.ss.lookup.SoftTFIDFDictionary
Insert a string into the dictionary, and associate it with the given value.

R

refreeze() - Method in class com.wcohen.ss.lookup.SoftTFIDFDictionary
 
remove() - Method in class com.wcohen.ss.BasicDistanceInstanceIterator
 
remove() - Method in class com.wcohen.ss.BasicStringWrapperIterator
 
remove() - Method in class com.wcohen.ss.CombinedStringDistanceLearner.JthDistanceInstanceIterator
 
remove() - Method in class com.wcohen.ss.CombinedStringDistanceLearner.JthStringWrapperValueIterator
 
remove() - Method in class com.wcohen.ss.expt.MatchData.MatchIterator
Not implemented.
RescoringSoftTFIDFDictionary - Class in com.wcohen.ss.lookup
Wrapper around a SoftTFIDFDictionary that allows you to 'rescore' the result using an arbitrary StringDistance.
RescoringSoftTFIDFDictionary(FastLookup, double, StringDistance) - Constructor for class com.wcohen.ss.lookup.RescoringSoftTFIDFDictionary
 
restore(String) - Method in class com.wcohen.ss.expt.MatchExptScript
Restore experimental data previously 'saved' toa file.
restore(File) - Static method in class com.wcohen.ss.lookup.SoftTFIDFDictionary
 
runScript(String) - Method in class com.wcohen.ss.expt.MatchExptScript
Load commands from a file and execute them.

S

s - Variable in class com.wcohen.ss.MemoMatrix
 
sameId(MatchData.Instance) - Method in class com.wcohen.ss.expt.MatchData.Instance
 
SampleSoftTFIDFUsage - Class in com.wcohen.ss.demo
 
SampleSoftTFIDFUsage() - Constructor for class com.wcohen.ss.demo.SampleSoftTFIDFUsage
 
sAt(int) - Method in class com.wcohen.ss.MemoMatrix
Get i-th char of s, indexing s from 1..n
save(String) - Method in class com.wcohen.ss.expt.MatchExptScript
Save current experimental data to a file
saveAs(File) - Method in class com.wcohen.ss.lookup.SoftTFIDFDictionary
 
ScaledLevenstein - Class in com.wcohen.ss
Levenstein string distance.
ScaledLevenstein() - Constructor for class com.wcohen.ss.ScaledLevenstein
 
score(StringWrapper, StringWrapper) - Method in class com.wcohen.ss.AbstractStringDistance
This method needs to be implemented by subclasses.
score(String, String) - Method in class com.wcohen.ss.AbstractStringDistance
Strings are scored by converting them to StringWrappers with the prepare function.
score(StringWrapper, StringWrapper) - Method in class com.wcohen.ss.AffineGap
 
score(StringWrapper, StringWrapper) - Method in interface com.wcohen.ss.api.StringDistance
Find the distance between s and t.
score(String, String) - Method in interface com.wcohen.ss.api.StringDistance
Find the distance between s and t
score(StringWrapper, StringWrapper) - Method in class com.wcohen.ss.ApproxNeedlemanWunsch
 
score(String, String) - Method in class com.wcohen.ss.CombinedStringDistanceLearner.CombinedStringDistance
 
score(StringWrapper, StringWrapper) - Method in class com.wcohen.ss.CombinedStringDistanceLearner.CombinedStringDistance
 
score(StringWrapper, StringWrapper) - Method in class com.wcohen.ss.Jaccard
 
score(StringWrapper, StringWrapper) - Method in class com.wcohen.ss.Jaro
 
score(StringWrapper, StringWrapper) - Method in class com.wcohen.ss.JensenShannonDistance
Jensen-Shannon distance between distributions.
score(StringWrapper, StringWrapper) - Method in class com.wcohen.ss.Level2
 
score(StringWrapper, StringWrapper) - Method in class com.wcohen.ss.Mixture
Distance is argmax_lambda prod_{w in s} lambda Pr(w|t) * (1-lambda) Pr(w|background).
score(StringWrapper, StringWrapper) - Method in class com.wcohen.ss.MongeElkan
Version of distance which is possibly scaled to [0,1].
score(StringWrapper, StringWrapper) - Method in class com.wcohen.ss.MultiStringDistance
 
score(String, String) - Method in class com.wcohen.ss.MultiStringDistance
Strings are scored by converting them to StringWrappers with the prepare function.
score(StringWrapper, StringWrapper) - Method in class com.wcohen.ss.NeedlemanWunsch
 
score(StringWrapper, StringWrapper) - Method in class com.wcohen.ss.ScaledLevenstein
 
score(StringWrapper, StringWrapper) - Method in class com.wcohen.ss.SmithWaterman
 
score(StringWrapper, StringWrapper) - Method in class com.wcohen.ss.SoftTFIDF
 
score(StringWrapper, StringWrapper) - Method in class com.wcohen.ss.SoftTokenFelligiSunter
 
score(StringWrapper, StringWrapper) - Method in class com.wcohen.ss.TagLink
getStringMetric computes the similarity between a pair of strings T and U.
score(StringWrapper, StringWrapper) - Method in class com.wcohen.ss.TFIDF
 
score(StringWrapper, StringWrapper) - Method in class com.wcohen.ss.TokenFelligiSunter
 
score(StringWrapper, StringWrapper) - Method in class com.wcohen.ss.tokens.TagLinkToken
score return the a strng distance value between 0 and 1 of a pair of tokens.
score(StringWrapper, StringWrapper) - Method in class com.wcohen.ss.WinklerRescorer
 
scoreCombination(double[]) - Method in class com.wcohen.ss.MultiStringAvgDistance
Combine the scores for each primitive distance function on each field.
scoreCombination(double[]) - Method in class com.wcohen.ss.MultiStringDistance
Combine the scores for each primitive distance function on each field.
set(int, StringWrapper) - Method in class com.wcohen.ss.MultiStringWrapper
Set the i-th field.
setClusterMode(boolean) - Method in class com.wcohen.ss.expt.Blocker
In clusterMode, consider pairings between instances from the same source.
setClusterMode(Boolean) - Method in class com.wcohen.ss.expt.Blocker
 
setCollectionSize(int) - Method in class com.wcohen.ss.TFIDF
Setting the collectionSize and alsoSet the size of the collection that this TFIDF measure was trained on to some value.
setDistance(double) - Method in interface com.wcohen.ss.api.DistanceInstance
 
setDistance(double) - Method in class com.wcohen.ss.CombinedStringDistanceLearner.MyDistanceInstance
 
setDistance(double) - Method in class com.wcohen.ss.expt.Blocker.Pair
 
setDistanceInstancePool(DistanceInstanceIterator) - Method in class com.wcohen.ss.AbstractStringDistance
Implements StringDistanceLearner api by providing a way to accept a pool of unlabeled DistanceInstance's.
setDistanceInstancePool(DistanceInstanceIterator) - Method in interface com.wcohen.ss.api.StringDistanceLearner
Accept a set of unlabeled DistanceInstance, to use in making distance instance queries.
setDistanceInstancePool(DistanceInstanceIterator) - Method in class com.wcohen.ss.CombinedStringDistanceLearner
Pass the training data along to the inner learners.
setDocumentFrequency(Token, int) - Method in class com.wcohen.ss.TFIDF
Set the document frequency of the token to some value.
setIgnoreCase(boolean) - Method in class com.wcohen.ss.tokens.SimpleTokenizer
 
setIgnorePunctuation(boolean) - Method in class com.wcohen.ss.tokens.SimpleTokenizer
 
setLambda(double) - Method in class com.wcohen.ss.JelinekMercerJS
 
setLambda(Double) - Method in class com.wcohen.ss.JelinekMercerJS
 
setMaxFraction(double) - Method in class com.wcohen.ss.expt.TokenBlocker
 
setMaxInvertedIndexSize(int) - Method in class com.wcohen.ss.lookup.SoftTFIDFDictionary
Set the maximum size of an inverted index that will be followed.
setMaxNGramSize(int) - Method in class com.wcohen.ss.expt.NGramBlocker
 
setMinNGramSize(int) - Method in class com.wcohen.ss.expt.NGramBlocker
 
setMismatchFactor(double) - Method in class com.wcohen.ss.SoftTokenFelligiSunter
 
setMismatchFactor(Double) - Method in class com.wcohen.ss.SoftTokenFelligiSunter
 
setMismatchFactor(double) - Method in class com.wcohen.ss.TokenFelligiSunter
 
setMismatchFactor(Double) - Method in class com.wcohen.ss.TokenFelligiSunter
 
setPseudoCount(double) - Method in class com.wcohen.ss.DirichletJS
 
setPseudoCount(Double) - Method in class com.wcohen.ss.DirichletJS
 
setScaling(boolean) - Method in class com.wcohen.ss.MongeElkan
If scaling is true, then distances are scaled to 0-1
setScaling(Double) - Method in class com.wcohen.ss.MongeElkan
For interfacing with reflection in MatchExptScript.
setStringWrapperPool(StringWrapperIterator) - Method in class com.wcohen.ss.AbstractStringDistance
Implements the StringDistanceLearner api, by providing a way to accumulate statistics for a set of related strings.
setStringWrapperPool(StringWrapperIterator) - Method in class com.wcohen.ss.AbstractTokenizedStringDistance
 
setStringWrapperPool(StringWrapperIterator) - Method in interface com.wcohen.ss.api.StringDistanceLearner
Unsupervised learning method that observes strings for which distance will be computed.
setStringWrapperPool(StringWrapperIterator) - Method in class com.wcohen.ss.CombinedStringDistanceLearner
Pass the training data along to the inner learners.
setTokenMatchThreshold(double) - Method in class com.wcohen.ss.SoftTFIDF
 
setTokenMatchThreshold(Double) - Method in class com.wcohen.ss.SoftTFIDF
 
setTokenMatchThreshold(double) - Method in class com.wcohen.ss.SoftTokenFelligiSunter
 
setTokenMatchThreshold(Double) - Method in class com.wcohen.ss.SoftTokenFelligiSunter
 
setTreshold(double) - Method in class com.wcohen.ss.tokens.TagLinkToken
setTreshold set a new value to the constant Gamma.
setWidth(int) - Method in class com.wcohen.ss.ApproxNeedlemanWunsch
 
setWindowSize(int) - Method in class com.wcohen.ss.lookup.SoftTFIDFDictionary
Set the 'windowSize' used for finding similar tokens.
show(String) - Method in class com.wcohen.ss.expt.MatchExptScript
Show datasets, blockers, or learners.
SimpleTokenizer - Class in com.wcohen.ss.tokens
Simple implementation of a Tokenizer.
SimpleTokenizer(boolean, boolean) - Constructor for class com.wcohen.ss.tokens.SimpleTokenizer
 
size() - Method in class com.wcohen.ss.expt.Blocker
Return number of candidate pairs, as produced from most recently block()-ed data
size() - Method in class com.wcohen.ss.expt.NullBlocker
 
size() - Method in class com.wcohen.ss.expt.TokenBlocker
 
size() - Method in class com.wcohen.ss.lookup.SoftDictionary
Return the number of entries in the dictionary.
size() - Method in class com.wcohen.ss.MultiStringWrapper
Return number of fields.
slowLookup(double, String) - Method in class com.wcohen.ss.lookup.SoftTFIDFDictionary
Exactly like lookup, but works by exhaustively checking every stored string.
SmithWaterman - Class in com.wcohen.ss
Smith-Waterman string distance, following Durban et al.
SmithWaterman() - Constructor for class com.wcohen.ss.SmithWaterman
 
SmithWaterman(CharMatchScore, double) - Constructor for class com.wcohen.ss.SmithWaterman
 
smoothedProbability(Token, double, double) - Method in class com.wcohen.ss.DirichletJS
smoothed probability of the token
smoothedProbability(Token, double, double) - Method in class com.wcohen.ss.JelinekMercerJS
smoothed probability of the token
smoothedProbability(Token, double, double) - Method in class com.wcohen.ss.JensenShannonDistance
Smoothed probability of the token with frequency freq in a bag with the given totalWeight
smoothedProbability(Token, double, double) - Method in class com.wcohen.ss.UnsmoothedJS
Unsmoothed probability of the token
SoftDictionary - Class in com.wcohen.ss.lookup
Looks up nearly-matching strings in a dictionary, using a string distance.
SoftDictionary() - Constructor for class com.wcohen.ss.lookup.SoftDictionary
 
SoftDictionary(StringDistanceLearner) - Constructor for class com.wcohen.ss.lookup.SoftDictionary
 
SoftDictionary(Tokenizer) - Constructor for class com.wcohen.ss.lookup.SoftDictionary
 
SoftDictionary(StringDistanceLearner, Tokenizer) - Constructor for class com.wcohen.ss.lookup.SoftDictionary
 
SoftTFIDF - Class in com.wcohen.ss
TFIDF-based distance metric, extended to use "soft" token-matching.
SoftTFIDF(Tokenizer, StringDistance, double) - Constructor for class com.wcohen.ss.SoftTFIDF
 
SoftTFIDF(StringDistance, double) - Constructor for class com.wcohen.ss.SoftTFIDF
 
SoftTFIDF(StringDistance) - Constructor for class com.wcohen.ss.SoftTFIDF
 
SoftTFIDFDictionary - Class in com.wcohen.ss.lookup
Looks up nearly-matching strings in a dictionary, using SoftTFIDF distance.
SoftTFIDFDictionary() - Constructor for class com.wcohen.ss.lookup.SoftTFIDFDictionary
 
SoftTFIDFDictionary(Tokenizer) - Constructor for class com.wcohen.ss.lookup.SoftTFIDFDictionary
 
SoftTFIDFDictionary(Tokenizer, double) - Constructor for class com.wcohen.ss.lookup.SoftTFIDFDictionary
 
SoftTFIDFDictionary(Tokenizer, double, int, int) - Constructor for class com.wcohen.ss.lookup.SoftTFIDFDictionary
Create a new SoftTFIDFDictionary.
SoftTokenFelligiSunter - Class in com.wcohen.ss
Highly simplified model of Felligi-Sunter's method 1, applied to tokens.
SoftTokenFelligiSunter(Tokenizer, StringDistance, double, double) - Constructor for class com.wcohen.ss.SoftTokenFelligiSunter
 
SoftTokenFelligiSunter() - Constructor for class com.wcohen.ss.SoftTokenFelligiSunter
 
sortingTime - Variable in class com.wcohen.ss.expt.MatchExpt
 
sprintf(Object[]) - Method in class com.wcohen.ss.PrintfFormat
Format an array of objects.
sprintf() - Method in class com.wcohen.ss.PrintfFormat
Format nothing.
sprintf(int) - Method in class com.wcohen.ss.PrintfFormat
Format an int.
sprintf(long) - Method in class com.wcohen.ss.PrintfFormat
Format an long.
sprintf(double) - Method in class com.wcohen.ss.PrintfFormat
Format a double.
sprintf(String) - Method in class com.wcohen.ss.PrintfFormat
Format a String.
sprintf(Object) - Method in class com.wcohen.ss.PrintfFormat
Format an Object.
StringDistance - Interface in com.wcohen.ss.api
Compute the difference between pairs of strings.
StringDistanceLearner - Interface in com.wcohen.ss.api
Learn a StringDistance.
StringDistanceTeacher - Class in com.wcohen.ss.api
Train a StringDistanceLearner and return the learned StringDistance, using some unspecified source of information to train the learner.
StringDistanceTeacher() - Constructor for class com.wcohen.ss.api.StringDistanceTeacher
 
StringWrapper - Interface in com.wcohen.ss.api
Something that implements some of the functionality of Java's string class, but which is a non-final class, and hence can also cache additional information to facilitate later processing.
stringWrapperIterator() - Method in class com.wcohen.ss.api.StringDistanceTeacher
Strings over which distances will be computed.
StringWrapperIterator - Interface in com.wcohen.ss.api
An iterator over StringWrapper objects.
stringWrapperIterator() - Method in class com.wcohen.ss.expt.BasicTeacher
 
stringWrapperIterator() - Method in class com.wcohen.ss.expt.MatchDataTeacher
 

T

t - Variable in class com.wcohen.ss.MemoMatrix
 
table(String) - Method in class com.wcohen.ss.expt.MatchExptScript
Show a table of some expt-wide numeric measurement.
TagLink - Class in com.wcohen.ss
 
TagLink() - Constructor for class com.wcohen.ss.TagLink
TagLink default constructor.
TagLink(AbstractStringDistance) - Constructor for class com.wcohen.ss.TagLink
TagLink constructor requires a character based string metric.
TagLink(Tokenizer, AbstractStringDistance) - Constructor for class com.wcohen.ss.TagLink
TagLink constructor requires a tokenizer and a tokenDistance metric
TagLink(String[]) - Constructor for class com.wcohen.ss.TagLink
TagLink constructor requires dataset string array in order to compute the IDF weights.
TagLink(String[], AbstractStringDistance) - Constructor for class com.wcohen.ss.TagLink
TagLink constructor requires dataset string array in order to compute the IDF weights and a tokenDistance metric.
TagLink.Candidates - Class in com.wcohen.ss
 
TagLink.Candidates(int, int, double) - Constructor for class com.wcohen.ss.TagLink.Candidates
Candidates constructor.
TagLink.UnitVector - Class in com.wcohen.ss
Marker class extending BagOfTokens
TagLink.UnitVector(String, Token[]) - Constructor for class com.wcohen.ss.TagLink.UnitVector
 
TagLink.UnitVector(BagOfTokens) - Constructor for class com.wcohen.ss.TagLink.UnitVector
 
TagLinkToken - Class in com.wcohen.ss.tokens
 
TagLinkToken() - Constructor for class com.wcohen.ss.tokens.TagLinkToken
TagLinkToken default constructor.
TagLinkToken(double) - Constructor for class com.wcohen.ss.tokens.TagLinkToken
TagLinkToken constrctur.
tAt(int) - Method in class com.wcohen.ss.MemoMatrix
Get i-th char of t, indexing s from 1..n
TFIDF - Class in com.wcohen.ss
TFIDF-based distance metric.
TFIDF(Tokenizer) - Constructor for class com.wcohen.ss.TFIDF
 
TFIDF() - Constructor for class com.wcohen.ss.TFIDF
 
TFIDF.UnitVector - Class in com.wcohen.ss
Marker class extending BagOfTokens
TFIDF.UnitVector(String, Token[]) - Constructor for class com.wcohen.ss.TFIDF.UnitVector
 
TFIDF.UnitVector(BagOfTokens) - Constructor for class com.wcohen.ss.TFIDF.UnitVector
 
time() - Method in class com.wcohen.ss.expt.MatchExpt
Return total time to process data.
toEvaluation() - Method in class com.wcohen.ss.expt.MatchExpt
 
toGUI() - Method in class com.wcohen.ss.expt.MatchExpt
Construct a viewer for the results
Token - Interface in com.wcohen.ss.api
An interned version of a string.
TokenBlocker - Class in com.wcohen.ss.expt
Finds all pairs that share a not-too-common token.
TokenBlocker(Tokenizer, double) - Constructor for class com.wcohen.ss.expt.TokenBlocker
 
TokenBlocker() - Constructor for class com.wcohen.ss.expt.TokenBlocker
 
TokenFelligiSunter - Class in com.wcohen.ss
Highly simplified model of Felligi-Sunter's method 1, applied to tokens.
TokenFelligiSunter(Tokenizer, double) - Constructor for class com.wcohen.ss.TokenFelligiSunter
 
TokenFelligiSunter() - Constructor for class com.wcohen.ss.TokenFelligiSunter
 
tokenIterator() - Method in interface com.wcohen.ss.api.Tokenizer
Return an iterator over interned tokens
tokenIterator() - Method in class com.wcohen.ss.tokens.NGramTokenizer
 
tokenIterator() - Method in class com.wcohen.ss.tokens.SimpleTokenizer
 
tokenize(String) - Method in interface com.wcohen.ss.api.Tokenizer
Return tokenized version of a string
tokenize(String) - Method in class com.wcohen.ss.tokens.NGramTokenizer
Return tokenized version of a string.
tokenize(String) - Method in class com.wcohen.ss.tokens.SimpleTokenizer
Return tokenized version of a string.
tokenizer - Variable in class com.wcohen.ss.AbstractTokenizedStringDistance
 
Tokenizer - Interface in com.wcohen.ss.api
Split a string into tokens.
tokenizer - Variable in class com.wcohen.ss.expt.TokenBlocker
 
toString() - Method in class com.wcohen.ss.ApproxMemoMatrix
Print the matrix, for debugging and/or explanation.
toString() - Method in class com.wcohen.ss.BasicStringWrapper
 
toString() - Method in class com.wcohen.ss.CombinedStringDistanceLearner.MyMultiDistanceInstance
 
toString() - Method in class com.wcohen.ss.DirichletJS
 
toString() - Method in class com.wcohen.ss.expt.Blocker.Pair
 
toString() - Method in class com.wcohen.ss.expt.ClusterNGramBlocker
 
toString() - Method in class com.wcohen.ss.expt.ClusterTokenBlocker
 
toString() - Method in class com.wcohen.ss.expt.MatchData.Instance
 
toString() - Method in class com.wcohen.ss.expt.MatchData
 
toString() - Method in class com.wcohen.ss.expt.MatchExpt
 
toString() - Method in class com.wcohen.ss.expt.NGramBlocker
 
toString() - Method in class com.wcohen.ss.expt.NullBlocker
 
toString() - Method in class com.wcohen.ss.expt.TokenBlocker
 
toString() - Method in class com.wcohen.ss.Jaccard
 
toString() - Method in class com.wcohen.ss.Jaro
 
toString() - Method in class com.wcohen.ss.JaroWinklerTFIDF
 
toString() - Method in class com.wcohen.ss.JelinekMercerJS
 
toString() - Method in class com.wcohen.ss.Level2
 
toString() - Method in class com.wcohen.ss.Level2Jaro
 
toString() - Method in class com.wcohen.ss.Level2JaroWinkler
 
toString() - Method in class com.wcohen.ss.Level2Levenstein
 
toString() - Method in class com.wcohen.ss.Level2MongeElkan
 
toString() - Method in class com.wcohen.ss.Levenstein
 
toString() - Method in class com.wcohen.ss.MemoMatrix
Print the matrix, for debugging and/or explanation.
toString() - Method in class com.wcohen.ss.Mixture
 
toString() - Method in class com.wcohen.ss.MongeElkan
 
toString() - Method in class com.wcohen.ss.MultiStringWrapper
 
toString() - Method in class com.wcohen.ss.ScaledLevenstein
 
toString() - Method in class com.wcohen.ss.SmithWaterman
 
toString() - Method in class com.wcohen.ss.SoftTFIDF
 
toString() - Method in class com.wcohen.ss.SoftTokenFelligiSunter
 
toString() - Method in class com.wcohen.ss.TagLink
toString returns the name and parameters of this string metric
toString() - Method in class com.wcohen.ss.TFIDF
 
toString() - Method in class com.wcohen.ss.TokenFelligiSunter
 
toString() - Method in class com.wcohen.ss.tokens.BasicToken
 
toString() - Method in class com.wcohen.ss.tokens.SimpleTokenizer
 
toString() - Method in class com.wcohen.ss.tokens.TagLinkToken
toString return the name of the string metric.
toString() - Method in class com.wcohen.ss.UnsmoothedJS
 
toString() - Method in class com.wcohen.ss.WinklerRescorer
 
totalTokenCount - Variable in class com.wcohen.ss.AbstractStatisticalTokenDistance
 
train(StringWrapperIterator) - Method in class com.wcohen.ss.AbstractStatisticalTokenDistance
Accumulate statistics on how often each token value occurs
train(StringWrapperIterator) - Method in class com.wcohen.ss.AbstractTokenizedStringDistance
 
train(StringDistanceLearner) - Method in class com.wcohen.ss.api.StringDistanceTeacher
 
train(StringWrapperIterator) - Method in class com.wcohen.ss.JensenShannonDistance
Accumulate statistics on how often each token occurs.
train(StringWrapperIterator) - Method in class com.wcohen.ss.Level2
Training is a null operation.

U

UnsmoothedJS - Class in com.wcohen.ss
Jensen-Shannon distance of two unsmoothed unigram language models.
UnsmoothedJS() - Constructor for class com.wcohen.ss.UnsmoothedJS
 
unwrap() - Method in interface com.wcohen.ss.api.StringWrapper
Return the string that is wrapped.
unwrap() - Method in class com.wcohen.ss.BasicStringWrapper
 

W

WinklerRescorer - Class in com.wcohen.ss
Winkler's reweighting scheme for distance metrics.
WinklerRescorer(StringDistance) - Constructor for class com.wcohen.ss.WinklerRescorer
Rescore the innerDistance's scores, to account for the subjectively greater importance of the first few characters.
WizardUI - Class in com.wcohen.ss
Top-level GUI interface.
WizardUI() - Constructor for class com.wcohen.ss.WizardUI
 

A B C D E F G H I J L M N O P R S T U W