com.wcohen.ss
Class Jaccard

java.lang.Object
  extended by com.wcohen.ss.AbstractStringDistance
      extended by com.wcohen.ss.Jaccard
All Implemented Interfaces:
StringDistance, StringDistanceLearner

public class Jaccard
extends AbstractStringDistance

Jaccard distance implementation. The Jaccard distance between two sets is the ratio of the size of their intersection to the size of their union.


Constructor Summary
Jaccard()
           
Jaccard(Tokenizer tokenizer)
           
 
Method Summary
 java.lang.String explainScore(StringWrapper s, StringWrapper t)
          Explain how the distance was computed.
static void main(java.lang.String[] argv)
           
 StringWrapper prepare(java.lang.String s)
          Preprocess a string by finding tokens.
 double score(StringWrapper s, StringWrapper t)
          This method needs to be implemented by subclasses.
 java.lang.String toString()
           
 
Methods inherited from class com.wcohen.ss.AbstractStringDistance
addExample, doMain, explainScore, getDistance, hasNextQuery, nextQuery, prepare, prepare, score, setDistanceInstancePool, setStringWrapperPool
 
Methods inherited from class java.lang.Object
clone, equals, finalize, getClass, hashCode, notify, notifyAll, wait, wait, wait
 

Constructor Detail

Jaccard

public Jaccard(Tokenizer tokenizer)

Jaccard

public Jaccard()
Method Detail

score

public double score(StringWrapper s,
                    StringWrapper t)
Description copied from class: AbstractStringDistance
This method needs to be implemented by subclasses.

Specified by:
score in interface StringDistance
Specified by:
score in class AbstractStringDistance

prepare

public StringWrapper prepare(java.lang.String s)
Preprocess a string by finding tokens.

Specified by:
prepare in interface StringDistance
Overrides:
prepare in class AbstractStringDistance

explainScore

public java.lang.String explainScore(StringWrapper s,
                                     StringWrapper t)
Explain how the distance was computed. In the output, the tokens in S and T are listed, and the common tokens are marked with an asterisk.

Specified by:
explainScore in interface StringDistance
Specified by:
explainScore in class AbstractStringDistance

toString

public java.lang.String toString()
Overrides:
toString in class java.lang.Object

main

public static void main(java.lang.String[] argv)