Tom Brøndsted: Document Classifier

Upload | Help | Back (more online)

Version 2.0 April 2008 (NEW agglomerative clustering)

[Ex.1] [Ex.2] [Ex.3] New to this system? Select one of the examples or go to the help page

inverse doc. freq. apply English stemming (Porter)
term=word term=wordpair term=wordtriplet

view!
view!
view!
view!
view!
view!
view!
view!
view!
(Patience! Calculation can take 10-40 sec.)


An experimental document classifier based on the vector space model and agglomerative clustering. Input is a number of links to documents to be analyzed. Output is a distance matrix depicting the similarities of the documents and how they cluster.