Tom Brøndsted: N-gram Analysis

Input form | Help

If you are new to this this tool, you should go to the help page.


You can enter a URI in the Address field below, and have the page analyzed:


n=1 n=2 n=3
no pre-clustering apply English stemming (Porter)


If you want to upload a file from your computer and have it analyzed, you can use this form:


n=1 n=2 n=3
no pre-clustering apply English stemming (Porter)

Tool that generates graph-based diagrams depicting the n-gram coverage of textual corpora (where n=1, 2, or 3). If the corpus is representative for the domain, the total number of unique observations will stabilize after a while.