Home
Word Frequencies
Virtual Manuscript
Cutter
Word Count Comparisons
Lemmatised Beowulf
Anglo Saxon Tools
These tools allow you to browse a selection of statistics we have gathered from the
Univeristy of Toronto Dictionary of Old English Corpus
.
Text index sorted by
Count
or by
Text Name
We have counted up every word in the Poetry and Prose portions of the DoE corpus. You can see the counts quickly here.
Word Stats
View the mean, minimum, median, maximum and standard deviation of each word across the entire corpus. You get an spreadsheet of every word.
Word Frequencies
Using out extensive repository of counts, you can find counts of individual words in any text, manuscript, genre or even in the entire corpus. These stats are too, free to download.
Virtual Manuscript
In a similar tool to Word Frequencies, you select a range of texts or manuscripts to build an archive of word counts for only the texts you are interested in.
Word Count Comparisons
This tool allows users to quickly scan through an Anglo-Saxon text to find uncommon words inspired by an Excel trick by Mike Drout on his
blog
. Coded by
Scott Kleinnman
at California State University, Northridge.
Lemmatisation of
Beowulf
Another tool coded by Scott Kleinnman uses custom lemmatising techniques to automatically lemmatise
Beowulf
and hopefully more texts in the future.
Web Text Tools
In out effort to provide a comprehensive and easy to use suite of text processing tools, we are developing web-based tools to easily perform text analysis based on word counts. We are developing an easy to use pipeline, but, for now, we can show a sample of what we will provide.
Cutter
Chop texts into small pieces.
You can download the raw source of this website as a ZIP (ask us). Provided
README
s explain each tool in depth.