linguistic corpora i've developed / improved upon, code i've written. if you use these in research, please cite me!
| kernelPhil | February 2021 |
|---|
| kernelPhil is an R package that contains kernel smoothing functions and related tools specialised for historical dialectology |
| Annotated DN online | November 2017 |
|---|
| a version of the Diplomatarium Norvegicum that is: (1) searchable with regexp; (2) tagged for gender, title/rank and name of first signatory, and number of signatories; (3) tagged for date; (4) (largely) localised |
| XML Fornrit for gendered speech in Old Icelandic | June 2013 |
|---|
| xml version of the Fornrit corpus from the Mörkuð íslensk málheild with direct speech tagged for the gender of the speaker |