Corpus

Corpora, Collections, and Datasets

There has been some discussion of what constitutes a collection among language documentation practitioners. Johnson (Citation: Johnson, 2004, p. 142) Johnson, H. (2004). Language documentation and archiving, or how to build a better corpus.

Metadata Dynamics for Linguistic and Sociolinguistic Corpora

This is an explanation of some diagrams I made at the LSA’s 2012 Satellite Workshop for Sociolinguistic Archival Preparation. They represent metadata cross cutting a corpus.