Corpora

Bridging Corpora: Creating Learner Pathways Across Texts

The Bridge, a linked data application supporting curriculum development is presented. It was developed with Latin in mind, but has been extended to Greek as well. It quickly helps instructors and students find new vocabulary words in newly assigned texts, based on texts they have already encountered in their curriculum.

Bridging Corpora: Creating Learner Pathways Across Texts

LexR: a Preliminary Readability Metric for Latin

Looking for comprehensibility measures for latin texts.

Comparing Relative Typing Difficulties Across Languages Using Corpora

A presentation of a methodology for using parallel corpora to compare typing experiences.

Language, Script, Orthography, and Text-input: Rating Text-input Difficulty Across Languages

A presentation of a looking at various factors for establishing a meaningful cross-language evaluation of text input tools.

Peer-Review and Complex Language Resources

How should peer-review be different for different kinds of resources? What constitutes a complex language resource?

The Role of the Language Community in Peer-Review process for Collections

When and where does community review and peer-review create mutal benefit?

Corpora, Collections, and Datasets

There has been some discussion of what constitutes a collection among language documentation practitioners. Johnson (Citation: Johnson, 2004, p. 142) Johnson, H. (2004). Language documentation and archiving, or how to build a better corpus.

Metadata Dynamics for Linguistic and Sociolinguistic Corpora

This is an explanation of some diagrams I made at the LSA’s 2012 Satellite Workshop for Sociolinguistic Archival Preparation. They represent metadata cross cutting a corpus.