Peer-Review and Complex Language Resources


Thieberger et al. (2016) argue in the Australian academic context that language documentation outputs should count as scholarly outputs for the purposes of tenure and promotion. We wholeheartedly agree that creating complex language resources should be given appropriate weight in hiring, tenure, and promotion discussions, and we outline three areas in which the assessment of archival language collections for these purposes can be further clarified and streamlined. First, we argue that there is an important difference between an archive and a repository. Namely, repositories have commitments to provide faithful copies of deposits. In contrast archives, grow, wean, and curate collections. These differences are significant for the assessment of collections and should serve to set expectations for depositors, re-users, and reviewers. We think that a clear delineation of services at preservation institutions can make the distinction between an archive and a repository clearer — mitigating expectations about engagement possibilities with the resource. Second, we agree with Sullivant (2020) that there is a distinction between a corpus (annotated or otherwise) and a collection, with the latter corresponding to the typical output of the language documentation process. Conflating the concepts of a collection and a corpus is unhelpful because they are different kinds of aggregates (Tillet 2009; O’Neill et al. 2015). The differences correspond to how the evolution and review processes of resources can be managed. Finally, comparing a complex language resource with a monograph or scholarly article can in fact devalue archival language collections by implying equivalences between the amount of time and effort required to produce drastically different outputs. Our methods of classifying complex resources further assist in finding appropriate peer-reviewers and help them objectively consider the resources in their reviews. Clarity around these three issues will make it easier for organizations and individuals to develop and adopt assessment procedures/criteria.

5 Oct, 2022 11:30
Berlin-Brandenburgische Akademie der Wissenschaften, Berlin, Germany


Thieberger, Margetts, Morey & Musgrave (2016)
, , & (). Assessing Annotated Corpora as Research Output. Australian Journal of Linguistics, 36(1). 1–21.
O’Neill, Žumer & Mixter (2015)
, & (). FRBR Aggregates: Their Types and Frequency in Library Collections. Library Resources & Technical Services, 59(3). 120–129.
Tillett (2009)
(). Definition of Aggregates as Works: Tillett Proposal. Retrieved from
Sullivant (2020)
(). Archival description for language documentation collections. Language Documentation & Conservation, 14. 520–578. Retrieved from
Content Mediums:
Hugh Paterson III
Hugh Paterson III
Collaborative Scholar

My research interests include typological patterns in articulatory phonetics; User Experience design in language tools; and graph theory applied to language and linguistic resource discovery.

Richard Griscom
Richard Griscom
Field Linguist

My research interests include field linguistics, grammatical description, and language documentation.

Lauren B. Collister
Lauren B. Collister
Director, Office of Scholarly Communication and Publishing

My research interests include Linguistics, Scholarly Publishing, Muisc.