There has been some discussion of what constitutes a collection among language documentation practitioners. Johnson (Citation: Johnson, 2004, p. 142) Johnson, H. (2004). Language documentation and archiving, or how to build a better corpus.
Working in services design and web-data handeling, I see a lot of different policies and data handling practices. However should there some special set of practices related to language data? I am exploring this question in a github repository.