OLAC Record
oai:scholarspace.manoa.hawaii.edu:10125/26114

Metadata
Title:Reusing manuscript vocabularies, an example from Western Australia
Bibliographic Citation:Thieberger, Nick, Thieberger, Nick; 2013-03-02; In this paper I will address a general problem of working with manuscript vocabularies in a way that permits them to be used in various ways, including in language revitalisation work. I will illustrate with the example of a collection of papers made by Daisy Bates, an ethnographer who collected many vocabularies of Australian Indigenous languages in the early 1900s. She printed and distributed 500 copies of a questionnaire with 1,838 prompts, mainly for vocabulary, but including some sentence examples. This questionnaire was then filled in by a variety of observers, with different approaches to writing the local language. Some 200 completed questionnaires were then typed so the collection now includes 4,800 pages of typescripts and 8,600 pages of completed manuscript questionnaires representing 123 individual speakers in as many locations. Copies of this paper material are held in two state libraries and in the National Library of Australia. Individual vocabularies from this collection have been used in various ways in the sixty years that they have been available, including in language surveys, comparative work, and in Native Title cases. Typically a vocabulary was typed and analysed according to the needs of the particular project, and no copy of the digitised version was kept. In this project the typescripts have been retyped as structured text using XML, specifically using the Text Encoding Initiative1 (TEI) guidelines for text markup. Because of the problem of reading the diverse range of handwriting it is useful to have both the textual version and the image of the manuscript available, and to allow users to annotate the resulting material. Structured text will allow searching, sorting, retrieving and recombining items in the collection in new ways, for example, providing a geographic representation of the wordlist, a comparative list of all terms, and downloadable versions of wordlists for reuse in classrooms, among others. I will assess the usefulness of the TEI in structuring the vocabularies and illustrate the current state of the project.; Kaipuleohone University of Hawai'i Digital Language Archive;http://hdl.handle.net/10125/26114.
Contributor (speaker):Thieberger, Nick
Creator:Thieberger, Nick
Date (W3CDTF):2013-03-02
Description:In this paper I will address a general problem of working with manuscript vocabularies in a way that permits them to be used in various ways, including in language revitalisation work. I will illustrate with the example of a collection of papers made by Daisy Bates, an ethnographer who collected many vocabularies of Australian Indigenous languages in the early 1900s. She printed and distributed 500 copies of a questionnaire with 1,838 prompts, mainly for vocabulary, but including some sentence examples. This questionnaire was then filled in by a variety of observers, with different approaches to writing the local language. Some 200 completed questionnaires were then typed so the collection now includes 4,800 pages of typescripts and 8,600 pages of completed manuscript questionnaires representing 123 individual speakers in as many locations. Copies of this paper material are held in two state libraries and in the National Library of Australia. Individual vocabularies from this collection have been used in various ways in the sixty years that they have been available, including in language surveys, comparative work, and in Native Title cases. Typically a vocabulary was typed and analysed according to the needs of the particular project, and no copy of the digitised version was kept. In this project the typescripts have been retyped as structured text using XML, specifically using the Text Encoding Initiative1 (TEI) guidelines for text markup. Because of the problem of reading the diverse range of handwriting it is useful to have both the textual version and the image of the manuscript available, and to allow users to annotate the resulting material. Structured text will allow searching, sorting, retrieving and recombining items in the collection in new ways, for example, providing a geographic representation of the wordlist, a comparative list of all terms, and downloadable versions of wordlists for reuse in classrooms, among others. I will assess the usefulness of the TEI in structuring the vocabularies and illustrate the current state of the project.
Identifier (URI):http://hdl.handle.net/10125/26114
Language:English
Language (ISO639):eng
Rights:Creative Commons Attribution-Noncommercial-Share Alike 3.0 Unported
Table Of Contents:26114.mp3
26114.pdf

OLAC Info

Archive:  Language Documentation and Conservation
Description:  http://www.language-archives.org/archive/ldc.scholarspace.manoa.hawaii.edu
GetRecord:  OAI-PMH request for OLAC format
GetRecord:  Pre-generated XML file

OAI Info

OaiIdentifier:  oai:scholarspace.manoa.hawaii.edu:10125/26114
DateStamp:  2017-05-11
GetRecord:  OAI-PMH request for simple DC format

Search Info

Citation: Thieberger, Nick. 2013. Language Documentation and Conservation.
Terms: area_Europe country_GB iso639_eng


http://www.language-archives.org/item.php/oai:scholarspace.manoa.hawaii.edu:10125/26114
Up-to-date as of: Fri May 24 9:50:14 EDT 2019