OLAC Record

Title:Arabic Treebank: Part 4 v 1.0 (MPG Annotation)
Access Rights:Licensing Instructions for Subscription & Standard Members, and Non-Members: http://www.ldc.upenn.edu/language-resources/data/obtaining
Bibliographic Citation:Maamouri, Mohamed, et al. Arabic Treebank: Part 4 v 1.0 (MPG Annotation) LDC2005T30. Web Download. Philadelphia: Linguistic Data Consortium, 2005
Contributor:Maamouri, Mohamed
Bies, Ann
Buckwalter, Tim
Jin, Hubert
Mekki, Wigdan
Date (W3CDTF):2005
Date Issued (W3CDTF):2005-10-15
Description:*Introduction* This file contains documentation on the Arabic Treebank: Part 4 v 1.0 (MPG Annotation), Linguistic Data Consortium (LDC) catalog number LDC2005T30 and ISBN 1-58563-343-7. The goal of the Arabic Treebank project is to support the development of data-driven approaches to natural language processing (NLP), human language technologies, automatic content extraction (topic extraction and/or grammar extraction), cross-lingual information retrieval, information detection, and other forms of linguistic research on Modern Standard Arabic in general, the LDC was sponsored to develop an Arabic POS and Treebank of 1,000,000 words. This corpus is the fourth part of that project. In this release, we provide annotation on part of speech (POS), gloss, and word segmentation. *Samples* To view a example of this corpus, please review this sample POS file.
ISBN: 1-58563-343-7
ISLRN: 165-794-218-631-9
Language:Standard Arabic
Language (ISO639):arb
License:LDC User Agreement for Non-Members: https://catalog.ldc.upenn.edu/license/ldc-non-members-agreement.pdf
Medium:Distribution: Web Download
Publisher:Linguistic Data Consortium
Publisher (URI):https://www.ldc.upenn.edu
Relation (URI):https://catalog.ldc.upenn.edu/docs/LDC2005T30
Rights Holder:Portions © 2004 Assabah Press Group, © 2005 Trustees of the University Pennsylvania.
Type (DCMI):Text
Type (OLAC):primary_text


Archive:  The LDC Corpus Catalog
Description:  http://www.language-archives.org/archive/www.ldc.upenn.edu
GetRecord:  OAI-PMH request for OLAC format
GetRecord:  Pre-generated XML file

OAI Info

OaiIdentifier:  oai:www.ldc.upenn.edu:LDC2005T30
DateStamp:  2019-12-12
GetRecord:  OAI-PMH request for simple DC format

Search Info

Citation: Maamouri, Mohamed; Bies, Ann; Buckwalter, Tim; Jin, Hubert; Mekki, Wigdan. 2005. Linguistic Data Consortium.
Terms: area_Asia country_SA dcmi_Text iso639_arb olac_primary_text

Up-to-date as of: Mon Jun 15 13:47:40 EDT 2020