OLAC Record oai:www.ldc.upenn.edu:LDC2003T13 |
Metadata | ||
Title: | Message Understanding Conference (MUC) 6 | |
Access Rights: | Licensing Instructions for Subscription & Standard Members, and Non-Members: http://www.ldc.upenn.edu/language-resources/data/obtaining | |
Bibliographic Citation: | Chinchor, Nancy, and Beth Sundheim. Message Understanding Conference (MUC) 6 LDC2003T13. Web Download. Philadelphia: Linguistic Data Consortium, 2003 | |
Contributor: | Chinchor, Nancy | |
Sundheim, Beth | ||
Date (W3CDTF): | 2003 | |
Date Issued (W3CDTF): | 2003-08-22 | |
Description: | *Introduction* Message Understanding Conference (MUC) 6 was produced by the Linguistic Data Consortium (LDC) and contains 318 annotated Wall Street Journal (WSJ) articles, as well as the scoring software and the corresponding documentation used in the MUC6 evaluation. In the 1990s, the MUC evaluations funded the development of metrics and statistical algorithms to support government evaluations of emerging information extraction technologies. Additional information from NIST can be found at http://www.itl.nist.gov/iaui/894.02/related_projects/muc. *Data* In addition to the 318 WSJ articles in the main directory, this corpus also contains 460 articles in concatenated files for dry run and formal testing and training, answer keys, and scorer configuration files. Both the Message Understanding Conference (MUC) 6 Additional News Text (LDC96T10) and the MUC 6 corpus are necessary in order to replicate the evaluation. All the materials are published as received from the corpus creators, without any quality control being done at the LDC (the only difference is that the files have been uncompressed). *Samples* Please view this text sample. *Updates* August 20th, 2003: What was formerly published as MUC VI Text Collection (LDC1996T10) was renamed as MUC 6 Additional News Text, because LDC96T10 consists only of additional training materials. | |
Extent: | Corpus size: 10240 KB | |
Identifier: | LDC2003T13 | |
https://catalog.ldc.upenn.edu/LDC2003T13 | ||
ISBN: 1-58563-239-2 | ||
ISLRN: 402-267-910-068-8 | ||
DOI: 10.35111/wbcc-y063 | ||
Language: | English | |
Language (ISO639): | eng | |
License: | LDC User Agreement for Non-Members: https://catalog.ldc.upenn.edu/license/ldc-non-members-agreement.pdf | |
Medium: | Distribution: Web Download | |
Publisher: | Linguistic Data Consortium | |
Publisher (URI): | https://www.ldc.upenn.edu | |
Relation (URI): | https://catalog.ldc.upenn.edu/docs/LDC2003T13 | |
Rights Holder: | Portions © 1986-1994 Dow Jones & Company, Inc. RESTRICTED RIGHTS LEGEND: INFORMATION FROM THE WALL STREET JOURNAL AND/OR THE DOW JONES NEWS SERVICE CONTAINED HEREIN IS THE PROPERTY OF DOW JONES & COMPANY, INC. AND IS PROTECTED BY COPYRIGHT. USE, DUPLICATION OR DISCLOSURE BY YOU IS SUBJECT TO THE RESTRICTIONS SET FORTH IN THE USER AGREEMENT DELIVERED TO YOU BY THE LINGUISTIC DATA CONSORTIUM OF THE UNIVERSITY OF PENNSYLVANIA. COPYRIGHT 1986-1994 DOW JONES & COMPANY, INC. ALL RIGHTS RESERVED. | |
Type (DCMI): | Text | |
Type (OLAC): | primary_text | |
OLAC Info |
||
Archive: | The LDC Corpus Catalog | |
Description: | http://www.language-archives.org/archive/www.ldc.upenn.edu | |
GetRecord: | OAI-PMH request for OLAC format | |
GetRecord: | Pre-generated XML file | |
OAI Info |
||
OaiIdentifier: | oai:www.ldc.upenn.edu:LDC2003T13 | |
DateStamp: | 2024-09-09 | |
GetRecord: | OAI-PMH request for simple DC format | |
Search Info | ||
Citation: | Chinchor, Nancy; Sundheim, Beth. 2003. Linguistic Data Consortium. | |
Terms: | area_Europe country_GB dcmi_Text iso639_eng olac_primary_text |