OLAC Record
oai:www.ldc.upenn.edu:LDC2003T13

Metadata
Title:Message Understanding Conference (MUC) 6
Access Rights:Licensing Instructions for Subscription & Standard Members, and Non-Members: http://www.ldc.upenn.edu/language-resources/data/obtaining
Bibliographic Citation:Chinchor, Nancy, and Beth Sundheim. Message Understanding Conference (MUC) 6 LDC2003T13. Web Download. Philadelphia: Linguistic Data Consortium, 2003
Contributor:Chinchor, Nancy
Sundheim, Beth
Date (W3CDTF):2003
Date Issued (W3CDTF):2003-08-22
Description:*Introduction* Message Understanding Conference (MUC) 6 was produced by the Linguistic Data Consortium (LDC) and contains 318 annotated Wall Street Journal (WSJ) articles, as well as the scoring software and the corresponding documentation used in the MUC6 evaluation. In the 1990s, the MUC evaluations funded the development of metrics and statistical algorithms to support government evaluations of emerging information extraction technologies. Additional information from NIST can be found at http://www.itl.nist.gov/iaui/894.02/related_projects/muc. *Data* In addition to the 318 WSJ articles in the main directory, this corpus also contains 460 articles in concatenated files for dry run and formal testing and training, answer keys, and scorer configuration files. Both the Message Understanding Conference (MUC) 6 Additional News Text (LDC96T10) and the MUC 6 corpus are necessary in order to replicate the evaluation. All the materials are published as received from the corpus creators, without any quality control being done at the LDC (the only difference is that the files have been uncompressed). *Samples* Please view this text sample. *Updates* August 20th, 2003: What was formerly published as MUC VI Text Collection (LDC1996T10) was renamed as MUC 6 Additional News Text, because LDC96T10 consists only of additional training materials.
Extent:Corpus size: 10240 KB
Identifier:LDC2003T13
https://catalog.ldc.upenn.edu/LDC2003T13
ISBN: 1-58563-239-2
ISLRN: 402-267-910-068-8
DOI: 10.35111/wbcc-y063
Language:English
Language (ISO639):eng
License:LDC User Agreement for Non-Members: https://catalog.ldc.upenn.edu/license/ldc-non-members-agreement.pdf
Medium:Distribution: Web Download
Publisher:Linguistic Data Consortium
Publisher (URI):https://www.ldc.upenn.edu
Relation (URI):https://catalog.ldc.upenn.edu/docs/LDC2003T13
Rights Holder:Portions © 1986-1994 Dow Jones & Company, Inc.

RESTRICTED RIGHTS LEGEND: INFORMATION FROM THE WALL STREET JOURNAL AND/OR THE DOW JONES NEWS SERVICE CONTAINED HEREIN IS THE PROPERTY OF DOW JONES & COMPANY, INC. AND IS PROTECTED BY COPYRIGHT. USE, DUPLICATION OR DISCLOSURE BY YOU IS SUBJECT TO THE RESTRICTIONS SET FORTH IN THE USER AGREEMENT DELIVERED TO YOU BY THE LINGUISTIC DATA CONSORTIUM OF THE UNIVERSITY OF PENNSYLVANIA. COPYRIGHT 1986-1994 DOW JONES & COMPANY, INC. ALL RIGHTS RESERVED.
Type (DCMI):Text
Type (OLAC):primary_text

OLAC Info

Archive:  The LDC Corpus Catalog
Description:  http://www.language-archives.org/archive/www.ldc.upenn.edu
GetRecord:  OAI-PMH request for OLAC format
GetRecord:  Pre-generated XML file

OAI Info

OaiIdentifier:  oai:www.ldc.upenn.edu:LDC2003T13
DateStamp:  2024-09-09
GetRecord:  OAI-PMH request for simple DC format

Search Info

Citation: Chinchor, Nancy; Sundheim, Beth. 2003. Linguistic Data Consortium.
Terms: area_Europe country_GB dcmi_Text iso639_eng olac_primary_text


http://www.language-archives.org/item.php/oai:www.ldc.upenn.edu:LDC2003T13
Up-to-date as of: Fri Dec 6 7:46:49 EST 2024