OLAC Record oai:www.ldc.upenn.edu:LDC95T21 |
Metadata | ||
Title: | North American News Text Corpus | |
Access Rights: | Licensing Instructions for Subscription & Standard Members, and Non-Members: http://www.ldc.upenn.edu/language-resources/data/obtaining | |
Bibliographic Citation: | Graff, David. North American News Text Corpus LDC95T21. Web Download. Philadelphia: Linguistic Data Consortium, 1995 | |
Contributor: | Graff, David | |
Date (W3CDTF): | 1995 | |
Description: | North American News Text Corpus is composed of English newswire text formatted using TIPSTER-style SGML markup from the following sources: Los Angeles Times/Washington Post Service 05/94-08/97 - 52 million words New York Times News 07/94-12/96 - 173 million words Reuters News Service 04/94-12/96 - 85 million words Wall Street Journal 07/94-12/96 - 40 million words The New York Times and the L. A. Times/Washington Post services also include a range of other newspaper sources in their syndicated newswires. The Los Angeles Times/Washington Post material includes the following sources (in lesser amounts) in addition to the two predominant sources: * Newsday * The Baltimore Sun * The Hartford Courant The New York Times material contains the following sources in lesser amounts, but New York Times articles predominate: * Bloomberg Business News * The Boston Globe * Los Angeles Daily News * Fort Worth Star-Telegram * Newsweek * Cox News Service * The Arizona Republic * Seattle Post-Intelligencer * San Francisco Examiner * Houston Chronicle * San Francisco Chronicle * Economist Newspaper Ltd. * Hearst Newspapers These newswire services also include small numbers of articles from a larger set of miscellaneous sources. The ones listed above appear with some frequency on a daily basis. *Additional Licensing Instructions* This 'members-only' corpus is available to current LDC members who can request the data at the listed reduced-license fee. | |
Identifier: | LDC95T21 | |
https://catalog.ldc.upenn.edu/LDC95T21 | ||
ISBN: 1-58563-053-5 | ||
ISLRN: 667-148-284-023-7 | ||
DOI: 10.35111/56ty-0638 | ||
Language: | English | |
Language (ISO639): | eng | |
License: | North American News Text Agreement: https://catalog.ldc.upenn.edu/license/north-american-news-text-license-agreement.pdf | |
Medium: | Distribution: Web Download | |
Publisher: | Linguistic Data Consortium | |
Publisher (URI): | https://www.ldc.upenn.edu | |
Relation (URI): | https://catalog.ldc.upenn.edu/docs/LDC95T21 | |
Rights Holder: | Portions © 1994-1996 Dow Jones & Company, Inc., © 1994-1997 Los Angeles Times-Washington Post News Service, Inc., © 1994-1996 New York Times, © 1994-1996 Reuters America, Inc., © 1995-1997 Trustees of the University of Pennsylvania. | |
Type (DCMI): | Text | |
Type (OLAC): | primary_text | |
OLAC Info |
||
Archive: | The LDC Corpus Catalog | |
Description: | http://www.language-archives.org/archive/www.ldc.upenn.edu | |
GetRecord: | OAI-PMH request for OLAC format | |
GetRecord: | Pre-generated XML file | |
OAI Info |
||
OaiIdentifier: | oai:www.ldc.upenn.edu:LDC95T21 | |
DateStamp: | 2020-11-30 | |
GetRecord: | OAI-PMH request for simple DC format | |
Search Info | ||
Citation: | Graff, David. 1995. Linguistic Data Consortium. | |
Terms: | area_Europe country_GB dcmi_Text iso639_eng olac_primary_text |