OLAC Record: TRECVID 2006 Keyframes

OLAC Record
oai:www.ldc.upenn.edu:LDC2010V02

Metadata

Title: TRECVID 2006 Keyframes

Access Rights: Licensing Instructions for Subscription & Standard Members, and Non-Members: http://www.ldc.upenn.edu/language-resources/data/obtaining

Bibliographic Citation: Over, Paul, Georges Quenot, and Kevin Walker. TRECVID 2006 Keyframes LDC2010V02. Web Download. Philadelphia: Linguistic Data Consortium, 2010

Contributor: Over, Paul

Quenot, Georges

Walker, Kevin

Date (W3CDTF): 2010

Date Issued (W3CDTF): 2010-08-18

Description: *Introduction* TRECVID 2006 Keyframes was developed as a collaborative effort among researchers at the Linguistic Data Consortium (LDC), NIST, LIMSI-CNRS, and Dublin City University. TREC Video Retrieval Evaluation (TRECVID) was sponsored by the National Institute of Standards and Technology (NIST) to promote progress in content-based retrieval from digital video via open, metrics-based evaluation. The keyframes in this release were extracted for use in the NIST TRECVID 2006 Evaluation. TRECVID was a laboratory-style evaluation that attempted to model real world situations or significant component tasks involved in such situations. In 2006 TRECVID completed a 2-year cycle on English, Arabic, and Chinese news video. The evaluation consisted of three system tasks and associated tests: * shot boundary determination * high-level feature extraction * search (interactive, manually-assisted, and/or fully automatic) The 2006 evaluation also included a rushes exploitation exploratory task, but the material associated with that task is not included in this release. For a detailed description of the TRECVID Evaluation Tasks, please refer to the NIST TRECVID 2006 Evaluation Description. *Data* The video stills that compose this corpus were drawn from approximately 158.6 hours of English, Arabic, and Chinese language broadcast programming data collected by LDC from NBC ("NBC Nightly News"), CNN ("Live From..", "Anderson Cooper 360"), MSNBC ("MSNBC News live"), New Tang Dynsaty TV ("Economic Frontier", "Focus Interactive"), Phoenix TV ("Good Morning China"), Lebanese Broadcasting Corp. ("Naharkum Saiid", "News on LBC"), Alhurra TV ("Alhurra News") and China Central TV ("CCTV_News"). Shots are fundamental units of video, useful for higher-level processing. To create the master list of shots, the video was segmented. The results of this pass are called subshots. Because the master shot reference is designed for use in manual assessment, a second pass over the segmentation was made to create the master shots of at least 2 seconds in length. These master shots were the ones used in submitting results for the feature and search tasks. In the second pass, starting at the beginning of each file, the subshots were aggregated, if necessary, until the currrent shot was at least 2 seconds in duration, at which point the aggregation began anew with the next subshot. The keyframes were selected by going to the middle frame of the shot boundary, then parsing left and right of that frame to locate the nearest I-Frame. This then became the keyframe and was extracted. Keyframes are provided at both the subshot (NRKF) and master shot (RKF) levels. In a small number of cases (all of them subshots) there was no I-Frame within the subshot boundaries. When this occurred, the middle frame was selected. The emphasis in the common shot boundary reference is on the shots, not the transitions. The shots are contiguous. There are no gaps between them. They do not overlap. The media time format is based on the Gregorian day time (ISO 8601) norm. Fractions are defined by counting pre-specified fractions of a second. In our case, the frame rate will likely be 29.97. One fraction of a second is thus specified as "PT1001N30000F". The video id has the format of "XXX" and shot id "shotXXX_YYY". The "XXX" is the sequence number of video onto which the video file name is mapped this will be listed in the "collection.xml" file. The "YYY" is the sequence number of the shot. Keyframes are identified as by a suffix "_RKF" for the main keyframe (one per shot) or "_NKRF" for additional keyframes derived from subshots that were merged so that shots have a minimum duration of 2 seconds. *Sample* Samples of data available in this corpus: Keyframe (video still) Shots metadata (mp7 markup) *Updates* No updates are available at this time.

Extent: Corpus size: 2824678 KB

Identifier: LDC2010V02

https://catalog.ldc.upenn.edu/LDC2010V02

ISBN: 1-58563-554-5

ISLRN: 347-638-481-141-4

DOI: 10.35111/nqx3-yy54

Language: English

Mandarin Chinese

Arabic

Chinese

Language (ISO639): eng

cmn

ara

zho

License: LDC User Agreement for Non-Members: https://catalog.ldc.upenn.edu/license/ldc-non-members-agreement.pdf

Medium: Distribution: Web Download

Publisher: Linguistic Data Consortium

Publisher (URI): https://www.ldc.upenn.edu

Relation (URI): https://catalog.ldc.upenn.edu/docs/LDC2010V02

Rights Holder: Portions © 2005 Cable News Network, LP, LLLP, © 2005 China Central TV, © 2005 National Broadcasting Company, Inc., © 2005 New Tang Dynasty TV, © 2005 PAC, Ltd., © 2005 Phoenix TV, © 2005, 2006, 2010 Trustees of the University of Pennsylvania.

Type (DCMI): MovingImage

Type (OLAC): primary_text

OLAC Info

Archive: The LDC Corpus Catalog

Description: http://www.language-archives.org/archive/www.ldc.upenn.edu

GetRecord: OAI-PMH request for OLAC format

GetRecord: Pre-generated XML file

OAI Info

OaiIdentifier: oai:www.ldc.upenn.edu:LDC2010V02

DateStamp: 2022-12-05

GetRecord: OAI-PMH request for simple DC format

Search Info
Citation: Over, Paul; Quenot, Georges; Walker, Kevin. 2010. Linguistic Data Consortium.
Terms: area_Asia area_Europe country_CN country_GB dcmi_MovingImage iso639_ara iso639_cmn iso639_eng iso639_zho olac_primary_text

http://www.language-archives.org/item.php/oai:www.ldc.upenn.edu:LDC2010V02
Up-to-date as of: Thu Sep 18 0:59:44 EDT 2025

Metadata
Title:		TRECVID 2006 Keyframes
Access Rights:		Licensing Instructions for Subscription & Standard Members, and Non-Members: http://www.ldc.upenn.edu/language-resources/data/obtaining
Bibliographic Citation:		Over, Paul, Georges Quenot, and Kevin Walker. TRECVID 2006 Keyframes LDC2010V02. Web Download. Philadelphia: Linguistic Data Consortium, 2010
Contributor:		Over, Paul
		Quenot, Georges
		Walker, Kevin
Date (W3CDTF):		2010
Date Issued (W3CDTF):		2010-08-18
Description:		Introduction TRECVID 2006 Keyframes was developed as a collaborative effort among researchers at the Linguistic Data Consortium (LDC), NIST, LIMSI-CNRS, and Dublin City University. TREC Video Retrieval Evaluation (TRECVID) was sponsored by the National Institute of Standards and Technology (NIST) to promote progress in content-based retrieval from digital video via open, metrics-based evaluation. The keyframes in this release were extracted for use in the NIST TRECVID 2006 Evaluation. TRECVID was a laboratory-style evaluation that attempted to model real world situations or significant component tasks involved in such situations. In 2006 TRECVID completed a 2-year cycle on English, Arabic, and Chinese news video. The evaluation consisted of three system tasks and associated tests: * shot boundary determination * high-level feature extraction * search (interactive, manually-assisted, and/or fully automatic) The 2006 evaluation also included a rushes exploitation exploratory task, but the material associated with that task is not included in this release. For a detailed description of the TRECVID Evaluation Tasks, please refer to the NIST TRECVID 2006 Evaluation Description. Data The video stills that compose this corpus were drawn from approximately 158.6 hours of English, Arabic, and Chinese language broadcast programming data collected by LDC from NBC ("NBC Nightly News"), CNN ("Live From..", "Anderson Cooper 360"), MSNBC ("MSNBC News live"), New Tang Dynsaty TV ("Economic Frontier", "Focus Interactive"), Phoenix TV ("Good Morning China"), Lebanese Broadcasting Corp. ("Naharkum Saiid", "News on LBC"), Alhurra TV ("Alhurra News") and China Central TV ("CCTV_News"). Shots are fundamental units of video, useful for higher-level processing. To create the master list of shots, the video was segmented. The results of this pass are called subshots. Because the master shot reference is designed for use in manual assessment, a second pass over the segmentation was made to create the master shots of at least 2 seconds in length. These master shots were the ones used in submitting results for the feature and search tasks. In the second pass, starting at the beginning of each file, the subshots were aggregated, if necessary, until the currrent shot was at least 2 seconds in duration, at which point the aggregation began anew with the next subshot. The keyframes were selected by going to the middle frame of the shot boundary, then parsing left and right of that frame to locate the nearest I-Frame. This then became the keyframe and was extracted. Keyframes are provided at both the subshot (NRKF) and master shot (RKF) levels. In a small number of cases (all of them subshots) there was no I-Frame within the subshot boundaries. When this occurred, the middle frame was selected. The emphasis in the common shot boundary reference is on the shots, not the transitions. The shots are contiguous. There are no gaps between them. They do not overlap. The media time format is based on the Gregorian day time (ISO 8601) norm. Fractions are defined by counting pre-specified fractions of a second. In our case, the frame rate will likely be 29.97. One fraction of a second is thus specified as "PT1001N30000F". The video id has the format of "XXX" and shot id "shotXXX_YYY". The "XXX" is the sequence number of video onto which the video file name is mapped this will be listed in the "collection.xml" file. The "YYY" is the sequence number of the shot. Keyframes are identified as by a suffix "_RKF" for the main keyframe (one per shot) or "_NKRF" for additional keyframes derived from subshots that were merged so that shots have a minimum duration of 2 seconds. Sample Samples of data available in this corpus: Keyframe (video still) Shots metadata (mp7 markup) Updates No updates are available at this time.
Extent:		Corpus size: 2824678 KB
Identifier:		LDC2010V02
		https://catalog.ldc.upenn.edu/LDC2010V02
		ISBN: 1-58563-554-5
		ISLRN: 347-638-481-141-4
		DOI: 10.35111/nqx3-yy54
Language:		English
		Mandarin Chinese
		Arabic
		Chinese
Language (ISO639):		eng
		cmn
		ara
		zho
License:		LDC User Agreement for Non-Members: https://catalog.ldc.upenn.edu/license/ldc-non-members-agreement.pdf
Medium:		Distribution: Web Download
Publisher:		Linguistic Data Consortium
Publisher (URI):		https://www.ldc.upenn.edu
Relation (URI):		https://catalog.ldc.upenn.edu/docs/LDC2010V02
Rights Holder:		Portions © 2005 Cable News Network, LP, LLLP, © 2005 China Central TV, © 2005 National Broadcasting Company, Inc., © 2005 New Tang Dynasty TV, © 2005 PAC, Ltd., © 2005 Phoenix TV, © 2005, 2006, 2010 Trustees of the University of Pennsylvania.
Type (DCMI):		MovingImage
Type (OLAC):		primary_text
OLAC Info
Archive:		The LDC Corpus Catalog
Description:		http://www.language-archives.org/archive/www.ldc.upenn.edu
GetRecord:		OAI-PMH request for OLAC format
GetRecord:		Pre-generated XML file
OAI Info
OaiIdentifier:		oai:www.ldc.upenn.edu:LDC2010V02
DateStamp:		2022-12-05
GetRecord:		OAI-PMH request for simple DC format
Search Info
Citation:		Over, Paul; Quenot, Georges; Walker, Kevin. 2010. Linguistic Data Consortium.
Terms:		area_Asia area_Europe country_CN country_GB dcmi_MovingImage iso639_ara iso639_cmn iso639_eng iso639_zho olac_primary_text