OLAC Record
oai:lindat.mff.cuni.cz:11858/00-097C-0000-0001-48FD-B

Metadata
Title:Victor
Bibliographic Citation:http://hdl.handle.net/11858/00-097C-0000-0001-48FD-B
Creator:Marek, Michal
Date (W3CDTF):2011-06-28T09:40:25Z
Date Available:2009-11-02T09:48:39Z
Description:Victor is a web page cleaning tool. It is aimed at removing menu, ads, footers, headers, etc. from HTML web pages, so that only main web page content remains. Victor is based on a conditional random fields algorithm.
Identifier (URI):http://hdl.handle.net/11858/00-097C-0000-0001-48FD-B
Language:No linguistic content
Language (ISO639):zxx
Publisher:Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL)
Rights:GNU General Public License, version 2
http://www.gnu.org/licenses/gpl-2.0.html
Subject:html cleaning
Type:toolService
Type (DCMI):Software

OLAC Info

Archive:  LINDAT/CLARIAH-CZ digital library at the Institute of Formal and Applied Linguistics (ÚFAL), Faculty of Mathematics and Physics, Charles University
Description:  http://www.language-archives.org/archive/lindat.mff.cuni.cz
GetRecord:  OAI-PMH request for OLAC format
GetRecord:  Pre-generated XML file

OAI Info

OaiIdentifier:  oai:lindat.mff.cuni.cz:11858/00-097C-0000-0001-48FD-B
DateStamp:  2020-02-19
GetRecord:  OAI-PMH request for simple DC format

Search Info

Citation: Marek, Michal. 2011. Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL).
Terms: dcmi_Software iso639_zxx


http://www.language-archives.org/item.php/oai:lindat.mff.cuni.cz:11858/00-097C-0000-0001-48FD-B
Up-to-date as of: Mon Sep 21 23:44:31 EDT 2020