OLAC Record oai:lindat.mff.cuni.cz:11858/00-097C-0000-0001-48FD-B |
Metadata | ||
Title: | Victor | |
Bibliographic Citation: | http://hdl.handle.net/11858/00-097C-0000-0001-48FD-B | |
Creator: | Marek, Michal | |
Date (W3CDTF): | 2011-06-28T09:40:25Z | |
Date Available: | 2009-11-02T09:48:39Z | |
Description: | Victor is a web page cleaning tool. It is aimed at removing menu, ads, footers, headers, etc. from HTML web pages, so that only main web page content remains. Victor is based on a conditional random fields algorithm. | |
Identifier (URI): | http://hdl.handle.net/11858/00-097C-0000-0001-48FD-B | |
Language: | No linguistic content | |
Language (ISO639): | zxx | |
Publisher: | Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL) | |
Rights: | GNU General Public License, version 2 | |
http://www.gnu.org/licenses/gpl-2.0.html | ||
Subject: | html cleaning | |
Type: | toolService | |
Type (DCMI): | Software | |
OLAC Info |
||
Archive: | LINDAT/CLARIAH-CZ digital library at the Institute of Formal and Applied Linguistics (ÚFAL), Faculty of Mathematics and Physics, Charles University | |
Description: | http://www.language-archives.org/archive/lindat.mff.cuni.cz | |
GetRecord: | OAI-PMH request for OLAC format | |
GetRecord: | Pre-generated XML file | |
OAI Info |
||
OaiIdentifier: | oai:lindat.mff.cuni.cz:11858/00-097C-0000-0001-48FD-B | |
DateStamp: | 2021-06-29 | |
GetRecord: | OAI-PMH request for simple DC format | |
Search Info | ||
Citation: | Marek, Michal. 2011. Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL). | |
Terms: | dcmi_Software iso639_zxx |