OLAC Record
oai:www.clarin.si:11356/1115

Metadata
Title:Opinion corpus of Slovene web commentaries KKS 1.001
Bibliographic Citation:http://hdl.handle.net/11356/1115
Creator:Kadunc, Klemen
Robnik-Šikonja, Marko
Date (W3CDTF):2017-05-28T08:41:45Z
Date Available:2017-05-28T08:41:45Z
Description:The corpus of web commentaries with sentiment categorizations was developed as a part of BSc Thesis (Kadunc, 2016) and served for evaluation of the Slovene Sentiment Lexicon KSS http://hdl.handle.net/11356/1097. It contains web commentaries about different topics (business, politics, sport, and other) from 4 Slovene web portals (RtvSlo, 24ur, Finance, Reporter). The corpus is in XML format and available in two forms: - original corpus, containing 4,777 commentaries, 898 positive, 3,291 negative and 588 neutral commentaries. - balanced corpus, a subset of the original corpus, containing 1,740 commentaries, 580 of each type of sentiment (positive, negative and neutral). References: Klemen Kadunc (2016). Določanje sentimenta slovenskim spletnim komentarjem s pomočjo strojnega učenja. Diplomsko delo. Univerza v Ljubljani, Fakulteta za računalništvo in informatiko (in Slovene). http://eprints.fri.uni-lj.si/3317/ Klemen Kadunc, Marko Robnik-Šikonja (2016). Analiza mnenj s pomočjo strojnega učenja in slovenskega leksikona sentimenta. Conference on Language Technologies & Digital Humanities, Ljubljana (in Slovene). http://www.sdjt.si/wp/dogodki/konference/jtdh-2016/zbornik/
Identifier (URI):http://hdl.handle.net/11356/1115
Language:Slovenian
Language (ISO639):slv
Publisher:Faculty of Computer and Information Science, University of Ljubljana
Rights:Creative Commons - Attribution 4.0 International (CC BY 4.0)
https://creativecommons.org/licenses/by/4.0/
Subject:web commentaries
opinion corpus
sentiment analysis
Type:corpus
Type (DCMI):Text
Type (OLAC):primary_text

OLAC Info

Archive:  Slovenian language resource repository CLARIN.SI
Description:  http://www.language-archives.org/archive/clarin.si
GetRecord:  OAI-PMH request for OLAC format
GetRecord:  Pre-generated XML file

OAI Info

OaiIdentifier:  oai:www.clarin.si:11356/1115
DateStamp:  2017-05-28
GetRecord:  OAI-PMH request for simple DC format

Search Info

Citation: Kadunc, Klemen; Robnik-Šikonja, Marko. 2017. Faculty of Computer and Information Science, University of Ljubljana.
Terms: area_Europe country_SI dcmi_Text iso639_slv olac_primary_text


http://www.language-archives.org/item.php/oai:www.clarin.si:11356/1115
Up-to-date as of: Tue Aug 20 10:27:08 EDT 2019