OLAC Record oai:lindat.mff.cuni.cz:11372/LRT-1181 |
Metadata | ||
Title: | COWAL - combined word aligner | |
Bibliographic Citation: | http://hdl.handle.net/11372/LRT-1181 | |
Contributor: | Tufiş, Dan | |
Ion, Radu | ||
Ceauşu, Alexandru | ||
Ştefănescu, Dan | ||
Date (W3CDTF): | 2014-07-30T21:27:46Z | |
Date Available: | 2014-07-30T21:27:46Z | |
Description: | COWAL is a wrapper of two stand-alone word aligners [[http://www.clarin.eu/tools/yawa-yet-another-word-aligner|YAWA]] and [[http://www.clarin.eu/tools/meba-word-aligner|MEBA]]. COWAL merges the alignments produced by each stand-alone aligner and then uses a trained SVM classifier to prune the unlikely alignment links. The classifier is based on the [[http://www.csie.ntu.edu.tw/~cjlin/papers/quadworkset.pdf|LIBSVM kit]], used with the default parameters (C-SVC classification and radial basis kernel function). The classifier was trained with positive and negative hand-validated examples of word alignment links. With the current F-measure of 83.98%, COWAL won the first place in the lexical alignment competition held with the occasion of the 43rd Annual Meeting of the Association for Computational Linguistics (ACL’05) workshop on “Building and Using Parallel Texts: Data Driven Machine Translation and Beyond”, Ann Arbor, USA. More detailed descriptions are available in [[http://www.racai.ro/~tufis/papers|the following papers]]: -- Dan Tufiş (2007). Exploiting Aligned Parallel Corpora in Multilingual Studies and Applications. In Toru Ishida, Susan R. Fussell, and Piek T.J.M. Vossen (eds.), Intercultural Collaboration. First International Workshop (IWIC 2007), volume 4568 of Lecture Notes in Computer Science, pp. 103-117. Springer-Verlag, August 2007. ISBN 978-3-540-73999-9. -- -- Dan Tufiş, Radu Ion, Alexandru Ceauşu, and Dan Ştefănescu (2006). Improved Lexical Alignment by Combining Multiple Reified Alignments. In Toru Ishida, Susan R. Fussell, and Piek T.J.M. Vossen (eds.), Proceedings of the 11th Conference EACL2006, pp. 153-160, Trento, Italy, April 2006. Association for Computational Linguistics. ISBN 1-9324-32-61-2. -- Dan Tufiş, Radu Ion, Alexandru Ceauşu, and Dan Ştefănescu (2005). Combined Aligners. In Proceedings of the ACL Workshop on Building and Using Parallel Texts: Data-Driven Machine Translation and Beyond, pp. 107-110, Ann Arbor, USA, June 2005. Association for Computational Linguistics. ISBN 978-973-703-208-9. | |
Identifier (URI): | http://hdl.handle.net/11372/LRT-1181 | |
Language: | English | |
Romanian | ||
Language (ISO639): | eng | |
ron | ||
Publisher: | Research Institute for Artificial Intelligence, Romanian Academy of Sciences | |
Type: | toolService | |
Type (DCMI): | Software | |
OLAC Info |
||
Archive: | LINDAT/CLARIN digital library at the Institute of Formal and Applied Linguistics (ÚFAL), Faculty of Mathematics and Physics, Charles University | |
Description: | http://www.language-archives.org/archive/lindat.mff.cuni.cz | |
GetRecord: | OAI-PMH request for OLAC format | |
GetRecord: | Pre-generated XML file | |
OAI Info |
||
OaiIdentifier: | oai:lindat.mff.cuni.cz:11372/LRT-1181 | |
DateStamp: | 2016-04-06 | |
GetRecord: | OAI-PMH request for simple DC format | |
Search Info | ||
Citation: | Tufiş, Dan; Ion, Radu; Ceauşu, Alexandru; Ştefănescu, Dan. 2014. Research Institute for Artificial Intelligence, Romanian Academy of Sciences. | |
Terms: | area_Europe country_GB country_RO dcmi_Software iso639_eng iso639_ron |