- Status: Completed
- Type: Specific
- Work Package: WP3
- Research coordinator: Agata Savary (Université de Tours)
- Coordinator in CLARIAH: Maarten van Gompel (Radboud)
- Participating Institutes: Many institutes
- End-users: Many annotators
- Developers: Maarten van Gompel (Radboud)
- Interest Groups: Text
- Task IDs: T062 (FLAT), T108 (FoLiA)
PARSEME (PARSing and Multi-word Expressions), is an interdisciplinary scientific network devoted to the role of multi-word expressions (MWEs) in parsing.
As part of this project, multi-word expressions were annotated on corpora for a wide variety of languages. They organized several shared task iterations.
An annotation environment was needed capable of annotating verbal multi-word expressions for a wide variety of language (including those with right-to-left scripts). The PARSEME project opted for FLAT. The tool was improved significantly for the PARSEME project, most notably the v0.6 release. This release featured more user/permission management functons, the implementation of an automatic conversion step from their TSV format to FoLiA, and many fixes.
- FLAT
- FoLiA
Related use cases:
- Data format for linguistically-annotated corpora (WP3, FoLiA)
Links:
Publications:
- Agata Savary , Marie Candito , Verginica Barbu Mititelu , Eduard Bejček , Fabienne Cap , Slavomír Čéplö , Silvio Ricardo Cordeiro , Gülşen Eryiğit , Voula Giouli , Maarten Gompel , al. (2018) — PARSEME multilingual corpus of verbal multiword expressions — In: Multiword expressions at length and in depth: Extended papers from the mwe 2017 workshop,. Language Science Press. Berlin. DOI 10.5281/zenodo.1471591
- Ramisch, Carlos; Cordeiro, Silvio Ricardo; Savary, Agata; et al., 2018, Annotated corpora and tools of the PARSEME Shared Task on Automatic Identification of Verbal Multiword Expressions (edition 1.1), LINDAT/CLARIAH-CZ digital library at the Institute of Formal and Applied Linguistics (ÚFAL), Faculty of Mathematics and Physics, Charles University, http://hdl.handle.net/11372/LRT-2842.
- Ramisch, Carlos; Guillaume, Bruno; Savary, Agata; et al., 2020, Annotated corpora and tools of the PARSEME Shared Task on Semi-Supervised Identification of Verbal Multiword Expressions (edition 1.2), LINDAT/CLARIAH-CZ digital library at the Institute of Formal and Applied Linguistics (ÚFAL), Faculty of Mathematics and Physics, Charles University, http://hdl.handle.net/11234/1-3367.