Skip to content

Latest commit

 

History

History
57 lines (37 loc) · 2.92 KB

parseme.md

File metadata and controls

57 lines (37 loc) · 2.92 KB

PARSEME: Annotation of verbal multi-word expressions

Metadata

  • Status: Completed
  • Type: Specific
  • Work Package: WP3
  • Research coordinator: Agata Savary (Université de Tours)
  • Coordinator in CLARIAH: Maarten van Gompel (Radboud)
  • Participating Institutes: Many institutes
  • End-users: Many annotators
  • Developers: Maarten van Gompel (Radboud)
  • Interest Groups: Text
  • Task IDs: T062 (FLAT), T108 (FoLiA)

Description

What is the research about?

PARSEME (PARSing and Multi-word Expressions), is an interdisciplinary scientific network devoted to the role of multi-word expressions (MWEs) in parsing.

As part of this project, multi-word expressions were annotated on corpora for a wide variety of languages. They organized several shared task iterations.

What is needed to do the research?

Tools

An annotation environment was needed capable of annotating verbal multi-word expressions for a wide variety of language (including those with right-to-left scripts). The PARSEME project opted for FLAT. The tool was improved significantly for the PARSEME project, most notably the v0.6 release. This release featured more user/permission management functons, the implementation of an automatic conversion step from their TSV format to FoLiA, and many fixes.

What software and services are involved?

  • FLAT
  • FoLiA

References

Related use cases:

Links:

Publications:

  • Agata Savary , Marie Candito , Verginica Barbu Mititelu , Eduard Bejček , Fabienne Cap , Slavomír Čéplö , Silvio Ricardo Cordeiro , Gülşen Eryiğit , Voula Giouli , Maarten Gompel , al. (2018) — PARSEME multilingual corpus of verbal multiword expressions — In: Multiword expressions at length and in depth: Extended papers from the mwe 2017 workshop,. Language Science Press. Berlin. DOI 10.5281/zenodo.1471591
  • Ramisch, Carlos; Cordeiro, Silvio Ricardo; Savary, Agata; et al., 2018, Annotated corpora and tools of the PARSEME Shared Task on Automatic Identification of Verbal Multiword Expressions (edition 1.1), LINDAT/CLARIAH-CZ digital library at the Institute of Formal and Applied Linguistics (ÚFAL), Faculty of Mathematics and Physics, Charles University, http://hdl.handle.net/11372/LRT-2842.
  • Ramisch, Carlos; Guillaume, Bruno; Savary, Agata; et al., 2020, Annotated corpora and tools of the PARSEME Shared Task on Semi-Supervised Identification of Verbal Multiword Expressions (edition 1.2), LINDAT/CLARIAH-CZ digital library at the Institute of Formal and Applied Linguistics (ÚFAL), Faculty of Mathematics and Physics, Charles University, http://hdl.handle.net/11234/1-3367.