alud

alud is a Go package for deriving Universal Dependencies from Dutch sentences parsed with Alpino.

Overview

  • Alpino parses, POS-tags, lemmatizes and analyses morphologically any Dutch sentence that it is given. Alpino reliably analyses an input sentence syntactically, yielding a fully annotated syntactic tree with both constituents as well as explicitly labelled syntactic relations.

  • alud derives Universal Dependencies from Alpino's output (in XML, in the alpino_ds format), and writes it in the CoNLL-U format. Alternatively, output can be embedded into the alpino_ds format, making them available for XPath queries.

  • alud is also capable of inserting given Universal Dependencies into the alpino_ds format.

  • alud was based on a translation of an xquery script, lassy2ud.

Universal Dependencies

Universal Dependencies (UD) is a programme that aims at cross-linguistically consistent tagging and dependency parsing. UD is an open community effort with over 500 contributors producing over 200 treebanks in over 100 languages. If you’re new to UD, you should start by reading the first part of the Short Introduction and then browsing the annotation guidelines on the UD website.

Learn

alud is available as a Go package. Up-to-date documentation of the package can be found here.

alud's GitHub page provides extra information, such as how to install it. If you want to install alud, we kindly refer you to the up-to-date GitHub page.

Mentions

  • Alpino: alud is developed to convert Alpino's output into Universal Dependencies.
  • Universal Dependencies: UD is a programme that aims at cross-linguistically consistent tagging and dependency parsing. UD is an open community effort with over 500 contributors producing over 200 treebanks in over 100 languages. If you’re new to UD, you should start by reading the first part of the Short Introduction and then browsing the annotation guidelines on the UD website.
  • CoNLL-U: alud can write its output in CoNLL-U, a widely used format for dependency parses.
  • CoNLL-U visualization: the main developer of alud also wrote a webservice for visualizing CoNLL-U files.
  • Go: alud is a package for the Go programming language.

Publications

Webpages

Credits

alud was developed by the Computational Linguistics group of the Faculty of Arts, Groningen University.