MoL-2011-18: van Cranenburgh, Andreas (2011) Discontinuous Data-Oriented Parsing through Mild Context-Sensitivity. [Report]
Preview |
Text (Full Text)
MoL-2011-18.text.pdf Download (798kB) | Preview |
Text (Abstract)
MoL-2011-18.abstract.txt Download (1kB) |
Abstract
It has long been argued that incorporating a notion of discontinuity
in phrase-structure is desirable, given phenomena such as
topicalization and extraposition, and particular features of languages
such as cross-serial dependencies in Dutch and the German
Mittelfeld. Up until recently this was mainly a theoretical topic, but
advances in parsing technology have made treebank parsing with
discontinuous constituents possible, with favorable results.
We improve on this by applying Data-Oriented Parsing (DOP) to a mildly
context-sensitive grammar formalism which allows for discontinuous
trees. Decisions during parsing are conditioned on all possible
fragments, resulting in improved performance. Despite the fact that
both DOP and discontinuity present formidable challenges in terms of
computational complexity, the model is reasonably efficient. Our
results emulate and surpass the state of the art in discontinuous
parsing.
Item Type: | Report |
---|---|
Report Nr: | MoL-2011-18 |
Series Name: | Master of Logic Thesis (MoL) Series |
Year: | 2011 |
Date Deposited: | 12 Oct 2016 14:38 |
Last Modified: | 12 Oct 2016 14:38 |
URI: | https://eprints.illc.uva.nl/id/eprint/863 |
Actions (login required)
View Item |