Jakub Waszczuk, Rafael Ehren, Regina Stodden, Laura Kallmeyer
We propose to tackle the problem of verbal multiword expression (VMWE) identification using a neural graph parsing-based approach. Our solution involves encoding VMWE annotations as labellings of dependency trees and, subsequently, applying a neural network to model the probabilities of different labellings. This strategy can be particularly effective when applied to discontinuous VMWEs and, thanks to dense, pretrained word vector representations, VMWEs unseen during training. Evaluation of our approach on three PARSEME datasets (German, French, and Polish) shows that it allows to achieve performance on par with the previous state-of- the-art (Al Saied et al., 2018).
© 2001-2024 Fundación Dialnet · Todos los derechos reservados