MASC-NEWS: an automatic annotation of the MASC corpus with both named entities and word senses.

We use BabelNet 2.0.1, a multilingual semantic network which integrates both lexicographic and encyclopedic knowledge, as our sense/entity inventory together with its semantic structure, to perform an automatic annotation, with both named entities and word senses, of the MASC 3.0 corpus, a large English corpus covering a wide range of genres of written and spoken text.

Data

References

Andrea Moro and Roberto Navigli and Francesco M. Tucci and Rebecca J. Passonneau. Annotating the MASC Corpus with BabelNet. Proc. of the International Conference on Language Resources and Evaluation (LREC 2014), Reykjavik, Iceland, May 26-31, 2014
Andrea Moro and Alessandro Raganato and Roberto Navigli. Entity Linking meets Word Sense Disambiguation: A Unified Approach. Transactions of the Association for Computational Linguistics (TACL), 2014


Last update: 14 April 2014 by Andrea Moro