SEW (Semantically Enriched Wikipedia) is a sense-annotated corpus, automatically built from Wikipedia, in which the overall number of linked mentions has been more than tripled solely by exploiting the hyperlink structure of Wikipedia pages and categories, along with the wide-coverage sense inventory of BabelNet. As a result SEW constitutes both a large-scale Wikipedia-based semantic network and a sense-tagged dataset with more than 200 million annotations of over 4 million different concepts and named entities.
Alessandro Raganato, Claudio Delli Bovi and Roberto Navigli.
Automatic Construction and Evaluation of a Large Semantically Enriched Wikipedia. Proceedings of 25th International Joint Conference on Artificial Intelligence (IJCAI-16), pages 2894–2900, New York City, New York, USA, 9-15 July 2016.