LSTMEmbed

Learning Word and Sense Representations from a Large Semantically Annotated Corpus with Long Short-Term Memories

Setup

Download a word-based embeddings file like from wor2vec or Glove or a sense-based file like from SensEmbed, and place it in data/

Training

python train_word_embeddings.sh

Requirements

Python 2.7 Keras 2

Trained Word and Sense Embeddings

Follow this link

Reference

Main paper to be cited

@inproceedings{iacobacci-navigli-2019-lstmembed,
		title = "{LSTME}mbed: Learning Word and Sense Representations from a Large Semantically Annotated Corpus with Long Short-Term Memories",
		author = "Iacobacci, Ignacio and Navigli, Roberto",
	booktitle = "Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics",
		month = jul,
		year = "2019",
		address = "Florence, Italy",
		publisher = "Association for Computational Linguistics",
		url = "https://www.aclweb.org/anthology/P19-1165",
		pages = "1685--1695",
		abstract = "While word embeddings are now a de facto standard representation of words in most NLP tasks, recently the attention has been shifting towards vector representations which capture the different meanings, i.e., senses, of words. In this paper we explore the capabilities of a bidirectional LSTM model to learn representations of word senses from semantically annotated corpora. We show that the utilization of an architecture that is aware of word order, like an LSTM, enables us to create better representations. We assess our proposed model on various standard benchmarks for evaluating semantic representations, reaching state-of-the-art performance on the SemEval-2014 word-to-sense similarity task. We release the code and the resulting word and sense embeddings at http://lcl.uniroma1.it/LSTMEmbed.",
}

============================================

Support

For more information, bug reports, fixes, please contact:

Ignacio Iacobacci
iiacobac[at]gmail[dot]com
http://iiacobac.wordpress.com/

Roberto Navigli
navigli[at]di[dot]uniroma1[dot]it
http://wwwusers.di.uniroma1.it/~navigli/

License

LSTMEmbed is an output of the MOUSSE ERC Consolidator Grant No. 726487. LSTMEmbed authors gratefully acknowledge the support of NVIDIA Corporation Hardware Grant. LSTMEmbed is licensed under a Creative Commons Attribution - Noncommercial - Share Alike 3.0 License.

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
model		model
tools		tools
README.md		README.md
train_word_embeddings.sh		train_word_embeddings.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

model

model

tools

tools

README.md

README.md

train_word_embeddings.sh

train_word_embeddings.sh

Repository files navigation

LSTMEmbed

Setup

Training

Requirements

Trained Word and Sense Embeddings

Reference

Support

License

About

Releases

Packages

Languages

iiacobac/LSTMEmbed

Folders and files

Latest commit

History

Repository files navigation

LSTMEmbed

Setup

Training

Requirements

Trained Word and Sense Embeddings

Reference

Support

License

About

Resources

Stars

Watchers

Forks

Languages