SyntagNet

This information from corpora-bounces@uib.no

SyntagNet, a resource with 88,000 lexical-semantic combinations, is now out!

We are proud to announce that SyntagNet 1.0 (http://syntagnet.org) is now available for download at http://syntagnet.org/download. Developed at the Sapienza NLP group (http://nlp.uniroma1.it), the multilingual Natural Language Processing group at the Sapienza University of Rome, SyntagNet is a manually-curated large-scale lexical-semantic combination database which associates pairs of concepts with pairs of co-occurring words. The goal of SyntagNet is to capture sense distinctions evoked by syntagmatic relations (e.g. mouse.n.1 and squeak.v.1 vs mouse.n.2 and click.n.4), hence providing information which complements the essentially paradigmatic knowledge shared by currently available Lexical Knowledge Bases such as WordNet. Its main features are:

  • Wide coverage, with 78,000 noun-verb and noun-noun lexical combinations extracted from the English Wikipedia and the British National Corpus.
  • High-quality, fully manual disambiguation for all of the lexical combinations, according to the WordNet 3.0 sense inventory.
  • A resulting Lexical Knowledge Base made up of 88,019 semantic combinations linking 20,626 WordNet 3.0 unique synsets with a relation edge.
  • A user-friendly web interface for looking up terms and their lexical-semantic combinations, with complete linkage to BabelNet 4.0.

And much more! Please check out our EMNLP 2019 paper:

M. Maru, F. Scozzafava, F. Martelli, R. Navigli. SyntagNet: Challenging Supervised Word Sense Disambiguation with Lexical-Semantic Combinations, Proc. of EMNLP-IJCNLP 2019

or http://syntagnet.org for more details!

SyntagRank, a state-of-the-art knowledge-based Word Sense Disambiguation system which uses SyntagNet to perform disambiguation in five languages (English, French, German, Italian and Spanish) is also available from the same website (will be demoed at ACL 2020!).

SyntagNet is an output of the MOUSSE ERC Consolidator Grant No. 726487 and of the ELEXIS project No. 731015 under the European Union’s Horizon 2020 research and innovation programme. Babelscape proudly developed the online interface and API, and provides the infrastructure for maintaining the service. 

The Sapienza NLP group

The Natural Language Toolkit

NLTK is a leading platform for building Python programs to work with human language data. It provides easy-to-use interfaces to over 50 corpora and lexical resources such as WordNet, along with a suite of text processing libraries for classification, tokenization, stemming, tagging, parsing, and semantic reasoning, wrappers for industrial-strength NLP libraries, and an active discussion forum.

Source: https://www.nltk.org/

Installation: https://www.nltk.org/install.html

Wordcounter

Online editor that can help you to improve word choice and writing style, and, optionally, help you to detect grammar mistakes and plagiarism. To check word count, simply place your cursor into the text box above and start typing. You’ll see the number of characters and words increase or decrease as you type, delete, and edit them. You can also copy and paste text from another program over into the online editor above. The Auto-Save feature will make sure you won’t lose any changes while editing, even if you leave the site and come back later.

URL: https://wordcounter.net