Release of BabelNet 5.2
We are proud to announce the release of a new version of BabelNet https://babelnet.org/, its Virtuoso SPARQL endpoint and its programmatic APIs, both for Java and Python, developed jointly by the Sapienza NLP Group http://nlp.uniroma1.it of Sapienza University of Rome under the supervision of prof. Roberto Navigli https://www.diag.uniroma1.it/navigli/ and Babelscape https://babelscape.com/, a deep-tech multilingual NLP company providing innovative solutions for natural language understanding.
BabelNet -- winner of the prominent paper award 2017 from the Artificial Intelligence Journal and the META prize 2015, and covered in media such as The Guardian https://www.theguardian.com/news/2018/feb/23/oxford-english-dictionary-can-worlds-biggest-dictionary-survive-internet and Time magazine http://wwwusers.di.uniroma1.it/~navigli/img/Redefining_the_modern_dictionary.png -- is today’s most far-reaching multilingual lexical-semantic knowledge graph which, according to need, can be used as an encyclopedic dictionary, or a semantic network or a huge knowledge base/ontology. It has been used by more than 1000 universities and research institutions, enabling multilinguality in several fields of AI and NLP, such as semantic search, Word Sense Disambiguation, Semantic Role Labeling, image tagging and semantically-enhanced multimodality.
BabelNet was created by means of the seamless integration and interlinking of the largest multilingual Web encyclopedia - i.e., Wikipedia - with the most popular computational lexicon of English - i.e., WordNet, and other lexical-semantic resources such as Wikidata, Wiktionary, OmegaWiki, dozens of wordnets (including Open English WordNet), GeoNames, and ImageNet. The BabelNet model is centered around multilingual synsets, i.e., concepts and named entities lexicalized in many languages, and connected with large amounts of semantic relations.
Version 5.2 ships with the following features:
-
21 new languages for a grand total of 520 languages; -
22 million synsets covered; -
Wikipedia and Wikidata updated thanks to BabelNet live (October 2022 dump); -
Wiktionary has been updated and 45k new concepts have been integrated (October 2022 dump); -
Lemma casing updated in 24 languages;
-
Images associated with synsets have been updated; -
Cross-resource mapping updated; - *Domain labels updated*; -
Wikipedia labels are *now *multilingual.
More statistics are available at: babelnet.org/statistics.
Kind regards, The BabelNet Team