Release of BabelNet 5.3
We are proud to announce the release of a new version of BabelNet https://babelnet.org/ and its programmatic *Java and Python API*, developed jointly by the Sapienza NLP Group http://nlp.uniroma1.it of *Sapienza University of Rome* under the supervision of prof. Roberto Navigli https://www.diag.uniroma1.it/navigli/ and Babelscape http://babelscape.com/, *a deep-tech multilingual NLP company* providing innovative solutions for natural language understanding.
BabelNet -- winner of the *prominent paper award 2017* from the Artificial Intelligence Journal and the META prize 2015, and covered in media such as The Guardian https://www.theguardian.com/news/2018/feb/23/oxford-english-dictionary-can-worlds-biggest-dictionary-survive-internet and Time Magazine http://wwwusers.di.uniroma1.it/~navigli/img/Redefining_the_modern_dictionary.png -- is today's *most far-reaching multilingual lexical-semantic knowledge graph* which, according to need, can be used as an *encyclopedic dictionary*, or a *semantic network* or a huge *knowledge base/ontology* e.g. to be integrated into *deep learning solutions*. It has been used by more than *1000 universities and research institutions*, enabling multilinguality in several fields of AI and NLP, such as multilingual semantic search, Word Sense Disambiguation and Entity Linking, Semantic Role Labeling, image tagging and semantically-enhanced multimodality.
BabelNet was created by means of the seamless integration and interlinking of the largest multilingual Web encyclopedia - i.e., Wikipedia - with the most popular computational lexicon of English - i.e., WordNet, and other lexical-semantic resources such as Wikidata, Wiktionary, OmegaWiki, dozens of wordnets (including Open English WordNet), GeoNames, and ImageNet. The BabelNet model is centered around *multilingual synsets*, i.e., concepts and named entities lexicalized in many languages, and connected with large amounts of semantic relations.
*Version 5.3* ships with the following features:
- *80 new languages* for a grand total of *600 languages*; - *23 million synsets* covered; - *Lemma casing updated in 24 languages*; - *Wikipedia and Wikidata updated* thanks to BabelNet live (November 2023 dump); - *Open English WordNet* has been updated to version 2023; - *Images* associated with synsets have been updated; - *Wiktionary* has been *updated* and *20k new concepts* have been integrated (November 2023 dump); - *Significantly improved cross-lingual resource mapping*, ensuring more accurate and contextually relevant lexicalizations and translations; - *General data cleanup* (glosses, senses, Named Entity vs. Concept labels); - *Wikipedia multilingual labels updated.*
More statistics are available at: babelnet.org/statistics.
Kind regards, The BabelNet group