We invite you to participate in SemEval-2023 Task 2: *Multi*lingual *Co* mplex *N*amed *E*ntity *R*ecognition (MultiCoNER) II.
*Task Website:* https://multiconer.github.io/
This task focuses on the *fine-grained* detection of complex entities, such as movie, book, music and product titles, in low context settings (short and uncased text).
The task provides data in 12 language. Here are some examples in different languages where entities are enclosed inside brackets with their type:
- *English: [wes anderson | Artist]*'s film *[the grand budapest hotel | VisualWork]* opened the festival . - *Spanish:* fue superado por el [aon center | Facility] de [los ángeles | HumanSettlement] . - *Ukranian:* назва альбому походить з роману « *[кінець дитинства | WrittenWork]* » англійського письменника* [артура кларка | Artist]* . - *Swedish: [tom hamilton | Artist]* amerikansk musiker basist i *[aerosmith | MusicalGRP]* . - *Portuguese:* também é utilizado para se fazer *[licor | Drink]* e *[vinhos | Drink]*. - *Hindi:* १७९६ में उन्हें *[शाही स्वीडिश विज्ञान अकादमी | Facility]* का सदस्य चुना गया। - *French:* l *[amiral de coligny | Politician]* réussit à s y glisser . - *German:* in *[frühgeborenes | Disease]* führt dies zu *[irds | Symptom]* . - *Bangla [লিটল মিক্স | MusicalGrp]* এ যোগদানের আগে তিনি *[পিৎজা হাট | ORG]* এ ওয়েট্রেস হিসাবে কাজ করেছিলেন। - *Italian*: è conservato nel [rijksmuseum | Facility] di [amsterdam | HumanSettlement] . - *Chinese:* 它的纤维穿过 [锁骨 | AnatomicalStructure] 并沿颈部侧面倾斜向上和内侧. - *Farsi: *مرکزاین استان شهر [ناگویا |HumanSettlement] است
Additionally, a *multilingual NER track* is also offered for multilingual systems that can process all languages.
The task focuses on detecting semantically ambiguous and complex entities in short and low-context settings. Participants are welcome to build NER systems for any number of languages. And we encourage to aim for a bigger challenge of building NER systems for multiple languages.
We have released training data for 12 languages along with a baseline system to start with. Participants can submit their system for one language but are encouraged to aim for a bigger challenge and build multi-lingual NER systems.
*Task Website:* https://multiconer.github.io/ *Mailing List:* multiconer-semeval@googlegroups.com *Slack Workspace:* https://join.slack.com/t/multiconer/shared_invite/zt-vi3g97cx-MpqTvS07XX22S7... *Training Data:* https://multiconer.github.io/dataset *Baseline System:* https://multiconer.github.io/baseline
*Shared task schedule:*
- Evaluation start: mid-January, 2022 - Evaluation end: by January 31, 2023 (latest date; task organizers may choose an earlier date) - System description paper submissions due: February 1, 2023 - Notification to authors: March 1, 2023
*Task organizers*
- Shervin Malmasi (Amazon) - Besnik Fetahu (Amazon) - Sudipta Kar (Amazon)
Please reach out to the organizers at multiconer-semeval-organizers@googlegroups.com, or join the Slack workspace to connect with the other participants and organizers. - Sudipta Kar Applied Scientist Amazon Alexa AI +1 8326437277 http://sudiptakar.info