Hello All,
We are pleased to announce the call for industry day presentations at the
45th European Conference on Information Retrieval (ECIR'23). The Industry
Day of ECIR’23 will be held on Thursday 6th April 2022 in Dublin, Ireland,
immediately after the main conference program. Our goal is to bring
together product developers, information professionals and IR researchers
to promote knowledge sharing and innovation across academia and industry.
This year, our primary focus will be on *differences and challenges in
building dedicated information access systems versus building “search as a
service”*. Applications may cover search or recommendation systems in
various domains (e-commerce, vertical search, etc) and modalities (voice,
music, images, etc). Similarly to 2021, we also encourage presentations of
research works carried out during student internships and showing how
interns contributed impactfully on real-world products with innovative
ideas. Submissions about topics relevant to ECIR in general are encouraged.
- *Deadline for proposal submission*: January 30, 2023, 11:59 pm (AoE)
Please find submission instructions :
http://ecir2023.org/calls/industry.html
*Industry Day Chairs:*
- Nicolas Fiorini (Algolia, France)
- Isabelle Moulinier (Thomson Reuters, US)
Thank you!
Esraa Ali, Ph.D.
DCU
Publicity officer, ECIR 2023
--
*
*Séanadh Ríomhphoist/Email Disclaimer*
*Tá an ríomhphost seo agus aon
chomhad a sheoltar leis faoi rún agus is lena úsáid ag an seolaí agus sin
amháin é. Is féidir tuilleadh a léamh anseo.
<https://sites.google.com/view/seanadh-riomhphoist>*
*This e-mail and any
files transmitted with it are confidential and are intended solely for use
by the addressee. Read more here.
<https://sites.google.com/view/dcu-email-disclaimer>*
*
--
<https://www.facebook.com/DCU/> <https://twitter.com/DCU>
<https://www.linkedin.com/company/dublin-city-university>
<https://www.instagram.com/dublincityuniversity/?hl=en>
<https://www.youtube.com/user/DublinCityUniversity>
Dear colleagues,
we would like to invite you to apply to our residential workshop on
identifying and analysing nostalgia using corpus & discourse methods.
Details:
Bertinoro (FC) 3-5 May 2023, accommodation and meals fully funded by
CoLiTec (Corpora Linguistics and Technology Resaerch Centre at DIT -
Dipartimento di Interpretazione e Traduzione - UniBo):
https://centri.unibo.it/colitec/en/events/two-day-workshop-using-corpus-dis…
.
Deadline for expression of interest: 16 Jan. 2023.
Best,
Anna
We are looking for postdocs to work with us on INDOMITA!
Online hate speech has resulted in actual hate crimes. INDOMITA offers automated assistance to combat online hate speech. However, hatred is complex. The offensiveness of a phrase is determined by social customs and user demographics: “Yo, a**hole!” is acceptable among friends but not strangers. Our user-centric approach takes into account social customs and user demographics to enhance detection of nuanced forms of hate speech. We will use three strategies: - modeling a complex problem with socio-demographic context, - automating counter-argument and responding to aggressive users, and - creating evaluation methods to analyze fairness, performance, and subjectivity. Our research will bridge language gaps and shed light on the relationships between online actors and online hatred. We are seeking candidates to work on NLP, machine learning, and neural networks for representation learning, natural language understanding, and hate speech detection in various languages and modalities.
Successful candidates will work closely with Prof. Dirk Hovy, Prof. Debora Nozza, and the MilaNLP lab.
Your profile:
• a Ph.D. in Computer Science, Computational Linguistics/NLP, Machine Learning, Data Science, or related fields.
• Excellent programming skills in Python. Additional languages (C++, R, etc) a plus.
• Fluency in spoken and written English. Knowledge of Italian is NOT a requirement.
• Knowledge of current neural network models and implementation tools for neural networks (e.g. PyTorch, Tensorflow, Keras, etc.).
• Proven track record with publications in top-tier venues in the field of NLP/Computational Linguistics/ML.
Position Details:
• Starting date: March 1 2023, or any time thereafter
• Duration: 2 years, 1 year extension possible
• Deadline: 23rd January 2023
• Salary: 42k EUR p.a. (median salary Milan: 37k EUR) Applicants from outside Italy may qualify for a researcher taxation scheme
How to apply:
Go to the https://jobmarket.unibocconi.eu/?type=a&urlBack=/wps/wcm/connect/Bocconi/Si… <https://jobmarket.unibocconi.eu/?type=a&urlBack=/wps/wcm/connect/Bocconi/Si…> and search for “INDOMITA”, you will then have to click on “Apply online” for proceeding with the application.
Candidates should attach publications and a cover letter to their application. Online interviews will take place during February 2023.
Please contact dirk.hovy(a)unibocconi.it <mailto:dirk.hovy@unibocconi.it> if you have any question.
(apologies for cross posting)
The HiTZ center (https://hitz.ehu.eus) has received several large
multiyear grants to advance research in all aspects of NLP and Language
Technologies, with a focus on Large Language Models, common sense,
cross-lingual evaluation and low-resource languages, covering all
modalities including text, speech, images and videos. The center will
be offering a number of PhD, post-doctoral and research engineer
positions shortly with relatively flexible starting dates. Please
contact us if interested.
HiTZ is a reference research center on Language Technologies. Its aim is
to promote research, training, technological transfer and innovation in
Artificial Intelligence focused on language and speech. HiTZ is a leader
in research in Spain. It holds, among other, two Spanish Research Awards
in Computer Science, one of the 15 fellows that the main professional
association in the area (Association for Computational Linguistics) has
in Europe, and the award for the beTwo fully-funded 4-year PhD positions
at the HiTZ center, in the Basque Countryst PhD thesis in Artificial
Intelligence in Europe (EurAI 2020). The center is part of the
University of the Basque Country (https://ehu.eus), one of the top 400
universities in the world according to the Shanghai ranking. The center
has offices in San Sebastian and Bilbao (Basque Country, Spain), two
cities which rank at the top of places-to-visit and happiness rankings.
This message is specifically about two fully-funded positions that need
to be filled quickly.
The recipients of these PhD positions will be awarded a fully-funded
four-year scholarship, as well as additional funding for the payment of
doctoral tuition fees and research stays abroad. The PhD theses will be
carried out in the framework of the TRAIN (EXTREMELY LOW-RESOURCED
MACHINE TRANSLATION) and DeepKnowledge (DEEP LANGUAGE MODELS FOR
UNDERSTANDING AND REASONING WITH MULTILINGUAL CONTENT) projects.
The TRAIN project will explore techniques for machine translation
between languages with very scarce resources, including multimodal
translation from Spanish Sign Language (LSE) to written Spanish.
The main research objective of DeepKnowledge consists in advancing the
state-of-the-art towards NLU and NLG by generating and exploiting new
language models by taking into account a multitask and multimodal
objective during the pre-training, as well as exploring novel ways to
exploiting new large language models.
The official deadline for the two FPI pre-doctoral grants is 26 January,
but contact us asap if interested
(https://www.ehu.eus/es/web/ikerketaren-kudeaketa/-/fpi-2022).
If you are interested in these two positions (or any of the future
positions), please send your CV and academic transcript to Gorka Labaka
(gorka.labaka(a)ehu.eus).
--
Gorka Labaka
Dept. of Computer Languages and Systems
HiTZ Basque Center for Language Technologies - Ixa
University of the Basque Country (UPV/EHU)
Tel: (+34) 946 01 44 86
e-mail: gorka.labaka(a)ehu.eus
Dear Corpora subscribers,
I'm pleased to announce the availability of two new corpora of automatic speech recognition transcripts from the YouTube channels of municipalities and other local government entities:
* The Corpus of Australian and New Zealand Spoken English (CoANZSE: https://cc.oulu.fi/~scoats/CoANZSE.html), a 196-million-word corpus of 57k transcripts from 482 YouTube channels, corresponding to 24k hours of video.
* The Corpus of German Speech (CoGS: https://cc.oulu.fi/~scoats/CoGS.html): 51m words, 1.3k channels, 39k transcripts, 7.2k hours of video.
The corpora were created using methods similar to those used to create the Corpus of North American Spoken English (https://cc.oulu.fi/~scoats/CoNASE.html) and the Corpus of British Isles Spoken English (https://cc.oulu.fi/~scoats/CoBISE.html). Transcript metadata includes location and video URL. Because tokens have word timing information, the corpora can serve as starting points for the collection of audio or video data targeting specific utterances.
The corpora are available free of charge for academic/research purposes. Download links are on the web pages.
With kind regards,
Steven Coats
University of Oulu, Finland
Dear colleagues,
Happy New Year!
Just in case you missed the Tweet (from @NARNiHS), the North American Research Network in Historical Sociolinguistics (NARNiHS) 2023 Annual Meeting (Jan 6-7) program is now "live" on our website!
==> https://narnihs.org/?page_id=2291 <==.
The full program booklet, including the full abstracts, has also been posted with the Annual Meeting program.
Note that no registration is necessary for NARNiHS members to attend the fully online NARNiHS 2023. Conference access information will be sent to all NARNiHS members in a separate message as we get closer to the conference dates (06-07 January 2023).
Not a member yet, but interested in joining NARNiHS? Membership is free! For details on how to join NARNiHS, check out:
==> https://narnihs.org/?page_id=2 <==.
We hope to see you at our Annual Meeting!
Israel Sanz (2022 NARNiHS Convenor),
on behalf of the NARNiHS 2022 organizing committee (Joshua Bousquette, Mark Richard Lauersdorf, Israel Sanz, Sandrine Tailleur): https://narnihs.org/?page_id=2160 .
*The Second Ukrainian Natural Language Processing Workshop (UNLP 2023)*
<https://unlp.org.ua/>
The Second UNLP features the first *Shared Task in Grammatical Error
Correction for Ukrainian*.
*Task Description*
In this shared task, your goal is to correct a text in the Ukrainian
language to make it grammatical or both grammatical and fluent. We see this
shared task as an opportunity to facilitate research of GEC for Slavic
languages.
There are two tracks in this shared task: GEC-only and GEC+Fluency. It is
not mandatory to participate in both subtasks, i.e., participating in
either GEC-only or GEC+Fluency is acceptable.
You can find the detailed instructions, train and validation data, and the
evaluation script at https://github.com/asivokon/unlp-2023-shared-task.
*Registration*
Teams that intend to participate should register by filling in this form
<https://forms.gle/46gamdVXhFkBeZeX8>.
*Publication*
Participants in the shared task are invited to submit a paper to the UNLP
2023 <https://unlp.org.ua/call-for-papers/> workshop. Submitting a
paper is *not
mandatory* for participating in the Shared Task. Papers must follow the
workshop submission instructions and will undergo regular peer review.
Their acceptance will not depend on the results obtained in the shared
task, but on the quality of the paper. Accepted papers will appear in the
ACL anthology and will be presented at a session of UNLP 2023 specially
dedicated to the Shared Task.
*Important Dates*
December 23, 2022 — Shared task announcement
February 12, 2023 — Registration deadline
February 13, 2023 — Release of test data to registered participants
February 20, 2023 — Shared Task paper submission
March 10, 2023 — Submission of system responses
March 13, 2023 — Notification of acceptance
March 14, 2023 — Results of the Shared Task announced to participants
March 27, 2023 — Camera-ready Shared Task papers due
May 2 or 6, 2023 — Workshop dates
*Contact*
Email: info(a)unlp.org.ua
Website: https://unlp.org.ua/
Twitter: https://twitter.com/UNLP_workshop
Telegram: https://t.me/UNLP_workshop
3-year PhD position in Computational Models of Analogy Making in Natural Language (IRIT and University of Toulouse, France)
We invite applications for a fully funded PhD position for 3 years at the IRIT laboratory and the University of Toulouse, Paul Sabatier, France, in the context of the recently funded project AT2TA on Analogy Making.
Analogy Making is a remarkable cognitive capability during which similarities and differences between two parallel situations are exploited in order to draw a common "essence" allowing us thus to categorize an object or a particular situation to a preexisting concept or create a new one. Reasoning by analogy allows us thus to transfer our understanding of a previous situation to a new one and appropriately adapt it. It has been argued that analogy making lies at the core of cognition (Hofstadter 2001) and has recently drawn the attention of Deep Learning pioneers (Chollet 2017, LeCun 2022).
In Natural Language Processing analogies usually take the form of a quadruplet a:b :: c:d traditionally expressed as "a is to b as c is to d" (for example, Paris is to France as Berlin is to Germany). Most of the extant work considers a, b, c, d as word embeddings and then relies on geometrical properties of those embedding in a higher dimensional space in order to recognise a quadruplet as an analogy or to generate a d such that a:b :: c:d forms analogy given a, b and c. Despite the importance of analogies, most works in NLP do not consider analogies between sentences and do not concentrate on the underlying latent relations that form the common essence between pairs (a,b) and (c,d). The successful PhD candidate will work on computational models which can identify analogies between sentences or even bigger chunks of text with a particular focus on the identification of common latent relations which are also an essential part for an explainable AI.
The successful candidate should hold a Master's degree in computational linguistics or computer science or cognitive science and has prior
experience in word embedding models or deep learning approaches in general. The candidate should have strong programming skills and expertise in machine learning. The position is affiliated with the IRIT laboratory at Toulouse and there will be frequent interactions with researchers at the Loria laboratory in Nancy in the context of the AT2TA project.
Applications will be considered until the position is filled, but applicants are encouraged to apply as early as possible since applications will be considered at the moment of reception. Applications, in English or French, should include a detailed CV, a letter of motivation and at least two recommendation letters. Applications should be sent to Stergos Afantenos (stergos.afantenos at irit.fr).
More information can be found here: https://cloud.irit.fr/index.php/s/OpwyvCBzadRFKxY
NooJ 2023: Second Call for Papers
Conference URL: https://conference.unizd.hr/noojconference/
*********************************************************************************
The 17th NooJ International Conference 2023
Zadar, Croatia
May, 31st – June, 2nd 2023
*********************************************************************************
The University of Zadar (Department of Classical Philology and Department of Information Sciences), in cooperation with the Centre de Recherches Interdisciplinaires et Transculturelles (C.R.I.T.) from the Université de Franche-Comté (Besançon) and the NooJ association are organizing the 17th NooJ International Conference 2023 to be held from May 31st to June 2nd, 2023 in Zadar (Croatia).
NooJ annual conferences give NooJ users the opportunity to meet and share their experience as developers, researchers and teachers; to present the latest linguistic resources, Digital Humanities experiments and NLP applications developed with NooJ; to offer researchers and graduate students a tutorial to help them parse corpora and build NLP applications with NooJ.
ABOUT NOOJ
********************
NooJ is a linguistic development environment software as well as a corpus processor. NooJ provides linguists with tools to develop dictionaries, Regular Grammars, Context-Free Grammars, Context-Sensitive Grammars, as well as their graphical equivalents, to formalize various linguistic phenomena. NooJ’s multi-layer approach allows linguists to accumulate elementary descriptions across different linguistic levels.
NooJ is used as a corpus processor in the Digital Humanities as it allows researchers in the Social sciences to apply sophisticated queries to large corpora in real time, annotate texts automatically and perform various statistical analyses.
NooJ’s linguistic engine has been integrated into various NLP applications that perform automatic semantic annotation, Named Entities Recognition, Information extraction, Paraphrase Generation, Business Intelligence, Machine Translator, Web Semantics.
NooJ is a free open-source software promoted by the METASHARE European programme. It can run on Windows (C# .NET), macOS, LINUX and UNIX (Java). Its new engine and its source “RA” can be downloaded from GitLab and runs natively on Windows, macOS and LINUX.
TOPICS OF INTEREST
********************
* Linguistic Resources: Typography, Spelling, Syllabification, Phonemic and Prosodic Transcription, Morphology, Lexical Analysis, Local Syntax, Structural Syntax, Transformational Analysis, Paraphrase Generation, Semantic Annotations, Semantic Analysis.
* Digital Humanities: Corpus Linguistics, Discourse Analysis, Literature Studies, Second-Language Teaching, Narrative content analysis, Corpus processing for the Social Sciences.
* Natural Language Processing Applications: Business Intelligence, Text Mining, Text Generation. Language Teaching Software, Automatic Paraphrasing, Machine Translation, etc.
SUBMISSIONS
********************
We invite the submission of abstracts in English until January 15th, 2023. Abstracts should be between 300 and 600 words and submitted via Easy Abstract: http://linguistlist.org/easyabs/nooj2023. The scientific committee will review all proposals and authors will be given notice of acceptance of their papers no later than March 1st, 2023. All papers must be original and cannot simultaneously be presented to another journal or conference.
IMPORTANT DATES
********************
Abstract submission: January 15th, 2023
Notification of acceptance: March 1st, 2023
Camera-ready abstract submission: March 20th, 2023
Early Registration: until April 15th, 2023
Selected papers submission: September 13th, 2023
POST-PROCEEDINGS
********************
A selection of the papers presented at the NooJ 2023 will be published by Springer Verlag in their CCIS Series. Deadline for submission of full camera-ready papers is September 13th, 2023.
Meeting Location:
Zadar, Croatia
Contact Information:
Linda Mijić
nooj2023conf(a)gmail.com
Meeting Dates:
May 31st, 2023 to June 2nd, 2023
Abstract Submission Information:
Abstracts can be submitted from November 11th, 2022 until January 15th, 2023.
NooJ 2023: Second Call for Papers
Conference URL: https://conference.unizd.hr/noojconference/
*********************************************************************************
The 17th NooJ International Conference 2023
Zadar, Croatia
May, 31st – June, 2nd 2023
*********************************************************************************
The University of Zadar (Department of Classical Philology and Department of Information Sciences), in cooperation with the Centre de Recherches Interdisciplinaires et Transculturelles (C.R.I.T.) from the Université de Franche-Comté (Besançon) and the NooJ association are organizing the 17th NooJ International Conference 2023 to be held from May 31st to June 2nd, 2023 in Zadar (Croatia).
NooJ annual conferences give NooJ users the opportunity to meet and share their experience as developers, researchers and teachers; to present the latest linguistic resources, Digital Humanities experiments and NLP applications developed with NooJ; to offer researchers and graduate students a tutorial to help them parse corpora and build NLP applications with NooJ.
ABOUT NOOJ
********************
NooJ is a linguistic development environment software as well as a corpus processor. NooJ provides linguists with tools to develop dictionaries, Regular Grammars, Context-Free Grammars, Context-Sensitive Grammars, as well as their graphical equivalents, to formalize various linguistic phenomena. NooJ’s multi-layer approach allows linguists to accumulate elementary descriptions across different linguistic levels.
NooJ is used as a corpus processor in the Digital Humanities as it allows researchers in the Social sciences to apply sophisticated queries to large corpora in real time, annotate texts automatically and perform various statistical analyses.
NooJ’s linguistic engine has been integrated into various NLP applications that perform automatic semantic annotation, Named Entities Recognition, Information extraction, Paraphrase Generation, Business Intelligence, Machine Translator, Web Semantics.
NooJ is a free open-source software promoted by the METASHARE European programme. It can run on Windows (C# .NET), macOS, LINUX and UNIX (Java). Its new engine and its source “RA” can be downloaded from GitLab and runs natively on Windows, macOS and LINUX.
TOPICS OF INTEREST
********************
* Linguistic Resources: Typography, Spelling, Syllabification, Phonemic and Prosodic Transcription, Morphology, Lexical Analysis, Local Syntax, Structural Syntax, Transformational Analysis, Paraphrase Generation, Semantic Annotations, Semantic Analysis.
* Digital Humanities: Corpus Linguistics, Discourse Analysis, Literature Studies, Second-Language Teaching, Narrative content analysis, Corpus processing for the Social Sciences.
* Natural Language Processing Applications: Business Intelligence, Text Mining, Text Generation. Language Teaching Software, Automatic Paraphrasing, Machine Translation, etc.
SUBMISSIONS
********************
We invite the submission of abstracts in English until January 15th, 2023. Abstracts should be between 300 and 600 words and submitted via Easy Abstract: http://linguistlist.org/easyabs/nooj2023. The scientific committee will review all proposals and authors will be given notice of acceptance of their papers no later than March 1st, 2023. All papers must be original and cannot simultaneously be presented to another journal or conference.
IMPORTANT DATES
********************
Abstract submission: January 15th, 2023
Notification of acceptance: March 1st, 2023
Camera-ready abstract submission: March 20th, 2023
Early Registration: until April 15th, 2023
Selected papers submission: September 13th, 2023
POST-PROCEEDINGS
********************
A selection of the papers presented at the NooJ 2023 will be published by Springer Verlag in their CCIS Series. Deadline for submission of full camera-ready papers is September 13th, 2023.
Meeting Location:
Zadar, Croatia
Contact Information:
Linda Mijić
nooj2023conf(a)gmail.com
Meeting Dates:
May 31st, 2023 to June 2nd, 2023
Abstract Submission Information:
Abstracts can be submitted from November 11th, 2022 until January 15th, 2023.