Dear all,
We are hiring a postdoc at Wageningen Food Safety Research (WFSR). Please
see the details at
https://www.wur.nl/en/vacancy/postdoc-datascience-and-machinelearning.htm.
The data science team at WFSR conducts research and develops tools using
advanced techniques and diverse data types. Here is a short list of our
activities: https://bigdata-wfsr.wur.nl/applications/ In addition to these
application,s explainability of machine learning models, federated
learning, natural language processing, generative artificial intelligence,
remote sensing, pest alarms, etc. are only a tiny subset of the techniques
we apply and research.
In addition to many internal projects, the main research projects we
collaborate with international partners are: HOLiFOOD (
https://holifoodproject.eu), ECO-Ready (https://www.eco-ready.eu), and EFRA
(https://efraproject.eu).
Please reach out with any questions and apply for this and any upcoming
possibilities (traineeships, project proposals, scientific programming and
data science opportunities)!
We are looking forward to hearing from you!
Best wishes,
Ali
[Apologies for crossposting]
*Global WordNet Conference 2025 - GWC2025*
The Global Wordnet Association is delighted to announce the *13th
International Global Wordnet Conference* (GWC2025), to be held in *Pavia
(Italy) from 27 to 31 January, 2025*. The GWC2025 conference will be hosted
by the Department of Humanities, at the University of Pavia.
đź“Ť*Dates*: 27-Jan-2025 - 31-Jan-2025
*Location*: Pavia, Italy
*Meeting Email*: gwc2025pavia(a)unipv.it
*Web Site*: https://unipv-larl.github.io/GWC2025/
🗓️ *Call Deadline*: 07-Oct-2024
We invite submissions of original research contributions addressing, though
not limited to, the topics listed below. *Presentations of new WordNets *will
be assigned to a dedicated panel. Additionally, proposals for tutorials and
demonstrations or panel discussions on *WordNet for ancient languages* are
encouraged.
Conference topics:
- Lexical semantics and meaning representation;
- Architecture of lexical databases;
- Tools and methods for WordNet development;
- Applications of WordNet;
- Standardization, distribution and availability of WordNet and WordNet
tools
See the full call for papers here: https://easychair.org/cfp/gwc2025
#dm4myth Workshop: Call for Papers
*** DEADLINE Extension: October 10th, 2024 ***
Venue: Computational Humanities Research Conference - CHR2024, (Aarhus,
Denmark)
Website: https://dm4myth.github.io/
Organizers: Dr. Franziska Pannach, Dr. Bruno Sartini
Important Dates
*NEW* Paper submission deadline: October 10th, 2024 (23:59 anywhere on
Earth)
Notification of Acceptance: November 1st, 2024
Camera-Ready Deadline: November 15th, 2024
Date: December 3, 2024 (Full day workshop)
Overview
We are excited to announce the First Workshop on Digital Methods for
Mythological Research (#dm4myth) taking place on December 3rd, 2024. The
workshop aims to bring together a network of researchers from various
disciplines and backgrounds who are passionate about mythological
narratives and their study using digital methods. This includes research
efforts such as the automatic or semi-automatic analysis and modeling of
mythological narratives, comparative efforts using digital tools, or the
study and representation of mythological characters.
#dm4myth is co-located with Computational Humanities Research (CHR2024,
Dec. 4th-6th), in Aarhus, Denmark. The workshop will consist of traditional
presentations by the participants, a networking and interest group portion
in the form of table discussions, and the kick-off of the #dm4myth network.
We welcome contributions from various humanities disciplines, such as (but
not limited to) Ancient Near Eastern Studies, Religious Studies, Classical
Studies/Classical Philology, Archaeology and Art History; as well as
technical approaches from Computer Scientists, including Semantic Web
specialists, Computational Linguists, Academic Data Scientists, and others.
Topics of interest include, but are not limited to:
-
Dataset construction, maintenance and development
-
Knowledge representation (ontologies, knowledge graphs, controlled
vocabularies) for the mythological domain
-
Application and development of NLP approaches for the study of
mythological plots and characters
-
Approaches for automatic character disambiguation
-
Visualization of plots and characters
-
Knowledge extraction from narrative source materials
-
Annotation of textual and/or visual narrative material (including
iconographic studies) in the domain
-
Creation, fine-tuning, reuse of (Large) Language Models for the analysis
and narrative modeling of mythological sources
Best Paper Award: The organizers and the programme committee will invite
the authors of the best paper to extend their work into a chapter in an
upcoming volume on Digital Mythological Studies.
Goals of the Workshop
#dm4myth aims to
1.
Facilitate knowledge exchange, including the development of best
practices, between researchers with different backgrounds who are
interested in applying and developing digital tools and methods for the
study of mythological narratives.
2.
Create a network of researchers with shared interests across disciplines
(e.g. Classics, Ancient Near Eastern Studies, Religious Studies, Computer
Science, Computational Linguistics).
3.
Pave the way for future interdisciplinary research on digital
mythological studies, and facilitate networking for researchers from
different backgrounds and disciplines.
Submissions
Papers should be submitted via the openreview
<https://openreview.net/group?id=computational-humanities-research.org/CHR/2…>
platform.
Types of Submissions:
Short Papers: 5-7 pages (excluding references)
Short papers include early-stage project results, work in progress,
negative results and critical reflections/tool criticism.
Long Papers: 7-10 pages (excluding references)
Long papers include full project reports, completed research, theoretical
reflections and original and unpublished results.
Submissions should be anonymised, written in English and must be formatted
according to the
<https://github.com/cohure/CHR2024-website/raw/main/data/chr2024_latex_templ…>Workshop
CEUR template <https://github.com/FPannach/dm4myth/> (instructions: here
<https://discourse.computational-humanities-research.org/t/chr-latex-instruc…>).
The workshop proceedings will be published as an open access version under
a Creative Commons License (CC BY 4.0
<https://creativecommons.org/licenses/by/4.0/deed.en>) in a suitable venue
(to be announced).
Workshop Principles
We welcome participants from all related disciplines and career stages. We
specifically invite colleagues from under-represented communities,
geographical, organizational or linguistic backgrounds and small
disciplines to submit. At least one author should attend the workshop in
person or virtually to present the work.
The content of the submissions should be written by human author(s), i.e.
substantial <https://ceur-ws.org/ACADEMIC-ETHICS.html> contributions to the
submission by artificial intelligence agents are not allowed and will
result in a rejection. The application of AI-assistants is allowed only for
light editing (e.g. spell-checking) of sections that are authored by
humans. In the interest of good scientific practice, the organizers
recommend the publication of data and code repositories under creative
commons (or comparable) licenses.
All submissions under-go peer review from at least two members of the
programme committee. We aim to provide at least one review each by a
domain-expert and a technical expert.
Programme Committee
Franziska Pannach, University of Groningen
Bruno Sartini, Ludwig-Maximilians-Universität München
Saskia Peels-Matthey, University of Groningen
Christian Zgoll, University of Göttingen
Robert Scott Smith, University of New Hampshire
Greta Hawes, Macquarie University
Jonathan Groß, Akademie der Wissenschaften zu Göttingen
Anke Tornow, design digitaler medien (ddm)
Valentina Pasqual, University of Bologna
Sebastian Barzaghi, University of Bologna
Fabio Mariani, Leuphana University of LĂĽneburg
Arianna Graciotti, University of Bologna
Sofia Baroncini, Leibniz Institute of European History
Gianmarco Spinaci, I Tatti, Harvard University Center for Italian
Renaissance Studies
Contact
For questions and comments, please contact the workshop organizers:
Franziska Pannach f[dot]a[dot]pannach[at]rug[dot]nl or Bruno Sartini
b[dot]sartini[at]lmu[dot]de
--
Dr. Franziska Pannach
Assistant Professor
Center for Language and Cognition (CLCG)
University of Groningen
https://www.rug.nl/staff/f.a.pannach/
*** Apologies for cross-postings ***
At the Institute for Computer Science (Prof. Dr. Alexander Mehler),
Department of Computer Science and Mathematics at Goethe University
Frankfurt, a position for a research assistant (m/f/d) (E 13 TV-G-U)
is available at the earliest possible date
research assistant (m/f/d)
(E 13 TV-G-U)
for a period of three years within the ENTAILab project – research
infrastructure and innovation lab. The salary scale is based on the
job characteristics of the collective agreement applicable to Goethe
University (TV-G-U).
The project is part of the priority program (SPP) New Data Spaces for
the Social Sciences, which is funded by the German Research Foundation
(DFG) (see https://www.new-data-spaces.de). The aim of the project is
to establish a research-oriented infrastructure for novel data in
survey research. To this end, a method-oriented innovation laboratory
for novel methods in survey research is to be set up, which will
develop and test methods of machine learning and artificial
intelligence in cooperation with the projects of the SPP. The subject
of the methods to be developed is multimodal data and thus not
primarily or exclusively linguistic research data.
You are expected to collaborate in the project and actively
participate in the workshops and events of the SPP. We are looking for
a highly qualified individual with a keen interest in working in the
field of cutting-edge research infrastructures and in the
team-oriented development and application of innovative,
research-oriented methods in the field of survey research and the
social sciences. With the SPP New Data Spaces for the Social Sciences
and the Text-Technology Lab, in which the position will be embedded,
we offer two research-strong, internationally oriented working
environments in the areas of computational humanities, multimodal
computing, machine learning and artificial intelligence. This also
includes financial resources for conference participation and
individual career development.
Requirements:
· Completed academic university degree (e.g. Master's) in a relevant
subject with a focus on information science
· Very good knowledge of English (C1)
· Proven experience in the field of databases and machine learning or
artificial intelligence methods
· Extensive programming knowledge in Java, Python or similar
· Knowledge of container technologies such as Docker, Kubernetes or similar
· An interest in social science issues is desirable.
Please send your application with the usual documents (cover letter,
CV, copies of certificates) electronically in a combined PDF document
by 08.10.2024 to Prof. Dr. Alexander Mehler: mehler(a)em.uni-frankfurt.de.
NAKBA-NLP 2025
The 1st International Workshop on Nakba Narratives as Language Resources
Part of the COLING-2025 [1] Conference
Abu Dhabi, UAE (Fully Virtual)
January 20, 2025
CALL FOR PAPERS
We invite submissions for Nakba-NLP 2025, a workshop dedicated to the
exploration and preservation of Nakba narratives through the application
of artificial intelligence, natural language processing, and corpus
linguistics. All submitted papers should explain their relevance to the
topic of 'Nakba Narratives as Language Resources'. The organisers
reserve the right to reject any papers that incite hatred, refute
established facts, or undermine the suffering of individuals.
We seek contributions on the following issues of interest:
* Digitisation of oral and written narratives
* Creation and labeling of language corpora and datasets
* Digital archives, metadata, and semantic/content mark-up
* Annotation tools and annotation guidelines
* Document classification, topic modeling, and information retrieval
* Named entity recognition for identifying people, places,
organizations, and events
* Entity linking and relationship extraction
* Event detection and event argument extraction
* Knowledge Graphs and Linked Data
* Vocabularies, dictionaries, and ontologies
* Data visualisation
* Knowledge representation
* Machine translation, summarisation, and paraphrasing
* Natural Language Generation
* Large Language Models
* Sentiment analysis and emotional content extraction
* Discourse analysis (e.g., bias, offensive language, and
misinformation) related to Nakba narratives
* Voice & dialogue-based systems; ASR
* Palestinian dialects (written and spoken)
Participants are invited to use the following archives: Institute for
Palestine Studies [2], The Palestinian Museum [3], Nakba-Archive [4],
POHA [5],Alhaq [6],ICHR [7], as well as Wikipedia and the Wikidata
Knowledge Graph.
SUBMISSION DETAILS
All submitted papers must clearly state and explain their relevance to
the topic of 'Nakba Narratives as Language Resources'. The organisers
reserve the right to reject any papers that incite hatred, refute
established facts, or undermine the suffering of individuals.
Submissions may be of two types:
* Long papers - up to eight (8) pages maximum, presenting substantial,
original, completed, and unpublished work.
* Short papers - up to four (4) pages, describing a small focused
contribution, negative results, system demonstrations, etc.
The workshop supports the COLING anti-harassment policy Policy. [8]
IMPORTANT DATES
* Submission Deadline: 25 November 2024
* Notifications of Acceptance: 5 December 2024
* Camera Ready Deadline: 13 December 2024 (cannot be changed).
Links:
------
[1] https://coling2025.org/
[2] https://www.palestine-studies.org/
[3] https://palmuseum.org/en
[4] https://www.nakba-archive.org/
[5] https://libraries.aub.edu.lb/poha/
[6] https://www.alhaq.org/
[7] https://www.ichr.ps/en
[8] https://coling2022.org/policy
SECOND CALL FOR PAPERS
CALL FOR PAPERS: THE 1ST WORKSHOP ON NLP FOR LANGUAGES USING ARABIC
SCRIPT (ABJADNLP 2025)
Co-located with COLING 2025 Conference, Abu Dhabi, UAE (19-20 January
2025)
Submission URL [1]
AbjadNLP is dedicated to advancing innovation and gaining deeper
insights into Natural Language Processing (NLP) for languages that use
the Arabic script. Our primary focus is on Abjad and Ajami languages
that utilise the Arabic script or its variations. Traditionally
associated with Semitic languages, Abjad scripts represent consonants in
every syllable. In contrast, Ajami scripts denote the alphabetic use of
the Arabic script in various African contexts, representing non-Arabic
languages. We are interested in research on languages that fall under
the Abjad or Ajami categories that use the Arabic script or any
variations of it.
We invite contributions, discussions, and explorations that delve deep
into the unique linguistic structures, resources, challenges, and
untapped potential presented by Abjad and Ajami languages within the
realm of NLP and language resources. Our goal is to create synergies
among researchers by addressing the diverse phenomena and challenges
inherent in these rich linguistic traditions.
The workshop is proud to highlight our connections with the Masakhane
NLP community and collaborations with institutions worldwide, such as
COMSATS on Urdu, and the long-standing UCREL NLP Group at Lancaster
University, whose work encompasses over 20 languages worldwide,
including Abjad and Ajami languages.
Note: We chose the name Abjad for simplicity, but our focus includes
Abjad and other languages that have adopted the Arabic and Perso-Arabic
scripts, as well as Ajami languages. We acknowledge that Sorani Kurdish,
when written in Arabic script, follows an alphabet style rather than an
Abjad style.
TOPICS OF INTEREST:
* Core Technologies: morphological analysis, disambiguation,
tokenisation, POS tagging, named entity detection, chunking, parsing,
semantic role labelling, sentiment analysis, language modelling, etc.
* Applications: machine translation, speech recognition, speech
synthesis, optical character recognition, assistive technologies, social
media, etc.
* Resources and Tools: dictionaries, annotated data, corpora,
orthography descriptions, font technology, glyph rendering, text input
methodologies, spell-checking, speech-to-text solutions, BLARK
descriptions, open access corpora.
* Cultural and Sociolinguistic Considerations: text processing,
transliteration challenges, and solutions, cultural contexts in NLP
applications.
SUBMISSION GUIDELINES:
We follow the COLING 2025 standards for submission format and
guidelines. Submissions should conform to the following types:
* Long papers: Up to eight (8) pages, presenting substantial,
original, completed, and unpublished work.
* Short papers: Up to four (4) pages, describing a small focused
contribution, negative results, system demonstrations, etc.
KEY DATES:
* 1st Call for Papers Announcement: 16 July 2024
* 2nd Call for Papers Announcement: 16 August 2024
* Paper Submission Deadline: 15 November 2024
* Notification of Paper Acceptance: 6 December 2024
* Camera-ready Paper Deadline: 13 December 2024
* Workshop Date: 19 or 20 January 2025
ORGANISING COMMITTEE:
General Chair: Mo El-Haj, Lancaster University
Programme Chairs:
* Hugh Paterson III, Collaborative Scholar
* Saad Ezzini, Lancaster University
* Ignatius Ezeani, Lancaster University
Review Committee:
* Mahum Hayat Khan, University of La Rioja
* Muhammad Sharjeel, COMSATS University Islamabad
Publication Chair: Sina Ahmadi, University of Zurich
Publicity Chairs:
* Cynthia Amol, Maseno University
* Amal Haddad Haddad, University of Granada
* Jaleh Delfani, University of Surrey
Advisory Committee:
* Ruslan Mitkov, Lancaster University
* Paul Rayson, Lancaster University
Links:
------
[1] https://softconf.com/coling2025/AbjadNLP25/
===============
===============
* We apologize if you receive multiple copies *
* For the online version of this Call, visit: https://cikm2024.org/
===============
CIKM 2024: 33rd ACM International Conference on Information and Knowledge Management
Boise, Idaho, USA
October 21–25, 2024
===============
The program of the 33rd ACM International Conference on Information and Knowledge Management (CIKM) has been announced.
--------------------------
Program at a glance
--------------------------
* Monday Oct. 21: Industry day - Tutorials
* Tuesday Oct. 22: Keynote - Applied research, full and resource papers - Short paper and demo posters
* Wednesday Oct. 23: Keynote - Applied research, full and resource papers - Banquet
* Thursday Oct. 24: Keynote - Applied research, full and resource papers - Closing
* Friday Oct. 25: Workshops - AnalytiCup - PhD Symposium
The full program can be accessed here: https://docs.google.com/document/d/1RG32LErl82m841_0Foq1SHRt-vtGoWND-aSWHWA…
NLP positions at the University of Alicante
PhD position in Natural Language Processing at the University of
Alicante
The Natural Language Processing (NLP) group at the University of
Alicante, one of the leading Spanish NLP research groups
(https://gplsi.dlsi.ua.es/grupo/), is advertising a PhD position
(investigador pre-doctoral) in NLP. The successful candidate will be
working a project which will compare the impact of various rule-based,
deep learning and large language models on different NLP applications.
The position is offered for 4 years and will carry a gross monthly
salary of EUR1,347.86. The successful candidate will receive 14
payments of the salary during each year of employment (or pro-rata if
less than full year).
We are seeking candidates with good NLP knowledge and excellent
programming skills.
The successful candidate will be required to start a process of having
his/her degree officially validated by the Spanish Government if they do
not have a Spanish degree or officially validated degree yet.
Prospective candidates can familiarise themselves with the following
calls which also outline the application procedure.
https://ssp.ua.es/es/selecinvest/03-personal-investigador/2024/pi84/convoca…https://euraxess.ec.europa.eu/jobs/275446
Non-extendable deadline for applications: 8 October 2024
For further information, please contact Alicia Picazo
(alicia.picazo(a)ua.es)
Research Assistant Natural Language Processing at the University of
Alicante
The Natural Language Processing (NLP) group at the University of
Alicante, one of the leading Spanish NLP research groups
(https://gplsi.dlsi.ua.es/grupo/), is advertising a research assistant
position (especialista) in NLP. The successful candidate will be working
a project which will compare the impact of various rule-based, deep
learning and large language models on different NLP applications. The
position is offered for 4 years and will carry a gross monthly salary of
EUR1,177.09. The successful candidate will receive 14 payments of the
salary during each academic year of employment (or pro-rata if less than
full year).
We are seeking candidates with NLP knowledge and good programming
skills.
Only candidates who have their degrees from Spanish universities or
whose degrees are already officially validated by the Spanish
Government, are eligible to apply for this position.
Prospective candidates can familiarise themselves with the following
calls which also outline the application procedure.
https://ssp.ua.es/es/selecinvest/04-personal-tecnico/2024/pas47/convocatori…https://euraxess.ec.europa.eu/jobs/275464
Non-extendable deadline for applications: 8 October 2024
For further information, please contact Alicia Picazo
(alicia.picazo(a)ua.es)
Apologies for cross-postings.
Book announcement: Python for Natural Language Processing
By Pierre Nugues
This book is available from Springer https://link.springer.com/book/10.1007/978-3-031-57549-5
and the notebooks from GitHub: https://github.com/pnugues/pnlp
Contents:
An Overview of Language Processing
A Tour of Python
Corpus Processing Tools
Encoding and Annotation Schemes
Python for Numerical Computations
Topics in Information Theory and Machine Learning
Linear and Logistic Regression
Neural Networks
Counting and Indexing Words
Word Sequences
Dense Vector Representations
Words, Parts of Speech, and Morphology
Subword Segmentation
Part-of-Speech and Sequence Annotation
Self-Attention and Transformers
Pretraining an Encoder: The BERT Language Model
Sequence-to-Sequence Architectures: Encoder-Decoders and Decoders