- Event Notification Type: Call for Papers
- Abbreviated Title: [CFP 2nd] EACL 2024
- Location: Hotel Radisson Blu, St. Julians
Sunday, 17 March 2024 to Friday, 22 March 2024
- Country: Malta
- Contact Email:
michael.strube(a)h-its.org
YGRAHAM(a)tcd.ie
m.purver(a)qmul.ac.uk
- Contact:
Michael Strube
Yvette Graham
Matthew Purver
- Website: https://2024.eacl.org/
- Submission Deadline: Sunday, 15 October 2023
============================
* Second Call for Papers: EACL 2024
The 18th Conference of the European Chapter of the Association for Computational Linguistics (EACL 2024) invites the submission of long and short papers on substantial, original, and unpublished research on Natural Language Processing. EACL 2024 will be held at the Hotel Radisson Blu, St. Julians, in Malta on 17th-22nd March 2024, with online attendance possible.
Papers must be submitted to EACL 2024 via the ACL Rolling Review (ARR) system. As in recent years, some of the presentations at the conference will be for papers accepted by the Transactions of the ACL (TACL) and Computational Linguistics (CL) journals.
* Important Dates
- Anonymity period begins: Friday, 15 September 2023
- Paper submission deadline (via ARR): Sunday, 15 October 2023
- Author response period: Friday-Tuesday, 8-12 December 2023
- Paper commitment deadline: Sunday, 20 December 2023
- Notification of acceptance: (long & short papers): Monday, 15 January 2024
- Withdrawal deadline (long & short papers): Monday, 22 January 2024
- Camera-ready papers due (long & short papers): Wednesday, 31 January 2024
- Workshops & Tutorials: Sunday; Thu-Fri, 17 & 21-22 March 2024
- Main Conference: Monday-Wed, 18-20 March 2024
All deadlines are 11.59 pm UTC -12h (“anywhere on Earth”).
============================
* Paper Submission Information
* Topics of Interest
EACL 2024 has the goal of a broad technical program. Relevant topics for the conference include, but are not limited to, the following areas (in alphabetical order):
- Computational Social Science and Cultural Analytics
- Dialogue and Interactive Systems
- Discourse and Pragmatics
- Efficient/Low-resource methods in NLP
- Ethics and NLP
- Generation
- Information Retrieval and Text Mining
- Information Extraction
- Interpretability and Model Analysis in NLP
- Language Grounding to Vision, Robotics and Beyond
- Linguistic Theories, Cognitive Modeling and Psycholinguistics
- Machine Learning for NLP
- Machine Translation
- Multilinguality and Language Diversity
- NLP Applications
- Phonology, Morphology, and Word Segmentation
- Question Answering
- Resources and Evaluation
- Semantics: Lexical
- Semantics: Sentence-level Semantics, Textual Inference and other areas
- Sentiment Analysis, Stylistic Analysis and Argument Mining
- Speech and Multimodality
- Summarization
- Syntax: Tagging, Chunking and Parsing
* Long Papers
Long paper submissions must describe substantial, original, completed and unpublished work. Wherever appropriate, concrete evaluation and analysis should be included. Long papers may consist of up to 8 pages of content, plus unlimited pages for references and appendices. Upon acceptance, long papers will be given one additional page of content (i.e. up to 9 pages) in the proceedings so that reviewers’ comments can be taken into account.
* Short Papers
Short paper submissions must describe original and unpublished work. Please note that a short paper is not a shortened long paper. Instead, short papers should have a point that can be made in a few pages. Short papers may consist of up to 4 pages of content, plus unlimited references and appendices. Upon acceptance, short papers will be given one additional page of content (i.e. up to 5 pages) in the proceedings so that reviewers’ comments can be taken into account.
* Findings of the ACL
Papers submitted to EACL 2024, but not selected for the main conference, will also automatically be considered for publication in the Findings of the Association of Computational Linguistics. Acceptance notifications for the main track and Findings will come out simultaneously.
* Presentation Mode
Long and short papers will be presented orally or as posters, as determined by the programme committee based on the nature rather than the quality of the work. While short papers will be distinguished from long papers in the proceedings, there will be no distinction in the proceedings between papers presented orally and as posters. Papers accepted to the Findings of the ACL may present a poster.
* Presentation Requirements
All accepted papers must be presented at the conference—either online or in-person—in order to appear in the proceedings. Authors of papers accepted for presentation at EACL 2024 must notify the program chairs by the withdrawal deadline if they wish to withdraw the paper. At least one author of each accepted paper must register for EACL 2024 by the early registration deadline.
* Paper Submission and Anonymity
Following standard ACL and ARR policy, submitted papers must be prepared for two-way anonymized review, and no deanonymized preprint may be posted in the month prior to submission. Please see the ARR CfP for more detail.
https://aclrollingreview.org/cfp
* Policies on Authorship, Citation and Ethics
EACL 2024 follows the ARR policies on authorship, citation and comparison and ethics - please see the ARR CfP.
* Multiple Submission Policy
EACL 2024 follows the ARR policy on multiple submission: we will not consider any paper that is under review in a journal or another conference at the time of submission, and submitted papers must not be submitted elsewhere during the review period. See the ARR CfP for more detail. Please note that the EACL 2024 submission deadline is currently timed to come after EMNLP 2023 decisions have been announced, and that EACL 2024 acceptance decisions will be announced before the likely submission deadline for ACL 2024, although after that for NAACL 2024.
* Mandatory Discussion of Limitations
We believe that it is also important to discuss the limitations of your work, in addition to its strengths. Following EACL 2023, EACL 2024 requires all papers to have a clear discussion of limitations, in a dedicated section titled “Limitations”. This section will appear at the end of the paper, after the discussion/conclusions section and before the references, and will not count towards the page limit. Papers without a limitations section will be automatically rejected without review. Papers resubmitted from previous ARR review rounds that did not include a limitations section must ensure that such a section is included in the EACL 2024 version.
While we are open to different types of limitations, just mentioning that a set of results have been shown for English only probably does not reflect what we expect. Mentioning that the method works mostly for languages with limited morphology, like English, is a much better alternative. In addition, limitations such as low scalability to long text, the requirement of large GPU resources, or other things that inspire further investigation are welcome.
The School of Informatics, University of Edinburgh, is thrilled to
announce a PhD scholarship funded by Google DeepMind.
The scholarship covers tuition fees (at the Home/International tuition
fee rate), provides an annual stipend of £18,622 per annum (for 3.5
years full time study) and provides a research training and support
grant. The student will be supervised by Dr. Mirella Lapata and will
also benefit from mentoring from DeepMind staff during their period of
study.
Applicants would be expected to work on a topic drawn from the
following research areas:
- multimodal natural language understanding and generation
- long-form and retrieval-augmented text generation
- multilingual generation
Applicants wishing to apply for the scholarship should meet one OR
both of the following criteria:
- are resident of a country and/or region underrepresented in AI;
- identify as women including cis and trans people and non-binary or
gender fluid people who identify in a significant way as women or
female;
- and/or identify as Black or other minority ethnicity.
The successful candidate will have a good honours degree or equivalent
in artificial intelligence, computer science, machine learning, or a
related discipline; or have a breadth of relevant experience in
industry/academia/public sector, etc. They will have strong
programming skills and previous experience in natural language
processing.
If you have further questions, please contact Dr. Mirella Lapata:
mlap(a)inf.ed.ac.uk.
To apply, please follow the instructions at:
http://www.inf.ed.ac.uk/postgraduate/apply.html
As your research area, please select "Informatics: ILCC: Language
Processing, Speech Technology, Information Retrieval, Cognition". On
the application form under "Research Project", please state "DeepMind
Scholarship".
IMPORTANT: After submitting your application through the website,
please email your applicant number to mlap(a)inf.ed.ac.uk.
Application deadline: 24 November 2023 received after
the deadline may be considered, but this cannot be guaranteed].
Call for Papers: North Africans in Machine Learning Affinity Workshop at
NeurIPS 2023!
We are thrilled to announce the Call for Papers for the North Africans in
Machine Learning Affinity Workshop, which will be held at NeurIPS 2023.
This is your chance to share your groundbreaking research, insights, and
discoveries with a vibrant community of peers in the field of Machine
Learning. Whether you're a junior researcher or a seasoned expert, and of
North African origins. if you have a passion for advancing the theory and
applications of ML, we want to hear from you!
Why submit your paper?
- Showcase your work on a prestigious stage.
- Gain valuable feedback from experts in the ML community.
- Connect with like-minded professionals from North African institutions
and beyond.
- Contribute to the collective knowledge of the ML field.
📜 Submission Guidelines:
- Papers related to all aspects of Machine Learning are welcome.
- Submissions from North Africans are encouraged.
- The workshop is open to academia and industry professionals.
🔗 Submission Link and Deadlines:
https://lnkd.in/eSKVv2H5
🏆 Awards and Recognition:
Outstanding contributions will be recognized, and selected papers may have
the opportunity to be featured prominently during the workshop.
Join us in making the North Africans in Machine Learning Affinity Workshop
at NeurIPS 2023 a resounding success! Submit your paper, share your
insights, and be part of this exciting journey in advancing the field of
Machine Learning.
Stay tuned for more updates and mark your calendars for NeurIPS 2023! Let's
shape the future of ML together.
(Apologies for cross-posting)
CFP: SYMPTEMIST Shared Task (BioCreative VIII run with AMIA 2023)
Named entity recognition and linking of symptoms, signs & findings (incl.
multilingual dataset)
https://temu.bsc.es/SYMPTEMIST/ <https://temu.bsc.es/distemist/>
The SYMPTEMIST track focuses on the automatic detection of mentions of
clinical symptoms (NER) and mapping to concept identifiers in clinical case
reports in Spanish (entity linking). Also a multilingual version of the
dataset will be released including versions in English, French, Italian,
Dutch, Portuguese, Romanian and Swedish.
Key information:
-
Web: https://temu.bsc.es/symptemist
-
Data: <https://doi.org/10.5281/zenodo.6408476>
https://zenodo.org/record/8223654
-
Annotation guidelines: https://zenodo.org/record/8246440
-
BioCreative web: https://biocreative.bioinformatics.udel.edu
-
Registration form (Track 2- SYMPTEMIST):
<https://temu.bsc.es/distemist/registration/>
https://docs.google.com/forms/d/e/1FAIpQLScoSNulOoxRju3c8v9Q-CSv-w5jJcXu93G…
Motivation
Systems able to detect and normalize clinical symptom mentions from medical
texts are crucial for almost any healthcare data mining, AI, medical
analytics or predictive application. As opposed to other clinical
information types, such as diagnoses (diseases/procedures), lab test
results or even medications, clinical symptoms can only be recovered
directly from written clinical narratives. Due to the high complexity,
variability and difficulty in generating annotated corpora for clinical
symptoms, only few large manually annotated data collections have been
constructed so far, with certain underlying limitations in terms of a)
entity linking / normalization of the symptom mentions to controlled
vocabularies and b) a lack of attempts to promote the development of
multilingual solutions and b) provide detailed annotation criteria and
guidelines. To address these issues, we have posed the SYMPTEMIST track at
the upcoming BioCreative VIII initiative, which will be run in the context
of the prestigious AMIA 2023 conference, which received over 1400
submissions this year.
Automatic detection of symptoms mentions are key for a range of clinical
use cases and real world applications like:
-
Predictive modeling of diseases
-
Differential diagnosis of complex diseases
-
Rare disease characterization & analysis
-
Selection of appropriate treatment & therapy
-
Study of disease-symptom associations
-
Early detection of disease outbreaks & epidemiological surveillance
-
Extraction of phenotypes
-
Drug repurposing & off label indications
The SYMPTEMIST organizers will also release multilingual resources to
foster the development of multilingual tools and generate systems not only
for Spanish but also for content in English and Romance languages (French,
Portuguese, Italian, Romanian and Catalan) as well as versions in Dutch,
Swedish and Czech.
Inspired by previous initiatives (e.g. n2c2, CLEF or TREC) and shared tasks
(CANTEMIST, PharmaCoNER, or CodiEsp), we are launching the SYMPTEMIST
shared task as part of the BioCreative 2023 evaluation initiative, with the
following three sub-tracks:
-
SYMPTEMIST-entities: automatic detection of mentions of symptoms.
-
SYMPTEMIST-linking: finding mentions of symptoms and normalizing them to
their Snomed-CT concept identifiers.
-
SYMPTEMIST-multilingual: automatic detection of mentions of symptoms in
versions of the corpus generated in English, French, Italian, Portuguese,
Romanian, Catalan, Dutch, Swedish and Czech.
Tentative schedule
-
Annotation Guidelines: August 8th 2023
-
Train Set Subtask 1 (NER): August 8th, 2023
-
Train Set Subtask 2 (Linking): September 10th 2023
-
Train Set Subtask 3 (Multilingual): September 10th 2023
-
SympTEMIST Test Set: September 30th 2023
-
Participants Test Predictions Deadline: October 5th 2023
-
Participants Evaluation Results Release. October 10th 2023
-
Submission of Participant Papers Deadline: October 22nd 2023
-
Notification of Acceptance Participant Papers: October 30 2023
-
Submission of Camera-ready Participant Papers Deadline. November 1st 2023
-
BioCreative VIII workshop @ AMIA 2023: November 11-15, 2023, In New
Orleans, LA.
BioCreative proceedings and AMIA workshop
Teams participating in SYMPTEMIST will be invited to contribute a systems
description paper for the BioCreative 2023 Working Notes proceedings and a
flash presentation of their approach at the BioCreative 2023 session. The
BioCreative VIII workshop will run with AMIA 2023, November 11-15, 2023, In
New Orleans, LA. See:
https://amia.org/education-events/amia-2023-annual-symposium
Workshop Proceedings and Special Issue:
The BioCreative VIII Proceedings will host all the submissions from
participating teams, and it will be freely available by the time of the
workshop. In addition, we are happy to announce that the journal Database
will host the BioCreative VIII special issue for work that has passed their
peer-review process. Invitation to submit will be sent after the workshop.
All BioCreative VIII tracks
Track 1: BioRED (Biomedical Relation Extraction Dataset)
*Track 2: SYMPTEMIST (Symptom TExt Mining Shared Task)
Track 3: Genetic Phenotype Extraction and Normalization from Dysmorphology
Physical Examination Entries
Track 4: Clinical Annotation Tool Track
Main Organizers
-
Martin Krallinger, Barcelona Supercomputing Center, Spain
-
Eulàlia Farré-Maduell, Barcelona Supercomputing Center, Spain
-
Luis Gascó, Barcelona Supercomputing Center, Spain
-
Salvador Lima, Barcelona Supercomputing Center, Spain
-
Jan Rodriguez, Barcelona Supercomputing Center, Spain
=======================================
Martin Krallinger, Dr.
Head of NLP for Biomedical Information Analysis Unit
Barcelona Supercomputing Center (BSC-CNS)
https://www.linkedin.com/in/martin-krallinger-85495920/
=======================================
We are innovating university! Interdisciplinary, international and digital – these are the pillars of the University of Technology Nuremberg. The aim is to combine and interlink engineering science with other topics of society. Besides the outlined interdisciplinary approach, the university will put its emphasis on courses in English, on digital learning as well as future-oriented research. In the medium term, the university is to provide a place for learning and personal development for up to 6,000 students – on a campus combining research, learning, and living. The project is currently one of the most important higher education projects of the Free State of Bavaria (Germany).
The University of Technology Nuremberg is looking to fill, at the earliest possible date, a position as a
Professor (m/f/d) (W3) of Natural Language Processing
at the Department of Engineering.
You represent the subject Natural Language Processing in research and teaching. You will play a key role in establishing the first main research area of the Department, which will focus on "Robotics and Artificial Intelligence". Here you will collaborate, among others, with excellent international researchers from the fields of Robotics, Machine Learning, Data Science and Computer Vision. The interdisciplinary collaboration with humanities scholars and social and natural scientists at the Department of Liberal Arts and Sciences, within the topics of “Human and Artificial Intelligence” and “Rhetoric and Political Communication” is a further goal.
To find the complete advertisement follow this link: https://www.utn.de/en/career/professorships/
CoCo4MT is extended its deadline for paper submission to July 16th!
The Second Workshop on Corpus Generation and Corpus Augmentation for
Machine Translation (CoCo4MT) @MT-SUMMIT XIX
The 19th Machine Translation Summit
Sep 4-8, 2023, Macau SAR, China
https://sites.google.com/view/coco4mt
SCOPE
It is a well-known fact that machine translation systems, especially
those that use deep learning, require massive amounts of data. Several
resources for languages are not available in their human-created format.
Some of the types of resources available are monolingual, multilingual,
translation memories, and lexicons. Those types of resources are
generally created for formal purposes such as parliamentary collections
when parallel and more informal situations when monolingual. The quality
and abundance of resources including corpora used for formal reasons is
generally higher than those used for informal purposes. Additionally,
corpora for low-resource languages, languages with less digital
resources available, tends to be less abundant and of lower quality.
CoCo4MT is a workshop centered around research that focuses on manual
and automatic corpus creation, cleansing, and augmentation techniques
specifically for machine translation. We accept work that covers any
language (including sign language) but we are specifically interested in
those submissions that explicitly report on work with languages with
limited existing resources (low-resource languages). Since techniques
from high-resource languages are generally statistical in nature and
could be used as generic solutions for any language, we welcome
submissions on high-resource languages also.
CoCo4MT aims to encourage research on new and undiscovered techniques.
We hope that the methods presented at this workshop will lead to the
development of high-quality corpora that will in turn lead to
high-performing MT systems and new dataset creation for multiple
corpora. We hope that submissions will provide high-quality corpora that
are available publicly for download and can be used to increase machine
translation performance thus encouraging new dataset creation for
multiple languages that will, in turn, provide a general workshop to
consult for corpora needs in the future. The workshop’s success will be
measured by the following key performance indicators:
- Promotes the ongoing increase in quality of machine translation
systems when measured by standard measurements,
- Provides a meeting place for collaboration from several research areas
to increase the availability of commonly used corpora and new corpora,
- Drives innovation to address the need for higher quality and abundance
of low-resource language data.
Topics of interest include:
- Difficulties with using existing corpora (e.g., political
considerations or domain limitations) and their effects on final MT
systems,
- Strategies for collecting new MT datasets (e.g., via crowdsourcing),
- Data augmentation techniques,
- Data cleansing and denoising techniques,
- Quality control strategies for MT data,
- Exploration of datasets for pretraining or auxiliary tasks for
training MT systems.
SHARED TASK
To encourage research on corpus construction for low-resource machine
translation, we introduce a shared task focused on identifying
high-quality instances that should be translated into a target
low-resource language. Participants are provided access to multi-way
corpora in the high-resource languages of English, Spanish, German,
Korean, and Indonesian, and using these, are required to identify
beneficial instances, that when translated into the low-resource
languages of Cebuano, Gujarati, and Burmese, lead to high-performing MT
systems. More details on data, evaluation and submission can be found on
the website (https://sites.google.com/view/coco4mt/shared-task) or by
emailing coco4mt-shared-task(a)googlegroups.com.
SUBMISSION INFORMATION
CoCo4MT will accept research, review, or position papers. The length of
each paper should be at least four (4) and not exceed ten (10) pages,
plus unlimited pages for references. Submissions should be formatted
according to the official MT Summit 2023 style templates
(https://www.overleaf.com/latex/templates/mt-summit-2023-template/knrrcnxhkq…).
Accepted papers will be published in the MT Summit 2023 proceedings
which are included in the ACL Anthology and will be presented at the
conference either orally or as a poster.
Submissions must be anonymized and should be made to the workshop using
the Softconf conference management system
(https://softconf.com/mtsummit2023/CoCo4MT). Scientific papers that have
been or will be submitted to other venues must be declared as such, and
must be withdrawn from the other venues if accepted and published at
CoCo4MT. The review will be double-blind.
We would like to encourage authors to cite papers written in ANY
language that are related to the topics, as long as both original
bibliographic items and their corresponding English translations are
provided.
Registration will be handled by the main conference. (To be announced)
IMPORTANT DATES
May 18, 2023 - Call for papers released
May 19, 2023 - Shared task release of train, dev and test data
May 25, 2023 - Shared task release of baselines
June 5, 2023 - Second call for papers
June 20, 2023 - Third and final call for papers
July 16, 2023 - Paper submissions due
July 16, 2023 - Shared task deadline to submit results
July 27, 2023 - Notification of acceptance
July 27, 2023 - Shared task system description papers due
August 03, 2023 - Camera-ready due
September 4-5, 2023 - CoCo4MT workshop
CONTACT
CoCo4MT Workshop Organizers:
coco4mt-2023-organizers(a)googlegroups.com
CoCo4MT Shared Task Organizers:
coco4mt-shared-task(a)googlegroups.com
ORGANIZING COMMITTEE (listed alphabetically)
Ananya Ganesh University of Colorado Boulder
Constantine Lignos Brandeis University
John E. Ortega Northeastern University
Jonne Sälevä Brandeis University
Katharina Kann University of Colorado Boulder
Marine Carpuat University of Maryland
Rodolfo Zevallos Universitat Pompeu Fabra
Shabnam Tafreshi University of Maryland
William Chen Carnegie Mellon University
PROGRAM COMMITTEE (listed alphabetically tentative)
Abteen Ebrahimi University of Colorado Boulder
Adelani David Saarland University
Ananya Ganesh University of Colorado Boulder
Alberto Poncelas ADAPT Centre at Dublin City University
Anna Currey Amazon
Amirhossein Tebbifakhr University of Trento
Atul Kr. Ojha National University of Ireland Galway
Ayush Singh Northeastern University
Barrow Haddow University of Edinburgh
Bharathi Raja Chakravarthi National University of Ireland Galway
Beatrice Savoldi University of Trento
Bogdan Babych Heidelberg University
Briakou Eleftheria University of Maryland
Constantine Lignos Brandeis University
Dossou Bonaventure Mila Quebec AI Institute
Duygu Ataman New York University
Eleftheria Briakou University of Maryland
Eleni Metheniti Université Toulosse - Paul Sabatier
Jasper Kyle Catapang University of Birmingham
John E. Ortega Northeastern University
Jonne Sälevä Brandeis University
Kalika Bali Microsoft
Katharina Kann University of Colorado Boulder
Kochiro Watanabe The University of Tokyo
Koel Dutta Chowdhury Saarland University
Liangyou Li Huawei
Manuel Mager University of Stuttgart
Maria Art Antonette Clariño University of the Philippines Los Baños
Marine Carpuat University of Maryland
Mathias Müller University of Zurich
Nathaniel Oco De La Salle University
Niu Xing Amazon
Patrick Simianer Lilt
Rico Sennrich University of Zurich
Rodolfo Zevallos Universitat Pompeu Fabra
Sangjee Dondrub Qinghai Normal University
Santanu Pal Saarland University
Sardana Ivanova University of Helsinki
Shantipriya Parida Silo AI
Shiran Dudy Northeastern University
Surafel Melaku Lakew Amazon
Tommi A Pirinen University of Tromsø
Valentin Malykh Moscow Institute of Physics and Technology
Xing Niu Amazon
Xu Weijia University of Maryland
Dear all,
We are organising a free training event (online and in person): Language Data Analysis for Business and Professional Communication.
It will take place on 22 September 2023 10:00 - 15:30 UK time.
More details and registration: https://www.lancaster.ac.uk/events/language-data-analysis-for-business-and-…
The ESRC Centre for Corpus Approaches to Social Science, Lancaster University offers a practical training workshop focused on computational analysis of language data for businesses and professional organisations and anyone interested in communication in professional contexts. The data includes social media, newspapers, business reports, marketing materials and other data sources.
The workshop will introduce a new software tool #LancsBox X<https://lancsbox.lancs.ac.uk/> developed at Lancaster University, which can analyse and visualise large amounts of language data (millions and billions of words). Practical examples of uses of #LancsBox X (case studies) will be provided.
Best,
Vaclav
Professor Vaclav Brezina
Professor in Corpus Linguistics
Department of Linguistics and English Language
ESRC Centre for Corpus Approaches to Social Science
Faculty of Arts and Social Sciences, Lancaster University
Lancaster, LA1 4YD
Office: County South, room C05
T: +44 (0)1524 510828
[cid:image001.png@01D9DF47.CD8CA940]@vaclavbrezina
[cid:image002.png@01D9DF47.CD8CA940]<http://www.lancaster.ac.uk/arts-and-social-sciences/about-us/people/vaclav-…>
Dear all,
Please find below the call for papers for next year’s Web Science
Conference.
Best regards,
Agnieszka Faleńska
*Call for Papers <https://websci24.org/call-for-papers/>*
ACM WebSci’24 ● 16th ACM Web Science Conference
May 21 – May 24, 2024 ● Stuttgart, Germany
*Reflecting on the Web, AI, and Society*
*https://websci24.webscience.org/* <https://websci24.webscience.org/>
*Important Dates*
- *Thu, November 30, 2023: Paper submission deadline*
- *Wed, January 31, 2024: Notification*
- *Thu, February 29, 2024: Camera-ready versions due*
- *Tue-Fri, May 21 – May 24, 2024: Conference dates*
*All dates are 23:59 **Anywhere on earth time*
<https://time.is/Anywhere_on_Earth>
*About the Web Science Conference*
Web Science is an interdisciplinary field dedicated to understanding the
complex and multiple impacts of the Web on society and vice versa. The
discipline is well situated to address pressing issues of our time by
incorporating various scientific approaches. We welcome quantitative,
qualitative, and mixed methods research, including social sciences and
computer science techniques. In addition, we are interested in work
exploring Web-based data collection and research ethics. We also encourage
studies that combine analyses of Web data and other types of data (e.g.,
from surveys or interviews) and help better understand user behavior online
and offline.
*Possible topics across methodological approaches and digital contexts
include but are not limited to:*
*Understanding the Web*
- Automation and AI in all its manifestations relevant to the Web
- Trends in globalization, fragmentation, and polarization of the Web
- The architecture and philosophy of the Web
- Critical analyses of the Web and Web technologies
Making the Web Inclusive
- Issues of discrimination and fairness
- Intersectionality and design justice in questions of marginalization
and inequality
- Ethical challenges of technologies, data, algorithms, platforms, and
people on the Web
- Safeguarding and governance of the Web, including anonymity, security,
and trust
- Inclusion, literacy and the digital divide
*The Web and Society*
- Social machines, crowd computing and collective intelligence
- Web economics, social entrepreneurship, and innovation
- Legal issues, including rights and accountability for AI actors
- Humanities, arts, and culture on the Web
- Politics and social activism on the Web
- Online education and remote learning
- Health and well-being online
- The role of the Web in the future of (augmented) work
- The Web as a source of news and information, and misinformation
*Doing Web Science*
- Data curation, Web archives and stewardship in Web Science
- Temporal and spatial dimensions of the Web as a repository of
information
- Analysis and modeling of human vs. automatic behavior (e.g., bots)
- Analysis of online social and information networks
- Detecting, preventing and predicting anomalies in Web data (e.g., fake
content, spam)
*2024 Emphasis: Reflecting on the Web, AI, and Society*
In addition to the topics at the heart of Web Science, we also welcome
submissions addressing the interplay between the Web, AI and society. New
advances in AI are revolutionizing the way in which people use the Web and
interact through it. As these technologies develop, it is crucial to
examine their effect on society and the socio-technical environment in
which we find ourselves. We are nearing the crossroads wherein content on
the Web will increasingly be automatically generated, blended with that
created by humans. This creates new potential yet brings new challenges and
exacerbates existing ones in relation to data quality and misinformation.
Additionally, we need to consider the role of the Web as a source of data
for AI, including privacy and copyright concerns, as well as bias and
representativity of resulting systems. The potential impact of new AI tools
on the nature of work may bring a transformation of some careers while
creating whole new ones. This year’s conference especially encourages
contributions documenting different uses of AI in relation to how people
use the Web, and in the ways the Web affects the creation and deployment of
AI tools.
*Format of the submissions*Please upload your submissions via EasyChair:
https://easychair.org/conferences/?conf=acmwebsci24
*There are two submission formats.*
- Full papers should be between 6 and 10 pages (including references,
appendices, etc.). Full papers typically report on mature and completed
projects.
- Short papers should be up to 5 pages (including references,
appendices, etc.). Short papers will primarily report on high-quality
ongoing work not mature enough for a full-length publication.
*All accepted submissions will be assigned an oral presentation (of two
different lengths).*
All papers should adopt the current ACM SIG Conference proceedings template
(acmart.cls). Please submit papers as PDF files using the ACM template,
either in Microsoft Word format (available at
https://www.acm.org/publications/proceedings-templateunder “Word Authors”)
or with the ACM LaTeX template on the Overleaf platform, which is available
at
https://www.overleaf.com/latex/templates/association-for-computing-machiner…
. In particular; please ensure that you are using the two-column version of
the appropriate template.
All contributions will be judged by the Program Committee upon rigorous
peer review standards for quality and fit for the conference by at least
three referees. Additionally, each paper will be assigned to a Senior
Program Committee member to ensure review quality.
WebSci-2024 review is double-blind. Therefore, please anonymize your
submission: do not put the author(s) names or affiliation(s) at the start
of the paper, and do not include funding or other acknowledgments in papers
submitted for review. References to authors’ own prior relevant work should
be included but should not specify that this is the authors’ own work. It
is up to the authors’ discretion how much to further modify the body of the
paper to preserve anonymity. The requirement for anonymity does not extend
outside of the review process, e.g., the authors can decide how widely to
distribute their papers over the Internet. Even in cases where the author’s
identity is known to a reviewer, the double-blind process will serve as a
symbolic reminder of the importance of evaluating the submitted work on its
own merits without regard to the authors’ reputation.
For authors who wish to opt-out of publication proceedings, this option
will be made available upon acceptance. This will encourage the
participation of researchers from the social sciences that prefer to
publish their work as journal articles. All authors of accepted papers
(including those who opt out of proceedings) are expected to present their
work at the conference.
*ACM Policies <https://websci24.org/acm-policy-2/>*
1. “By submitting your article to an ACM Publication, you are hereby
acknowledging that you and your co-authors are subject to all ACM
Publications Policies
<https://eur03.safelinks.protection.outlook.com/?url=https%3A%2F%2Fwww.acm.o…>,
including ACM’s new Publications Policy on Research Involving Human
Participants and Subjects
<https://eur03.safelinks.protection.outlook.com/?url=https%3A%2F%2Fwww.acm.o…>.
Alleged violations of this policy or any ACM Publications Policy will be
investigated by ACM and may result in a full retraction of your paper, in
addition to other potential penalties, as per ACM Publications Policy.”
2. “Please ensure that you and your co-authors obtain an ORCID ID
<https://eur03.safelinks.protection.outlook.com/?url=https%3A%2F%2Forcid.org…>,
so you can complete the publishing process for your accepted paper. ACM
has been involved in ORCID from the start and we have recently made
a commitment
to collect ORCID IDs from all of our published authors
<https://eur03.safelinks.protection.outlook.com/?url=https%3A%2F%2Fauthors.a…>.
The collection process has started and will roll out as a requirement
throughout 2022. We are committed to improve author discoverability,
ensure proper attribution and contribute to ongoing community efforts
around name normalization; your ORCID ID will help in these efforts.”
*Program Committee Chairs:*
*Oshani Seneviratne (Rensselaer Polytechnic Institute)*
*Luca Maria Aiello (IT University of Copenhagen)*
*Yelena Mejova (ISI Foundation)*
For any questions and queries regarding the paper submission, please
contact the chairs at acmwebsci24(a)easychair.org.
Monthly online ILFC Seminar: interactions between formal and computational
linguistics
https://gdr-lift.loria.fr/monthy-online-ilfc-seminar/
GdR LIFT is happy to announce the four forthcoming sessions of the ILFC
seminar on the interactions between formal and computational linguistics:
- 2023/09/13 17:00-18:00 UTC+2: *Anna Ivanova* (Massachusetts Institute
of Technology; 11:00-12:00 UTC-4)
Title: *Dissociating formal and functional linguistic competence in
large language models*
Abstract: *Today’s large language models (LLMs) routinely generate
coherent, grammatical and seemingly meaningful paragraphs of text. This
achievement has led to speculation that LLMs have become “thinking
machines”, capable of performing tasks that require reasoning and/or world
knowledge. In this talk, I will introduce a distinction between formal
competence—knowledge of linguistic rules and patterns—and functional
competence—understanding and using language in the world. This distinction
is grounded in human neuroscience, which shows that formal and functional
competence recruit different cognitive mechanisms. I will show that the
word-in-context prediction objective has allowed LLMs to essentially master
formal linguistic competence; however, LLMs still lag behind at many
aspects of functional linguistic competence, and improvements in this
domain often depend on specialized fine-tuning or coupling with an external
module. In the last part of the talk, I will present a case study
highlighting the difficulties of disentangling formal and functional
competence when it comes to evaluating world knowledge, and show that
similar difficulties are present in neuroscience research. I will conclude
by discussing the value of the formal/functional competence framework for
evaluating and building flexible, humanlike models of language use.*
- 2023/10/18 17:00-18:00 UTC+2: *Alexander Koller* (Saarland University)
Title: [TBA]
Abstract: [TBA]
- 2023/11/15 17:00-18:00 UTC+1: *Raquel Fernández* (University of
Amsterdam)
Title: [TBA]
Abstract: [TBA]
- 2023/12/13 17:00-18:00 UTC+1: *Richard Futrell* (UC Irvine; 8:00-9:00
UTC-8)
Title: [TBA]
Abstract: [TBA]
The seminar is held on Zoom. To attend the seminar and get updates, please
subscribe to our mailing list (we now only rarely communicate through other
mailing lists): https://sympa.inria.fr/sympa/subscribe/seminaire_ilfc
(apologies, German version only as this position requires German language skills)
In der Fakultät für Elektrotechnik, Informatik und Mathematik - Institut für Informatik / Fachgebiet Natural Language Processing - ist zum nächstmöglichen Zeitpunkt eine Stelle als
wissenschaftliche*r Mitarbeiter*in (w/m/d)
(Entgeltgruppe 13 TV-L) im Umfang von 100% der regelmäßigen Arbeitszeit zu besetzen. Es handelt sich um eine Qualifizierungsstelle im Sinne des Wissenschaftszeitvertragsgesetzes (WissZeitVG), die zur Förderung eines Promotionsverfahrens im Bereich Natural Language Processing dient. Die Stelle ist befristet für die Dauer des Promotionsverfahrens, abhängig von der bisher erreichten Qualifizierung, jedoch für einen Zeitraum von i.d.R. 3 Jahren, zu besetzen. Eine Verlängerung zum Abschluss der Promotion ist innerhalb der Befristungsgrenzen des WissZeitVG ggf. möglich.
Aufgabengebiet:
* Eigenverantwortliche Forschung in den Bereichen "Privacy in Natural Language Processing" oder "Legal Natural Language Processing"
* Lehrverpflichtung im Umfang von i.d.R. 4 SWS
* Beteiligung an der Selbstverwaltung
Einstellungsvoraussetzungen:
* Wissenschaftlicher Hochschulabschluss (Master in Computer Science oder einer verwandten Disziplin)
* Solide Kenntnisse im Bereich des maschinellen Lernens und der Verarbeitung natürlicher Sprache
* Starkes Interesse am Thema “Privacy in Natural Language Processing” oder "Legal Natural Language Processing
* Ausgezeichnete analytische und programmiertechnische Fähigkeiten
* Teamfähigkeit
* Fließende Sprachkenntnisse in Englisch und Deutsch
Bewerbungen von Frauen sind ausdrücklich erwünscht und werden gem. LGG bei gleicher Eignung, Befähigung und fachlicher Leistung bevorzugt berücksichtigt, sofern nicht in der Person eines Mitbewerbers liegende Gründe überwiegen. Teilzeitbeschäftigung ist grundsätzlich möglich. Ebenso ist die Bewerbung geeigneter Schwerbehinderter und Gleichgestellter im Sinne des Sozialgesetzbuches Neuntes Buch (SGB IX) erwünscht.
Bewerbungen mit den üblichen Unterlagen werden als PDF-Dokument per E-Mail unter Angabe der Kennziffer 6082 "PhD-NLP" bis zum 24.09.2023 erbeten an: habernal(a)mail.uni-paderborn.de. Als Ansprechpartner steht Prof. Habernal unter der oben genannten E-Mail-Adresse zur Verfügung. Informationen zur Verarbeitung Ihrer personenbezogenen Daten finden Sie unter: https://www.uni-paderborn.de/zv/personaldatenschutz.