2nd CFP: Special Issue of Computational Linguistics: Language Learning, Representation, and Processing in Humans and Machines - Corpora

11 Sep 2023


      Second Call for Papers
Special Issue of Computational Linguistics: Language Learning, Representation, and Processing in Humans and Machines
** Guest Editors **
Marianna Apidianaki (University of Pennsylvania) 
Abdellah Fourtassi (Aix Marseille University)
Sebastian Padó (University of Stuttgart)
** NEW: Abstract submission deadline: November, 10 **
** Paper submission deadline: December, 10 **
Large language models (LLMs) acquire rich world knowledge from the data they are exposed to during training, in a way that appears to parallel how children learn from the language they hear around them. Indeed, since the introduction of these powerful models, there has been a general feeling among researchers in both NLP and cognitive science that a systematic understanding of how these models work and how they use the knowledge they encode, would shed light on the way humans acquire, represent, and process this same knowledge (and vice versa).
Yet, despite the similarities, there are important differences between machines and humans that have prevented a direct translation of insights from the analysis of LLMs to a deeper understanding of human learning. Chief among these differences is that the size of data required to train LLMs far exceeds -- by several orders of magnitude -- the data children need to acquire sophisticated conceptual structures and meanings. Besides, the engineering-driven architectures of LLMs do not appear to have obvious equivalents in children's cognitive apparatus, at least as studied by standard methods in experimental psychology. Finally, children acquire world knowledge not only via exposure to language but also via sensory experience and social interaction.
This edited volume aims to create a forum of exchange and debate between linguists, cognitive scientists and experts in deep learning, NLP and computational linguistics, on the broad topic of learning in humans and machines. Experts from these communities can contribute with empirical and theoretical papers that advance our understanding of this question. Submissions might address the acquisition of different types of linguistic and world knowledge. Additionally, we invite contributions that characterize and address challenges related to the mismatch between humans and LLMs in terms of the size and nature of input data, and the involved learning and processing mechanisms.
Topics include, but are not limited to:
• Grounded learning: comparison of unimodal (e.g., text) vs multimodal (e.g., images and video) learning.
   • Social learning: comparison of input-driven mechanisms vs. interaction-based learning.
   • Exploration of different knowledge types (e.g., procedural / declarative); knowledge integration and inference in LLMs.
   • Methods to characterize and quantify human-like language learning or processing in LLMs. 
   • Interpretability/probing methods addressing the linguistic and world knowledge encoded in LLM representations. 
   • Knowledge enrichment methods aimed at improving the quality and quantity of the knowledge encoded in LLMs.
   • Semantic representation and processing in humans and machines in terms of, e.g., abstractions made, structure of the lexicon, property inheritance and generalization, geometrical approaches to meaning representation, mental associations, and meaning retrieval.    
   • Bilingualism in humans and machines; second language acquisition in children and adults; construction of multi-lingual spaces and cross-lingual correspondences.
   • Exploration of language models that incorporate cognitively plausible mechanisms and reasonably-sized training data.
   • Use of techniques from other disciplines (e.g., neuroscience or computer vision) for analyzing and evaluating LLMs.
   • Open-source tools for analysis, visualization, or explanation.
Submission Instructions
** NEW **  Authors are strongly encouraged to submit a short (max 1 page) abstract of their paper by November 10. 
Abstracts will be sent to the Guest Editors (e-mails below). Minor modifications to the abstract will still be possible until final submission.
Papers should be formatted according to the Computational Linguistics style guidelines: http://cljournal.org/ 
We accept both long and short papers. Long papers are between 30 and 40 journal pages in length; short papers are between 15 and 25 pages in length.
Papers for this special issue will be submitted through the CL electronic submission system, just like regular papers: http://cljournal.org/submissions.html
Authors of special issue papers will need to select “Special Issue on LLPR” under the Journal Section heading in the CL submission system. 
Please note that papers submitted to a special issue undergo the same reviewing process as regular papers.
Timeline
Deadline for abstract submission: November, 10 2023
Deadline for paper submission: December, 10 2023
Notification after 1st round of reviewing: February, 10 2024
Revised versions of the papers: April, 30 2024
Final decisions: June, 10 2024
Final version of the papers: July, 1 2024
Inquiries
All inquiries should be directed to the guest editors of this special issue.
Guest Editors
Marianna Apidianaki
marapi@seas.upenn.edu mailto:marapi@seas.upenn.edu
Abdellah Fourtassi
abdellah.fourtassi@gmail.com mailto:abdellah.fourtassi@gmail.com
Sebastian Padó
pado@ims.uni-stuttgart.de mailto:pado@ims.uni-stuttgart.de
Reviewers
Afra Alishahi, Tilburg University 
Rachel Bawden, INRIA 
Philippe Blache, Aix-Marseille University, CNRS
Idan Blank, UCLA
Gemma Boleda, Universitat Pompeu Fabra
Marie-Catherine de Marneffe, UCLouvain, FNRS, The Ohio State University
Katrin Erk, University of Texas at Austin
Benoit Favre, Aix-Marseille University
Richard Futrell, UC Irvine
Aina Garí Soler, Télécom-Paris
Mario Giulianelli, University of Amsterdam
Gabriel Grand, MIT
Dieuwke Hupkes, META
Anna Ivanova, MIT
Jordan Kodner, Stony Brook University
Andrew Lampinen, DeepMind
Roger Levy, MIT
Tal Linzen, New York University (NYU)
Barbara Plank, LMU Munich
Christopher Potts, Stanford University
Veronica Qing Lyu, University of Pennsylvania
Okko Räsänen, Tampere University
Anna Rogers, IT University of Copenhagen
Thomas Schatz, Aix-Marseille University
Sebastian Schuster, Saarland University
Cory Shain, Stanford University
Jörg Tiedemann, University of Helsinki
Sean Trott, University of California, San Diego
Ivan Vuliç, University of Cambridge
Computational Linguistics is the longest-running flagship journal of the Association for Computational Linguistics. The journal has a high impact factor: 9.3 in 2022 and 7.778 in 2021. Average time to first decision of regular papers and full survey papers (excluding desk rejects) is 34 days for the period January to May 2023, and 47 days for the period January to December 2022.