Dear colleagues,
We are a group of researchers from the Max Planck Institute for Human Development, Humboldt University of Berlin, and the University of Milano-Bicocca, collaborating on a research initiative to build shared infrastructure for psycholinguistic research.
We would like to invite contributions to PsychLing-101, a community-driven repository that collects psycholinguistic datasets in a unified format for both traditional analyses and evaluation of large language models.
Project overview
PsychLing-101 aims to build a curated corpus of psycholinguistic datasets stored in a standardized structure. The goal is to make datasets easier to find, reuse, and compare across studies, and to support more reproducible research in psycholinguistics and cognitive science.
To build a broad and useful resource, we welcome contributions from researchers across the field. We invite datasets of many types, from norming studies and behavioral experiments to eye-tracking and neuroimaging data, including both large-scale collections and smaller curated studies.
Repository and documentation: https://github.com/Data-X01/PsychLing-101
Ways to contribute
Researchers can contribute in several ways: • Submit a dataset following the instructions in the repository README: https://github.com/Data-X01/PsychLing-101/blob/main/README.md • Curate an existing study from the list in the CONTRIBUTING file: https://github.com/Data-X01/PsychLing-101/blob/main/CONTRIBUTING.md • Suggest a dataset or study by opening an issue in the repository
Outcome for contributors
All dataset contributors will be included as co-authors on an overview paper describing the repository and the standardized corpus.
Questions
For questions or assistance, please open a GitHub issue or contact: psychling101@gmail.com
If you think this initiative may be of interest to colleagues or students, we would greatly appreciate you sharing it within your networks.
Best wishes, Taisiia Tikhomirova & Dirk Wulff (on behalf of the PsychLing-101 team)