*** Apologies for cross-posting ***
++ CALL FOR WORKSHOP PROPOSALS ++
****************************************************************************
45th European Conference on Information Retrieval
April 2nd – 6th April 2023 – Dublin, Ireland
Website: https://ecir2023.org/
****************************************************************************
++ Important Dates ++
- Submission deadline: September 19th, 2022
- Acceptance Notification Date: October 9th, 2022
- Workshops day: April 2nd, 2023
++ Overview ++
ECIR 2023 workshops provide a platform for presenting novel ideas and research results in emerging areas in IR in a more focused and interactive way than the conference itself. Workshops can be either a half-day (3.30 hours plus breaks) or a full day (7 hours plus breaks) and are to be onsite. At least one organizer is expected to attend the workshop.
++ List of Topics ++
ECIR 2023 encourages the submission of workshops on the theory, experimentation, and practice of retrieval, representation, management, and usage of textual, visual, audio, and multi-modal information, but proposals aligned with other topics of IR (namely those identified in the general call for papers) are highly welcome as well.
Relevant topics include, but are not limited to:
* User aspects, including information interaction, contextualisation, personalisation, simulation, characterisation, and behaviours.
* System and foundational aspects, including retrieval models and architectures, content analysis and classification, recommendation algorithms, query processing and ranking, efficiency and scalability.
* Machine learning, deep learning, neural IR, natural language processing, as applied to textual, visual, audio, and multi-modal information.
* Applications such as web search, recommender systems, web and social media apps, domain-specific search, enterprise search, novel interfaces to search tools, intelligent search, academic search, and conversational agents.
* Evaluation research, including new measures and novel methods for measuring and evaluating systems, datasets, users, and/or applications.
* Cross-disciplinary workshops, including IR and other domains such as NLP, data science, etc., are also particularly welcome.
++ Submission Guidelines ++
Workshop proposals should contain the following information:
* Title and abstract of the workshop;
* Motivation and relevance to ECIR;
* Workshop goals/objectives and overall vision, coupled with desired outcomes;
* Format and Structure, in particular duration of the workshop (full-day or half-day workshop); mention to the type of papers (e.g., full papers, demo papers, negative papers, etc); type of presentation (e.g., oral; poster, etc); and proceedings (e.g., CEUR; Special Issue, etc); planned activities, the tentative schedule of events etc.; resources needed to deliver the workshop (e.g., poster boards, etc);
* Intended audience, including number of expected participants and how they will be selected/invited;
* List of organisers with a brief bio of each with respect to the content of the workshop;
* Names of potential programme committee members, invited speakers; etc
* Indicate if the workshop is related to or follows on from another workshop; if so, please, identify which conference it was previously held at, the past attendance and outcomes, and why another workshop is needed;
* Any other relevant information to support your proposal.
Workshop proposals should be prepared using Springer proceedings templates to be found on Springer webpage (https://www.springer.com/gp/computer-science/lncs/conference-proceedings-gu…<https://www.springer.com/gp/computer-science/lncs/conference-proceedings-gu…)>), with a maximum length of 8 pages. All proposals should be submitted electronically through the conference submission system (https://easychair.org/conferences/?conf=ecir23) and must be in English. Workshop proposals will be reviewed by the ECIR 2023 workshop committee based on the quality of their proposal, covered topics, relationship to ECIR and likelihood to attract participants. Final decisions will be made by the ECIR workshop co-chairs.
++ Workshop Chairs ++
Ricardo Campos, Polytechnic Institute of Tomar and INESC TEC, Portugal
Gianmaria Silvello, University of Padua, Italy
++ Contacts ++
For further information, please contact the ECIR 2023 Workshop chairs by email to ecir2023-workshop(a)easychair.org<mailto:ecir2023-workshop@easychair.org>
CALL FOR PAPERS
The Northern European Journal of Language Technology (NEJLT) invites
submissions of excellent research papers and letters on language
technology. NEJLT is a global journal that publishes peer-reviewed language
technology and computational linguistics research on all languages, indexed
in the ACL Anthology.
https://www.nejlt.org
What's special about NEJLT?
* Re-use reviews: We welcome revised manuscripts submitted with prior
reviews to a fast-track review process.
* Customised review: Specify a "type" for your paper at submission, that
determines how your paper is reviewed
* "Letter" format submission: Comments, positions, letters, or small
experiments also welcomed as very short articles
* Free to submit, free to publish, free to read
### SUBMISSION TYPES
NEJLT accepts (1) full articles, and (2) letters.
(1) Full articles are to be given a subtype. The types available at NEJLT
are:
* Computationally-aided linguistic analysis
* NLP engineering experiment paper
* Reproduction paper
* Resource paper
* Position paper
* Survey Paper
Other works are welcome - contact the editor.
(2) NEJLT Letters on computational linguistics and natural language
processing should be around 1000 words long, and are given a special,
dedicated review process.
More information about submission types and information for authors is at:
https://www.nejlt.org/authorinfo/
### SCOPE
NEJLT invites manuscripts from anywhere in the world that present excellent
research in the field of language technology and natural language
processing. Work on all languages is welcome.
* Language focus:
* Global; no specific focus. Research on all and any languages is invited.
* Topics of interest: including but not limited to
* Cognitive Modeling and Psycholinguistics
* Computational Social Science and Social Media
* Dialogue and Interactive Systems
* Discourse and Pragmatics
* Ethics and NLP
* Generation of language
* Green NLP
* Information Extraction
* Information Retrieval and Text Mining
* Interpretability and Analysis of Models for NLP
* Language Grounding to Vision, Robotics and Beyond
* Theory and Formalism in NLP (Linguistic and Mathematical)
* Machine Learning for NLP
* Machine Translation
* NLP Applications
* Phonology, Morphology and Word Segmentation
* Question Answering
* Resources and Evaluation
* Semantics: Lexical
* Semantics: Sentence Level
* Semantics: Textual Inference and Other Areas of Semantics
* Sentiment Analysis, Stylistic Analysis, and Argument Mining
* Speech and Multimodality
* Summarization
* Syntax: Tagging, Chunking and Parsing
* Works focusing on Northern European languages are encouraged, with the
same requirements of excellence
The editor-in-chief of NEJLT is appointed by the North European Association
for Language Technology. This geographical connection gives the journal its
name, though the journal itself does not have a Northern European language
focus.
More on NEJLT's scope is at: https://www.nejlt.org/
### REVIEWING
NEJLT is committed to rapid and fair reviewing. NEJLT strives to preserve
anonymity throughout the review process. The journal also invites revised
resubmissions from select events, including the reviews from those events,
including ACL, EMNLP, NAACL, EACL, AACL, and NeurIPS. For details, see:
https://www.nejlt.org/review/
### ABOUT THE JOURNAL
NEJLT publishes in the field of language technology, i.e. Natural Language
Processing, Computational Linguistics, and related topics. Research focused
on any natural language is invited.
NEJLT invites both journal articles and academic letters, and has a
multi-iteration reviewing process, where revisions are a possibility.
The reviewing philosophy of the journal is to minimise reviewing biases,
and also to provide constructive, helpful feedback during the review
process.
NEJLT is a global journal with global focus. The journal’s publisher is
located in Northern Europe, hence its name, and supports the journal
without charge, enabling open access publication with no costs. NEJLT is
indexed by many publication indexing services, and ranked by many national
bibliographic ranking systems.
NEJLT accepts submissions continuously all year round.
More at: https://www.nejlt.org/about/
### ORGANIZATION
* Editor-in-Chief:
* Leon Derczynski, ITU Copenhagen; ld(a)itu.dk
* Editorial board:
* Isabelle Augenstein, University of Copenhagen
* Nikolaos Aletras, University of Sheffield
* Rachel Bawden, INRIA, Paris
* Emily M. Bender, University of Washington
* Miryam de Lhoneux, University of Copenhagen
* Nicoletta Calzolari, Institute for Computational Linguistics, NRC Italy
* Manuel Ciosici, University of Southern California ISI
* Yang Feng, Chinese Academy of Sciences
* Eva Hajičová, Charles University
* Marco Kuhlmann, Linköping University
* Yuji Matsumoto, NAIST/Riken AIP
* Joakim Nivre, Uppsala University
* Ellie Pavlick, Brown University
* Verena Rieser, Heriot Watt University
* Vered Shwartz, Allen Institute for Artificial Intelligence (AI2)
* Thamar Solorio, University of Houston
* Mark Steedman, University of Edinburgh
* Jörg Tiedemann, University of Helsinki
* Bonnie Webber, Universty of Edinburgh
NEJLT's editorial team is detailed at: https://www.nejlt.org/team/
### PUBLICATION AND OPEN ACCESS
The ACL Anthology has accepted future inclusion of articles published in
NEJLT.
NEJLT is full open access. This means that accepted papers may be
downloaded directly from the web and will not be charged for. There are
also no fees for submitting or for publishing. There are no plans to
collect fees at any point in the future at any part of the NEJLT process.
Papers are published under the CC-BY 4.0 license. This means that NEJLT is
an Open Access Gold journal.
The journal is published by Linköping University press. The editor-in-chief
of NEJLT is appointed by the North European Association for Language
Technology. This geographical connection gives the journal its name, though
the journal itself does not have a Northern European language focus.
More details on NEJLT policies at: https://www.nejlt.org/policies/
### CONTACT
Please, see www.nejlt.org for further information. We look forward to
seeing your manuscripts.
> [Apologies for cross-posting]
>
> -> Last Weekend to Submit Your Paper to SIMBig 2022
>
> =================================================================
> LAST WEEKEND of CALL FOR PAPERS - SIMBig 2022
> =================================================================
>
> SIMBig 2022 - 9th International Conference on Information Management and Big Data
> Where: Universidad Nacional Mayor de San Marcos, Lima, PERU
> When: November 16 - 18, 2022
> Website: http://simbig.org/SIMBig2022/ <http://simbig.org/SIMBig2022/>
>
> =================================================================
>
> OVERVIEW
> ----------------------------------
>
> SIMBig 2022 seeks to present new methods of Artificial Intelligence (AI), Data Science, and related fields, for analyzing, managing, and extracting insights and patterns from large volumes of data.
>
>
> KEYNOTE SPEAKERS
> -------------
>
> Leman Akoglu, Carnegie Mellon University, USA
> Jiang Bian, University of Florida, USA
> Rich Caruana, Microsoft, USA
> Dilek Hakkani-Tur, Amazon Alexa AI, USA
> Monica Lam, Stanford University, USA
> Wang-Chiew Tan, Facebook AI, USA
> Andrew Tomkins, Google, USA
> Bin Yu, University of California, Berkeley, USA
>
> IMPORTANT DATES
> -------------
>
> August 05, 2022 August 19, 2022 --> Papers submission deadline
> September 09, 2022 September 17, 2022 ---> Notification of acceptance
> October 07, 2022 --> Camera-ready versions
> November 16 - 18, 2022 --> Conference held in Lima, Peru
>
> PUBLICATION AND TRAVEL AWARDS
> -------------
>
> All accepted papers of SIMBig 2022 (tracks including) will be published with Springer CCIS Series <https://www.springer.com/series/7899>.
>
>
> The best 8-10 papers of SIMBig 2022 (tracks including) will be selected to submit an extension to be published with the Springer SN Computer Science Journal. <https://www.springer.com/journal/42979>
> Thanks to the support of the North American Chapter of the Association for Computational Linguistics (NAACL) <http://naacl.org/>, SIMBig 2022 will offer 4 student travel awards for the best papers.
>
>
>
> TOPICS OF INTEREST
> -------------
>
> SIMBig 2022 has a broad scope. We invite contributions on theory and practice, including but not limited to the following technical areas:
>
> Artificial Intelligence
> Data Science
> Machine Learning
> Natural Language Processing
> Semantic Web
> Healthcare Informatics
> Biomedical Informatics
> Data Privacy and Security
> Information Retrieval
> Ontologies and Knowledge Representation
> Social Networks and Social Web
> Information Visualization
> OLAP and Business intelligence
> Data-driven Software Engineering
>
> SPECIAL TRACKS
> -------------
>
> SIMBig 2022 proposes three special tracks in addition to the main conference:
>
> ANLP <https://simbig.org/SIMBig2022/en/anlp.html> - Applied Natural Language Processing
> DISE <https://simbig.org/SIMBig2022/en/dise.html> - Data-drIven Software Engineering
> SNMAM <https://simbig.org/SIMBig2022/en/snmam.html> - Social Network and Media Analysis and Mining
>
> CONTACT
> -------------
>
> SIMBig 2022 General Chairs
>
> Juan Antonio Lossio-Ventura, National Institutes of Health, USA (juan.lossio(a)nih.gov <mailto:juan.lossio@nih.gov>)
> Hugo Alatrista-Salas, Pontificia Universidad Católica del Perú, Peru (halatrista(a)pucp.pe <mailto:halatrista@pucp.pe>)
>
*ICNLSP 2022: LAST call for papers*
Dear all,
We are delighted to announce that ICNLSP 2022
<https://www.icnlsp.org/2022welcome/>, the 5*th* edition of the
International Conference on Natural Language and Speech Processing, hosted
by DataScientia (University of Trento)
<http://datascientia.disi.unitn.it/events/> for the third time, will be
held online, on 16-17 December 2022.
*Important dates*
*Submission deadline*: *30 August 2022*
*Notification of acceptance*: *31 October 2022*
*Camera-ready paper due*: *20 November 2022*
*Conference dates*: *16, 17 Decemberber 2022*
*Publication*
1- All accepted papers will be published in ACL Anthology, and indexed in
DBLP.
2- Selected papers will be published in Signals and Communication
Technology (Springer) (https://www.springer.com/series/4748), indexed by
Scopus and zbMATH.
*Keynote speakers*
1. *Eric Laporte*, *Gustave Eiffel University*, *France.*
2.* Jan Niehues*, *University of Maastricht*, *Netherlands.*
3. *Ahmed Ali*, *Qatar Computing Research Institute*, *Qatar*.
*Workshop: NSURL 2022*
The workshop on NLP Solutions for Under Resourced Languages NSURL
<http://nsurl.org> will be held with ICNLSP 2022
<https://www.icnlsp.org/2022welcome/>. The workshop aim to be a forum for
solving NLP tasks concerning Arabic and its dialects and also
under-resourced languages as African, Persian, etc.
We look forward to welcome you to ICNLSP 2022
<https://www.icnlsp.org/2022welcome/> that will be an opportunity to get
acquainted with the latest research in the field of natural language and
speech processing, hoping that it will be successful with your active
participation.
*Contact*
icnlsp2022(a)easychair.org
Dear List member,
The Institute of Translation Studies at the University of Innsbruck,
Austria, is looking to appoint a University Assistant (Postdoc) in
Multilingual (specialised) lexicography.
The deadline for application is August 26th.
All details (German and English) and the link for applying are at
https://lfuonline.uibk.ac.at/public/karriereportal.details?asg_id_in=12939
Informal enquiries can be sent to
Laura.Giacomini(a)uibk.ac.at
Best wishes,
Laura Giacomini
The Center for Information and Language Processing (CIS) at LMU Munich
has several fully-funded positions in Natural Language Processing and
Deep Learning available in the groups of Barbara Plank and Hinrich
Schütze.
Application deadline: September 8th, 2022
Details and application: https://www.cis.lmu.de/web/jobs2022.html
The Center for Information and Language Processing (CIS) at LMU Munich
(co-directed by Barbara Plank and Hinrich Schütze) has two open
tenure-track lecturer positions (Akademische/r Rat/Raetin auf
Lebenszeit) in computational linguistics / natural language
processing.
Application deadline: September 30th, 2022
Details and application: https://www.cis.lmu.de/web/arpositions2022.html
Hello! We are a team of researchers from MSR New England and New York. We are seeking participants (aged 18 or older) who have experience in machine learning or are interested in applying machine learning to developing computational models for signed languages for a survey study.
The purpose of this project is to explore how machine learning practitioners can better build machine learning models for sign language computation (e.g., recognition/translation). We want to understand your general motivations in working with machine learning problems and expected challenges when newly working with sign language data and tasks. Please know that sign language knowledge or sign language computation experience is NOT required to participate in this project.
The survey can be found at https://forms.office.com/r/7LPnkdTFLN<https://nam06.safelinks.protection.outlook.com/?url=https%3A%2F%2Fforms.off…> along with a consent form for further details. For every submission of the survey, $10 will be donated to LEAD-K (Language Equity and Acquisition for Deaf Kids), up to the first 50 submissions. The survey receives one submission per person.
Once you agree to consent, you will be directed to the survey questions. It will take about 30 minutes to answer the questions, including your experience in machine learning and sign language computation (if any), understanding of sign language culture, and demographics such as your age or education level.
Your responses will be anonymous, unless you choose to provide your name and email address for future contact where you will be invited to participate in a paid study to collaborate with American Sign Language experts. Your name and email address will never be shared outside of the research team.
Please complete the survey by Tuesday, 8/23 and feel free to forward this to other colleagues who may be interested!
Thank you so much for your consideration!
Rie Kamikubo, Danielle Bragg, Alex Lu, Hal Daumé III
In this newsletter:
Fall 2022 LDC Data Scholarship Program
30th Anniversary Highlight: The LDC Gigawords
________________________________
New publication:
HAVIC MED Novel 2 Test - Videos, Metadata and Annotation<https://catalog.ldc.upenn.edu/LDC2022V02>
Fall 2022 LDC Data Scholarship Program
Student applications for the Fall 2022 LDC Data Scholarship program are being accepted now through September 15, 2022. This program provides eligible students with no-cost access to LDC data. Students must complete an application consisting of a data use proposal and letter of support from their advisor. For application requirements and program rules, visit the LDC Data Scholarships page<https://www.ldc.upenn.edu/language-resources/data/data-scholarships>.
30th Anniversary Highlight: The LDC Gigawords
Giga: a combining form meaning "billion," used in the formation of compound words (Source: https://www.dictionary.com/browse/giga-)
LDC's Gigaword corpora are a natural outgrowth of its vast decades-long multi-language newswire collection. Newswire data was originally collected, annotated, and distributed for use in many sponsored projects and was also released through the LDC catalog in tailored data sets. Then came the idea of making LDC's entire newswire collection available by language with a simple, minimal markup to support a broad range of NLP/HLT tasks. The first Arabic<https://catalog.ldc.upenn.edu/LDC2011T11>, Chinese<https://catalog.ldc.upenn.edu/LDC2011T13>, and English<https://catalog.ldc.upenn.edu/LDC2011T07> Gigaword editions were released in 2003; subsequent cumulative releases through fifth editions in 2011 represent LDC's newswire collection spanning 1994-2010 in those languages. French<https://catalog.ldc.upenn.edu/LDC2011T10> and Spanish<https://catalog.ldc.upenn.edu/LDC2011T12> Gigawords were first published in 2006, culminating in the release of third editions in 2011, likewise covering newswire collected by LDC through 2010.
The community has used, and continues to use, these data sets in numerous ways. Automatic text summarization is a favorite, and current work in this area applies deep learning principles (see, e.g., Gao et al. 2020<https://link.springer.com/article/10.1007/s00521-018-3946-7>, English). Gigawords are also useful for text source classification (Huang et al. 2003<https://aclanthology.org/Y08-1042.pdf>, Chinese), information extraction (Lan et al. 2020<https://arxiv.org/pdf/2004.14519.pdf>, Arabic), knowledge extraction and distributional semantics (Napoles et al. 2012<https://aclanthology.org/W12-3018.pdf>, English), and natural language understanding (Ganitkevitch 2013<https://www.cs.jhu.edu/~juri/pdf/proposal-naacl-2013-srw.pdf>, English), among other fields. Recent variations like the annotated<https://catalog.ldc.upenn.edu/LDC2012T21> and concretely annotated<https://catalog.ldc.upenn.edu/LDC2018T20> English Gigawords add syntactic, semantic, and coreference annotations to this billion word text collection.
All Gigaword corpora are available for licensing by Consortium members and non-members. Visit Obtaining Data <https://www.ldc.upenn.edu/language-resources/data/obtaining> for more information.
________________________________
New publication:
HAVIC MED Novel 2 Test - Videos, Metadata and Annotation<https://catalog.ldc.upenn.edu/LDC2022V02> is comprised of 6,200 hours of user-generated videos with annotation and metadata developed by LDC for the 2015 NIST Multimedia Event Detection tasks. The data consists of videos of various events (event videos) and videos completely unrelated to events (background videos). Each event video was manually annotated with judgments describing its event properties and other salient features. Background videos were labeled with topic and genre categories.
HAVIC MED Novel 2 Test -- Videos, Metadata and Annotation is distributed via web download.
2022 Subscription Members will automatically receive copies of this corpus. 2022 Standard Members may request a copy as part of their 16 free membership corpora. This corpus is a members-only release and is not available for non-member licensing. Contact ldc(a)ldc.upenn.edu<mailto:ldc@ldc.upenn.edu> for information about membership.
Membership Coordinator
Linguistic Data Consortium<ldc.upenn.edu>
University of Pennsylvania
T: +1-215-573-1275
E: ldc(a)ldc.upenn.edu<mailto:ldc@ldc.upenn.edu>
M: 3600 Market St. Suite 810
Philadelphia, PA 19104
Il giorno ven 12 ago 2022 alle 14:00 <corpora-request(a)list.elra.info> ha
scritto:
> Send Corpora mailing list submissions to
> corpora(a)list.elra.info
>
> To subscribe or unsubscribe via email, send a message with subject or
> body 'help' to
> corpora-request(a)list.elra.info
>
> You can reach the person managing the list at
> corpora-owner(a)list.elra.info
>
> When replying, please edit your Subject line so it is more specific
> than "Re: Contents of Corpora digest..."
>
> Today's Topics:
>
> 1. [CfP] TREC Health Misinformation Track 2022 (Maria Maistro)
> 2. [CfP] ACM TOIS Efficiency in Neural IR (Maria Maistro)
> 3. Call for Badges - ACM SIGIR Artifact Badges Continuous Submission
> (Nicola Ferro)
> 4. Call for proposals: Natural Language Processing (John Benjamin’s)
> (Caro)
>
>
> ----------------------------------------------------------------------
>
> Message: 1
> Date: Fri, 12 Aug 2022 08:03:18 +0000
> From: Maria Maistro <mm(a)di.ku.dk>
> Subject: [Corpora-List] [CfP] TREC Health Misinformation Track 2022
> To: "corpora(a)list.elra.info" <corpora(a)list.elra.info>
> Message-ID: <86B5F708-9063-456A-B790-888B9639E00F(a)ku.dk>
> Content-Type: multipart/alternative;
> boundary="_000_86B5F7089063456AB790888B9639E00Fkudk_"
>
> Call for Participation - TREC Health Misinformation Track 2022
> https://trec-health-misinfo.github.io
>
> Overview 🧐
> --------------------------
> Web search engines are frequently used to help people make decisions about
> health-related issues. Unfortunately, the web is filled with misinformation
> regarding the efficacy of treatments for health issues. Search users may
> not be able to discern correct from incorrect information, nor credible
> from non-credible sources. As a result of finding misinformation deemed by
> the user to be useful to their decision making task, they can make
> incorrect decisions that waste money and put their health at risk.
>
> The TREC Health Misinformation track fosters research on retrieval methods
> that promote reliable and correct information over misinformation for
> health-related decision making tasks.
>
> Tasks 💼
> --------------------------
> * Ad-hoc Retrieval Task: design a ranking model that promotes credible and
> correct information over incorrect information;
> * Answer Prediction Task: predict the answer to the topic’s stance.
>
> Guidelines 📋 we u guy
> --------------------------
> * Corpus: noclean version of the C4 dataset (
> https://huggingface.co/datasets/allenai/c4);
> * Topics: about consumer health search (people seeking health advice
> online);
> * Runs: runs may be either automatic or manual with the standard TREC run
> format.
>
> Detailed guidelines: https://trec-health-misinfo.github.io
>
> Important Dates 🔥
> --------------------------
> * Runs due from participants: August 28, 2022
> * Evaluation results returned: End of September 2022
> * Notebook paper due: October 2022
> * TREC 2022 Conference: November 14-18, 2022
> * Final paper due: February 2023
>
> Organization 👔
> --------------------------
> * Charles Clarke, University of Waterloo
> * Maria Maistro, University of Copenhagen
> * Mark Smucker, University of Waterloo
>
>
> ———
>
> Maria Maistro, PhD
> Tenure-track Assistant Professor
> Department of Computer Science
> University of Copenhagen
> Universitetsparken 5, 2100 Copenhagen, Denmark
>