December 2023 - Corpora

Final call for papers: CODI, 5th Workshop on Computational Approaches to Discourse
by Amir Zeldes 11 Dec '23

11 Dec '23

CODI, 5th Workshop on Computational Approaches to Discourse: Final Call for Papers 2024-03-21 or 22 - EACL 2024 - Malta ** Submission deadline: December 20th, 2023 - No deadline extension ** Website link: https://sites.google.com/view/codi2024 Aims and scope The last ten years have seen a dramatic improvement in the ability of NLP systems to understand and produce words and sentences. This development has created a renewed interest in discourse phenomena as researchers move towards the processing of long-form text and conversations. There is a surge of activity in discourse parsing, coherence models, text summarization, corpora for discourse level reading comprehension, and discourse related/aided representation learning, to name a few, but the problems in computational approaches to discourse are still substantial. At this juncture, we have organized four Workshops on Computational Approaches to Discourse (CODI) at EMNLP 2020, EMNLP 2021, COLING 2022 and ACL 2023 to bring together discourse experts and upcoming researchers. These workshops have catalyzed work to advance research on discourse level problems and have served as a forum for the discussion of suitable datasets and reliable evaluation methods. The previous workshops on discourse in machine translation (DiscoMT), linking lexical, sentential and discourse semantics (LSDSem), discourse structure in natural language generation (DSNNLG), discourse relation parsing and treebanking (DISRPT) and coreference (CORBON/CRAC), have shown that there is considerable interest and success in bringing together the community working on specific problems in discourse. We believe that the discourse community will also benefit from a general forum where work ranging from corpus development/analysis to computational models, and evaluation is discussed, and desiderata can be drawn for future progress. The 5th CODI workshop is planned as a 1 day event which brings together different subcommunities. It will feature invited talks and regular papers. We also accept papers accepted at other major conferences for non-archival presentation, including Findings papers. Topics of interest We welcome papers on symbolic and probabilistic approaches, corpus development and analysis, as well as machine and deep learning approaches to discourse. We appreciate theoretical contributions as well as practical applications, including demos of systems and tools. The goal of the workshop is to provide a forum for the community of NLP researchers working on all aspects of discourse. Topics of interest include, but are not limited to: - discourse structure - discourse connectives - discourse relations - annotation tools and schemes for discourse phenomena - corpora annotated with discourse phenomena - discourse parsing - cross-lingual discourse processing - cross-domain discourse processing - anaphora and coreference resolution - event coreference - argument mining - coherence modeling - discourse and semantics - discourse in applications such as machine translation, summarization, etc. - evaluation methodology for discourse processing Submissions We solicit three categories of papers: regular (long and short) workshop papers, demos and extended abstracts. Only regular workshop papers and demos will be included in the proceedings as archival publications. Double submission of papers is allowed but will need to be indicated at submission. Regular papers must describe original unpublished research. Long papers may consist of up to 8 pages of content, plus unlimited pages for references. Short papers can be up to 4 pages, plus unlimited pages for references. Demo submissions may describe systems, tools, visualizations, etc., and may consist of up to 4 pages, plus unlimited pages for references. Each submission can contain unlimited pages for Appendices but the paper submissions need to remain fully self-contained, as these supplementary materials are completely optional, and reviewers are not even asked to review them. Extended abstracts can describe work in progress. These may be two pages long (without references). Extended abstracts are non-archival. They will be included in the workshop program and handbook, but will not appear in the workshop proceedings. Paper accepted or rejected at one of the main conferences We also invite presentations of paper accepted at another main conference, a specific deadline and submission process will be communicated later on. They will be included in the workshop program and handbook, but will not appear in the workshop proceedings. We will also consider for publication papers rejected at one of the main conferences (see the direct submission deadline below), authors will have to submit both the paper and the reviews. The submission process will be communicated later on. Submission website All submissions must be anonymous and follow the EACL 2024 formatting instructions described here: � https://aclrollingreview.org/cfp Please submit your workshop papers at https://softconf.com/eacl2024/CODI-2024/ Important dates 2023-12-20: CODI papers due 2024-01-17: Direct submission (papers rejected at a main conference) 2024-01-20: Notification of acceptance 2024-01-30: Camera ready deadline for main conference and CODI 2024-03-17 – 2024-03-22: CODI workshop All deadlines are 11.59 pm UTC -12h ("anywhere on Earth"). Due to the tight schedule, there will be no deadline extension. Invited Speakers - Hannah Rohde, University of Edinburgh - Manfred Stede, Potsdam University Organizers Chloé Braud, CNRS-IRIT Christian Hardmeier, IT University of Copenhagen Chuyuan (Lisa) Li, � University of British Columbia Jessy Li, University of Texas, Austin Sharid Loáiciga, University of Gothenburg Michael Strube, Heidelberg Institute for Theoretical Studies Amir Zeldes, Georgetown University To contact the organizers, please send an email to: codi-workshop(a)googlegroups.com <mailto:codi-workshop@googlegroups.com> �

1 0

Tenure-Track Position in Data Curation & Responsible Data Science (Open Rank) - Information Sciences
by Halil Kilicoglu 11 Dec '23

11 Dec '23

*** Apologies for cross-posting *** Dear colleagues, The School of Information Sciences (iSchool) at the University of Illinois-Champaign is hiring for a tenure-track position in Data Curation & Responsible Data Science (open rank). The application deadline is January 05, 2024. The expected start date is August 16, 2024. For the full job announcement and application link, please see https://illinois.csod.com/ux/ats/careersite/1/home/requisition/8864?c=illin…. Feel free to contact me if you have any questions. Best regards, --Halil *HALIL KILICOGLU* *Associate Professor* School of Information Sciences University of Illinois at Urbana-Champaign halil(a)illinois.edu https://ischool.illinois.edu/people/halil-kilicoglu

1 0

UMRs in Boulder Summer School - First Call for Applications
by kristine.stenzel＠colorado.edu 11 Dec '23

11 Dec '23

June 10-13, 2024, University of Colorado, Boulder Held in conjunction with the UMR Parsing Workshop, June 14, 2024 https://umr4nlp.github.io/web/UMRParsingWorkshop.html Impressive progress has been made in many aspects of natural language processing (NLP) in recent years. Most notably, the achievements of transformer-based large language models such as ChatGPT would seem to obviate the need for any type of semantic representation beyond what can be encoded as contextualized word embeddings of surface text. Advances have been particularly notable in areas where large training data sets exist, and it is advantageous to build an end-to-end training architecture without resorting to intermediate representations. For any truly interactive NLP applications, however, a more complete understanding of the information conveyed by each sentence is needed to advance the state of the art. Here, "understanding'' entails the use of some form of meaning representation. NLP techniques that can accurately capture the required elements of the meaning of each utterance in a formal representation are critical to making progress in these areas and have long been a central goal of the field. As with end-to-end NLP applications, the dominant approach for deriving meaning representations from raw textual data is through the use of machine learning and appropriate training data. This allows the development of systems that can assign appropriate meaning representations to previously unseen text. In this four-day course, instructors from the University of Colorado and Brandeis University will describe the framework of Uniform Meaning Representations (UMRs), a recent cross-lingual, multi-sentence incarnation of Abstract Meaning Representations (AMRs), that addresses these issues and comprises such a transformative representation. Incorporating Named Entity tagging, discourse relations, intra-sentential coreference, negation and modality, and the popular PropBank-style predicate argument structures with semantic role labels into a single directed acyclic graph structure, UMR builds on AMR and keeps the essential characteristics of AMR while making it cross-lingual and extending it to be a document-level representation. It also adds aspect, multi-sentence coreference and temporal relations, and scope. Each day will include lectures and hands-on practice. Topics to be covered June 10-13: 1. The basic structural representation of UMR and its application to multiple languages; 2. How UMR encodes different types of MWE (multi-word expressions), discourse and temporal relations, and TAM (tense-aspect-modality) information in multiple languages, and differences between AMR and UMR; 3. Going from IGT (interlinear glossed text) to UMR graphs semi-automatically; 4. Formal semantic interpretation of UMR incorporating a continuation-based semantics for scope phenomena involving modality, negation, and quantification; 5. Extension to UMR for encoding gesture in multimodal dialogue, Gesture AMR (GAMR), which aligns with speech-based UMR to account for situated grounding in dialogue. The fifth day of the summer school, June 14, will be co-located with a UMR Parsing Workshop, focusing on parsing algorithms that generate AMR and UMR representations over multiple languages. https://umr4nlp.github.io/web/UMRParsingWorkshop.html To apply, please complete this form by Jan. 30, 2024. https://www.colorado.edu/linguistics/umrs-boulder-summer-school-application Other important dates: ● Notification of acceptance: Feb. 20, 2024 ● Confirmation of participation: Mar. 1, 2024 ● Arrival in Boulder June 9, departure June 15, 2024. Participation will be fully funded (reasonable airfare, lodging, and meals). This summer school has been made possible by funding from NSF Collaborative Research: Building a Broad Infrastructure for Uniform Meaning Representations (Award # 2213805), with additional support from the University of Colorado Boulder and the CLEAR Center.

1 0

UMR Parsing Workshop 2024 - First Call
by kristine.stenzel＠colorado.edu 11 Dec '23

11 Dec '23

UMR Parsing Workshop - First Call University of Colorado, Boulder June 14, 2024 This workshop will focus on developing parsers for Uniform Meaning Representations. The goal is to start from raw text from real-world settings that could be in any one of many typologically different languages, even low-resource languages for which there is little or no training data. This can be achieved by exploiting a common semantic annotation standard. This workshop has been made possible by funding for NSF Collaborative Research: Building a Broad Infrastructure for Uniform Meaning Representations (Award # 2213805), which is aimed at developing guidelines and annotation for cross-lingual Uniform Meaning Representations, based on the original Abstract Meaning Representation guidelines for English, but ensuring cross-linguistically consistent annotation and recoverability of the original raw texts. This workshop will overlap with the last day of the Colorado UMR Annotation Summer School. The workshop is open to everyone and will cover the fundamentals of UMR annotation and the differences between AMR and UMR. In addition to the talk from our invited speaker, there will be presentations on recent successful approaches to AMR parsing and how they can be applied to UMR parsing. We welcome submissions from anyone on related topics, such as: ● AMR or UMR parsing for any language ● AMR or UMR generation for any language ● Evaluation metrics for AMR or UMR parsing ● Bootstrapping of AMRs or UMRs from related semantic representations such as Propbanks ● Projections of English AMR onto other languages; ● Challenges of applying AMR annotation to languages other than English; ● Challenges of accurate multi-sentence coreference as a subtask of AMR parsing; ● Any other topic related to the parsing and generation of AMRs or UMRs. Important dates ● Workshop paper submissions due: March 30, 2024 ● Notification of acceptance: April 25, 2024 ● Camera-ready versions due: May 30, 2024 Submissions Submissions should report original and unpublished research on topics of interest to the workshop. Accepted papers are expected to be presented at the workshop and will be published in the workshop proceedings. They should emphasize obtained results rather than intended work and should clearly indicate the state of completion of the reported results. Submission is electronic, using the Workshop submission site in Easy Chair. https://easychair.org/my/conference?conf=umrpw2024 Submissions must adhere to the two-column format of ACL venues, using the Overleaf template taken from ACL 2023. https://www.overleaf.com/latex/templates/acl-2023-proceedings-template/qjdg… Initial submissions should be fully anonymous to ensure double-blind reviewing. Long papers must not exceed eight (8) pages of content; short papers must not exceed four (4) pages of content. References and appendices do not count against these limits. To ensure double-blind reviews, papers must not include the authors’ names and affiliations or self-references that reveal any author’s identity. Papers that do not conform to these requirements will be rejected without review.

1 0

Call for Papers - 11th Inter-Varietal Applied Corpus Studies (IVACS) Biennial Conference - University of Cambridge - 16th & 17th July 2024
by Andrew Caines 11 Dec '23

11 Dec '23

Dear Colleagues, We are delighted to launch the Call for Paper for the *11th Inter-Varietal Applied Corpus Studies (IVACS) Biennial Conference *which will be hosted by the University of Cambridge, U.K., on Tuesday 16th and Wednesday 17th July 2024. Conference website: https://www.ivacs2024.com/ Abstract deadline 20th December, 2023. *Plenary Speakers* We are delighted that the following researchers will be giving plenary talks at the conference: - Dr Brian Clancy <https://www.mic.ul.ie/staff/276-brian-clancy> - Dr Geraldine Mark <https://profiles.cardiff.ac.uk/staff/markg2> Please spread the word! Best wishes, Anne and Andrew *Dr Andrew Caines, Conference Convenor, University of Cambridge* *Prof. Anne O'Keeffe, Inter-Varietal Applied Corpus Studies (IVACS) Network Director* *Call for Papers* *The 11th Inter-Varietal Applied Corpus Studies (IVACS) Biennial Conference* We are particularly interested in papers in but not limited to the following areas: Strand 1 – Corpus Methods and Innovations: Innovations in Corpus Design, Analysis and Annotation Tools; Critical Reflections on Corpus Methods; Advances in Quantitative and Qualitative Approaches to Analysing Corpora; Innovations in Statistics for CL. Strand 2 – Corpus Linguistics, Pragmatics and Discourse: Corpus Approaches to Discourse Analysis, Conversation Analysis, Critical Discourse Analysis; Corpus Pragmatics; CL and Real-World Contexts (e.g. Media Discourse, Classroom Discourse; Workplace Discourse). Strand 3 – Corpus Linguistics and Applied Linguistics: Learner Corpus Research; CL and Second Language Acquisition; Data-Driven Learning; CL for Materials Development; CL and Teacher Education; CL and Lexicography. Strand 4 – Corpus Linguistics, Literature, Texts and Register: CL and Register Studies; Corpus Stylistics; CL and Literary Linguistics; CL and Translation Studies; Forensic Linguistics. Strand 5 – Corpus Linguistics and Speech: CL Speech Technology; CL and Multimodality; Spoken Corpora; Corpus Phonology. Strand 6 – Corpus Linguistics and Sociolinguistics: CL and Language Change; Language Varieties and Variation; CL and Minority Language Studies. Strand 7 – Computational Linguistics and Corpora: The use of Corpora for Computational Linguistics research; Exploration and analyses of Corpora using Computational Linguistic methods; Data collection and annotation for Computational Linguistics. *Abstract Submission and Timeline* Full papers will involve a 20-minute presentation, plus 10 minutes for questions and discussion. Posters can present work in progress or summaries of completed studies, research projects or other innovations. Posters will be printed in portrait A0 size. Abstracts will be 300 words (not including reference list, if any). Note that the deadlines are 11.59 pm UTC -12h <https://www.timeanddate.com/time/zone/timezone/utc-12> (“anywhere on Earth”). Abstract deadline 20th December, 2023 Notification 31st January, 2024 Conference 16th-17th July, 2024 Submission of abstracts: OpenReview <https://openreview.net/group?id=IVACS/2024/Conference> *Seeking Reviewers* Would you have time to help us review the abstracts in January? Maximum 5 per person. Please sign up here <https://forms.gle/BkopQZ12esXMAnv36>

1 1

Third call for papers: The 9th Workshop on Noisy and User-generated Text (WNUT 2024)
by rob van der goot 11 Dec '23

11 Dec '23

The WNUT Workshop will be collocated with EACL 2024 (Malta). The website for the workshop is at: http://noisy-text.github.io/ The WNUT workshop focuses on core NLP tasks (e.g., POS/NER tagging and translation; not computational social science) over user-generated text, such as that found on social media, web forums, online reviews, digital health records, or language learner essays. We seek submissions of long and short papers on original and unpublished work (same format and page limit as EACL main conference). All accepted submissions will be presented as posters. Additionally, selected submissions will be presented orally. There will be best paper awards for both short and long papers. Topics of interest include but are not limited to: * NLP of noisy text, e.g. POS, NER tagging, Parsing * Text normalization and error correction * Paraphrase identification and semantic similarity of short text or noisy text * Extracting user demographics, profiles, and major life events * Machine translation and Multilingual NLP over noisy text * Information extraction from noisy text, global and regional trend detection, and event extraction * Colloquial language, e.g. idiom detection * Domain adaptation to user-generated text * Detecting rumors, contradictory information, sarcasm and humor on social media * Sentiment analysis * Temporal aspects of user-generated content (resolving time expressions, concept drift, etc...) * Representing and mining language variation in user-generated content * Processing of automatically generated data * Robustness to Noise, both Natural and Adversarial [IMPORTANT DATES] * Submission Deadline: December 18, 2023 (anytime on earth; dual-submission allowed) * Acceptance Notification: January 20, 2024 * Camera-Ready Deadline: january 30, 2024 * Workshop Day: March 21/22, 2024 [INVITED SPEAKERS] * Su Lin Blodgett * Jennifer Foster [ORGANIZERS] * Tim Baldwin (University of Melbourne) * Wei Xu (Georgia Institute of Technology) * Alan Ritter (Georgia Institute of Technology) * Rob van der Goot (IT University of Copenhagen) * Max Müller-Eberstein (IT University of Copenhagen) [SUBMISSION] Submissions should conform to the ACL style guidelines. Long and short paper submissions must be anonymized. Please submit your papers via OpenReview: https://openreview.net/group?id=eacl.org/EACL/2024/Workshop/WNUT

1 0

CAiSE'24 Ph.D. Award: First Call for Applications
by Announce 11 Dec '23

11 Dec '23

*** Ph.D. Award: First Call for Applications *** 36th International Conference on Advanced Information Systems Engineering (CAiSE'24) June 3-7, 2024, 5* St. Raphael Resort and Marina, Limassol, Cyprus https://cyprusconferences.org/caise2024/ (*** Submission Deadline: 1st March, 2024 AoE ***) The deadline to apply for the CAiSE 2024 PhD Award is March 1st 2024. The conditions to apply are: • having participated as an author in a previous CAiSE Doctoral Consortium or at a main CAiSE Event: either the main conference, the CAiSE Forum, EMMSAD, or BPMDS; • having successfully defended the PhD thesis in the last two years (i.e., since January 2022). The application must be submitted electronically to the PhD Awards track of CAiSE 2024 via EasyChair <https://easychair.org/conferences/?conf=caise2024> . The application must be a single PDF file containing: • a short cover letter that includes the list of PhD committee members, • a support letter from the thesis advisor, • the candidate's defended PhD thesis, • the candidate’s CV. About the PhD Award The CAiSE PhD Award 2024 is granted annually to an outstanding recent PhD thesis in the field of Information Systems Engineering. The award is co-sponsored by the CAiSE Steering Committee and Springer. It consists of a certificate, free full registration (5 days) to the next two editions of the CAiSE conference, and a book voucher for a free selection worth EUR 500 from Springer’s printed books collection. In addition, the selected thesis will be recommended for publication as a monograph in the LNBIP series published by Springer, provided that Springer’s publication conditions are met. The PhD theses submitted for the award will be reviewed by a standing committee of senior members selected from the CAiSE Advisory Committee, the CAiSE Steering Committee, and the CAiSE Program Committee. Award Chair Professor Andreas L Opdahl, University of Bergen, Norway Key Dates • Submission of application: 1st March, 2024 (AoE) • Notification: 15th April, 2024 Past Recipents • 2023: Anna Bernasconi, PhD from Politecninco Milano (Italy), thesis title “Model, Integrate, Search... Repeat: a Sound Approach to Building Integrated Repositories of Genomic Data” (link to the forthcoming monograph: https://link.springer.com/book/9783031449062) • 2022: Volodymyr Leno, PhD from University of Melbourne (Australia), thesis title “Robotic Process Mining: Accelerating the adoption of Robotic Process Automation” (link to the thesis: https://minerva-access.unimelb.edu.au/bitstream/handle/ 11343/297274/98f9efca-4dd2-eb11-94dc-0050568d0279_manuscript.pdf) • 2021: Orlenys Lopez Pintado, PhD from University of Tartu (Estonia), thesis title “Collaborative Business Process Execution on the Block Chain: the Caterpillar System” (link to the thesis: https://dspace.ut.ee/items/1e09072c-5442-463a-b8c6-0425951cb90b) • 2020: Steven Mertens, PhD from Ghent University (Belgium), thesis title “Enabling process management for loosely framed knowledge-intensive processes” (link to the published monograph: https://www.springer.com/gp/book/9783030661922) • 2019: Giovanni Meroni, PhD from Politecnico di Milano (Italy), thesis title “Artifact-driven business process monitoring” (link to the published monograph: https://www.springer.com/gp/book/9783030324117) • 2018: Wei Wang, PhD from University of Queensland (Australia) thesis title “Integrated Modeling of Business Processes and Business Rules” (link to the published monograph: https://www.springer.com/gp/book/9783030118082) • 2017: Marcela Ruiz, PhD from the Universidad Politécnica de Valencia (Spain), thesis title “TraceME: A Traceability-Based Method for Conceptual Model Evolution” (link to the published monograph: https://www.springer.com/gp/book/9783319897158) • 2016: Le Minh Sang Tran, PhD from University of Trento (Italy), thesis title “Managing the Uncertainty of the Evolution of Requirements Models” (testimony of the 2016 CAiSE PhD Award winner: https://www.youtube.com/watch?v=q-vvlH66lC4)

1 0

1st CFP - Third Workshop on Digital Humanities and Natural Language Processing
by Leonardo Zilio 11 Dec '23

11 Dec '23

* We apologize if you receive multiple copies of this CFP * For the online version of this Call, visit: https://easychair.org/cfp/3rdDHandNLP =============== 3rd DHandNLP Third Workshop on Digital Humanities and Natural Language Processing Co-located with PROPOR 2024 14-15 March 2024, Universidade de Santiago de Compostela, Galizia, Spain Website: https://sites.google.com/view/dhandnlp-propor Submission deadline: 20 January 2024 (23:59 GMT) Submission link: https://easychair.org/conferences/?conf=3rddhandnlp 3rd DHandNLP is a one-day workshop during PROPOR - 14-15 March 2024 *Workshop description* Digital humanities (DH) stand at the intersection of computing and the humanities, involving collaborative transdisciplinary research. While current DH practice already shows an impressive array of new digital tools and methods for the study of the humanities, we believe that natural language processing techniques and experience can significantly enhance the field, while DH can also bring new testbeds and problems for the NLP community. As shown in the previous workshops, there is an increasing set of researchers in the processing of Portuguese who are interested in this active collaboration, and we believe that we should cater for a forum which may join the two communities, DH and NLP, showcasing several different aspects allowed by this cross-fertilization. The 3rdDHandNLP welcomes papers stemming from humanities that deal with language, such as philosophy, history, geography, law, philology, linguistics, or literature, and that can benefit from a digital approach or enhanced with computational linguistics methods or techniques, be it by using large sets of (written or spoken) textual data or by developing applications for an increasingly digital world. We also welcome papers that use “traditional” DH tools or techniques, such as topic modeling, and papers that use standard NLP tools that were already applied in different DH contexts, such as named entity recognition, document clustering and classification, sentiment analysis, dialect/language identification and linked data. *Main workshop topics* - Digital philology, critical editions production and textual criticism - Lexicometrics, lexicology and lexicography - Visualization or sonification of large textual bodies in specific domains - Computational stylometry, authorship attribution and profiling - Distant reading of literature - Construction of historical thesauri Finally, we are especially interested in approaches that deal with historical material, involving not only historical linguistics but historical lexicology, corpus processing and their multilingual analysis. *SUBMISSION GUIDELINES* All papers must be anonymous, original and not simultaneously submitted to another journal or conference. They must strictly adhere to the submission templates of the main conference. We welcome submissions of: - Short papers, consisting of up to 4 pages of content, plus unlimited pages of references - Full papers, consisting of up to 8 pages of content, plus unlimited pages of references Kind regards, Maria José B. Finatto and Leonardo Zilio (on behalf of the organising committee)

1 0

PROPOR24 Deadline extension + last call for demos.
by Iria de Dios Flores 11 Dec '23

11 Dec '23

The PROPOR 2024 demonstration program committee invites submissions for demonstrations. Following the spirit of previous PROPOR editions, the demonstration track aims at bringing together academia and industry, creating a forum where more than written or spoken descriptions of research are available. Thus, demos should allow attendees to try and test them during their presentation in a dedicated session that will provide a more informal and interactive setting. Products, systems, or tools are examples of acceptable demos. Both early-research prototypes and mature systems may also be considered. *Important dates:* Demos Submission: January 10 2024 Notification of acceptance or rejection: February 21 2024 Camera-ready demo paper: February 28 2024 Conference: March 14 and 15 2024 *Topics:* The areas of interest include all topics related to theoretical and applied issues of written and spoken Portuguese and Galician, such as, but not limited to, the same topics as for the conference paper submission: Natural language processing tasks (e.g. parsing, word sense disambiguation, coreference resolution) Natural language processing applications (e.g. question answering, subtitling, summarization, sentiment analysis) Natural language generation Information extraction and information retrieval Speech technologies (e.g. spoken language generation, speech and speaker recognition, spoken language understanding) Speech applications (e.g. spoken language interfaces, dialogue systems, speech-to-speech translation) Resources, standardization and evaluation (e.g. corpora, ontologies, lexicons, grammars) NLP-oriented linguistic description or theoretical analysis Distributional semantics and language modeling Portuguese language varieties and dialect processing (including the language varieties of Angola, Brazil, Cape Verde, East Timor, Galicia, Guinea-Bissau, Macau, Mozambique, Portugal, São Tomé, and Principe) Multilingual studies, methods, applications, and resources including Portuguese/Galician The systems may be of the following kinds: Natural Language Processing systems or system components Application systems using language technology components Software tools for computational linguistics research Software for demonstration or evaluation Development tools *Submissions:* Submissions should consist of a non-anonymous brief description document of up to three pages of content, including references. Developers must outline the main characteristics of their system/product/tool, provide sufficient details to allow its evaluation, and give information on how they plan to demonstrate it. Developers are encouraged to focus their description on the relevance of the computational processing component of Portuguese or Galician in the proposed system. Submissions should be written in English. At submission time, only PDF format is accepted. For the final versions, authors of accepted papers will be given one extra content page to take the reviews into account. Authors of accepted papers will be requested to send the source files for the production of the proceedings. Submissions must be sent via EasyChair ( https://easychair.org/my/conference?conf=propor2024) — please select the track: PROPOR2024 Demo Paper. All submitted papers must conform to the official ACL style guidelines. ACL provides style files for LaTeX and Microsoft Word that meet these requirements. They can be found at: LaTeX styelesheet: https://github.com/acl-org/acl-style-files/tree/master/latex MS Word stylesheet: https://github.com/acl-org/acl-style-files/tree/master/word Publication: Accepted demo papers are expected to be published by ACL as a volume in ACL Anthology (https://aclanthology.org/) as part of the PROPOR 2024 proceedings. They will be available online. To ensure publication, at least one author of each accepted paper must complete an adequate registration for PROPOR 2024 by the early registration deadline. *Presentation format:* Accepted demos will be presented at a designated demo session with an optional accompanying poster. Developers should make sure they could run their demos properly. Thus, it is the authors’ responsibility to provide the necessary technical conditions (i.e. equipment) for the demo at the conference. Note that the local organizers will not provide any hardware or software. Free high-speed Internet access will be available. There will be a best demo award for the best-presented project. Further details on the date, time, and instructions of the demonstration session(s) will be determined and provided at a later date. *Demo chairs:* Marlo Souza (Universidade Federal da Bahia, Brazil) Iria de-Dios-Flores (Universidade de Santiago de Compostela, Spain) -- *Iria de-Dios-Flores (PhD)* *https://sites.google.com/view/iriadediosflores/ <https://sites.google.com/view/iriadediosflores/>*

1 0

1st CfP: Workshop on Reference, Framing, and Perspective
by pia.sommerauer＠live.com 11 Dec '23

11 Dec '23

We are very pleased to share our first call for papers for our workshop on Reference, Framing, and Perspective co-located with LREC-COLING 2024. * Workshop website: https://cltl.github.io/reference-framing-perspective/ * When: Saturday, May 25th, 20204 * Where: Torino, Italy (co-located with LREC-COLING 2024) * Deadline for submissions: February (details tba) * Paper submission link: tba * Deadline for camera-ready papers: beginning of April 2024 (details tba) When something happens in the world, we have access to an unlimited range of ways (from lexical choices to specific syntactic structures) to refer to the same real-world event. Variations in reference may convey radically different perspectives. This process of making reference to something by adopting a specific perspective is also known as framing. Although previous work in is this area is present (see Ali and Hassan (2022)’s survey for an overview), there is a lack of unitary framework and only few targeted datasets (Chen et al., 2019) and tools based on Large Language Models exist (Minnema et al., 2022). In this workshop, we propose to adopt Frame Semantics (Fillmore, 1968, 1985, 2006) as a unifying theoretical framework and analysis method to understand the choices made in linguistic references to events. The semantic frames (expressed by predicates and roles) we choose give rise to our understanding, or framing, of an event. We aim to bring together different research communities interested in lexical and syntactic variation, referential grounding, frame semantics, and perspectives. We believe that there is significant overlap within the goals and interests of these communities, but not the necessary common ground to enable collaborative work. Shared dataset: To facilitate discussion among participants and to make this a real working workshop, we make available a shared corpus. The corpus is composed of news articles reporting on the 2020/2021 Eurovision Song Contest (canceled in 2020 and held in 2021) that took place in Rotterdam (the Netherlands). The news articles have been collected using the structured data-to-text approach (Vossen et al., 2018). At this point, the corpus contains texts in English and Dutch. We are extending it to a range of other European languages. We invite participants to submit short and targeted analyses using the data (extended abstracts to be discussed in a hands-on data session). Participants are also free to use the data in regular contributions. More information about the corpus will be released soon. Regular contributions: We aim to lay the groundwork for such efforts. We invite contributions (regular long papers of 8 pages or short papers of 4 pages) targeting any of the following - non-exhaustive - list of topics: * Theoretical models of framing and perspective * Annotation frameworks for framing and perspectives * Computational models of framing and perspective * Approaches for creating and analyzing referentially grounded datasets (containing different perspectives, written at different points in time, written in different languages) * Approaches for and analyses of texts about contested and divisive events triggering different opinions and perspectives * Analyses of and methods for analyzing (diachronic) lexical variation and framing * Language resources for reference, frames, and perspectives * Approaches and tools to compare claims of sources * Frames as expressions of bias in the representation of social groups * User interface for the visualization of multiple perspectives Extended abstracts: We invite extended abstracts (1,500 words maximum) about small analyses or experiments conducted on our Shared Data. The abstracts will be non-archival and discussed in a dedicated data session. Invited speakers: Maria Antoniak Vered Shwartz Organizers: Pia Sommerauer, Tommaso Caselli, Malvina Nissim, Levi Remijnse, Piek Vossen

1 0

2026

2025

2024

2023

2022

Corpora December 2023