- Corpora - ELRA lists

CAiSE'24: Last Call for Journal First Submissions
by Announce 22 Mar '24

22 Mar '24

*** Last Call for Journal First Submissions *** 36th International Conference on Advanced Information Systems Engineering (CAiSE'24) June 3-7, 2024, 5* St. Raphael Resort and Marina, Limassol, Cyprus https://cyprusconferences.org/caise2024/ (*** Submission Deadline: 31st March, 2024 AoE ***) CAiSE 2024 is organising journal-first sessions as part of the scientific program. The aim of these sessions is to disseminate recent important research contributions and spark discussions between authors and researchers in the CAiSE community. Authors of selected journal articles on CAiSE-related topics will be invited to present their work at the conference. SCOPE For the journal-first sessions, we solicit submissions related to articles that have been accepted for publication by a reputable journal and that meet the following criteria: • The article relates to the topics of the CAiSE conference and the recent call for papers. • The article is an original submission to the journal and not an extension of an earlier conference or workshop paper. • The article is an original research article; review articles or commentaries will not be considered. • The article was accepted for publication by a journal on or after 1 January 2023, the acceptance must have been publicly announced, the article must be available at the publisher’s website (e.g., as "articles in advance" or published on a journal’s website), and the article must be written in English. • The article has not been presented at, and is not under consideration for, journal-first tracks of other conferences. FORMAT Accepted submissions will be presented as part of the CAiSE 2024 scientific programme. SUBMISION Submissions must be done electronically via Easychair (https://easychair.org/my/conference?conf=caise2024) and include: • Title and author information of the article. • The original abstract and keywords. • DOI of the original publication or, alternatively, a link to the publication at the journal’s website. EVALUATION All submissions will be reviewed by the track chairs with the aim to accept all qualifying submissions subject to ability to accommodate them in the program. If needed, priority will be given to submissions according to their topical fit with the scope of the conference, the importance of the contribution, as well as the standing of the respective journal (including, but not limited to, the journal's impact factor and ranking results). ATTENDANCE AND PRESENTATION At least one author of each submission accepted for the journal-first track must register and attend the conference to present the work. The author needs a full registration to present the journal article. As the articles of the journal-first track have been published already, they will not be part of the CAiSE 2024 proceedings. The articles will be listed in the conference program and CAiSE 2024 participants will have access to the respective abstracts and a pointer to the original journal article. IMPORTANT DATES • Submission: 31st March, 2024 (AoE) • Notification of Acceptance: 14th April, 2024 • Author Registration: 17th May, 2024 • Conference Dates: 3rd-7th June, 2024 JOURNAL FIRST CHAIRS • Paolo Giorgini, University of Trento, Italy • Jeffrey Parsons, Memorial University of Newfoundland, Canada

1 0

Preliminary Call for Papers – Special issue of Information Processing & Management on Large Language Models and Data Quality for Knowledge Graphs
by Stefano Marchesin 22 Mar '24

22 Mar '24

Apologies for crossposting. Call for Papers Information Processing & Management (IPM), Elsevier - CiteScore: 14.8 - Impact Factor: 8.6 Guest editors: - Omar Alonso, Applied Science, Amazon, Palo Alto, California, USA. E-mail: omralon(a)amazon.com - Stefano Marchesin, Department of Information Engineering, University of Padua, Padua, Italy. E-mail: stefano.marchesin(a)unipd.it - Gianmaria Silvello, Department of Information Engineering, University of Padua, Padua, Italy. E-mail: gianmaria.silvello(a)unipd.it Special Issue on “Large Language Models and Data Quality for Knowledge Graphs” In recent years, Knowledge Graphs (KGs), encompassing millions of relational facts, have emerged as central assets to support virtual assistants and search and recommendations on the web. Moreover, KGs are increasingly used by large companies and organizations to organize and comprehend their data, with industry-scale KGs fusing data from various sources for downstream applications. Building KGs involves data management and artificial intelligence areas, such as data integration, cleaning, named entity recognition and disambiguation, relation extraction, and active learning. However, the methods used to build these KGs involve automated components that could be better, resulting in KGs with high sparsity and incorporating several inaccuracies and wrong facts. As a result, evaluating the KG quality plays a significant role, as it serves multiple purposes – e.g., gaining insights into the quality of data, triggering the refinement of the KG construction process, and providing valuable information to downstream applications. In this regard, the information in the KG must be correct to ensure an engaging user experience for entity-oriented services like virtual assistants. Despite its importance, there is little research on data quality and evaluation for KGs at scale. In this context, the rise of Large Language Models (LLMs) opens up unprecedented opportunities – and challenges – to advance KG construction and evaluation, providing an intriguing intersection between human and machine capabilities. On the one hand, integrating LLMs within KG construction systems could trigger the development of more context-aware and adaptive AI systems. At the same time, however, LLMs are known to hallucinate and can thus generate mis/disinformation, which can affect the quality of the resulting KG. In this sense, reliability and credibility components are of paramount importance to manage the hallucinations produced by LLMs and avoid polluting the KG. On the other hand, investigating how to combine LLMs and quality evaluation has excellent potential, as shown by promising results from using LLMs to generate relevance judgments in information retrieval. Thus, this special issue promotes novel research on human-machine collaboration for KG construction and evaluation, fostering the intersection between KGs and LLMs. To this end, we encourage submissions related to using LLMs within KG construction systems, evaluating KG quality, and applying quality control systems to empower KG and LLM interactions on both research- and industrial-oriented scenarios. Topics include but are not limited to: - KG construction systems - Use of LLMs for KG generation - Efficient solutions to deploy LLMs on large-scale KGs - Quality control systems for KG construction - KG versioning and active learning - Human-in-the-loop architectures - Efficient KG quality assessment - Quality assessment over temporal and dynamic KGs - Redundancy and completeness issues - Error detection and correction mechanisms - Benchmarks and Evaluation - Domain-specific applications and challenges - Maintenance of industry-scale KGs - LLM validation via reliable/credible KG data Submission guidelines: Authors are invited to submit original and unpublished papers. All submissions will be peer-reviewed and judged on originality, significance, quality, and relevance to the special issue topics of interest. Submitted papers should not have appeared in or be under consideration for another journal. Papers can be submitted from 1 June 2024 to 1 September 2024. The estimated publication date for the special issue is 15 January 2025. Papers submission via IP&M electronic submission system: https://www.editorialmanager.com/IPM Instructions for authors: https://www.sciencedirect.com/journal/information-processing-and-management… To submit your manuscript to the special issue, please choose the article type: "VSI: LLMs and Data Quality for KGs". More info here: https://www.sciencedirect.com/journal/information-processing-and-management… Important dates: - Submissions open: 1 June 2024 - Submissions close: 1 September 2024 - Publication date: 15 January 2025 References: Weikum G., Dong X.L., Razniewski S., et al. (2021) Machine knowledge: creation and curation of comprehensive knowledge bases. Found. Trends Databases, 10, 108–490. Hogan A., Blomqvist E., Cochez M. et al. (2021) Knowledge graphs. ACM Comput. Surv., 54, 71:1–71:37. B. Xue and L. Zou. 2023. Knowledge Graph Quality Management: A Comprehensive Survey. IEEE Trans. Knowl. Data Eng. 35, 5 (2023), 4969 – 4988 G. Faggioli, L. Dietz, C. L. A. Clarke, G. Demartini, M. Hagen, C. Hauff, N. Kando, E. Kanoulas, M. Potthast, B. Stein, and H. Wachsmuth. 2023. Perspectives on Large Language Models for Relevance Judgment. In Proc. of the 2023 ACM SIGIR International Conference on Theory of Information Retrieval, ICTIR 2023, Taipei, Taiwan, 23 July 2023. ACM, 39 – 50. S. MacAvaney and L. Soldaini. 2023. One-Shot Labeling for Automatic Relevance Estimation. In Proc. of the 46th International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR 2023, Taipei, Taiwan, July 23-27, 2023. ACM, 2230 – 2235. X. L. Dong. 2023. Generations of Knowledge Graphs: The Crazy Ideas and the Business Impact. Proc. VLDB Endow. 16, 12 (2023), 4130 – 4137. S. Pan, L. Luo, Y. Wang, C. Chen, J. Wang, and X. Wu. 2023. Unifying Large Language Models and Knowledge Graphs: A Roadmap. CoRR abs/2306.08302 (2023). -- Stefano Marchesin, PhD Assistant Professor (RTD/a) Information Management Systems (IMS) Group Department of Information Engineering University of Padua Via Gradenigo 6/a, 35131 Padua, Italy Home page: http://www.dei.unipd.it/~marches1/

1 0

[CfP] Deadline extended: Workshop on Reclaiming the Narrative: Digital Recovery, AI & Mitigating Harm in Social Media @ ICWSM 2024
by Steve Wilson 21 Mar '24

21 Mar '24

Join us for the 1st workshop on “Reclaiming the Narrative: Digital Recovery, AI & Mitigating Harm in Social Media” at ICWSM! If you work in harm reduction, NLP, AI, Social Sciences, recovery, HCI, and adjacent narrative studies, this one’s for you! * When: June 3, 2024. * Format: Hybrid (@Buffalo, NY and Zoom) * [NEW] Submission deadline extended to: April 1 We invite submissions of abstracts (2 pages), as well as Long (8 pages) and Short (4 pages) papers, excluding references and appendices. The Long and Short papers will be included in the ICWSM Workshop proceedings, published by AAAI Press. For more details, please visit https://sites.google.com/view/reclaiming-the-narrative/

1 0

[CfP] SIGIR Workshop on evaluating IR systems with Large Language Models (LLM4EVAL)
by Guglielmo Faggioli 21 Mar '24

21 Mar '24

Overview The first workshop on evaluating IR systems with Large Language Models (LLMs) is accepting submissions that describe original research findings, preliminary research results, proposals for new work, and recent relevant studies already published in high-quality venues. The workshop will have both an in-person and virtual component, and submissions are welcome even for researchers who cannot attend in person, as they will present their work in the virtual component. Topics of interest We welcome both full papers and extended abstract submissions on the following topics, including but not limited to: - LLM-based evaluation metrics for traditional IR and generative IR. - Agreement between human and LLM labels. - Effectiveness and/or efficiency of LLMs to produce robust relevance labels. - Investigating LLM-based relevance estimators for potential systemic biases. - Automated evaluation of text generation systems. - End-to-end evaluation of Retrieval Augmented Generation systems. - Trustworthiness in the world of LLMs evaluation. - Prompt engineering in LLMs evaluation. - Effectiveness and/or efficiency of LLMs as ranking models. - LLMs in specific IR tasks such as personalized search, conversational search, and multimodal retrieval. - Challenges and future directions in LLM-based IR evaluation. Submission guidelines We welcome the following submissions: - Previously unpublished manuscripts will be accepted as extended abstracts and full papers (any length between 1 - 9 pages) with unlimited references, formatted according to the latest ACM SIG proceedings template available at http://www.acm.org/publications/proceedings-template. - Published manuscripts can be submitted in their original format. All submissions should be made through Easychair: https://easychair.org/conferences/?conf=llm4eval All papers will be peer-reviewed (single-blind) by the program committee and judged by their relevance to the workshop, especially to the main themes identified above, and their potential to generate discussion. For already published studies, the paper can be submitted in the original format. These submissions will be reviewed for their relevance to this workshop. All submissions must be in English (PDF format). Please note the workshop will have an in-person (to be held with SIGIR 2024) and virtual component (to be held at a later date on SIGIR VF). During submission, the authors should select their preferred component. All accepted papers will have a poster presentation with a few selected for spotlight talks. Accepted papers may be uploaded to arXiv.org, allowing submission elsewhere as they will be considered non-archival. The workshop’s website will maintain a link to the arXiv versions of the papers. Important Dates - Submission Deadline: April 25th, 2024 (AoE time) - Acceptance Notifications: May 31st, 2024 (AoE time) - Workshop date: July 18, 2024 Website and Contact More details are available at https://llm4eval.github.io/cfp/. For any questions about paper submission, you may contact the workshop organizers at llm4eval(a)easychair.org

1 0

Mapping of American English vocabulary by grade levels
by Flor, Michael 21 Mar '24

21 Mar '24

Dear colleagues, We are happy to announce the availability of the following lexical resource: A graded word list of American English, for 126K words. The publication is: Flor, M., Holtzman, S., Deane, P., & I. Bejar (2024). Mapping of American English vocabulary by grade levels. ITL - International Journal of Applied Linguistics. DOI: https://doi.org/10.1075/itl.22025.flo The resource is available at GitHub: https://github.com/maafiah/VXGL Michael Flor Senior Research Scientist Research Division Educational Testing Service Princeton, NJ, USA mflor(a)ets.org<mailto:mflor@ets.org> ________________________________ This e-mail and any files transmitted with it may contain privileged or confidential information. It is solely for use by the individual for whom it is intended, even if addressed incorrectly. If you received this e-mail in error, please notify the sender; do not disclose, copy, distribute, or take any action in reliance on the contents of this information; and delete it from your system. Any other use of this e-mail is prohibited. Thank you for your compliance. ________________________________

1 0

Job : Poste MCF Section 71, Universite de Lille
by Amel Fraisse 21 Mar '24

21 Mar '24

Cher.e.s, collègues, un poste d’enseignant-chercheur (MCF) en Sciences de l’information et de la communication est ouvert au concours lors de la campagne sychronisée 2024. Le profil de poste "Mutations de l'information et de la communication scientifique » est disponible sur galaxie : https://www.galaxie.enseignementsup-recherche.gouv.fr/ensup/ListesPostesPub… La personne recrutée effectuera ses recherches au sein du laboratoire Geriico. Les enseignements s’effectueront au sein du département INFODOC de l’université de Lille. Bien cordialement, Amel Fraisse. <> Amel Fraisse Maitresse de Conférences Directrice Département INFODOC Université de Lille - Département INFODOC - Laboratoire GERiiCO amel.fraisse(a)univ-lille.fr <mailto:prenom.nom@univ-lille.fr> / https://pro.univ-lille.fr/amel-fraisse/ <http://www.univ-lille.fr/> Domaine Universitaire de Pont de Bois - Villeneuve d'Ascq Bât. 2 - bureau B2.467 T. +33 (0)3 20 41 69 38

1 0

[CFP] 1st Call for Participation - ISWC-LLMs4OL 2024 Challenge: Large Language Models for Ontology Learning
by Jennifer D'Souza 21 Mar '24

21 Mar '24

[apologies if you received multiple copies of this call] Dear colleagues and friends, *We are pleased to release the 1st Call for Participation - LLMs4OL Challenge collocated with The International Semantic Web Conference (ISWC 2024)* *Overview:* LLMs4OL stands for "Large Language Models for Ontology Learning." The LLMs4OL paradigm was first introduced in our research paper ( https://link.springer.com/chapter/10.1007/978-3-031-47240-4_22) published in the ISWC 2023 main conference proceedings. In this context, we aimed to test the readiness of LLMs to address the Ontology Learning (OL) task w.r.t. three main subtasks: 1) Term Typing, 2) Type Taxonomy Discovery, and 3) Non-Taxonomic Relation Extraction. Therein our evaluations included ontolgies from various knowledge domains, i.e. lexicosemantics (WordNet), geography (GeoNames), biomedicine (NCI, MEDICIN, SNOMEDCT), and web content types (schema.org). With the ISWC-LLMs4OL 2024 challenge, we aim to catalyze community-wide engagement in validating and expanding the use of LLMs in OL by releasing our evaluation datasets publicly in the community. This initiative is poised to advance our comprehension of LLMs’ roles within the Semantic Web, encouraging innovation and collaboration in developing scalable and accurate OL methods. More info on the task website: https://sites.google.com/view/llms4ol/ The LLMs4OL Challenge will be divided into two evaluation phases: - Evaluation Phase 1: Few-shot Testing; - Evaluation Phase 2: Zero-shot Testing *Dates* Training datasets available: March 30, 2024 Test data available (Task A): May 27, 2024 Evaluation ends (Task A): June 4, 2024 Test data available (Tasks B & C): June 5, 2024 Evaluation ends (Tasks B & C): June 18, 2024 Participant papers due: June 28, 2024 Notification to authors: July 19, 2024 Camera ready due: July 30, 2024 ISWC 2024, Baltimore, Maryland, USA: 11-15 November 2024 *Task Organizers* Hamed Babaei Giglou (TIB Leibniz Information Centre for Science and Technology - Germany) Jennifer D’Souza (TIB Leibniz Information Centre for Science and Technology - Germany) Sören Auer (TIB Leibniz Information Centre for Science and Technology - Germany) We look forward to having you on board! *Contact:* llms4ol.challenge [at] gmail.com

1 0

NLPAICS’2024: Submission Deadline Extended to 22 April 2024
by t.ranasinghe＠aston.ac.uk 21 Mar '24

21 Mar '24

First International Conference on Natural Language Processing and Artificial Intelligence for Cyber Security (NLPAICS’2024) Lancaster University, Lancaster, United Kingdom 29-30 July 2024 https://www.nlpaics.com *** Submission Deadline Extended to 22 April 2024 *** Recent advances in Natural Language Processing (NLP), Deep Learning and Large Language Models (LLMs) have resulted in improved performance of applications. . In particular, there has been a growing interest in employing AI methods in different Cyber Security applications. In today's digital world, Cyber Security has emerged as a heightened priority for both individual users and organisations. As the volume of online information grows exponentially, traditional security approaches often struggle to identify and prevent evolving security threats. The inadequacy of conventional security frameworks highlights the need for innovative solutions that can effectively navigate the complex digital landscape for ensuring robust security. NLP and AI in Cyber Security have vast potential to significantly enhance threat detection and mitigation by fostering the development of advanced security systems for autonomous identification, assessment, and response to security threats in real-time. Recognising this challenge and the capabilities of NLP and AI approaches to fortify Cyber Security systems, the First International Conference on Natural Language Processing (NLP) and Artificial Intelligence (AI) for Cyber Security (NLPAICS’2024) serves as a gathering place for researchers in NLP and AI methods for Cyber Security. We invite contributions that present the latest NLP and AI solutions for mitigating risks in processing digital information. Conference topics The conference invites submissions on a broad range of topics related to the employment of NLP and AI (and in general, language studies and models) for Cyber Security including but not limited to: ## Societal and Human Security and Safety - Content Legitimacy and Quality o Detection and mitigation of hate speech and offensive language o Fake news, deepfakes, misinformation and disinformation o Detection of machine generated language in multimodal context (text, speech and gesture) o Trust and credibility of online information - User Security and Safety o Cyberbullying and identification of internet offenders o Monitoring extremist fora o Suicide prevention o Clickbait and scam detection o Fake profile detection in online social networks - Technical Measures and Solutions o Social engineering identification, phishing detection o NLP for risk assessment o Controlled languages for safe messages o Prevention of malicious use of ai models o Forensic linguistics - Human Factors in Cyber Security ## Speech Technology and Multimodal Investigations for Cyber Security - Voice-based security: Analysis of voice recordings or transcripts for security threats - Detection of machine generated language in multimodal context (text, speech and gesture) - NLP and biometrics in multimodal context ## Data and Software Security - Cryptography - Digital forensics - Malware detection, obfuscation - Models for documentation - NLP for data privacy and leakage prevention (DLP) - Addressing dataset “poisoning” attacks ## Human-Centric Security and Support - Natural language understanding for chatbots: NLP-powered chatbots for user support and security incident reporting - User behaviour analysis: analysing user-generated text data (e.g., chat logs and emails) to detect insider threats or unusual behaviour - Human supervision of technology for Cyber Security ## Anomaly Detection and Threat Intelligence - Text-Based Anomaly Detection o Identification of unusual or suspicious patterns in logs, incident reports or other textual data o Detecting deviations from normal behaviour in system logs or network traffic - Threat Intelligence Analysis o Processing and analysing threat intelligence reports, news, articles and blogs on latest Cyber Security threats o Extracting key information and indicators of compromise (IoCs) from unstructured text ## Systems and Infrastructure Security - Systems Security o Anti-reverse engineering for protecting privacy and anonymity o Identification and mitigation of side-channel attacks o Authentication and access control o Enterprise-level mitigation o NLP for software vulnerability detection - Malware Detection through Code Analysis o Analysing code and scripts for malware o Detection using NLP to identify patterns indicative of malicious code ## Financial Cyber Security - Financial fraud detection - Financial risk detection - Algorithmic trading security - Secure online banking - Risk management in finance - Financial text analytics ## Ethics, Bias, and Legislation in Cyber Security - Ethical and Legal Issues o Digital privacy and identity management o The ethics of NLP and speech technology o Explainability of NLP and speech technology tools o Legislation against malicious use of AI o Regulatory issues - Bias and Security o Bias in Large Language Models (LLMs) o Bias in security related datasets and annotations ## Datasets and resources for Cyber Security Applications ## Specialised Security Applications and Open Topics - Intelligence applications - Emerging and innovative applications in Cyber Security ## Special Theme Track - Future of Cyber Security in the Era of LLMs and Generative AI We are excited to share that NLPAICS 2024 will have a special theme track with the goal of stimulating discussion around Large Language Models (LLMs), Generative AI and ensuring their safety. The latest generation of LLMs, such as CHATGPT, Gemini, LLAMA and open-source alternatives, has showcased remarkable advancements in text and image understanding and generation. However, as we navigate through uncharted territory, it becomes imperative to address the challenges associated with employing these models in everyday tasks, focusing on aspects such as fairness, ethics, and responsibility. The theme track invites studies on how to ensure the safety of LLMs in various tasks and applications and what this means for the future of the field. The possible topics of discussion include (but are not limited to) the following: - Detection of LLM-generated language in multimodal context (text, speech and gesture) - LLMs for forensic linguistics - Bias in LLMs - Safety benchmarks for LLMs - Legislation against malicious use of LLMs - Tools to evaluate safety in LLMs - Methods to enhance the robustness of language models ## Submissions and Publication NLPAICS welcomes high-quality submissions in English, which can take two forms: -Regular long papers: These can be up to eight (8) pages long, presenting substantial, original, completed, and unpublished work. -Short papers: These can be up to four (4) pages long and are suitable for describing small, focused contributions, negative results, system demonstrations, etc. Note that the page limits mentioned above exclude additional pages for references, ethical considerations, conflict-of-interest statements, as well as data and code availability statements. Papers must be anonymised to support double-blind reviewing. Please submit your work as pdf using the following link: https://softconf.com/n/nlpaics2024/ Submission templates can be accessed here: LaTeX Overleaf, LaTeX , MS Office Accepted papers, including both long and short papers, will be published as part of the same e-proceedings to be uploaded on ACL Anthology. ## Important dates -Submissions due: 22 April 2024 -Reviewing process: 29 April-5 June 2024 -Notification of acceptance: 10 June 2024 -Camera-ready due: 1 July 2024 -Conference: 29-30 July 2024 ## Keynote speakers We are delighted to announce our already confirmed keynote speakers Nigel Hardacre (Lancashire Constabulary) Sevil Şen (Hacettepe University) More keynote speakers will be listed in soon. ## Programme Committee Members of the Programme Committee of NLPAICS’2024 are listed here. ## Venue The First International Conference on Natural Language Processing and Artificial Intelligence for Cyber Security (NLPAICS’2024) will take place at Lancaster University and is organised by the Lancaster University UCREL NLP research group. ## Organisation - Conference Chair o Ruslan Mitkov (Lancaster University) - Conference Programme Chairs o Cengiz Acartürk (Jagiellonian University) o Matthew Bradbury (Lancaster University) o Mo El-Haj (Lancaster University) o Paul Rayson (Lancaster University) - Sponsorship Chair o Saad Ezzini (Lancaster University) - Publicity Chair o Tharindu Ranasinghe (Aston University) - Publication Chair o Ignatius Ezeani (Lancaster University) - Social Programme Chair o Nouran Khallaf (Lancaster University) ## Registration Conference registration is open on https://nlpaics.com/registration/ Early bird registration closes on 15 April 2024. ## Further information and contact details The conference website is https://nlpaics.com and will be updated on a regular basis. For further information, please email info(a)nlpaics.com Dr Tharindu Ranasinghe Lecturer in Computer Science School of Informatics and Digital Engineering Birmingham, B4 7ET, UK aston.ac.uk

1 0

HIRING: Associate Research Scientists (PhD, Postdoc) in AI and NLP, at UKP Lab and INSAIT
by Niemann, Elisabeth 21 Mar '24

21 Mar '24

The UKP Lab in Darmstadt, Germany, led by Iryna Gurevych and the recently founded INSAIT in Sofia, Bulgaria have several job openings: *** Associate Research Scientists (PostDoc- or PhD-level) in AI and NLP *** Are you an outstanding PhD candidate or Postdoc with a strong profile in Natural Language Processing, LLMs and AI? We have several openings as Associate Research Scientists at UKP Lab (Germany) and INSAIT (Bulgaria)! We highly appreciate demonstrable engagement in open-source projects, communication skills in English and the ability to effectively cooperate with scientists of various interdisciplinary backgrounds. Prior experience with relevant areas of NLP and Machine Learning and strong engineering skills are a plus. More information about the opening and the application process can be found here: https://www.informatik.tu-darmstadt.de/ukp/ukp_home/jobs_ukp/2024_associate… Join our internationally recognized team, enjoy diverse opportunities for professional development, and conduct cutting-edge research! Application deadline: April 15th, 2024. Please submit your application via the following form: https://careers.ukp.informatik.tu-darmstadt.de/ukprecruitment. Please indicate which institute you are applying to. -------------------------------------------------------------------- Prof. Dr. Iryna Gurevych UKP Lab, Technical University Darmstadt, Germany INSAIT, Sofia, Bulgaria

1 0

NeTTT’2024 Submission Deadline Extended to 30 April 2024
by t.ranasinghe＠aston.ac.uk 21 Mar '24

21 Mar '24

International Conference ‘New Trends in Translation and Technology’ (NeTTT’2024) Varna, Bulgaria, 3-6 July 2024 https://nettt-conference.com/ *** Submission Deadline Extended to 30 April 2024 *** # The conference The second edition of the forthcoming International Conference ‘New Trends in Translation and Technology’ (NeTTT’2024) will take place in Varna, Bulgaria, 3-6 July 2024. Continuing the tradition of the first edition of the NeTTT conference and HiT-IT events series, the objective of the conference is (i) to bridge the gap between academia and industry in the field of translation and interpreting by bringing together academics in linguistics, translation studies, machine translation and natural language processing, developers, practitioners, language service providers and vendors who work on or are interested in different aspects of technology for translation and interpreting, and (ii) to be a distinctive event for discussing the latest developments and practices. NeTTT’2024 invites all professionals who would like to learn about the new trends, present the latest work or/and share their experience in the field, and who would like to establish business and research contacts, collaborations and new ventures. The conference will take the form of presentations (peer-reviewed research and user presentations, keynote speeches), and posters; it will also feature panel discussions. The accepted papers will be published as open-access conference e-proceedings. # Conference topics Contributions are invited on any topic related to latest technology and practices in machine translation, translation, subtitling, localisation and interpreting. NeTTT’2024 will feature a Special Theme Track "Future of Translation Technology in the Era of LLMs and Generative AI". The conference topics include but are not limited to: ## CAT tools - Translation Memory (TM) systems - NLP and MT for translation memory systems - Terminology extraction tools - Localisation tools ## Machine Translation - Latest developments in Neural Machine Translation - MT for under-resourced languages - MT with low computing resources - Multimodal MT - Integration of MT in TM systems - Resources for MT ## Technologies for MT deployment - MT evaluation techniques, metrics and evaluation results - Human evaluations of MT output - Evaluating MT in a real-world setting - Quality estimation for MT - Domain adaptation ## Translation Studies - Corpus-based studies applied to translation - Corpora and resources for translation - Translationese - Cognitive effort and eye-tracking experiments in translation ## Interpreting studies - Corpus-based studies applied to interpreting - Corpora and resources for interpreting - Interpretese - Resources for interpreting and interpreting technology applications - Cognitive effort and eye-tracking experiments in interpreting ## Interpreting technology - Machine interpreting - Computer-aided interpreting - NLP for dialogue interpreting - Development of NLP based applications for communication in public service settings (healthcare, education, law, emergency services) ## Emerging Areas in Translation and Interpreting - MT and translation tools for literary texts and creative texts - MT for social media and real-time conversations - Sign language recognition and translation ## Subtitling - NLP and MT for subtitling - Latest technology for subtitling ## User needs - Analysis of translators’ and interpreters’ needs in terms of translation and interpreting technology - User requirements for interpreting and translation tools - Incorporating human knowledge into translation and interpreting technology - What existing translators’ (including subtitlers’) and interpreters’ tools do not offer - User requirements for electronic resources for translators and interpreters - Translation and interpreting workflows in larger organisations and the tools for translation and interpreting employed ## The business of translation and interpreting - Translation workflow and management - Technology adoption by translators and industry - Setting up translation /interpreting / language provider company ## Teaching translation and interpreting - Teaching Machine Translation - Teaching translation technology - Teaching interpreting technology - Latest AI developments in the syllabi of translation and interpreting curricula ## Ethical issues in translation and technology - Bias and fairness in MT - Privacy and security in cloud MT systems - Transparency and explainability of MT systems - Environmental impact on MT systems # Special Theme Track - Future of Translation Technology in the Era of LLMs and Generative AI We are excited to share that NeTTT’2024 will have a special theme with the goal of stimulating discussion around Large Language Models, Generative AI and the Future of Translation and Interpreting Technology. While the new generation of Large Language Models such as CHATGPT and LLAMA showcase remarkable advancements in language generation and understanding, we find ourselves in uncharted territory when it comes to their performance on various Translation and Interpreting Technology tasks with regards to fairness, interpretability, ethics and transparency. The theme track invites studies on how LLMs perform on Translation and Interpreting Technology tasks and applications, and what this means for the future of the field. The possible topics of discussion include (but are not limited to) the following: - Changes in the translators and interpreters’ professions in the new AI era especially as a result of the latest developments in LLMSs and Generative AI - Generative AI and translation - Generative AI and interpreting - Augmenting machine translation systems with generative AI - Domain and terminology adaptation with Large Language Models - Literary translation with Large Language Models - Improving Machine Translation Quality with Contextual Prompts in Large Language Models - Prompt engineering for translation - Generative AI for professional translation - Generative AI for professional interpreting # Keynote speakers We are delighted to announce the NeTTT’2024 keynote speakers - Helena Moniz (University of Lisbon and Unbabel), President of the European Association of Machine Translation - Carla Parra Escartín (RWS Language Weaver) # Tutorial (3 July 2024) - Tharindu Ranasinghe (Aston University), Quality Estimation for Machine Translation # Programme Committee The Programme Committee of NeTTT’2024 is listed https://nettt-conference.com/26844-2/. # Conference Chairs - Ruslan Mitkov (Lancaster University) - Gloria Corpas Pastor (University of Malaga) # Programme Chairs - Constantin Orasan (University of Surrey) - Tharindu Ranasinghe (Aston University) # Sponsorship Chair - Vilelmini Sosoni (Ionian University) # Publication Chair - Maria Kunilovskaya (University of Saarland) # Organising Committee - Organising Committee of NeTTT’2024 is listed https://nettt-conference.com/organisers/ # Submissions and publication NETTT’2024 invites the following types of submissions: User papers – for industry and practitioners. References to related work are optional. Allowed paper length: between 1 and 4 pages. Academic submissions, in three different categories (have to follow formatting requirements, references to related work are required): • (academic) full papers – describing original completed research. Allowed paper length: maximum 12 pages + unlimited references. • (academic) work-in-progress papers/posters – describing work in progress, late breaking research, papers at a more conceptual stage, and other types of papers that do not fit in the ‘full’ papers category. Allowed paper length: maximum 7 pages + unlimited references. • (academic) demo papers – describing working systems. Allowed paper length: maximum 5 pages + unlimited references. In addition to the papers, the authors will be expected to demonstrate the systems at the workshop. The conference will not consider and evaluate abstracts only. Each submission will be reviewed by three members of the Programme Committee. Submission is organised via Softconf START conference management system at https://softconf.com/n/nettt2024. For submitting the papers, we invite the authors to comply with the Springer format, following the templates: • LaTeX: https://resource-cms.springernature.com/springer-cms/rest/v1/content/192386…, • Overleaf: https://www.overleaf.com/latex/templates/springer-lecture-notes-in-computer…, • Word: https://resource-cms.springernature.com/springer-cms/rest/v1/content/192387…. The accepted papers will be published in the conference proceedings and made available online on the conference website. Authors of accepted papers will receive guidelines regarding how to produce camera-ready versions of their papers. The final version of the accepted papers will be published in e-proceedings with assigned ISBN and DOI. All accepted papers will be included in the conference e-proceedings which will be available at the conference website. # Schedule - Submission deadline: 30 April 2024 - Notification: 5 June 2024 - Final version due: 20 June 2024 All deadlines are valid for 23.59 Anywhere on Earth. # Registration Conference registration is open on https://nettt-conference.com/fees-registration/ The promotional early registration fee has been extended to 17 March 2024. # Venue The conference will take place at https://www.chernomorebg.com/en/conference-centre.html, Varna, situated only 200 m away from the fine sandy Black Sea beach. # Further information and contact details The conference website is https://nettt-conference.com and will be updated on a regular basis. For further information, please contact us at nettt2024(a)nettt-conference.com Dr Tharindu Ranasinghe Lecturer in Computer Science School of Informatics and Digital Engineering Birmingham, B4 7ET, UK aston.ac.uk

1 0

2026

2025

2024

2023

2022

Corpora