- Corpora - ELRA lists

Tenure Stream Assistant Professor Positions: Dalhousie University
by Hassan Sajjad 16 Dec '23

16 Dec '23

Full Ad: https://dal.peopleadmin.ca/postings/14872 The Faculty of Computer Science at Dalhousie University ( https://www.cs.dal.ca) invites applications for up to three tenure-stream Assistant Professor positions in any area of Computer Science. We are seeking candidates whose research focuses on one of our five *research clusters* <https://www.dal.ca/faculty/computerscience/research-industry/fcs_research.h…>: 1) Big Data Analytics, Artificial Intelligence & Machine Learning, 2) Human Computer Interaction, Visualization & Graphics, 3) Systems, 4) Algorithms & Bioinformatics and 5) Computer Science Education. Research areas of particular interest include but are not limited to: Computer Vision and Signal Understanding, Qualitative and Design Research in HCI, Natural Language Processing and Artificial Intelligence and Machine Learning. *Application Instructions: *Applications must include a cover letter, curriculum vitae, statements of research and teaching interests, sample publications, and the names and full contact information of three referees. Applications are due by *February 15, 2024*. All application materials should be submitted directly at *https://dal.peopleadmin.ca/postings/14872* <https://dal.peopleadmin.ca/postings/14872>. -- Regards; Hassan Sajjad

1 0

December 2023 Newsletter - LDC
by Penn LDC 15 Dec '23

15 Dec '23

In this newsletter: LDC 2024 membership discounts now available Approaching deadline for Spring 2024 data scholarship applications LDC closed for Winter Break Dec. 25-Jan. 1 New publications: Kasdi-Merbah (University) Emotional Database in Arabic Speech<https://catalog.ldc.upenn.edu/LDC2023S10> TAC-KBP Belief and Sentiment - Comprehensive Training and Evaluation Data 2016-2017<https://catalog.ldc.upenn.edu/LDC2023T13> ________________________________ LDC 2024 membership discounts now available Now through March 1, 2024, current 2023 members receive a 10% discount for renewing their membership, and new or returning organizations receive a 5% discount. Membership remains the most economical way to access current and past LDC releases. Consult Join LDC<https://www.ldc.upenn.edu/members/join-ldc> for details on membership options and benefits. Approaching deadline for Spring 2024 data scholarship applications Attention students: don't miss out on the chance to receive no-cost access to LDC data for your research. Applications for Spring 2024 data scholarships are due January 15, 2024. For more information on requirements and program rules, see LDC Data Scholarships<https://www.ldc.upenn.edu/language-resources/data/data-scholarships>. LDC closed for Winter Break Dec. 25-Jan. 1 LDC will be closed from Monday, December 25, 2023, through Monday, January 1, 2024, in accordance with the University of Pennsylvania Winter Break Policy. Our offices will reopen on Tuesday, January 2, 2024. Requests received by the Membership Office during Winter Break will be processed when the office reopens. ________________________________ New publications: Kasdi-Merbah (University) Emotional Database in Arabic Speech<https://catalog.ldc.upenn.edu/LDC2023S10> was developed by the University of Kasdi Merbah Ouargla<https://www.univ-ouargla.dz/index.php/en/> and contains two hours of Modern Standard Arabic prompted speech from 500 speakers (254 female, 246 male) representing 5,000 utterances. Each speaker read ten sentences, with two sentences each for five different emotions (sadness, fear, anger, happiness, neutral). 2023 members can access this corpus through their LDC accounts. Non-members may license this data for a fee. * TAC-KBP Belief and Sentiment - Comprehensive Training and Evaluation Data 2016-2017<https://catalog.ldc.upenn.edu/LDC2023T13> includes all training and evaluation data developed by LDC for the Belief and Sentiment tracks: source documents (Chinese, English, and Spanish newswire and discussion forums); gold standard entity, relation, and event annotation; and belief and sentiment annotation. The goal of the TAC-KBP Belief and Sentiment track was to provide information about beliefs and sentiments held by entities toward other entities, as well as toward events and relations. The gold standard set of labeled entities, relations, and events was used to create a system for automatically labeling belief and sentiment about each possible target (entity, relation, or event) and for identifying the entity holding the belief or sentiment. 2023 members can access this corpus through their LDC accounts. Non-members may license this data for a fee. To unsubscribe from this newsletter, log in to your LDC account<https://catalog.ldc.upenn.edu/login> and uncheck the box next to "Receive Newsletter" under Account Options or contact LDC for assistance. Membership Coordinator Linguistic Data Consortium<ldc.upenn.edu> University of Pennsylvania T: +1-215-573-1275 E: ldc(a)ldc.upenn.edu<mailto:ldc@ldc.upenn.edu> M: 3600 Market St. Suite 810 Philadelphia, PA 19104

1 0

PhD position: computational approaches to Narrative in Argumentation (1.0 FTE)
by Pianzola, Federico 15 Dec '23

15 Dec '23

Fully-funded 4-year PhD position in the field of Computational Linguistics, focusing on the intricate relationship between narratives and argumentation in persuasive communication. University of Groningen, Netherlands *Deadline: 7 January 11:59pm CET* More info at the link below: https://www.rug.nl/about-ug/work-with-us/job-opportunities/?details=00347-0… -- Kind regards, Federico Pianzola Assistant Professor of Computational Humanities University of Groningen https://federicopianzola.me ERC project; https://golemlab.eu Book:* Digital Social Reading: Sharing Fiction in the 21st Century <https://wip.mitpress.mit.edu/digital-social-reading>* (MIT Press open peer review)

1 0

Edge Hill Corpus Research Group, 11 January 2024
by Costas Gabrielatos 15 Dec '23

15 Dec '23

The next meeting of the Edge Hill corpus Research Group will take place online (via Teams) on Thursday 11 January 2024, 2:00-3:00 pm (UK time). Attendance is free. You can register here: https://store.edgehill.ac.uk/conferences-and-events/conferences/events/edge… Registration closes on Wednesday 10 January, 12 noon (UK time) Topics: Corpus Methodology, Phraseology Speaker: Benet Vincent<https://www.coventry.ac.uk/life-on-campus/staff-directory/arts-and-humaniti…> (Coventry University, UK) Title: Methodological issues and challenges in the use of phrase-frames to investigate phraseology Abstract The importance of gaining a better understanding of phraseology has been recognised for some time now in the area of English for Academic Purposes (EAP). A widespread approach is to extract from a corpus frequently-occurring fixed strings (lexical bundles, or clusters) of potentially useful phrases/multi-word units (see e.g. Gilmore and Millar's 2018). A limitation of this sort of study is the focus on fixed continuous sequences when phrases are well-known to allow a degree of variation (see e.g. Gries, 2008). One proposal to address this limitation is the 'phrase frame' (p-frame), a fixed sequence of items occurring frequently in a corpus with one or two empty slots (Lu, Yoon & Kisselev, 2021). This approach allows researchers to retrieve the most frequent p-frames in a particular corpus, then identify which items typically fill these slots and what meanings / functions might be associated with them. The idea is that the results of such research can help us better understand how members of a specific discourse community typically express themselves, which in turn may inform EAP pedagogy (Lu, Yoon, & Kisselev, 2018). Our project aimed to use a p-frame approach to create a list of pedagogically useful phrases to help novice writers of RA introductions in Health Sciences. A number of studies have used a p-frame approach with similar aims though for different discipline areas, including Fuster-Márquez and Pennock-Speck (2015), Cunningham (2017) and Lu et al., (2018, 2021). However, analysis of these studies indicates that they lack consensus on a number of issues central to p-frame methodology, presenting a challenge for new work in this area. This presentation will provide an overview of the key issues in p-frame research which we have identified and show how we have addressed them. The main aim will be to underline the importance of ensuring that the methods applied by a p-frame study align with the aims of the project. References Cunningham, K. J. (2017). A phraseological exploration of recent mathematics research articles through key phrase frames. Journal of English for Academic Purposes, 25, 71. https://doi.org/10.1016/j.jeap.2016.11.005 Fuster-Márquez, M., & Pennock-Speck, B. (2015). Target frames in British hotel websites. International Journal of English Studies, 15(1), 51-69. https://doi.org/10.6018/ijes/2015/1/213231 Gilmore, A., & Millar, N. (2018). The language of civil engineering research articles: A corpus-based approach. English for Specific Purposes, 51, 1-17. https://doi.org/10.1016/j.esp.2018.02.002 Gries, S. (2008). Phraseology and linguistic theory. In Phraseology: An interdisciplinary perspective, S. Granger & F. Meunier (eds.), 3-26. Lu, X., Yoon, J., & Kisselev, O. (2018). A phrase-frame list for social science research article introductions. Journal of English for Academic Purposes, 36, 76-85. https://doi.org/10.1016/j.jeap.2018.09.004 Lu, X., Yoon, J., & Kisselev, O. (2021). Matching phrase-frames to rhetorical moves in social science research article introductions. English for Specific Purposes, 61, 63-83. https://doi.org/10.1016/j.esp.2020.10.001 ________________________________ Edge Hill University<http://ehu.ac.uk/home/emailfooter> Modern University of the Year, The Times and Sunday Times Good University Guide 2022<http://ehu.ac.uk/tef/emailfooter> University of the Year, Educate North 2021/21 ________________________________ This message is private and confidential. If you have received this message in error, please notify the sender and remove it from your system. Any views or opinions presented are solely those of the author and do not necessarily represent those of Edge Hill or associated companies. Edge Hill University may monitor email traffic data and also the content of email for the purposes of security and business communications during staff absence.<http://ehu.ac.uk/itspolicies/emailfooter>

1 0

Edge Hill Corpus Research Group, 11 January 2024
by Costas Gabrielatos 15 Dec '23

15 Dec '23

The next meeting of the Edge Hill corpus Research Group will take place online (via Teams) on Thursday 11 January 2024, 2:00-3:00 pm (GMT). Attendance is free. You can register here: https://store.edgehill.ac.uk/conferences-and-events/conferences/events/edge… Registration closes on Wednesday 10 January, 12 noon (UK time) Topics: Corpus Methodology, Phraseology Speaker: Benet Vincent<https://www.coventry.ac.uk/life-on-campus/staff-directory/arts-and-humaniti…> (Coventry University, UK) Title: Methodological issues and challenges in the use of phrase-frames to investigate phraseology Abstract The importance of gaining a better understanding of phraseology has been recognised for some time now in the area of English for Academic Purposes (EAP). A widespread approach is to extract from a corpus frequently-occurring fixed strings (lexical bundles, or clusters) of potentially useful phrases/multi-word units (see e.g. Gilmore and Millar's 2018). A limitation of this sort of study is the focus on fixed continuous sequences when phrases are well-known to allow a degree of variation (see e.g. Gries, 2008). One proposal to address this limitation is the 'phrase frame' (p-frame), a fixed sequence of items occurring frequently in a corpus with one or two empty slots (Lu, Yoon & Kisselev, 2021). This approach allows researchers to retrieve the most frequent p-frames in a particular corpus, then identify which items typically fill these slots and what meanings / functions might be associated with them. The idea is that the results of such research can help us better understand how members of a specific discourse community typically express themselves, which in turn may inform EAP pedagogy (Lu, Yoon, & Kisselev, 2018). Our project aimed to use a p-frame approach to create a list of pedagogically useful phrases to help novice writers of RA introductions in Health Sciences. A number of studies have used a p-frame approach with similar aims though for different discipline areas, including Fuster-Márquez and Pennock-Speck (2015), Cunningham (2017) and Lu et al., (2018, 2021). However, analysis of these studies indicates that they lack consensus on a number of issues central to p-frame methodology, presenting a challenge for new work in this area. This presentation will provide an overview of the key issues in p-frame research which we have identified and show how we have addressed them. The main aim will be to underline the importance of ensuring that the methods applied by a p-frame study align with the aims of the project. References Cunningham, K. J. (2017). A phraseological exploration of recent mathematics research articles through key phrase frames. Journal of English for Academic Purposes, 25, 71. https://doi.org/10.1016/j.jeap.2016.11.005 Fuster-Márquez, M., & Pennock-Speck, B. (2015). Target frames in British hotel websites. International Journal of English Studies, 15(1), 51-69. https://doi.org/10.6018/ijes/2015/1/213231 Gilmore, A., & Millar, N. (2018). The language of civil engineering research articles: A corpus-based approach. English for Specific Purposes, 51, 1-17. https://doi.org/10.1016/j.esp.2018.02.002 Gries, S. (2008). Phraseology and linguistic theory. In Phraseology: An interdisciplinary perspective, S. Granger & F. Meunier (eds.), 3-26. Lu, X., Yoon, J., & Kisselev, O. (2018). A phrase-frame list for social science research article introductions. Journal of English for Academic Purposes, 36, 76-85. https://doi.org/10.1016/j.jeap.2018.09.004 Lu, X., Yoon, J., & Kisselev, O. (2021). Matching phrase-frames to rhetorical moves in social science research article introductions. English for Specific Purposes, 61, 63-83. https://doi.org/10.1016/j.esp.2020.10.001 ________________________________ Edge Hill University<http://ehu.ac.uk/home/emailfooter> Modern University of the Year, The Times and Sunday Times Good University Guide 2022<http://ehu.ac.uk/tef/emailfooter> University of the Year, Educate North 2021/21 ________________________________ This message is private and confidential. If you have received this message in error, please notify the sender and remove it from your system. Any views or opinions presented are solely those of the author and do not necessarily represent those of Edge Hill or associated companies. Edge Hill University may monitor email traffic data and also the content of email for the purposes of security and business communications during staff absence.<http://ehu.ac.uk/itspolicies/emailfooter>

1 0

SemEval 2024 Task 2: Safe Biomedical Natural Language Inference for Clinical Trials
by mael.jullien＠gmail.com 15 Dec '23

15 Dec '23

Call for Participation We're excited to announce "SemEval-2023 Task 7: Multi-Evidence Natural Language Inference for Clinical Trial Data" (NLI4CT). This task aims to enhance the application of Large Language Models (LLMs) in clinical settings, addressing challenges like factual inconsistency, shortcut learning, and performance degradation. The SemEval workshop will be co-located with NAACL 2024 in Mexico City, Mexico on June 16–21, 2024. Background: LLMs are crucial in Natural Language Processing but face limitations in real-world scenarios, especially in medical applications. There's a growing need to analyze over 400,000 Clinical Trial Reports (CTRs) efficiently. NLI offers a solution for large-scale interpretation and retrieval of medical evidence. Task Overview: Task: Textual Entailment in the context of breast cancer CTRs. Determine the inference relation (entailment vs contradiction) between CTR-statement pairs. We provide a training set and a specially designed test set to evaluate the effectiveness of models in clinical NLI settings. Example: Consider this scenario from a Clinical Trial Report (CTR): Eligibility Criteria: - Gender: Female - ERBB2 positive BC - Treatment history: No previous chemotherapy Exclusion Criteria: - BMI > 40 - Smoker Given Statement: "Morbidly obese female patients with ERBB2+ Breast Cancer are not eligible for Trial X." In this case, the statement contradicts the eligibility criteria of the CTR. While the patient's gender and medical condition align with the inclusion criteria, the term "morbidly obese" generally indicates a BMI > 40, which falls under the exclusion criteria. Therefore, the relationship between the CTR and the statement is a contradiction. Research Aims: Investigate the consistency and faithfulness of NLI models in clinical settings. Develop and apply novel evaluation methodologies through interventional and causal analyses. Important Dates: Practice Task: Already underway! Main Task: Starts January 10, 2024. Post-Competition Task: Begins February 1, 2024. Get Involved: Register Here: https://codalab.lisn.upsaclay.fr/competitions/16190 Join Our Slack: https://join.slack.com/t/semeval2024task2/shared_invite/zt-22m7o4u7o-wa_Pnh…

1 0

Gezocht: Computationeel linguïst bij het Instituut voor de Nederlandse Taal (INT)
by vincent＠ccl.kuleuven.be 14 Dec '23

14 Dec '23

Het INT zoekt op korte termijn een computationeel linguïst om bij te dragen aan de deelname van het instituut aan het SSHOC-NL (Social Sciences & Humanities Open Cloud) project, en daarnaast ook op andere wijze bij te dragen aan het uitbouwen van de taalinfrastructuur voor het Nederlands. Binnen SSHOC-NL neem het instituut deel aan de taak Machine Learning and AI for Methodologically Sound Data Enrichment. Deze taak houdt met name in dat onderzoekers worden ondersteund bij het vinden en gebruiken van de juiste (verrijkings- en analyse)tools en dat ze in staat worden gesteld de resultaten van de tools op relevante data te evalueren. De taalinfrastructuur van het INT bestaat uit een kennisbank die de aspecten van de documentatie van modern en historisch Nederlands omvat, van lexicale informatie tot syntax, semantiek en taalvariatie. Het werk hieraan wordt ondersteund door computationele workflows. Jouw werk Je onderzoekt hoe nieuwe methoden en technieken het opbouwen en ontsluiten van de taalinfrastructuur kunnen ondersteunen en draagt bij aan processen gericht op het beheer, onderhoud en de beschikbaarstelling van producten en tools. Je publiceert over je werk, neemt deel aan nationale en internationale samenwerkingsverbanden als CLARIN en CLARIAH, en spreekt op academische en professionele conferenties. Functie-eisen: ● Minimaal postdocniveau, gepromoveerd ● Resultaatgericht en analytisch ● Je schakelt makkelijk tussen tools en technologieën ● Diepgaande kennis van en ervaring met Natural Language Processing en Machine learning, inclusief de meest recente technieken, zoals Transformers en Large Language Models ● Ervaring met het ontwikkelen van efficiënte, goed gestructureerde en gedocumenteerde software Ervaring met deze technologieën geven je een extra streepje voor: ● Python, Huggingface transformers ● Meerdere programmeertalen (m.n. Java/Kotlin, JavaScript/TypeScript) ● PostgreSQL (of andere relationele database) ● Ontwikkelen van webapplicaties / -services Arbeidsvoorwaarden Het gaat om een aanstelling van 32-40 uur per week. De arbeidsvoorwaarden zijn overeenkomstig de cao Onderzoeksinstellingen. Het betreft vooralsnog een tijdelijk dienstverband met uitzicht op een vaste aanstelling. Het INT heeft aantrekkelijke arbeidsvoorwaarden, waaronder veel vrije dagen, een eindejaarsuitkering en aansluiting bij het ABP voor de pensioenvoorziening. Neem voor meer informatie over de vacature contact op met Jesse de Does, jesse.dedoes(a)ivdnt.org. Acquisitie naar aanleiding van deze vacature wordt niet op prijs gesteld. Sollicitaties kunnen vóór 15 januari 2024 per mail verstuurd worden naar: secretariaat(a)ivdnt.org t.a.v. de directeur, mevrouw prof. dr. F. Steurs. Over het INT Het Instituut voor de Nederlandse Taal is dé plek voor iedereen die iets wil weten over het Nederlands door de eeuwen heen. Het is een breed toegankelijk wetenschappelijk instituut dat alle aspecten van de Nederlandse taal bestudeert, waaronder de woordenschat, grammatica en taalvariatie. Het INT verzamelt de nieuwste Nederlandse woorden, actualiseert belangrijke naslagwerken zoals de Algemene Nederlandse Spraakkunst en maakt vaktaal toegankelijk via terminologielijsten. Het instituut is ook betrokken bij diverse nationale en internationale samenwerkingsverbanden. Om al deze taken te ondersteunen, ontwikkelt het INT zelf websites en -applicaties, bijvoorbeeld over vertalen van en naar het Nederlands, terminologie, taaladvies of historische teksten. Het instituut bevindt zich in de sfeervolle Leidse binnenstad, op loopafstand van het station. Deels thuiswerken is mogelijk. Er zijn collega’s van alle leeftijden en de sfeer is informeel. Er is veel ruimte voor eigen inbreng en volop mogelijkheid om ervaring op te doen met verschillende tools en technologieën.

1 0

First call for papers - The 3rd Workshop on Perspectivist Approaches to NLP
by Sara Tonelli 14 Dec '23

14 Dec '23

NLPerspectives: The 3rd Workshop on Perspectivist Approaches to NLP FIRST CALL FOR PAPERS https://nlperspectives.di.unito.it/w/3rd-workshop-on-perspectivist-approach… Until recently, the dominant paradigm in natural language processing (and other areas of artificial intelligence) has been to resolve observed label disagreement into a single “ground truth” or “gold standard” via aggregation, adjudication, or statistical means. However, in recent years, the field has increasingly focused on subjective tasks, such as abuse detection or quality estimation, in which multiple points of view may be equally valid, and a unique ‘ground truth’ label may not exist (Plank, 2022). At the same time, as concerns have been raised about bias and fairness in AI, it has become increasingly apparent that an approach which assumes a single “ground truth” can erase minority voices. Strong perspectivism in NLP (Cabitza et al., 2023) pursues the spirit of recent initiatives such as Data Statements (Bender and Friedman, 2018), extending their scope to the full NLP pipeline, including the aspects related to modelling, evaluation and explanation. In line with the first <https://nlperspectives.di.unito.it/w/w2022/> and second <https://nlperspectives.di.unito.it/w/2nd-workshop-on-perspectivist-approach…> editions, the third NLPerspectives (Perspectivist Approaches to Disagreement in NLP) workshop will explore current and ongoing work on: the collection and labelling of non-aggregated datasets; and approaches to modelling and including these perspectives in NLP pipelines, as well as evaluation and applications of multi-perspective Machine Learning models. We also welcome opinion pieces and literature reviews, e.g., fairness and inclusion in a perspectivist framework. Following our previous workshops, a key outcome of the third edition will be to continue the work begun at https://pdai.info/ to create a repository of perspectivist datasets with non-aggregated labels for use by researchers in perspectivist NLP modelling. Authors are, therefore, invited to share their LRs (data, tools, services, etc.) and provide essential information about resources (i.e., also technologies, standards, evaluation kits, etc.) that have been used for the work or are a result of their research. In addition, authors will be required to adhere to ethical research policies on AI and may include an ethics statement in their papers. The NLPerspectives workshop will be co-located with the 14th edition of LREC-COLING 2024 <https://lrec-coling-2024.org/> in Torino, Italy, in May 20-25, 2024. Submissions The papers should be submitted as a PDF document, conforming to the formatting guidelines provided in the call for papers of LREC-COLING conference: authors-kit <https://lrec-coling-2024.org/authors-kit/> We accept three types of submissions: Regular research papers; Non-archival submissions: like research papers, but will not be included in the proceedings; Research communications: 4-page abstracts summarising relevant research published elsewhere. Research papers (archival or non-archival) may consist of up to 8 pages of content. Research communications may consist of up to 4 pages of content. More details will be up soon. Topics We invite original research papers from a wide range of topics, including but not limited to: Non-aggregated data collection and annotation frameworks Descriptions of corpora collected under the perspectivist paradigm Multi-perspective Modelling and Machine Learning Evaluation of multi-perspective models/ models of disagreement Multi-perspective disagreement as applied to NLP evaluation Fairness and inclusive modelling Perspectivist approaches for social good Applications of multi-perspective modelling Computing with (dis)agreement Perspectivist Natural Language Generation Foundational aspects of perspectivism Opinion pieces and reviews on perspectivist approaches to NLP Submissions are open to all, and are to be submitted anonymously (and must conform to the instructions for double-blind review). All papers will be refereed through a double-blind peer review process by at least three reviewers, with final acceptance decisions made by the workshop organisers. Scientific papers will be evaluated based on relevance, significance of contribution, impact, technical quality, scholarship, and quality of presentation. Attendance At least one author of each accepted paper is required to participate in the conference and present the work. Important Dates * Friday February 23, 2024: Paper submission * Friday March 29, 2024: Notification of acceptance * Friday April 12, 2024: Camera-ready papers due * Tuesday May 21, 2024: Workshop Workshop organisers: Gavin Abercrombie, Heriot-Watt University Valerio Basile, University of Turin Davide Bernardi, Amazon Alexa Shiran Dudy, Northeastern University Simona Frenda, University of Turin Lucy Havens, University of Edinburgh Sara Tonelli, Fondazione Bruno Kessler Contact us at g.abercrombie(a)hw.ac.uk <mailto:g.abercrombie@hw.ac.uk> if you have any questions. Website: https://nlperspectives.di.unito.it/ -- -- Le informazioni contenute nella presente comunicazione sono di natura privata e come tali sono da considerarsi riservate ed indirizzate esclusivamente ai destinatari indicati e per le finalità strettamente legate al relativo contenuto. Se avete ricevuto questo messaggio per errore, vi preghiamo di eliminarlo e di inviare una comunicazione all’indirizzo e-mail del mittente. -- The information transmitted is intended only for the person or entity to which it is addressed and may contain confidential and/or privileged material. If you received this in error, please contact the sender and delete the material.

1 0

CfP 5th workshop on Resources for African Indigenous Language (RAIL) @ LREC-COLING
by Menno Van Zaanen 14 Dec '23

14 Dec '23

First call for papers The fifth workshop on Resources for African Indigenous Language (RAIL) Colocated with LREC-COLING 2024 https://bit.ly/rail2024 Conference dates: 20-25 May 2024 Workshop date: 25 May 2024 Venue: Lingotto Conference Centre, Torino (Italy) The fifth RAIL workshop website: https://bit.ly/rail2024 LREC-COLING 2024 website: https://lrec-coling-2024.org/ The fifth Resources for African Indigenous Languages (RAIL) workshop will be co-located with LREC-COLING 2024 in Lingotto Conference Centre, Torino, Italy on 25 May 2024. The RAIL workshop is an interdisciplinary platform for researchers working on resources (data collections, tools, etc.) specifically targeted towards African indigenous languages. In particular, it aims to create the conditions for the emergence of a scientific community of practice that focuses on data, as well as computational linguistic tools specifically designed for or applied to indigenous languages found in Africa. Many African languages are under-resourced while only a few of them are somewhat better resourced. These languages often share interesting properties such as writing systems, or tone, making them different from most high-resourced languages. From a computational perspective, these languages lack enough corpora to undertake high level development of Human Language Technologies (HLT) and Natural Language Processing (NLP) tools, which in turn impedes the development of African languages in these areas. During previous workshops, it has become clear that the problems and solutions presented are not only applicable to African languages but are also relevant to many other low-resource languages. Because these languages share similar challenges, this workshop provides researchers with opportunities to work collaboratively on issues of language resource development and learn from each other. The RAIL workshop has several aims. First, the workshop brings together researchers who work on African indigenous languages, forming a community of practice for people working on indigenous languages. Second, the workshop aims to reveal currently unknown or unpublished existing resources (corpora, NLP tools, and applications), resulting in a better overview of the current state-of-the-art, and also allows for discussions on novel, desired resources for future research in this area. Third, it enhances sharing of knowledge on the development of low-resource languages. Finally, it enables discussions on how to improve the quality as well as availability of the resources. The workshop has “Creating resources for less-resourced languages” as its theme, but submissions on any topic related to properties of African indigenous languages (including non-African languages) may be accepted. Suggested topics include (but are not limited to) the following: * Digital representations of linguistic structures * Descriptions of corpora or other data sets of African indigenous languages * Building resources for (under resourced) African indigenous languages * Developing and using African indigenous languages in the digital age * Effectiveness of digital technologies for the development of African indigenous languages * Revealing unknown or unpublished existing resources for African indigenous languages * Developing desired resources for African indigenous languages * Improving quality, availability and accessibility of African indigenous language resources Submission requirements: We invite papers on original, unpublished work related to the topics of the workshop. Submissions, presenting completed work, may consist of up to eight (8) pages of content plus additional pages of references. The final camera-ready version of accepted long papers are allowed one additional page of content (up to 9 pages) so that reviewers’ feedback can be incorporated. Papers should be formatted according to the LREC- COLING style sheet (https://lrec-coling-2024.org/authors-kit/), which is provided on the LREC-COLING 2024 website (https://lrec-coling-2024.org/). Reviewing is double-blind, so make sure to anonymise your submission (e.g., do not provide author names, affiliations, project names, etc.) Limit the amount of self citations (anonymised citations should not be used). The RAIL workshop follows the LREC-COLING submission requirements. Please submit papers in PDF format to the START account (the submission link will be available soon). Accepted papers will be published in proceedings linked to the LREC-COLING conference. Important dates: Submission deadline: 16 February 2024 Date of notification: 15 March 2024 Camera ready deadline: 29 March 2024 RAIL workshop: 25 May 2024 Organising Committee Rooweither Mabuya, South African Centre for Digital Language Resources (SADiLaR), South Africa Muzi Matfunjwa, South African Centre for Digital Language Resources (SADiLaR), South Africa Mmasibidi Setaka, South African Centre for Digital Language Resources (SADiLaR), South Africa Menno van Zaanen, South African Centre for Digital Language Resources (SADiLaR), South Africa -- Prof Menno van Zaanen menno.vanzaanen(a)nwu.ac.za Professor in Digital Humanities South African Centre for Digital Language Resources https://www.sadilar.org ________________________________ NWU PRIVACY STATEMENT: http://www.nwu.ac.za/it/gov-man/disclaimer.html DISCLAIMER: This e-mail message and attachments thereto are intended solely for the recipient(s) and may contain confidential and privileged information. Any unauthorised review, use, disclosure, or distribution is prohibited. If you have received the e-mail by mistake, please contact the sender or reply e-mail and delete the e-mail and its attachments (where appropriate) from your system. ________________________________

1 0

Junior Research Group Leader “AI and NLP for Mental Health”, Department of Computer Science, Technical University Darmstadt, Germany
by Niemann, Elisabeth 14 Dec '23

14 Dec '23

Are you interested in doing cutting-edge research in Natural Language Processing, Machine Learning, and AI for mental health? Do you want your research to have a real-world impact in the fight against mental disorders - some of the most common and serious diseases in the world? The Department of Computer Science at the Technical University Darmstadt is recruiting a Junior Research Group Leader “AI and NLP for Mental Health” (equivalent to an Assistant Professor) as part of DYNAMIC, the newly approved interdisciplinary LOEWE-funded center "Dynamic Network Approach of Mental Health to Stimulate Innovations for Change". The Junior Research Group will work closely with the research labs of Prof. Iryna Gurevych and Prof. Kristian Kersting. Further collaboration opportunities exist with the labs of Prof. Marcus Rohrbach, Prof. Anna Rohrbach, Prof. Carsten Binnig and others. Join our diverse and interdisciplinary team to tackle some of the hardest and most exciting research challenges, including representation learning and multimodality! The research position is funded from 01 October 2024 to 31 December 2027 - applications are possible until 31 January 2024 and should include a CV, PhD certificate, publication list, letter of motivation, research and teaching statement. Learn more here: https://www.informatik.tu-darmstadt.de/ukp/ukp_home/jobs_ukp/group_leader_m… -------------------------------------------------------------------- Prof. Dr. Iryna Gurevych UKP Lab Technical University Darmstadt, Germany http://www.ukp.tu-darmstadt.de/

1 0

2026

2025

2024

2023

2022

Corpora