March 2023 - Corpora

[SEBD 2023] Doctoral Consortium - 2nd Call for Papers
by Stefano Marchesin 09 Mar '23

09 Mar '23

[APOLOGIES FOR MULTIPLE POSTINGS] SEBD 2023 Doctoral Consortium - 2nd Call for Papers ==================================================================== Important dates Doctoral Consortium Submission Deadline: Friday, March 31, 2023 (AoE) Papers Notification: Wednesday, April 26, 2023 (AoE) Camera-Ready Submission Deadline: Thursday, June 01, 2023 (AoE) Doctoral Consortium Day: Sunday, July 02, 2023 Submission Link: https://cmt3.research.microsoft.com/SEBD2023/ ================================================= The SEBD 2023 Doctoral Consortium will take place in a dedicated session during the 31st Italian Symposium on Advanced Database Systems (SEBD 2023), Galzignano Terme, Padova (Italy), July 02-05, 2023, http://sebd2023.dei.unipd.it/. The goal is to provide a forum for PhD candidates to present their ongoing research and receive feedback from renowned and experienced members of the research community. The Consortium fosters a collaborative environment, encouraging constructive discussions and sharing of ideas. It will be an excellent opportunity for developing person-to-person networks to the benefit of the PhD students in their future careers – as well as of the community. Submissions from students who are in the early stages of their research should provide a clear description of the problem to be addressed and the planned methodology. Submissions from students who are in the middle or final stages of their PhD research should clearly indicate the contributions made to date and future work directions. Each doctoral symposium paper must be single-authored by a current PhD student or a PhD student who submitted the thesis between September and December 2022. The paper should be written in English and must be 6-7 pages long, including selected references. Submissions must be formatted in PDF, prepared in CEUR-ART Column 1 Style (http://ceur-ws.org/Vol-XXX/CEURART.zip), and submitted electronically via the submission system: https://cmt3.research.microsoft.com/SEBD2023/ Submissions will be reviewed by the Doctoral Consortium Program Committee (appointed by the Doctoral Consortium Chairs). All papers will be reviewed with respect to the overall presentation quality, the potential for the future impact of the research on the field, and the expected benefit to the other doctoral students attending the conference. The accepted papers will be published as part of the SEBD 2023 proceedings on WS-CEUR.org and indexed in Scopus, DBLP and Google Scholar. ================================================= Topics The SEBD Symposium and its Doctoral Consortium cover a broad range of topics, including traditional database management, as well as new challenges for data management in any possible domain. Suggested topics include (but are not limited to) the following ones: - Big Data and Smart Computing; - Data integration, Heterogeneous and Federated DBMS; - Data mining, knowledge discovery, information extraction, and machine learning; - Data visualization; - Data warehousing; - Distributed and parallel databases; - Grid, peer-to-peer databases, and Cloud Computing; - Incompleteness, inconsistency, and other aspects of data quality; - Uncertainty in databases; - Ethical problems posed by Big Data Analysis; - Keyword-based and natural language access to structured, semistructured, and unstructured data; - Knowledge representation and reasoning; - Ontology-based data management; - Privacy, security and trust management; - Query processing and optimization, approximate query answering; - Real-time, embedded, sensor, and mobile databases; - Scientific and Statistical Databases; - Semantic Web and Open Linked data; - Social networks and Graph databases; - Transaction and workflow management, interoperability and Web services. ================================================= Contact For any questions regarding Doctoral Consortium submissions, please email the Doctoral Consortium Chairs: - Letizia Tanca (letizia.tanca(a)polimi.it) - Stefano Marchesin (stefano.marchesin(a)unipd.it) -- Stefano Marchesin, PhD Postdoctoral Researcher Information Management Systems (IMS) Group Department of Information Engineering University of Padua Via Gradenigo 6/a, 35131 Padua, Italy Home page: http://www.dei.unipd.it/~marches1/

1 0

Quechua to Spanish Dialectal and Low-resource track at IWSLT 2023
by John Ortega 09 Mar '23

09 Mar '23

=== Apologies in advance for cross-posting == We need your help to preserve indigenous languages! Due to the overwhelming success in previous workshops like LoResMT and AmericasNLP we have decided to continue to push the needle for Quechua to Spanish translations. Please participate in the first edition of the QUE-SPA speech translation shared task being held at IWSLT 2023. This low-resource task will help increase language preservation for low-resource languages. We invite advanced research and approaches of all types so bring your rule-based, statistical, neural, and more! IMPORTANT LINKS Dialectal and Low-resource webpage: https://iwslt.org/2022/low-resource Data webpage: https://github.com/Llamacha/IWSLT2023_Quechua_data Google Group: https://groups.google.com/g/iwslt-evaluation-campaign IWSLT conference webpage: https://iwslt.org/2023/ HOW TO PARTICIPATE Please join the IWSLT Evaluation Campaign Google Group and access the registration using the following link: https://groups.google.com/g/iwslt-evaluation-campaign The QUE-SPA data set can be downloaded here: https://github.com/Llamacha/IWSLT2023_Quechua_data Task submissions can be uploaded to GitHub, please email the organizers below for more details. Evaluation scripts based on BLEU are made available via the IWSLT 2023 website. IMPORTANT DATES Jan 14, 2023 Release of shared task training and dev data Apr 1-15, 2023 Evaluation period Apr 24, 2023 Paper submission deadline (all papers) Apr 30, 2023 System papers update deadline May 22, 2023 Notification of acceptance May 31, 2023 Camera ready paper due July 12, 2023 Pre-recorded video due July 13-14, 2023 IWSLT conference ORGANIZING COMMITTEE John E. Ortega (Northeastern University) William Chen (Carnegie Mellon University) Rodolfo Zevallos (Universitat Pompeu Fabra)

1 0

Quechua to Spanish Dialectal and Low-resource track at IWSLT 2023
by John Ortega 09 Mar '23

09 Mar '23

=== Apologies in advance for cross-posting == We need your help to preserve indigenous languages! Due to the overwhelming success in previous workshops like LoResMT and AmericasNLP we have decided to continue to push the needle for Quechua to Spanish translations. Please participate in the first edition of the QUE-SPA speech translation shared task being held at IWSLT 2023. This low-resource task will help increase language preservation for low-resource languages. We invite advanced research and approaches of all types so bring your rule-based, statistical, neural, and more! IMPORTANT LINKS Dialectal and Low-resource webpage: https://iwslt.org/2022/low-resource Data webpage: https://github.com/Llamacha/IWSLT2023_Quechua_data Google Group: https://groups.google.com/g/iwslt-evaluation-campaign IWSLT conference webpage: https://iwslt.org/2023/ HOW TO PARTICIPATE Please join the IWSLT Evaluation Campaign Google Group and access the registration using the following link: https://groups.google.com/g/iwslt-evaluation-campaign The QUE-SPA data set can be downloaded here: https://github.com/Llamacha/IWSLT2023_Quechua_data Task submissions can be uploaded to GitHub, please email the organizers below for more details. Evaluation scripts based on BLEU are made available via the IWSLT 2023 website. IMPORTANT DATES Jan 14, 2023 Release of shared task training and dev data Apr 1-15, 2023 Evaluation period Apr 24, 2023 Paper submission deadline (all papers) Apr 30, 2023 System papers update deadline May 22, 2023 Notification of acceptance May 31, 2023 Camera ready paper due July 12, 2023 Pre-recorded video due July 13-14, 2023 IWSLT conference ORGANIZING COMMITTEE John E. Ortega (Northeastern University) William Chen (Carnegie Mellon University) Rodolfo Zevallos (Universitat Pompeu Fabra)

1 0

USAS Tagger: same output for Python version and web demo?
by Tony Berber-Sardinha 09 Mar '23

09 Mar '23

Dear all I'm using the python implementation of the USAS tagger, pymusas. I noitced that the output from pymusas is different from the web demo version. For example, the phrase: 'the characteristics of the network' is tagged like this by pymusas: the the DET ['Z5'] characteristics characteristic NOUN ['Df/A5.1+++mfnc'] of of ADP ['Df/A5.1+++mfnc'] the the DET ['Df/A5.1+++mfnc'] network network NOUN ['Df/A5.1+++mfnc'] that is, the same tag is applied to the whole noun phrase. but is tagged like this on the web: 0000003 010 AT the Z5 0000003 020 NN2 characteristics O4.1 A4.2+ N2 0000003 030 IO of Z5 0000003 040 AT the Z5 0000003 050 NN1 network S5+c Q4.3 Y2 in this case, each word in the noun phrase receives its own tag. or: 'on the table' pymusas: on on ADP ['N6'] the the DET ['N6'] table table NOUN ['N6'] web: 0000003 010 II on N6[i1.3.1 Z5 0000003 020 AT the N6[i1.3.2 Z5 0000003 030 NN1 table N6[i1.3.3 H5 Q1.2 N2 I'm wondering if it's possible for pymusas to generate output similar to the web demo's output. Specifically, I'd like to obtain individual tags for each word, rather than just the tag for the entire multiword expression. I've used the following python code: import spacy # We exclude the following components as we do not need them. nlp = spacy.load('en_core_web_sm', exclude=['parser', 'ner']) # Load the English PyMUSAS rule based tagger in a separate spaCy pipeline english_tagger_pipeline = spacy.load('en_dual_none_contextual') # Adds the English PyMUSAS rule based tagger to the main spaCy pipeline nlp.add_pipe('pymusas_rule_based_tagger', source=english_tagger_pipeline) output_doc = nlp(text) print(f'Text\tLemma\tPOS\tUSAS Tags') for token in output_doc: print(f'{token.text}\t{token.lemma_}\t{token.pos_}\t{token._.pymusas_tags}') thank you ahead! Tony Berber Sardinha

2 3

Call-for-Participation: 1st Recommending Task @ ImageCLEF 2023 (Cultural Heritage Content-based Recommendation Task)
by Bogdan Ionescu 09 Mar '23

09 Mar '23

[Apologies for multiple postings] ImageCLEFrecommending (1st edition) Registration: https://www.imageclef.org/2023/recommending Run submission: May 10, 2023 Working notes submission: June 5, 2023 CLEF 2023 conference: September 18-21, Thessaloniki, Greece *** CALL FOR PARTICIPATION *** In recent years cultural heritage organisations have made considerable efforts to digitise their collections, and this trend is expected to continue due to organisational goals and national cultural policies. Thus media archives have not only exponentially increased in size, but now hold contents in various modalities, e.g., video, image, text. Even when structured metadata is available it is still difficult to discover the contents of media archives and allow users to navigate multiperspectivity in media collections. Content-based recommendation systems can help but there is limited understanding how well these perform and how relevant they are for the end-users. Moreover, the system used so far have not addressed the new user requirements of more transparency and explainability of the algorithms used. The task targets a key infrastructure for researchers and heritage professionals: Europeana. With over 53 million records, the single search bar that served as the main access point was identified as a bottleneck by many users. Thus, the strategy has gradually shifted towards exploration of the available collections based on themes. Now users can explore over 60 curated digital exhibitions, countless galleries and blog posts. While there is a system in place to recommend individual items given a query item, the recommendations for editorials are done at the moment only manually. For instance when a new blog is created, the author would manually provide a list of related galleries, blogs or exhibitions that have been already published. *** TASK *** The task requires participants to devise recommendation methods and systems, apply them in the supplied data set gathered from Europeana and provide a series of recommendations in two scenarios: (i) given a list of items provide a list of recommended items; (ii) given an editorial (Europeana blog or gallery) provide a list of recommended editorials. *** DATA SET *** For the task a new dataset based on Europeana items and editorials will be provided to the participants. The individual items in the dataset will include a wealth of metadata based on the Europeana Data Model (EDM) schema. Editorials will be either Galleries (containing a title, optional description and list of items which make it up), or blog posts (containing a title, text in English and a number of items). It should be noted that although all data items follow EDM the quality of the metadata is not perfect, with some data fields being potentially somewhat ambiguous, or at least used sometimes in a creative way by the original data providers (especially with some overlap sometimes in what ends up in "format", "medium", "type" and "subject". *** METRICS *** Performance will be evaluated on the basis of the recommendations that are provided computing Mean Average Precision at X (Map@X) compared to the ground truth. Moreover, the systems competing in this task that can provide an explanation for the results provided will be preferred. *** IMPORTANT DATES *** - Run submission: May 10, 2023 - Working notes submission: June 5, 2023 - CLEF 2023 conference: September 18-21, Thessaloniki, Greece (https://clef2023.clef-initiative.eu/) *** OVERALL COORDINATION *** Alexandru Stan, IN2 Digital Innovations, Germany George Ioannidis, IN2 Digital Innovations, Germany Bogdan Ionescu, Politehnica University of Bucharest, Romania Hugo Manguinhas, Europeana Foundation, Netherlands *** ACKNOWLEDGEMENT *** The task is supported under the H2020 AI4Media â€œA European Excellence Centre for Media, Society and Democracyâ€ project, contract #951911 https://www.ai4media.eu/. On behalf of the Organizers, Bogdan Ionescu https://www.AIMultimediaLab.ro/

1 0

First Call For papers: 16th International Natural Language Generation Conference INLG 2023
by Simeon Schüz 09 Mar '23

09 Mar '23

*First Call For papers: 16th International Natural Language Generation Conference INLG 2023* We invite the submission of long and short papers, as well as system demonstrations, related to all aspects of Natural Language Generation (NLG), including data-to-text, concept-to-text, text-to-text and vision-to-text approaches. Accepted papers will be presented as oral talks or posters. The event is organized under the auspices of the Special Interest Group on Natural Language Generation (SIGGEN) (https://aclweb.org/aclwiki/SIGGEN) of the Association for Computational Linguistics (ACL) (https://aclweb.org/). The event will be held from 11-15 September in Prague, Czech Republic. INLG 2023 will be (jointly) colocated with SIGDial 2023. *Important dates* All deadlines are Anywhere on Earth (UTC-12) - START system regular paper submission deadline: May 22, 2023 - ARR commitment to INLG deadline via START system: June 15, 2023 - START system demo paper submission deadline: June 15, 2023 - Notification: July 11, 2023 - Camera ready: July 25, 2023 - Conference: 11-15 September 2023 *Topics* INLG 2023 solicits papers on any topic related to NLG. General topics of interest include, but are not limited to: - Affect/emotion generation - Analysis and detection of automatically generated text - Bias and fairness in NLG systems - Cognitive modelling of language production - Computational efficiency of NLG models - Content and text planning - Corpora and resources for NLG - Ethical considerations of NLG - Evaluation and error analysis of NLG systems - Explainability and Trustworthiness of NLG systems - Generalizability of NLG systems - Grounded language generation - Large Language Models for NLG - Lexicalisation - Multimedia and multimodality in generation - Natural language understanding techniques for NLG - NLG and accessibility - NLG in speech synthesis and spoken language models - NLG in dialogue - NLG for human-robot interaction - NLG for low-resourced languages - NLG for real-world applications - Paraphrasing, summarization and translation - Personalisation and variation in text - Referring expression generation - Storytelling and narrative generation - Surface realisation - System architectures *Submissions & Format* Three kinds of papers can be submitted: - Long papers are most appropriate for presenting substantial research results and must not exceed eight (8) pages of content, plus unlimited pages of ethical considerations, supplementary material statements, and references. The supplementary material statement provides detailed descriptions to support the reproduction of the results presented in the paper (see below for details). The final versions of long papers will be given one additional page of content (up to 9 pages) so that reviewers' comments can be taken into account. - Short papers are more appropriate for presenting an ongoing research effort and must not exceed four (4) pages, plus unlimited pages of ethical considerations, supplementary material statements, and references. The final versions of short papers will be given one additional page of content (up to 5 pages) so that reviewers' comments can be taken into account. - Demo papers should be no more than two (2) pages, including references, and should describe implemented systems relevant to the NLG community. It also should include a link to a short screencast of the working software. In addition, authors of demo papers must be willing to present a demo of their system during INLG 2023. Submissions should follow ACL Author Guidelines (https://www.aclweb.org/adminwiki/index.php?title=ACL_Author_Guidelines) and policies for submission, review and citation, and be anonymised for double blind reviewing. Please use ACL 2023 style files; LaTeX style files and Microsoft Word templates are available at https://2023.aclweb.org/calls/style_and_formatting/. Authors must honour the ethical code set out in the ACL Code of Ethics (https://www.aclweb.org/portal/content/acl-code-ethics). If your work raises any ethical issues, you should include an explicit discussion of those issues. This will also be taken into account in the review process. You may find the following checklist of use: https://aclrollingreview.org/responsibleNLPresearch/ Authors are strongly encouraged to ensure that their work is reproducible; see, e.g., the following reproducibility checklist (https://2021.aclweb.org/calls/reproducibility-checklist/). Papers involving any kind of experimental results (human judgments, system outputs, etc) should incorporate a data availability statement into their paper. Authors are asked to indicate whether the data is made publicly available. If the data is not made available, authors should provide a brief explanation why. (E.g. because the data contains proprietary information.) A statement guide is available on the INLG 2023 website (https://inlg2023.github.io/). To submit a long or short paper to INLG 2023, authors can either submit directly or commit a paper previously reviewed by ARR via the same paper submission site (https://softconf.com/n/inlg2023/). For direct submissions, the deadline for submitting papers is May 22, 2023, 11:59:59 AOE. If committing an ARR paper to INLG, the submission is also made through the INLG 2023 paper submission site, indicating the link of the paper on OpenReview. The deadline for committing an ARR paper to INLG is June 15, 2023, 11:59:59 AOE, and the last eligible ARR paper submission deadline for INLG 2023 is April 15, 2023. It is important to note that when committing an ARR paper to INLG, it should be submitted through the INLG 2023 paper submission site, just like a direct submission paper, with the only difference being the need to provide the OpenReview link to the paper and to provide an optional author response to reviews. Demo papers should be submitted directly through the INLG 2023 paper submission site (https://softconf.com/n/inlg2023/) by June 15, 2023, 11:59:59 AOE. All accepted papers will be published in the INLG 2023 proceedings and included in the ACL anthology. A paper accepted for presentation at INLG 2023 must not have been presented at any other meeting with publicly available proceedings. Dual submission to other conferences is permitted, provided that authors clearly indicate this in the submission form. If the paper is accepted at both venues, the authors will need to choose which venue to present at, since they can not present the same paper twice. *Awards* INLG 2023 will present several awards to recognize outstanding achievements in the field. These awards are: - Best Long Paper Award: This award will be given to the best long paper submission based on its originality, impact, and contribution to the field of NLG. - Best Short Paper Award: This award will be given to the best short paper submission based on its originality, impact, and contribution to the field of NLG. - Best Demo Paper Award: This award will recognize the best demo paper submitted to the conference. This award considers not only the paper's quality but also the demonstration given at the conference. The demonstration will play a significant role in the judging process. - Best Evaluation Award: The award is a new addition to INLG 2023. This award is designed to honour authors who have demonstrated the most comprehensive and insightful analysis in evaluating their results. This award aims to highlight papers where the authors have gone the extra mile in providing a thorough and detailed analysis of their results, offering a nuanced understanding of their findings.

1 0

ELE 2 project: survey on computing facilities for NLP/LT
by gaurish thakkar 09 Mar '23

09 Mar '23

*Apologies for cross-posting* Within the European Language Equality 2 project (https://european-language-equality.eu/) we are collecting information about various computing facilities and requirements for NLP/LT. The analysis of collected information will result in a snapshot of the current situation and relevant recommendations for High Performance Computing use in NLP/LT. If you are an NLP/LT researcher who uses HPC (e.g. GPUs, clusters, grids) in your work, we would appreciate if you could fill out this short survey: https://forms.gle/vcMF8nPmMSR9BZo27. Please, complete the survey until 2023-03-22. Here (https://european-language-equality.eu/) you can find more about the project and on the background of this survey. Thank you in advance for your valuable insights. Marko Tadić -- Marko Tadić, professor University of Zagreb Faculty of Humanities and Social Sciences Department of Linguistics Ivana Lučića 3 HR-10000 Zagreb Croatia w: http://www.ffzg.unizg.hr/oling/?page_id=88 o: https://orcid.org/0000-0001-6325-820X --

1 0

[CfP] SEMANTiCS 2023 – Workshop & Tutorial Track|| Extended Submission Deadline March 15, 2023
by Anisa Rula & Jennifer D'Souza 09 Mar '23

09 Mar '23

==== SEMANTiCS - 19th International Conference on Semantic Systems Leipzig, Germany Workshops and Tutorials September 20 - 22, 2023 https://2023-eu.semantics.cc/page/cfp_ws ==== SEMANTiCS 2023 is a major venue for research and industrial innovation and features a workshop and tutorial program addressing the diverse practical interests of its audience. This program is intended to offer a rich diversity of topics to conference attendees and local participants seeking to pick up new skills and stay up-to-date regarding the latest developments in the community. We encourage submissions of proposals on all topics in the general areas of SEMANTiCS 2023 and proposals bridging or introducing new perspectives in these areas. Workshops and tutorials may incorporate panel discussions, lightning talks, meetings, networking or hands-on sessions, hackathons and other practical formats where applicable. Rooms for business or project meetings are available upon request as well. =Important Dates for Workshops= * Proposals WS *Extended* Deadline: March 15, 2023 (11:59 pm, Hawaii time) * Notification of Acceptance: March 22, 2023 (11:59 pm, Hawaii time) =Important Dates for Tutorials (and other meetings, e.g. seminars, show-cases, etc., without call for papers)= * Proposals Tutorial Deadline: June 06, 2023 (11:59 pm, Hawaii time) * Notification of Acceptance: June 20, 2023 (11:59 pm, Hawaii time) Submission via Easychair on https://easychair.org/conferences/?conf=sem23 =Scope & Goals= Workshops and tutorials at SEMANTiCS 2023 allow your organisation or project to advance and promote your topics and gain increased visibility. The workshops and tutorials will be announced on the SEMANTiCS website and they will be seen by all participants. SEMANTiCS 2023 workshops and tutorials can be incubators for industrial and scientific communities that form and share a particular research and development agenda. They provide a forum for presenting contributions and findings to a diverse and knowledgeable community. Furthermore, the event can be used as a dissemination activity in the scope of large research projects or as a closed format for research/commercial project consortia meetings. =Setup and Requirements= SEMANTiCS 2023 workshops and tutorials may be either half or full day long. Workshops and tutorials take place on the days before and/or after the main SEMANTiCS 2023 EU conference (20th, 21st, and/or 22nd of September 2023). Details will be communicated on time. Organizers of workshops and tutorials will be granted three free tickets (only for the workshop & tutorial day) for organization purposes or keynotes. Participants of workshops and tutorials will be charged a marginal fee to cover the basic costs. Workshop and tutorials proposals must include the following information: * outline of the themes and goals of the event, including a title and a brief abstract (less than 200 words) intended for the SEMANTiCS 2023 website * a statement addressing why the event is important, why the event is timely, how it is relevant to SEMANTiCS 2023 and the field of semantic web. For the tutorials, why the presenters are qualified for a high-quality introduction of the topic * related workshops and conferences, i.e., specifying if this is a continuation of a workshop series or is a new workshop to address an emerging issue. Please provide information about past versions of this workshop and other related workshops (including URLs and submission/acceptance counts, if available). * a statement addressing the quality assurance criterion that will be used by the event organizers to select the papers for the workshops and the presenters for the tutorials (e.g., peer review or review/evaluation by event organizers). If a peer review process is chosen as a quality assurance criterion for the workshops, the organizers will be responsible for their own reviewing process. Workshop organizers will be responsible also for their own publicity (e.g., website, timelines and call for papers) and proceedings production. * structure of the event and plans for generating and stimulating discussion; how will the interaction be organized in case of a hybrid event * desired minimum and maximum number of event participants, expected number of participants, and (in case of previously held events) number of registered attendees and web site for previous editions of the event * a description of the intended audience and the expected learning outcomes * desired prerequisite knowledge of the audience * proposed duration of the event (i.e., half or full day), different sessions if applicable (final time slot will be assigned in accordance with the SEMANTiCS program) * any equipment, room capacity, or other logistic constraints * full contact information of all organizers of the event and main contact person; a brief description of each organizer's background, including relevant past experience in organizing events Proposals for workshop and tutorial proposals must be submitted via Easychair: https://easychair.org/my/conference?conf=sem23 =Review and Evaluation Criteria= Workshop and tutorial proposals will be reviewed by the SEMANTiCS 2023 Workshop Chairs, as well as by the SEMANTiCS 2023 organizing committee, according to the following criteria: * The potential to advance the state of semantic web research and practice * The quality assurance criterion proposed by the organizers to select high-quality papers for workshops and presenters for tutorials * The organizers' experience and ability to lead a successful event * Timeliness and expected interest in the event topics * The balance and synergy between all SEMANTiCS 2023 events =Topics of interest include (but are not limited to)= * Web Semantics & Linked (Open) Data * Enterprise Knowledge Graphs, Graph Data Management and Deep Semantics * Machine Learning & Deep Learning Techniques * Semantic Information Management & Knowledge Integration * Terminology, Thesaurus & Ontology Management * Data Mining and Knowledge Discovery * Reasoning, Rules and Policies * Natural Language Processing and Computational Linguistics * Social and Human aspects of Semantic Web * Data Quality Management and Assurance * Explainable Artificial Intelligence * Semantics in Data Science * Semantics of Blockchain & Distributed Ledger Technologies * Trust, Data Privacy, and Security with Semantic Technologies * Economics of Data, Data Services and Data Ecosystems * Applications of Semantic Web technologies in domains such as law, medicine, life sciences, digital humanities, mobility and smart cities, etc. We especially invite contributions that illustrate the applicability of the topics mentioned above for industrial purposes and/or illustrate the business relevance of their contribution for specific industries. Workshop proposals on emerging themes for the topics listed above are encouraged. In case you have additional questions concerning the submission process, please do not hesitate to contact us via Easychair. We are looking forward to your contribution! Jennifer D’Souza - jennifer.dsouza(a)tib.eu Anisa Rula - anisa.rula(a)unibs.it Workshop & Tutorial Chairs

1 0

CfP: 3rd Workshop DL4LD: Addressing Deep Learning, Relation Extraction, and Linguistic Data
by Radovan Garabik 08 Mar '23

08 Mar '23

Call for papers 3rd Workshop DL4LD (Deep Learning, Relation Extraction and Linguistic Data with a Case Study on BATS) as a continuation of the series of DL4LD (Deep Learning and Neural Approaches for Linguistic Linked Data) workshops University of Vienna, Vienna, Austria 13 September 2023 Website: http://dl4ld2023.mruni.eu/ The fourth biennial conference on Language, Data and Knowledge (LDK 2023) ( http://2023.ldk-conf.org/ ) and Cost Action CA18209 NexusLinguarum (https://nexuslinguarum.eu) are glad to announce the 3rd workshop DL4LD (Deep Learning, Relation Extraction and Linguistic Data with a Case Study on BATS): Addressing Deep Learning, Relation Extraction, and Linguistic Data with a Case Study on The Bigger Analogy Test Set (BATS). The workshop will be held in a hybrid mode, so speakers and attendees can choose to participate onsite or online. Conference aims and topics The workshop welcomes contributions from scholars and researchers working in computational linguistics, data science, linguistics, computer science, etc. This workshop aims at bringing together relation extraction, deep learning, and neural approaches with linguistic linked data. We invite research papers, application descriptions, system demonstrations, and position papers that discuss the interconnection of both areas. The workshop is going to include a twofold session with the first part focusing on the workshop presentations and the second part poster presentations on the multilingual linguistic data preparation for NLP experiments focusing on the case study of BATS. We suggest the researchers working on related languages join teams and present rich comparative case studies. Panel discussion includes “unorthodox” new ideas, overviews of the challenges and opportunities of multilingual data preparation for the BATS experiment. The workshop presents an excellent opportunity for the exchange of ideas, insights, and the latest research. The workshop topics are (but not limited to) the following: Topics: • Deep Learning for Linguistic Linked (Open) Data, modelling, resources & interlinking • LLOD and Deep Learning for Digital Humanities • Enhancement of language models with structured linguistic data • Use cases combining language models and structured linguistic data • Deep Learning and LLOD in NLP • Deep learning and relation extraction • Deep learning and knowledge graphs • Multilingual data preparation for the BATS experiment Deep learning and neural network approaches are indispensable in modern Natural Language Processing and generally in all kinds of linguistic data analysis approaches. Artificial intelligence integrating knowledge is one of the core topics in the current research which focuses on providing human thinking for AI to solve complex tasks. One of the important techniques for supporting this research is knowledge acquisition or so-called relation extraction. Relation extraction and deep learning can serve the understanding of the specificities of linguistic data, to be better exploited and combined with linked data mechanisms. Knowledge is a way of understanding the world, aiming to provide a human-level cognition and intelligence for the next-generation artificial intelligence. One way of knowledge representation is semantic relations between entities. Relation Extraction ensures an effective way to automatically acquire important knowledge of semantic relations. It is a sub-task of information extraction and plays an essential role in Natural Language Processing. Its purpose is to identify semantic relations between entities from natural language text. Concerning the current research, there is a field of studies for relation extraction which have described the techniques based on Deep Neural Networks used as a prevailing technique in the research. The workshop intends to be an event of discussion for researchers interested in addressing the peculiarities of the interrelated research areas mentioned before and in advancing the state of the art in deep learning, relation extraction, and linguistic data science. Program: The Scientific Program will include an invited talk and research presentations, followed by the panel discussion. Invited Speaker: Michael Cochez, Vrije Universiteit Amsterdam Submissions and dates Submissions can be in the form of: • short papers: 4–6 pages; • long papers: 9–12 pages. All submission lengths are given including references. Accepted submissions will be published by ACL in an open-access conference proceedings volume, free of charge for authors. The ACL templates should therefore be used for all conference submissions (https://github.com/acl-org/acl-style-files/). As the reviewing process is single-blind, submissions should not be anonymised. The workshop will be hybrid. At least one author of each accepted paper is required to register for the workshop and present their work (either remotely or on-site). There will be no registration fee for participation. Submissions must be submitted via EasyChair: https://easychair.org/conferences/?conf=dl4ld2023/ Important dates: Time Zone: Anywhere on Earth Papers due: May, 19, 2023 Papers acceptance notifications: June, 16, 2023 Camera-ready papers due: June, 30, 2023 Registration for participation without submissions deadline: August 30th, 2023 Program committee Andrius Utka, Vytautas Magnus University, Lithuania Chaya Liebeskind, Jerusalem College of Technology, Israel Ciprian-Octavian Truica, Uppsala University, Sweden Cosimo Palma, University of Naples “L’Orientale” – University of Pisa, Italy Dagmar Gromann, University of Vienna, Austria Enriketa Sogutlu, University College Bedër, Albania Giedre Valunaite Oleskeviciene, Mykolas Romeris University, Lithuania Hugo Gonçalo Oliveira, University of Coimbra, Portugal Jorge Gracia del Río University of Zaragoza, Spain Mariana Damova, Mozaika, Bulgaria Michael Cochez, Vrije Universiteit Amsterdam, Netherlands Purificação Silvano, University of Porto, Portugal Radovan Garabík, Ľ. Štúr Institute of Linguistics, Slovak Academy of Sciences, Slovakia Sigita Rackevičienė, Mykolas Romeris University, Lithuania Organizing committee Giedrė Valūnaitė Oleškevičienė, Mykolas Romeris University, Lithuania Radovan Garabík, Ľ. Štúr Institute of Linguistics, Slovak Academy of Sciences, Slovakia Chaya Liebeskind, Jerusalem College of Technology, Israel Cosimo Palma, University of Naples “L’Orientale” – University of Pisa, Italy Enriketa Sogutlu, University College Bedër, Albania Purificação Silvano, University of Porto, Portugal Sigita Rackevičienė, Mykolas Romeris University, Lithuania Contact: gvalunaite(a)mruni.eu

1 0

CFP Dataset Released - ClinAIS Shared Task at IberLEF2023
by iker.delaiglesia＠ehu.eus 08 Mar '23

08 Mar '23

IberLEF 2023 Task ClinAIS: Automatic Identification of Sections in Clinical Documents Website: https://ixa2.si.ehu.eus/clinais/ ClinAIS: will be organized as part of IberLEF 2023, at the SEPLN 2023 Conference (Jaen, September 2023). TRAIN AND DEV DATA AVAILABLE NOW ! The ClinAIS task presented at IberLEF 2023 aims to tackle the problem of automatic identification of sections in unstructured Spanish clinical documents. The task is focused on identifying 7 predefined medical sections: Present Illness, Derived from/to, Past Medical History, Family history, Exploration, Treatment and Evolution in ECNs, mainly progress notes. The successful resolution of this task will enable the improvement of higher level applications that can extract valuable, actionable information from clinical documents, such as medical entity recognition, patient cohort retrieval, and temporal relation extraction. This will ultimately improve patient care and clinical decision-making. IMPORTANT DATES ✔ March 2023 Release of Train + Dev Sets and Evaluation Library - April 2023 Release of Test and Background Set - May 2023 Submission of Results. - May 2023 System Paper Submission Deadline. - June 2023 Notification to Authors. - June 2023 Camera Ready Submission Deadline. - September 2023 Publication of Proceedings. - September 2023 IberLEF within SEPLN 2023. ORGANIZERS HiTZ Center Iker de la Iglesia, Research Scientist Aitziber Atutxa, Associate Professor at UPV/EHU Koldo Gojenola, Associate Professor at UPV/EHU Esther Miranda, Technical Staff IOMED María Vivó, NLP Data Scientist Paula Chocrón, NLP Engineer & Researcher Gabriel de Maeztu, Co-founder & CTO at IOMED Contact: ixa.iomed-clinais(a)ehu.es Registration (please register in both the official website and CodaLab): https://ixa2.si.ehu.eus/clinais/registration https://codalab.lisn.upsaclay.fr/competitions/10751

1 0

2026

2025

2024

2023

2022

Corpora March 2023