** apologies for cross-posting **
Linking Lexicographic and Language Learning Resources (4LR)
Workshop at LDK 2023 – Call for Papers
Workshop website: https://lexicala.com/4lr/
The workshop ‘Linking Lexicographic and Language Learning Resources’ (4LR) will be held in conjunction with LDK 2023 – 4th conference on Language, Data and Knowledge – (http://2023.ldk-conf.org/) at the University of Vienna, Austria, on September 13 (tentative), in hybrid mode.
The aim of this workshop is to explore linguistic linked (open) data and knowledge management methods and technologies for linking lexicographic and language learning resources, tools and applications in general and dictionaries and CEFR lists in particular.
Our starting point is, on the one hand, enhancing CEFR-graded language proficiency lists with lexicographic content and, on the other hand, incorporating CEFR labels in learner’s dictionaries. CEFR – the Common European Framework of Reference for Languages – is a generally established international standard for describing language proficiency, and CEFR-graded resources have been developed for many languages in Europe. However, incorporating their information is still not a common practice in modern lexicography for most languages, except for notably two English dictionaries for advanced learners (Cambridge and Oxford). There are substantial unsolved issues, such as inconsistencies in vocabulary size per level between languages; no, or limited, sense disambiguation in CEFR resources; words from a higher CEFR level in definitions and example sentences. Moreover, there has been limited collaboration and interoperability so far among the related fields of lexicography, language acquisition, and linguistic linked data, whether regarding research, development, or practical application.
4LR will feature an overview by the organizers, as well as an invited talk by Jorge Gracia from University of Zaragoza and chair of NexusLinguarum CA on Linked Data for Lexicographic Resources.
In addition, we invite submissions for papers (20 minutes, plus discussion) on the following topics:
• Linking lexicographic content to CEFR-graded vocabularies
• Pedagogical lexicography and knowledge graphs
• Attributing CEFR labels in learner’s dictionaries
• Incorporating vocabulary and grammar profiles in lexicographic resources
• Creating and linking crosslingual concept-based CEFR resources
• Multilingual knowledge management and language learning applications and tools
SUBMISSION AND DATES
Please submit your abstract including 300-500 words via EasyChair [https://easychair.org/conferences/?conf=4lr2023].
19 May 2023 Deadline for abstract submission
29 May 2023 Deadline for notification for abstract submission
30 June 2023 Deadline for camera-ready paper submission
13 Sep 2023 (tentative) 4LR workshop
14–15 Sep LDK 2023 conference
ORGANIZERS AND CONTACT
Kris Heylen. Dutch Language Institute (Kris DOT Heylen AT ivdnt DOT org)
Jelena Kallas. Institute of the Estonian Language
Ilan Kernerman. Lexicala by K Dictionaries
Carole Tiberius. Dutch Language Institute
Website: https://lexicala.com/4lr/
4LR is supported by NexusLinguarum COST Action (CA18209) – European network for Web-centered linguistic data science.
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
4LR workshop at LDK 2023 will follow a related workshop ‘Lexicography and CEFR: Linking Lexicographic Resources and Language Proficiency Levels’ that will be held in conjunction with eLex 2023 on June 29 in Brno, Czech Republic.
[apologies for cross-posting]
Call for Participation in Clinkart@EVALITA2023
Linking a Lab Result to its Test Event in the Clinical Domain
***SYSTEM SUBMISSION DEADLINE: MAY 19, 2023***
We invite you to participate in the Clinkart task on the extraction of
relations from clinical cases. Clinkart is organised in the context of
Evalita 2023 and the task mainly consists in identifying test results and
measurements and linking them to the textual mentions of the laboratory
tests and measurements from which they were obtained, as in the example
below:
All’ECG RS 66 bpm, deviazione assiale sinistra, BBD incompleto.
66 bpm → RS
Check the website for more detailed information and for downloading the
data: https://e3c.fbk.eu/clinkart.
Participants are invited to submit their results by May 19th (by email to
clinkart[at]fbk.eu). Each team is allowed to submit up to 2 different runs.
Assessment of the results will be returned to the participants by May 30th.
The evaluation dataset is based on the European Clinical Case Corpus (E3C),
which consists of clinical cases in five European languages and is freely
available (CC-BY-NC-4.0).
Participants are encouraged to also use the data of the TESTLINK task
(which focuses on Spanish and Basque) to train their systems (TESTLINK
website: https://e3c.fbk.eu/testlinkiberlef).
Join us!
The Clinkart organising team
#Sorry for the multiple posts
========================================================================
Call for Papers
2023 IEEE International Conference on Big Data (IEEE BigData 2023)
http://bigdataieee.org/BigData2023/
December 15-18, 2023 – Sorrento, Italy
========================================================================
In recent years, “Big Data” has become a new ubiquitous term. Big Data is
transforming science, engineering, medicine, healthcare, finance, business,
and ultimately our society itself. The IEEE Big Data conference series
started in 2013 has established itself as the top tier research conference
in Big Data.
•The first conference IEEE Big Data 2013 had more than 400 registered
participants from 40 countries
( http://bigdataieee.org/BigData2013/) and the regular paper acceptance
rate is 17.0%.
•The IEEE Big Data 2020 (http://bigdataieee.org/BigData2020/, regular paper
acceptance rate: 15.7%) was held online, Dec 10-13, 2020 with close to 1100
registered participants from 50 countries
•The IEEE Big Data 2021 (http://bigdataieee.org/BigData2021/, regular paper
acceptance rate: 19.9%) was held online, Dec 15-18, 2021 with close to 1089
registered participants from 52 countries
•The IEEE Big Data 2022 (http://bigdataieee.org/BigData2022/, regular paper
acceptance rate: 19.2%) was held in Osaka, Japan, Dec 17-20, 2022 with
close to 1250 registered participants from 54 countries.
The 2023 IEEE International Conference on Big Data (IEEE BigData 2023) will
continue the success of the previous IEEE Big Data conferences. It will
provide a leading forum for disseminating the latest results in Big Data
Research, Development, and Applications.
We solicit high-quality original research papers (and significant
work-in-progress papers) in any aspect of Big Data with emphasis on 5Vs
(Volume, Velocity, Variety, Value and Veracity), including the Big Data
challenges in scientific and engineering, social, sensor/IoT/IoE, and
multimedia (audio, video, image, etc.) big data systems and applications.
The conference adopts single-blind review policy. We expect to have a very
high quality and exciting technical program at Osaka, Japan this year.
Example topics of interest includes but is not limited to the following:
1.Big Data Science and Foundations
a.Novel Theoretical Models for Big Data
b.New Computational Models for Big Data
c.Data and Information Quality for Big Data
d.New Data Standards
2.Big Data Infrastructure
a.Cloud/Grid/Stream Computing for Big Data
b.High Performance/Parallel Computing Platforms for Big Data
c.Autonomic Computing and Cyber-infrastructure, System Architectures,
Design and Deployment
d.Energy-efficient Computing for Big Data
e.Programming Models and Environments for Cluster, Cloud, and Grid
Computing to Support Big Data
f.Software Techniques and Architectures in Cloud/Grid/Stream Computing
g.Big Data Open Platforms
h.New Programming Models for Big Data beyond Hadoop/MapReduce, STORM
i.Software Systems to Support Big Data Computing
3.Big Data Management
a.Data Acquisition, Integration, Cleaning, and Best Practices
b.Computational Modeling and Data Integration
c.Large-scale Recommendation Systems and Social Media Systems
d.Cloud/Grid/Stream Data Mining- Big Velocity Data
e.Mobility and Big Data
f.Multimedia and Multi-structured Data- Big Variety Data
g.Compliance and Governance for Big Data
4.Big Data Search and Mining
a.Social Web Search and Mining
b.Web Search
c.Algorithms and Systems for Big Data Search
d.Distributed, and Peer-to-peer Search
e.Big Data Search Architectures, Scalability and Efficiency
f.Link and Graph Mining
g.Semantic-based Data Mining and Data Pre-processing
h.Search and Mining of variety of data including scientific and
engineering, social, sensor/IoT/IoE, and multimedia data
5.Big Data Learning and Analytics
a.Predictive analytics on Big Data
b.Machine learning algorithms for Big Data
c.Deep learning for Big Data
d.Feature representation learning for Big Data
e.Dimension reduction for Big Data
f.Physics informed Big Data learning
g.Visualization Analytics for Big Data
6.Data Ecosystem
a.Data ecosystem concepts, theory, structure, and process
b.Ecosystem services and management
c.Methods for data exchange, monetization, and pricing
d.Trust, resilience, privacy, and security issues
e.Privacy preserving Big Data collection/analytics
f.Trust management in Big Data systems
g.Ecosystem assessment, valuation, and sustainability
h.Experimental studies of fairness, diversity, accountability, and
transparency
7.Big Data Applications
a.Complex Big Data Applications in Science, Engineering, Medicine,
Healthcare, Finance, Business, Law, Education, Transportation, Retailing,
Telecommunication
b.Big Data Analytics in Small Business Enterprises (SMEs),
c.Big Data Analytics in Government, Public Sector and Society in General
d.Real-life Case Studies of Value Creation through Big Data Analytics
e.Big Data as a Service
f.Big Data Industry Standards
g.Experiences with Big Data Project Deployments
INDUSTRIAL & Government Track
------------------------------
The Industrial Track solicits papers describing implementations of Big Data
solutions relevant to industrial settings. The focus of industry track is
on papers that address the practical, applied, or pragmatic or new research
challenge issues related to the use of Big Data in industry. We accept full
papers (up to 10 pages) and extended abstracts (2-4 pages).
The Government Track welcomes papers discussing the usefulness and need for
publicly-contribution big data and open data and their use. Specifically,
data utilization scenarios, needs analysis, data utilization obstacle
analysis and solutions, data integration processes, interfaces as data
utilization solutions, visualization, use cases, evidence-based policy
making, building an ecosystem for solving social issues, analyzing their
cases, comparing international and regional differences, and conducting
comparative surveys before and after specific events (like Covid-19). We
are also looking for other big data solutions related to national and local
governments, and public services.
Please submit an extended abstract (2-4 pages) OR a full-length paper (up
to 10 pages) through the online submission page (Industrial & Government
Track dedicated page)
Paper Submission:
-----------------
Please submit a full-length paper (up to 10 page IEEE 2-column format,
reference pages counted in the 10 pages) through the online submission
system.
https://wi-lab.com/cyberchair/2023/bigdata23/index.php
Papers should be formatted to IEEE Computer Society Proceedings Manuscript
Formatting Guidelines (see link to "formatting instructions" below).
https://www.ieee.org/conferences/publishing/templates.html
Important Dates:
----------------
- Electronic submission of full papers: Sept 3, 2023
- Notification of paper acceptance: Oct 27, 2023
- Camera-ready of accepted papers: Nov 17, 2023
- Conference: Dec 15-18, 2023
========================================================================
Tsuyoshi Okita
Kyushu Institute of Technology
AI department
*** Apologies for cross-posting ***
++ DEADLINE EXTENSION ++
=============================================================================================================================
The 1st International Workshop on Implicit Author Characterization from Texts for Search and Retrieval (IACT’23)
The workshop will be held in conjunction with the 46th International ACM SIGIR Conference on Research and Development in Information Retrieval.
Workshop website: https://en.sce.ac.il/news/iact23
Date: July 27, 2023
Location: Taipei, Taiwan.
Paper submission deadline: Extended to May 23, 2023, AoE
Submission link: https://easychair.org/conferences/?conf=iact23
To bring the research community's attention to the limitations of current models in recognizing and characterizing AI vs. human authors, we organize the first edition of IACT workshops under the umbrella of the SIGIR conference. Research works submitted to the workshop should foster scientific advances in all aspects of author characterization.
Submission Guidelines:
All papers must be original and not simultaneously submitted to another journal or conference. The following paper categories are welcome:
- Full research papers: up to 8 pages. Original and high-quality unpublished contributions to the theory and practical aspects of the workshop topics.
- Short research papers: up to 5 pages. It can describe ongoing research, resources, and demos.
- Negative results papers: up to 5 pages. Highlighting tested hypotheses that did not get the expected outcome is also welcomed.
- Position papers: up to 5 pages. Discussing current and future research directions.
The length constraints do not include references.
The submissions must be anonymous and will be peer-reviewed by at least two program committee members.
Workshop Format:
The authors of accepted papers will be given 15 minutes for a short oral presentation. The workshop will run as a hybrid event to allow virtual attendance and meet the SIGIR format.
Workshop Topics:
Research works submitted to the workshop should foster the scientific advance on all aspects of implicit author information extraction from text, including but not limited to the following:
- Differentiation between AI-generated content and human-generated content and bot profiling
- Characterization of conversational agents
- Feature detection of authors for human vs. AI determination
- Prompt understanding and recognition in language models
- Personalized question-answering and conversation generation
- Troll identification on social media
- Review authenticity estimation
- Multi-modal, multi-genre, and multilingual author analysis
- Character analysis, description, and representation in narrative texts
- Detecting implicit expressions of sentiment, emotion, opinion, and bias
- Transfer learning for implicit author characterization
- Implicit author characterization annotation schema
- Evaluation of implicit author characterization
- Author characterization in low-resource languages and under-studied domains
- Accountability and regulation of AI-based information extraction, retrieval, and content generation
- Copyright issues of AI-generated content
- Ethical and privacy implications of author characterization and implicit information extraction
- Fairness and bias of AI-generated content
Organizing Committee:
Marina Litvak - marinal(a)ac.sce.ac.il; Shamoon College of Engineering Beer Sheva; Israel
Irina Rabaev - irinar(a)ac.sce.ac.il; Shamoon College of Engineering Beer Sheva; Israel
Alípio Mário Jorge - amjorge(a)fc.up.pt; University of Porto; Porto, Portugal
Ricardo Campos - ricardo.campos(a)ipt.pt; Polytechnic Institute of Tomar INESC TEC, Portugal; Porto, Portugal
Adam Jatowt - adam.jatowt(a)uibk.ac.at; University of Innsbruck; Innsbruck, Austria
Invited Speakers:
Prof. Mark Last - Ben-Gurion University of the Negev, Israel
Prof. Dr. Valia Kordoni - Humboldt-Universität Berlin, Germany
Contacts:
Dr. Marina Litvak: litvak.marina(a)gmail.com
Dr. Irina Rabaev: irinar(a)ac.sce.ac.il
All the best,
Hugo Sousa on behalf of the IATC'23 Organizers
[Apologies for multiple postings]
LREC-COLING 2024
The 2024 Joint International Conference on Computational Linguistics,
Language Resources and Evaluation
Lingotto Conference Centre - Torino (Italy)
20-25 May, 2024
https://lrec-coling-2024.lrec-conf.org
Twitter: @LrecColing2024 <https://twitter.com/LrecColing2024>
*First Call for papers***
Two international key players in the area of computational linguistics,
the ELRA Language Resources Association (ELRA) and the International
Committee on Computational Linguistics (ICCL), are joining forces to
organize the 2024 Joint International Conference on Computational
Linguistics, Language Resources and Evaluation (LREC-COLING 2024) to be
held in Torino, Italy on 20-25 May, 2024.
*IMPORTANT DATES*
(All deadlines are 11:59PM UTC-12:00 (“anywhere on Earth”)
* 22 September 2023: Paper anonymity period starts
* 13 October 2023: Final submissions due (long, short and position
papers)
* 13 October 2023: Workshop/Tutorial proposal submissions due
* 22–29 January 2024: Author rebuttal period
* 5 February 2024: Final reviewing
* 19 February 2024: Notification of acceptance
* 25 March 2024: Camera-ready due
* 20-25 May 2024: LREC-COLING2024 conference
*SUBMISSION TOPICS *
LREC-COLING 2024 invites the submission of long and short papers
featuring substantial, original, and unpublished research in all aspects
of natural language and computation, language resources (LRs) and
evaluation, including spoken and sign language and multimodal
interaction. Submissions are invited in five broad categories: (i)
theories, algorithms, and models, (ii) NLP applications, (iii) language
resources, (iv) NLP evaluation and (v) topics of general interest.
Submissions that span multiple categories are particularly welcome.
*(i) Theories, algorithms, and models*
* Discourse and Pragmatics
* Explainability and Interpretability of Large Language Models
* Language Modeling
* CL/NLP and Linguistic Theories
* CL/NLP for Cognitive Modeling and Psycholinguistics
* Machine Learning for CL/NLP
* Morphology and Word Segmentation
* Semantics
* Tagging, Chunking, Syntax and Parsing
* Textual Inference
*(ii) NLP applications*
* Applications (including BioNLP and eHealth, NLP for legal purposes,
NLP for Social Media and Journalism, etc.)
* Dialogue and Interactive Systems
* Document Classification, Topic Modeling, Information Retrieval and
Cross-Lingual Retrieval
* Information Extraction, Text Mining, and Knowledge Graph Derivation
from Texts
* Machine Translation for Spoken/Written/Sign Languages, and
Translation Aids
* Sentiment Analysis, Opinion and Argument Mining
* Speech Recognition/Synthesis and Spoken Language Understanding
* Natural Language Generation, Summarization and Simplification
* Question Answering
* Offensive Speech Detection and Analysis
* Vision, Robotics, Multimodal and Grounded Language Acquisition
*(iii) Language resource design, creation, and use: text, speech, sign,
gesture, image, in single or multimodal/multimedia data*
* Guidelines, standards, best practices and models for LRs,
interoperability
* Methodologies and tools for LRs construction, annotation, and
acquisition
* Ontologies, terminology and knowledge representation
* LRs and Semantic Web (including Linked Data, Knowledge Graphs, etc.)
* LRs and Crowdsourcing
* Metadata for LRs and semantic/content mark-up
* LRs in systems and applications such as information extraction,
information retrieval, audio-visual and multimedia search, speech
dictation, meeting transcription, Computer-Aided Language Learning,
training and education, mobile communication, machine translation,
speech translation, summarisation, semantic search, text mining,
inferencing, reasoning, sentiment analysis/opinion mining,
(speech-based) dialogue systems, natural language and
multimodal/multisensory interactions, chatbots, voice-activated
services, etc.
* Use of (multilingual) LRs in various fields of application like
e-government, e-participation, e-culture, e-health, mobile
applications, digital humanities, social sciences, etc.
* LRs in the age of deep neural networks
* Open, linked and shared data and tools, open and collaborative
architectures
* Bias in language resources
* User needs, LT for accessibility
*(iv) NLP evaluation methodologies*
* NLP evaluation methodologies, protocols and measures
* Benchmarking of systems and products
* Evaluation metrics in Machine Learning
* Usability evaluation of HLT-based user interfaces and dialogue systems
* *User satisfaction evaluation*
**
*(v) Topics of general interest*
* Multilingual issues, language coverage and diversity, less-resourced
languages
* Replicability and reproducibility issues
* Organisational, economical, ethical and legal issues
* Priorities, perspectives, strategies in national and international
policies
* International and national activities, projects and initiatives
LREC-COLING 2024 invites high-quality submissions written in
English. Submissions of three forms of papers will be considered:
A. Regular long papers - up to eight (8) pages maximum*, presenting
substantial, original, completed, and unpublished work.
B. Short papers - up to four (4) pages*, describing a small focused
contribution, negative results, system demonstrations, etc.
C. Position papers - up to eight (8) pages*, discussing key hot topics,
challenges and open issues, as well as cross-fertilization between
computational linguistics and other disciplines.
* Excluding any number of additional pages for references, ethical
consideration, conflict-of-interest, as well as data and code
availability statements.
Appendices or supplementary material will be allowed ONLY in the final,
camera-ready version, but not during submission, as papers should be
reviewed without the need to refer to any supplementary materials.
Linguistic examples, if any, should be presented in the original
language but also glossed into English to allow accessibility for a
broader audience.
Note that paper types are decisions made orthogonal to the eventual,
final form of presentation (i.e., oral versus poster).
*AUTHOR RESPONSIBILITIES*
Papers must be of original, previously-unpublished work. Papers must be
anonymized to support double-blind reviewing. Submissions thus must not
include authors’ names and affiliations. The submissions should also
avoid links to non-anonymized repositories: the code should be either
submitted as supplementary material in the final version of the paper,
or as a link to an anonymized repository (e.g., Anonymous GitHub
<https://anonymous.4open.science/> or Anonym Share
<https://anonymshare.com/>). Papers that do not conform to these
requirements will be rejected without review.
If the paper is available as a preprint, this must be indicated on the
submission form but not in the paper itself. In addition, LREC-COLING
2024 will follow the same policy as ACL conferences establishing an
anonymity period during which non-anonymous posting of preprints is not
allowed.
More specifically, direct submissions to LREC-COLING 2024 may not be
made available online (e.g. via a preprint server) in a non-anonymized
form after September 22, 11:59PM UTC-12:00 (for arXiv, note that this
refers to submission time).
Also included in that policy are instructions to reviewers to not rate
papers down for not citing recent preprints. Authors are asked to cite
published versions of papers instead of preprint versions when possible.
Papers that have been or will be under consideration for other venues at
the same time must be declared at submission time. If a paper is
accepted for publication at LREC-COLING 2024, it must be immediately
withdrawn from other venues. If a paper under review at LREC-COLING 2024
is accepted elsewhere and authors intend to proceed there, the
LREC-COLING 2024 committee must be notified immediately.
*ETHICS STATEMENT *
We encourage all authors submitting to LREC-COLING 2024 to include an
explicit ethics statement on the broader impact of their work, or other
ethical considerations after the conclusion but before the references.
The ethics statement will not count toward the page limit (8 pages for
long, 4 pages for short papers).
*PRESENTATION REQUIREMENT *
All papers accepted to the main conference track must be presented at
the conference to appear in the proceedings, and at least one author
must register for LREC-COLING2024.
All papers accepted to the main conference will be required to submit a
presentation video. The conference will be hybrid, with an emphasis on
encouraging interaction between the online and in-person modalities, and
thus presentations can be either on-site or virtual.
Dear all,
due to several requests we decided to extend the deadlines for the R&I
track.
The new dates are as follows:
Abstract Submission Deadline: May 23, 2023 (11:59 pm, Hawaii
time - originally May 9)
Paper Submission Deadline: May 30, 2023 (11:59 pm, Hawaii time
- originally May 16)
Notification of Acceptance: June 20, 2023 (11:59 pm, Hawaii time)
Camera-Ready Paper: July 04, 2023 (11:59 pm, Hawaii time)
For details please go to: https://2023-eu.semantics.cc/page/cfp_rev_rep
Looking forward to your submissions! Stay tuned and stay safe!
With kind regards,
Maribel Acosta & Silvio Peroni
-- R&I Track Chairs --
Free registration is now open https://bit.ly/3pwUwlG - tickets are limited and open to non-authors.
Call for Abstracts: 1st Workshop on Readability for Low Resourced Languages (RLRL 2023)
Dear all, join us for an exciting online workshop where experts in natural language processing will come together to discuss the latest research and innovative approaches to assessing the readability of low-resource languages. The workshop will take place as a free online event on September 5, 2023, and is being hosted jointly by Lancaster University, Sheffield Hallam University and King Saud University.
We welcome researchers and practitioners to submit abstract proposals of up to 500 words for talks related to the development of a Readability Framework for low-resource languages. The extended versions of the accepted abstracts will appear in the Computing Research Repository (CoRR), subject to the number of abstracts received and being in English.
Although the workshop will be conducted in English, for the first time, we are accepting submissions in a different language, starting with Arabic. Arabic speaking authors, are encouraged to submit their abstracts in Arabic. Presentations will be recorded with subtitles pre-added in by the authors. The organisers will live translate the Q&As. We plan to extend this option to other languages in future events. Due to lack of resources we are unable to provide live translation from English to Arabic during the workshop but we are open for free-of-charge solutions (please send suggestion to the organisers directly).
The ultimate goal of the workshop is to discuss best practices and state-of-the-art AI-based approaches to create mathematical representations of expected readability levels at different school grade or cognitive ability levels. The workshop will also focus on utilising classifiers that are intuitive for humans to understand and adjust, enabling the analysis and improvement of the decision-making criteria. We welcome abstracts on work that is still in progress or that does not yet have conclusive results. We encourage authors to share their work at various stages of development to facilitate discussions and collaboration during the workshop.
Important Dates:
- Due date for workshop abstract submission: July 17, 2023
- Notification of abstract acceptance to authors: August 1, 2023
- Workshop date: September 5, 2023 (online event)
We are pleased to announce the following keynote speakers for the workshop:
- Professor Laurence Anthony - Faculty of Science and Engineering at Waseda University, Japan.
- Dr Violetta Cavalli-Sforza - School of Science and Engineering at Al Akhawayn University, Morocco.
The main objectives of the workshop are three-fold:
1- Increase awareness of the importance of readability in low-resource languages and its impact on language learning and literacy.
2- Discuss the challenges of readability in low-resource languages, such as limited resources and lack of standardization, and brainstorm strategies for addressing these challenges.
3- Foster a community of practice among participants, allowing them to share their experiences and best practices for addressing readability issues in low-resource languages.
Abstract submission:
Abstract submission page is now open, please submit abstracts of no more than 500 words either in English or Arabic https://easychair.org/conferences/?conf=rlrl2023
Topics of interest include, but are not limited to:
- Machine learning for text readability
- Applications of readability assessment
- Readability in low-resource languages
- Comprehensibility measures
- Mathematical representations of readability levels
- Text simplification for low-resource languages
- Readability and comprehensibility in language learning
- The effects of text simplification on readability
- Readability frameworks for indigenous languages
- Updating readability representations
We look forward to your contributions and to a productive and enlightening workshop on September 5, 2023.
RLRL 2023 Organisers:
- Dr Mo El-Haj (SCC/DSI/UCREL, Lancaster University)
- Dr Abdel-Karim Al Tamimi (CSSE, Sheffield Hallam University)
- Prof. Hend Al Khalifa (iWAN, King Saud University)
https://wp.lancs.ac.uk/acc/rlrl2023/
Best wishes,
Mahmoud
---------------------
Dr Mo El-Haj
Senior Lecturer in NLP
Co-Director of UCREL NLP Group
Strategic Lead of Arabic and Financial NLP Research
School of Computing and Communications, Lancaster University
https://www.lancaster.ac.uk/staff/elhaj
*Updated the deadlines*
- Paper submission deadline: May 21, 2023 AoE
- Submission notification date: May 31, 2023
*==============================================================================================*
*The SIGIR '23 Workshop on Knowledge Discovery from Unstructured Data in
Financial Services (KDF)*
Artificial intelligence (AI) and information retrieval (IR) systems and
techniques have been widely adopted in financial services to tackle various
tasks, such as information retrieval from business documents, retrieval
from non-textual content like tables and graphs, recommending financial
products and services to customers, providing decision support for
investment practices, automating of due diligence protocols, detecting
fraudulent transactions, financial sentiment analysis on social media, and
understanding Environmental, Social and Governance (ESG) impact on
investment practices.
Knowledge from IR systems can help augment human intelligence. However,
discovering and extracting the knowledge conveyed inside unstructured
financial data, like SEC filings, prospectuses, business reports, and other
enterprise documents are extremely challenging due to the massive volume of
data, large variation in the data format, low signal-to-noise ratio,
scarcity of expert annotated datasets, task ambiguity, hurdles regarding
data integrity and privacy, robustness against domain shift, and
high-performance requirements set by industry and regulatory standards.
Manual extraction of knowledge is usually inefficient, error-prone, and
inconsistent, so it is one of the key technical bottlenecks for financial
services companies to accelerate their operating productivity. These
challenges and issues call for robust artificial intelligence, information
retrieval, and machine learning algorithms and systems to help. The
automated processing of unstructured data to discover knowledge from
complex financial documents requires bringing together a suite of
techniques such as natural language processing, information retrieval,
semantic analysis, and complex reasoning. In addition, how knowledge is
captured and represented, synthesized across diverse sources, and used
within AI systems, is crucial to developing effective solutions in
financial services.
Furthermore, based on the reflections and feedback from our past KDF
workshops, the 2023 workshop is particularly interested in multi-modal
understanding of financial documents, retrieving and reasoning over tabular
data within financial documents, and financial domain-specific
representation learning. The workshop will be composed of three components:
invited talks, paper presentations, along with a shared task competition.
We cordially welcome researchers, practitioners, and students from academic
and industrial communities who are interested in the topics to participate
and/or submit their original work. *The workshop will be a hybrid event –
supporting both in-person and virtual participation.*
Topics of Interest
The topics of the workshop include, but are not limited to, the following
areas:
- AI and IR technologies for business document understanding for
financial corpora, including searching and question answering systems,
understanding and reasoning over non-textual content such as tables and
graphs;
- representation learning, and distributed representation learning and
encoding in natural language processing for financial documents;
- language modeling on financial corpora including tabular and numerical
data, and multi-modal modeling;
- multi-source knowledge integration and fusion, and knowledge alignment
and integration from heterogeneous data;
- reconciling unstructured knowledge with structured knowledge and human
expertise;
- named-entity disambiguation, recognition, resolution, relationship
discovery, ontology learning and extraction in financial and business
documents;
- AI-assisted domain data tagging, labeling, and annotation for IR
tasks; automatic data extraction from financial filings and quality
verification;
- corporate ESG event discovery, evaluation, and impact assessment;
- event discovery from alternative data and impact on corporate equity
pricing;
- AI and IR systems for financial risk assessment on financial legal
documents such as contracts and prospectuses;
- verifying facts and statements generated by large pre-trained language
models using IR and knowledge discovery;
- IR or QA techniques and applications on financial documents leveraging
large language models.
Submission GuidelinesWe invite submissions of relevant work that be of
interest to the workshop. All submissions must be original contributions
that have not been previously published and that are not currently under
review by other conferences or journals. Submissions will be peer reviewed,
single-blinded. Submissions will be assessed based on their novelty,
technical quality, significance of impact, interest, clarity, relevance,
and reproducibility. All submissions must be in PDF format and follow the
current ACM two-column conference format
https://www.acm.org/publications/proceedings-template. We accept two types
of submissions:· full research paper: no longer than 9 pages
(including references, proofs, and appendixes).· short/poster
paper: no longer than 4 pages (including references, proofs, and
appendixes).Submission will be accepted via Microsoft CMT
https://cmt3.research.microsoft.com/KDF2023/. All accepted submissions will
be presented in the workshop. Submission will be non-archival, and the
authors may post their work on arXiv or other online repositories.Important
Dates
- Paper abstract due (optional): May 1, 2023 AoE
- Paper submission deadline: May 21, 2023 AoE
- Submission notification date: May 31, 2023
- Workshop: July 27, 2023
Organizing Committee· Sameena Shah - JPMorgan AI Research·
Xiaodan
Zhu - Queen's University· Wenhu Chen - University of Waterloo·
Manling Li - University of Illinois Urbana-Champaign· Armineh
Nourbakhsh - JPMorgan AI Research· Xiaomo Liu - JPMorgan AI Research
· Zhiqiang Ma - JPMorgan AI Research· Charese Smiley -
JPMorgan AI Research· Yulong Pei - JPMorgan AI Research· Akshat
Gupta - JPMorgan AI ResearchWorkshop Website
http://kdf-workshop.github.io/kdf23
*Contact*
For general inquiries about KDF, please write to the organizers at
kdf.workshop(a)gmail.com.
--
Best,
Xiaomo