Dear colleagues,
The UCSC Natural Language Processing (NLP) master’s degree program
<http://nlp.ucsc.edu/> provides both depth and breadth in core algorithms
and methods for NLP. Taught intensively over 15-18 months, our program
design combines theoretical learning with hands-on practice to ensure our
students have the right skill set to prepare for a professional career in
this fast-growing field. We are currently accepting applications for Fall
2023 admission consideration.
Program Highlights:
-
A 15-18 month program with a Capstone project mentored by NLP faculty
and industry experts.
-
NLP students receive an in-depth, systematic education in NLP, machine
learning, and data science and analytics with faculty who have both
academic and industry experience.
-
Our Industry Advisory Board provides insight and career advice through
mentoring, guest lectures, and attendance at networking and professional
development events.
-
All our courses are exclusive to NLP students and are designed with
input from our Industry Advisory Board ensuring the content is current,
relevant, and grounded in real-world context.
-
Our program is based at state-of-the-art facilities at the UCSC Silicon
Valley Campus, located in Santa Clara, California.
Applying to the NLP MS Program
This program is intended for students with a strong background in computer
science. We are looking for multifaceted individuals with solid skills in
programming, algorithms, machine learning, probability, statistics, and
linguistics. Visit our Admissions page <http://nlp.ucsc.edu/admissions> to
review admission requirements and tips for applying, and to connect with
our support team.
Applications for Fall 2023 admission consideration are now open. Apply by
March 1, 2023 <https://applygrad.ucsc.edu/apply/>.
If you have questions about the program or the application process, please
contact the NLP Support Team.
The NLP Program Team
Natural Language Processing MS Program <http://nlp.ucsc.edu/>
Baskin Engineering
University of California, Santa Cruz
Hi everyone,
We’re looking for a summer intern Computational Linguist to join our team
at Linkedin!
https://www.linkedin.com/jobs/view/3448033554
Please feel free to reach out if you have any enquiries about this
internship, or pass this along to any student you think would enjoy working
in Computational Linguistics and Language Technologies this summer.
Many thanks,
Kind regards,
Lamia
DMR 2023, the Fourth International Workshop on Designing Meaning
Representations, will be co-located with IWCS 2023
<http://iwcs2023.loria.fr/>. It will be held on June 20th, 2023 in Nancy,
France.
While deep learning methods have led to many breakthroughs in practical
natural language applications, most notably in Machine Translation, Machine
Reading, Question Answering, Recognizing Textual Entailment, and so on,
there is still a sense among many NLP researchers that we have a long way
to go before we can develop systems that can actually “understand” human
language and explain the decisions they make. Indeed, “understanding”
natural language entails many different human-like capabilities, and they
include but are not limited to the ability to track entities in a text,
understand the relations between these entities, track events and their
participants described in a text, understand how events unfold in time, and
distinguish events that have actually happened from events that are planned
or intended, are uncertain, or did not happen at all. We believe a critical
step in achieving natural language understanding is to design meaning
representations for text that have the necessary meaning “ingredients” that
help us achieve these capabilities. Such meaning representations can also
potentially be used to evaluate the compositional generalization capacity
of deep learning models.
There has been a growing body of research devoted to the design,
annotation, and parsing of meaning representations in recent years. The
meaning representations that have been used for semantic parsing research
are developed with different linguistic perspectives and practical goals in
mind and have different formal properties. Formal meaning representation
frameworks such as Minimal Recursion Semantics (MRS) and Discourse
Representation Theory (as exemplified in the Parallel Meaning Bank) are
developed with the goal of supporting logical inference in reasoning-based
AI systems and are therefore easily translatable into first-order logic,
requiring proper representation of semantic components such as
quantification, negation, tense, and modality. Other meaning representation
frameworks such as Abstract Meaning Representation (AMR), Tecto-grammatical
Representation (TR) in Prague Dependency Treebanks and the Universal
Conceptual Cognitive Annotation (UCCA), put more emphasis on the
representation of core predicate-argument structure, lexical semantic
information such as semantic roles and word senses, or named entities and
relations. There is also a more recent effort in developing a Uniform
Meaning Representation (UMR) that is based on AMR but extends it to
cross-linguistic settings and enhances it to represent document-level
semantic content. The automatic parsing of natural language text into these
meaning representations and the generation of natural language text from
these meaning representations are also very active areas of research, and a
wide range of technical approaches and learning methods have been applied
to these problems.
This workshop will bring together researchers who are producers and
consumers of meaning representations, and through their interaction develop
a deeper understanding of the key elements of meaning representations that
are the most valuable to the NLP community. The workshop will also provide
an opportunity for meaning representation researchers to critically examine
existing frameworks with the goal of using their findings to inform the
design of next-generation meaning representations. A third goal of the
workshop is to explore opportunities and identify challenges in the design
and use of meaning representations in multilingual settings. A final goal
of the workshop is to understand the relationship between distributed
meaning representations trained on large data sets using network models,
and the symbolic meaning representations that are carefully designed and
annotated by NLP researchers and gain a deeper understanding of areas where
each type of meaning representation is the most effective.
The workshop will solicit papers that address one or more of the following
topics:
— Design and annotation of meaning representations;
— Cross-framework comparison of meaning representations;
— Challenges and techniques in automatic parsing of meaning representations;
— Challenges and techniques in automatically generating text from meaning
representations;
— Meaning representation evaluation metrics;
— Lexical resources, ontologies, and grounding in relation to meaning
representations;
— Real-world applications of meaning representations;
— Issues in applying meaning representations to multilingual settings and
lower-resourced languages;
— The relationship between symbolic meaning representations and distributed
semantic representations;
— Formal properties of meaning representations;
— Any other topics that address the design, processing, and use of meaning
representations.
Important dates:
All deadlines are 11:59PM UTC-12:00 (“anywhere on Earth”).
Paper due: April 3, 2023
Notification of acceptance: May 1, 2023
Camera-ready deadline: June 1, 2023
Workshop date: June 20, 2023
Submission instructions:
Submission site: https://softconf.com/iwcs2023/dmr2023
Submissions should report original and unpublished research on topics of
interest to the workshop. Accepted papers are expected to be presented at
the workshop and will be published in the workshop proceedings on the ACL
Anthology. They should emphasize obtained results rather than intended
work, and should indicate clearly the state of completion of the reported
results. A paper accepted for presentation at the workshop must not be or
have been presented at any other meeting with publicly available
proceedings.
Submission is electronic, using the Softconf START conference management
system. Here is the link <https://softconf.com/iwcs2023/dmr2023> to the DMR
submission site. Submissions must adhere to the two-column format of ACL
venues: please use our specific style-files
<https://iwcs2021.github.io/download/iwcs2021-templates.zip> or the Overleaf
template
<https://www.overleaf.com/latex/templates/instructions-for-iwcs-2021-proceed…>,
taken from ACL 2021. Similar to ACL 2021, initial submissions should be
fully anonymous to ensure double-blind reviewing. Long papers must not
exceed eight (8) pages of content. Short papers and demonstration papers
must not exceed four (4) pages of content. If a paper is accepted, it will
be given an additional page to address reviewers’ comments in the final
version of the paper. References and appendices do not count against these
limits.
Reviewing of papers will be double-blind. Therefore, the paper must not
include the authors' names and affiliations or self-references that reveal
the author's identity--e.g., "We previously showed (Smith, 1991) ..."
should be replaced with citations such as "Smith (1991) previously showed
...". Papers that do not conform to these requirements will be rejected
without review.
Authors of papers that have been or will be submitted to other meetings or
publications must provide this information to the workshop organizers (
dmr2023-chairs(a)googlegroups.com). Authors of accepted papers must notify
the program chairs within 10 days of acceptance if the paper is withdrawn
for any reason.
DMR 2023 does not have an anonymity period. However, we ask you to be
reasonable and not publicly advertise your preprint during (or right
before) review.
More information will be posted to the DMR 2023 website:
http://iwcs2023.loria.fr/dmr-2023-the-fourth-international-workshop-on-desi…
Dear Corpora members,
Is anybody aware of any corpora focussing on the topic of sustainability?
Possibly with different genres of texts, but not necessarily.
Thank you!
Laura Narisano
We are looking for a PhD student to work with Jindřich Libovický on a 4-year
project focusing on a better understanding of multilingual language
representations, improving their language neutrality and language coverage.
The prospective PhD thesis topic is: Study if and how cultural aspects
of meaning (e.g., moral values) are encoded in the representations and
methods to make multilingual more culture-aware. However, the exact topic
is negotiable.
The Institute of Formal and Applied Linguistics (ÚFAL) has an active
community working on cutting-edge natural language processing. There
will be significant interaction with the other researchers at the
institute, especially the groups of Ondřej Bojar, Daniel Zeman, Pavel
Pecina, Zdeněk Žabokrtský, and Jan Hajič. Charles University is the
top-ranking university in Czechia, attracting strong talents locally and
internationally.
If you are interested in applying, please send me:
* A cover letter (clearly indicate which position you are applying for
and include possible starting dates);
* A CV;
* A short research plan addressing past, current, and future interests;
* A brief statement of teaching experience and interest;
* The names and addresses of two references.
The working language is English, and no knowledge of Czech is required.
Candidates with a strong background in natural language processing and
machine learning are preferred, but other backgrounds are also welcome.
Please submit your applications by March 15, 2023, via email to
<my_surname> at ufal dot mff dot cuni dot cz. The expected starting date
is October 1, 2023.
Thanks,
Jindřich Libovický
------
Charles Univeristy, Institute of Formal and Applied Linguistics
Malostranské náměstí 25
118 00 Praha
Czech Republic
Email: <my_surname> at ufal dot mff dot cuni dot cz
Web: https://ufal.mff.cuni.cz/jindrich-libovicky
Event Notification Type: Call for Participation
Website: <a href="https://pan.webis.de/clef23/pan23-web/author-profiling.html">https://pan.webis.de/clef23/pan23-web/author-profiling.htmlt</a>
Software submission deadline: May 29, 2023
Submission Deadline: June 05, 2023
We kindly invite you to participate at the Pan-2023 Task 2 - “Profiling Cryptocurrency Influencers with Few-Shot Learning”.
This task is being held as part of CLEF 2023, and all participating teams will be able to publish their system description paper at the CLEF proceedings.
This shared task focuses on the author profiling of cryptocurrency influencers in social media from a low-resource perspective, that is, with little training data. Moreover, we propose to profile types of influencers also using a low-resource setting.
Specifically, we focus on English Twitter posts for three different sub-tasks:
Low-resource influencer profiling (subtask-1): profile authors according to their degree of influence (null, nano, micro, macro, mega).
Low-resource influencer interest profiling (subtask-2): profile authors according to their main interests or areas of influence (technical information, price update, trading matters, gaming, other).
Low-resource influencer intent profiling (subtask-3): profile authors according to the intent of their messages (subjective opinion, financial information, advertising, announcement).
Important Links
Task Website
Dataset site
Slack workspace
Important Dates
February 20, 2023: Training data ready
May 10, 2023: Early bird software submission phase (optional)
May 29, 2023: Software submission deadline
June 05, 2023: Participant paper submission
September 18-21, 2023: Conference
Task organizers
Francisco Rangel (Symanto)
Mara Chinea-Rios (Symanto) Contact Email: mara.chinea(a)symanto.com
Marc Franco-Salvador (Symanto)
Paolo Rosso (Universidad Politécnica de Valencia)
Please reach out to the organizers at crypto-influencers-pan-organizers(a)googlegroups.com, or join the Slack workspace (https://pan2023profil-0q48349.slack.com) to connect with the other participants and organizers.
Third call for papers
Fourth workshop on Resources for African Indigenous Language (RAIL)
https://bit.ly/rail2023
Note: deadline extension and submission system information
The 4th RAIL (Resources for African Indigenous* Languages) workshop
will be co-located with EACL 2023 in Dubrovnik, Croatia. The Resources
for African Indigenous Languages (RAIL) workshop is an
interdisciplinary platform for researchers working on resources (data
collections, tools, etc.) specifically targeted towards African
indigenous languages. In particular, it aims to create the conditions
for the emergence of a scientific community of practice that focuses on
data, as well as computational linguistic tools specifically designed
for or applied to indigenous languages found in Africa.
Previous workshops showed that the presented problems (and solutions)
are not only applicable to African languages. Many issues are also
relevant to other low-resource languages, such as different scripts and
properties like tone. As such, these languages share similar
challenges. This allows for researchers working on these languages with
such properties (including non-African languages) to learn from each
other, especially on issues pertaining to language resource
development.
The RAIL workshop has several aims. First, it brings together
researchers working on African indigenous languages, forming a
community of practice for people working on indigenous languages.
Second, the workshop aims to reveal currently unknown or unpublished
existing resources (corpora, NLP tools, and applications), resulting in
a better overview of the current state-of-the-art, and also allows for
discussions on novel, desired resources for future research in this
area. Third, it enhances sharing of knowledge on the development of
low-resource languages. Finally, it enables discussions on how to
improve the quality as well as availability of the resources.
The workshop has “Impact of impairments on language resources” as its
theme, but submissions on any topic related to properties of African
indigenous languages (including non-African languages) may be accepted.
Suggested topics include (but are not limited to) the following:
Digital representations of linguistic structures
Descriptions of corpora or other data sets of African indigenous
languages
Building resources for (under resourced) African indigenous languages
Developing and using African indigenous languages in the digital age
Effectiveness of digital technologies for the development of African
indigenous languages
Revealing unknown or unpublished existing resources for African
indigenous languages
Developing desired resources for African indigenous languages
Improving quality, availability and accessibility of African indigenous
language resources
*: The term indigenous languages used in the RAIL workshop is intended
to refer to non-colonial languages (in this case those used in Africa).
In no way is this term used to cause any harm or discomfort to anyone.
Many of these languages were or are still marginalised, and the aim of
the workshop is to bring attention to the creation, curation, and
development of resources for these languages in Africa.
Submission requirements:
We invite papers on original, unpublished work related to the topics of
the workshop. Submissions, presenting completed work, may consist of up
to eight (8) pages of content plus additional pages of references. The
final camera-ready version of accepted long papers are allowed one
additional page of content (so up to 9 pages) so that reviewers’
feedback can be incorporated.
Submissions need to use the EACL stylesheets. These can be found at
https://2023.eacl.org/calls/styles. Submission is electronic in PDF
through the START system which can be found at
https://softconf.com/eacl2023/RAIL2023. Reviewing is double-blind, so
make sure to anonymize your submission (e.g., do not provide author
names, affiliations, project names, etc.) Limit the amount of self
citations (anonymized citations should not be used). Accepted papers
will be published in the ACL workshop proceedings.
Please make sure you also go through the responsible NLP checklist
(https://aclrollingreview.org/responsibleNLPresearch/). Also,
submissions should have a section titled “Limitations” (as described in
the stylesheets). Authors are also encouraged to include an explicit
ethics statement.
Important dates:
Submission deadline 20 February 2023
Date of notification 13 March 2023
Camera ready deadline 27 March 2023
RAIL workshop 5 or 6 May 2023
Organising Committee
Rooweither Mabuya, South African Centre for Digital Language Resources
(SADiLaR), South Africa
Don Mthobela, Cam Foundation
Mmasibidi Setaka, South African Centre for Digital Language Resources
(SADiLaR), South Africa
Menno van Zaanen, South African Centre for Digital Language Resources
(SADiLaR), South Africa
--
Prof Menno van Zaanen menno.vanzaanen(a)nwu.ac.za
Professor in Digital Humanities
South African Centre for Digital Language Resources
https://www.sadilar.org
________________________________
NWU PRIVACY STATEMENT:
http://www.nwu.ac.za/it/gov-man/disclaimer.html
DISCLAIMER: This e-mail message and attachments thereto are intended solely for the recipient(s) and may contain confidential and privileged information. Any unauthorised review, use, disclosure, or distribution is prohibited. If you have received the e-mail by mistake, please contact the sender or reply e-mail and delete the e-mail and its attachments (where appropriate) from your system.
________________________________
Call for Papers
*Combating Health Misinformation for Social Wellbeing*Workshop @ ICWSM 2023,
the 17th International Conference on Web and Social MediaJune 5th – 8th
2023, Limassol, Cyprus
https://truehealth.disco.unimib.it/
------------------------------
*Scope and topics*
In recent years, *people have increasingly referred to the Web and social
media as sources of information about health-related problems and solutions*,
as confirmed by the U.S. Pew Research Center, and other European and
international studies. Although, on the one hand, these platforms favor
easier and more direct access to information sources by users without the
intermediation of experts, on the other hand, it is precisely such
democratization of health information that constitutes a potential danger
for people. As we have seen especially in the last period, linked to the
pandemic, the proliferation of false information, conspiracy theories, and
unreliable remedies risk compromising the health not only of individuals
but that of the community as a whole.
From this perspective, it becomes necessary to *study and propose
technological solutions* to help users come into contact with genuine
information, especially in a critical domain such as health, for social
well-being.
To this end, it is essential to promote *research of an interdisciplinary
nature*, involving computer scientists, physicians, lawyers, and
communication experts who can *address the problem of health
misinformation* from
different points of view by combining their expertise.
The* topics of interest* of the TrueHealth 2023 Workshop at ICWSM include,
but are not limited to:
- Assessing the genuineness of Online Health Information (OHI);
- Consumer Health Search (CHS) and genuine information access;
- Debunking health misinformation;
- Fake news/rumors and healthcare;
- Measures, evaluation methods, and datasets for health misinformation
detection;
- Health misinformation detection;
- Health literacy and information genuineness;
- Fact-checking in Online Health Information (OHI);
- Misinformation and public opinion on health;
- Relationship between access to non-genuine information and danger to
public health;
- Relationship between psychological characteristics and perceptions of
health misinformation;
- Techniques for accessing and retrieving genuine Online Health
Information (OHI).
------------------------------
*Submission Instructions*
Instructions can be found on the official ICWSM website:
https://www.icwsm.org/2023/index.html/call_for_submissions.html
------------------------------
*Important Dates*
- *Workshop Papers Submissions*: March 27, 2023
- *Workshop Paper Acceptance Notification*: April 10, 2023
- *Workshop Final Camera-Ready Paper Due*: May 6, 2023
- *ICWSM-2023 Workshops Day*: June 5, 2023
------------------------------
*Organizers*
- *Gabriella Pasi*
<https://ikr3.disco.unimib.it/people/gabriella-pasi/> (Full
Professor), University of Milano-Bicocca, Milan, Italy
- *Rishabh Upadhyay
<https://en.unimib.it/rishabh-gyanendra-upadhyay>* (Research
Fellow), University of Milano-Bicocca, Milan, Italy
- *Marco Viviani
<https://ikr3.disco.unimib.it/people/marco-viviani/>* (Associate
Professor), University of Milano-Bicocca, Milan, Italy
------------------------------
*Program Committee*
- *Sanda Harabagiu* <https://personal.utdallas.edu/~sanda/>, The
University of Texas at Dallas, USA
- *Liadh Kelly
<https://www.maynoothuniversity.ie/faculty-science-engineering/our-people/li…>*,
Maynooth University, Ireland
- *Dongwon Lee <https://ist.psu.edu/directory/dul13>*, Penn State: The
Pennsylvania State University, USA
- *Yelena Mejova <https://www.isi.it/en/people/yelena-mejova>,* ISI
Foundation, Italy
- *Michael Sirivianos* <https://netsysci.cut.ac.cy/michael.sirivianos/>,
Cyprus University of Technology, Cyprus
- *Angelo Spognardi <https://angelospognardi.site.uniroma1.it/>*,
Sapienza University of Rome, Italy
- *Bei Yu <https://ischool.syr.edu/bei-yu/>*, Syracuse University, USA
- *Arkaitz Zubiaga <http://www.zubiaga.org/>*, Queen Mary University of
London, UK
- Other members are going to be added in the coming days
---
Rishabh Upadhyay,
IKR3 Lab, UNIMIB,
Milan, Italy
The CLINKART and TESTLINK twin evaluation tasks
We invite you to participate in the CLINKART and TESTLINK twin tasks on the
extraction of relations from clinical cases.
Both tasks mainly consist in identifying test results and measurements and
linking them to the textual mentions of the laboratory tests and
measurements from which they were obtained.
CLINKART (https://e3c.fbk.eu/clinkart) is organised in the context of
Evalita 2023 and focuses on Italian. Training data is already available.
TESTLINK (https://e3c.fbk.eu/testlinkiberlef) is organised in the context
of IberLEF 2023 and proposes three different challenges: two tasks on
monolingual datasets in Spanish and Basque and a third one requiring a
multilingual approach. Training data due on February 27.
The datasets of both tasks are based on the European Clinical Case Corpus
(E3C), which consists of clinical cases in five European languages and is
freely available (CC-BY-NC-4.0).
Join us in one or many of the proposed challenges!
The CLINKART and TESTLINK organisation teams
*NLP4Disability: Call for Papers *
*The First Workshop on Natural Language Processing (NLP) for Disability @
PETRA 2023 <http://www.petrae.org/>*. July 5-7, 2023. CORFU ISLAND, GREECE
*Website*: https://nlp4disability.github.io/
*Submission link*: https://easychair.org/conferences/?conf=petra2023
*Topics of interest*
This workshop will explore how natural language processing can be used to
improve the lives of people with disabilities. Topics of interest include,
but are not limited to:
- Automatic identification of disabilities from text
- Development of accessible natural language interfaces
- Generating alternative text descriptions of images for people with
visual impairments - Improving automatic speech recognition for people with
hearing impairments
- Accessible natural language interfaces
- Assistive technologies for people with disabilities
- Computational linguistics for people with disabilities - Language
processing for people with disabilities
- Text processing for people with disabilities
- NLP Bias Against Disabled People
*Important Dates*
Workshop paper submission deadline: *March 10, 2023*
Review Period: March 11 - April 07, 2023
Notification date: April 10, 2023
Camera-Ready: April 30, 2023
*Publications*
All accepted papers will appear in the workshop proceedings and will be
published with the *ACM Digital Library as part of the ACM ICPS program*.
Also, Authors of selected papers will also be invited to submit an extended
and improved version to a Special Issue published in Nafath newsletter
(ISSN: 2789-9144) indexed in DOAJ and Google Scholar (
https://nafath.mada.org.qa).
*Workshop Organizers*
- *Hend Al-Khalifa*. Faculty at the Information Technology Department,
College of Computer and Information Sciences, King Saud University, Riyadh,
Saudi Arabia
- *Zainab AlMeraj*. Faculty at the Information Science Department,
College of Life Sciences, Kuwait University, Kuwait
- *Achraf Othman*. Mada Center, Qatar
- *Dena Al-Thani*. College of Science and Engineering, Hamad bin Khalifa
University, Qatar