Dear List Members,
ADAPT Centre (https://www.adaptcentre.ie/), Munster Technological
University (MTU), Ireland and Bluetensor S.r.l, Italy (
https://bluetensor.ai/) have formed a partnership to fully fund a 4-year
PhD position in the area of Artificial Intelligence. The general area for
the PhD focus will be *Multilingual Multimodal Information Extraction*. The
successful candidate will work closely with a team of mentors from academia
and industry.
*Why ADAPT Centre?*
Contribute to the ADAPT research agenda that pioneers and combines research
in AI driven technologies: Natural Language Processing,
Video/Text/Image/Speech processing, digital engagement & HCI, semantic
modeling, personalisation, privacy & data governance.
Work with our interdisciplinary team of leading experts from the
complementary fields of, Social Sciences, Communications, Commerce/Fintech,
Ethics, Law, Health, Environment and Sustainability.
Leverage our success. ADAPT’s researchers have signed 43 collaborative
research projects, 52 licence agreements and oversee 16 active
commercialisation funds and 52 commercialisation awards. ADAPT has won 40
competitive EU research projects and obtained €18.5 million in
non-exchequer non-commercial funding. Additionally, six spinout companies
have been formed. ADAPT’s researchers have produced over 1,500 journal and
conference publications and nearly 100 PhD students have been trained.
As an ADAPT funded PhD researcher you will have access to a network of 85
global experts and over 250 staff as well as a wide multi-disciplinary
ecosystem across 8 leading Irish universities. We can influence and inform
your work, share our networks and collaborate with you to increase your
impact, and accelerate your career opportunities. Specifically we offer:
Opportunity to build your profile at international conferences and global
events.
A solid career pathway through formalised training & development, expert
one-on-one supervision and exposure to top specialists.
A Fully funded, 4 year PhD postgraduate studentship which includes a
tax-free stipend of approx. €18,500 per year for up to four years including
EU tuition fees, research and equipment costs and all costs associated with
training related covered.
*Why Bluetensor?*
Combine research with field experience. Every day we apply the latest
technological innovations by selecting those that are best suited to the
business, developing customised solutions to solve our customers' concrete
problems.
Compare yourself with different sectors. See how your research is
indispensable in different businesses. BlueTensor develops artificial
intelligence projects in industry, education, law, media, etc.
participating in public and private tenders.
Exploit synergies. Technologies become even more powerful when combined
with others. BlueTensor has expertise in deep learning, machine learning,
predictive analysis, computer vision and natural language processing. In
each project we know whether and how to combine algorithms in the best
possible way. You will be able to deal with our highly skilled team of data
scientists, front-end developers, full-stack developers and AI architects.
*Minimum qualifications*
Minimum 2.1 honours undergraduate degree in either Computer Science,
Computer Engineering, Electrical and Electronic Engineering or related
disciplines with strong programming skills.
- Expertise and interest in Machine Learning/Natural Language
Processing/Data Mining
-
- Previous scientific publication experience preferred.
-
- Excellent written and verbal communication and interpersonal skills
*Application Process*
Interested candidates can send an application with the following documents
directly to Mohammed Hasanuzzaman (mohammed.hasanuzzaman(a)adaptcentre.ie)
- Detailed curriculum vitae, including – if applicable – relevant
publications;
-
- Transcripts of degrees,
-
- The name and email contacts of two academic referees,
-
- A cover letter/letter of introduction (max 2000 words). In the letter,
applicants should include the following details:
-
- An explanation of your interest in the research to be conducted and
why you believe they are suitable for the position.
-
- Details of your final year undergraduate project (if applicable)
-
- Details of your MSc project (if applicable)
-
- Details of any relevant modules previously taken, at undergraduate
and/or Master level.
-
- Details of any relevant work experience (if applicable).
*Diversity*
ADAPT is committed to achieving better diversity and gender representation
at all levels of the organisation, across leadership, academic, operations,
research staff and studentship levels. ADAPT is committed to the continued
development of employment policies, procedures and practices that promote
gender equality. On that basis we encourage and welcome talented people
from all backgrounds to join ADAPT.
*About the ADAPT Centre*
ADAPT is the world-leading SFI research centre for AI Driven Digital
Content Technology hosted by Trinity College Dublin. ADAPT’s partner
institutions include Dublin City University, University College Dublin,
Technological University Dublin, Maynooth University, Munster Technological
University, Athlone Institute of Technology, and the National University of
Ireland Galway. ADAPT's research vision is to pioneer new forms of
proactive, scalable, and integrated AI-driven Digital Content Technology
that empower individuals and society to engage in digital experiences with
control, inclusion, and accountability with the long term goal of a
balanced digital society by 2030. ADAPT is pioneering new Human Centric AI
techniques and technologies including personalisation, natural language
processing, data analytics, intelligent machine translation, human-computer
interaction, as well as setting the standards for data governance, privacy
and ethics for digital content.
*Our Research Vision*
Governments and civil society are starting to recognise the need for urgent
and concerted action to address the societal impact of the accelerating
pace of digital content technologies and the AI techniques that underpin
them. ADAPT provides an ambitious, ground-breaking, integrated research
programme that assembles three interlocking Strands that together are
capable of addressing this challenge. Each of these complementary and
reinforcing research Strands takes one of the different perspectives on the
provision of personalised, immersive, multimodal digital engagement, i.e.
the individual’s experience and control of the engagement, the algorithms
underlying digital content processing, and the balanced governance by
enterprise and societal stakeholders.
*About Bluetensor*
Bluetensor was founded in Trento (Italy) in 2018 and today its team
consists of 13 resources. By combining Project Management with the Agile
method, Bluetensor meets clients' needs by developing solutions that have
an impact on business time, costs and processes.
The projects carried out over the past three years have earned Buetensor an
excellent international reputation. One of its reasoning systems, for
example, is being beta tested by teams of experts on five continents.
Moreover, thanks to its strong expertise and challenging projects,
Bletensor has created two new start-ups to apply artificial intelligence in
particular sectors.
Bluetensor’s founders believe in the importance of scientific research.
That's why Bluetensor is sponsor of the Doctorate in Industrial Innovation
at the University of Trento and partner of various initiatives with Italian
and foreign universities.
BlueTensor’s vision: “Promote a smarter way of working and a better
lifestyle by empowering people and organizations through Artificial
Intelligence.”
Best regards,
Mohammed
------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
*Dr. Mohammed Hasanuzzaman, Lecturer, Munster Technological University
<https://www.mtu.ie/> *
*Funded Investigator, ADAPT Centre- <https://www.adaptcentre.ie/> A
<https://www.adaptcentre.ie/>* World-Leading SFI Research Centre
<https://www.adaptcentre.ie/>
*Member, Lero, the SFI Research Centre for Software
<https://lero.ie/>**C**hercheur
Associé*, GREYC UMR CNRS 6072 Research Centre, France
<https://www.greyc.fr/en/home/>
*Associate Editor/EBM:** IEEE Transactions on Affective Computing, Nature
Scientific Reports, IEEE Transactions on Computational Social Systems, ACM
TALLIP, Computer Speech and Language*
Dept. of CS
Munster Technological University
Bishopstown campus
Cork e: mohammed.hasanuzzaman(a)adaptcentre.ie <email(a)adaptcentre.ie>/
Ireland https://mohammedhasanuzzaman.github.io/
[image: Mailtrack]
<https://mailtrack.io?utm_source=gmail&utm_medium=signature&utm_campaign=sig…>
Sender
notified by
Mailtrack
<https://mailtrack.io?utm_source=gmail&utm_medium=signature&utm_campaign=sig…>
07/08/22,
20:45:43
Leave
Sent from my iPhone
> On 5 Aug 2022, at 14:00, corpora-request(a)list.elra.info wrote:
>
> Send Corpora mailing list submissions to
> corpora(a)list.elra.info
>
> To subscribe or unsubscribe via email, send a message with subject or
> body 'help' to
> corpora-request(a)list.elra.info
>
> You can reach the person managing the list at
> corpora-owner(a)list.elra.info
>
> When replying, please edit your Subject line so it is more specific
> than "Re: Contents of Corpora digest..."
>
> Today's Topics:
>
> 1. TempoWiC shared task at EMNLP EvoNLP Workshop: Evaluation starts (deadline: September 12)
> (Jose Camacho Collados)
> 2. PhD Position in Speech Translation at Fondazione Bruno Kessler
> (Matteo Negri)
> 3. [QPP++] 2nd CfP Query Performance Prediction and Its Evaluation in New Tasks @ CIKM 2022
> (Guglielmo Faggioli)
>
>
> ----------------------------------------------------------------------
>
> Message: 1
> Date: Thu, 4 Aug 2022 16:04:09 +0000
> From: Jose Camacho Collados <CamachoColladosJ(a)cardiff.ac.uk>
> Subject: [Corpora-List] TempoWiC shared task at EMNLP EvoNLP Workshop:
> Evaluation starts (deadline: September 12)
> To: "corpora(a)list.elra.info" <corpora(a)list.elra.info>
> Message-ID: <LO2P265MB5789301B4C033C1A12DC3F218E9F9(a)LO2P265MB5789.GBR
> P265.PROD.OUTLOOK.COM>
> Content-Type: multipart/alternative; boundary="_000_LO2P265MB578930
> 1B4C033C1A12DC3F218E9F9LO2P265MB5789GBRP_"
>
> ************************************
>
> Call for participation: TempoWiC shared task at EvoNLP shared task (co-located with EMNLP)
>
>
> Training and test data available!
>
>
> Shared Task website: https://sites.google.com/view/evonlp/shared-task
>
> Codalab evaluation page: https://codalab.lisn.upsaclay.fr/competitions/5360
>
>
> Important dates:
>
> * 1 August 2022: Test data released and evaluation phase starts
>
> * 12 September 2022: Evaluation phase ends
>
> * 16 September 2022: Results released
>
> * 10 October 2022: System description paper deadline
>
> ************************************
>
>
> TempoWiC is the Shared Task for the "EvoNLP: The First Workshop on Ever Evolving NLP" workshop, co-located with EMNLP 2022. For this novel temporal meaning shift task, users are given a pair of sentences (or, in this case, tweets) and a target word (e.g. delta), and the task consists of deciding whether the meaning of the target word is the same or not. Basically, the framing is the same binary classification as the original WiC (Word-in-Context) task but adapted so the temporal aspect is taken into account (tweets in each pair were selected from different time periods).
>
>
> For example, we can observe a meaning shift happening to the word folklore in the following instance, where its meaning represents a recent music album in the second example.
>
> (1) There's a thunderstorm outside so clearly it's the perfect time to watch videos about folklore monsters. (August 2019)
>
> (2) Cardigan on folklore is my favorite song. I wish @taylorswift13 would love me (August 2020)
>
>
>
> --
>
>
> Jose Camacho Collados
> http://www.josecamachocollados.com<http://www.josecamachocollados.com/>
>
> -------------- next part --------------
> A message part incompatible with plain text digests has been removed ...
> Name: not available
> Type: text/html
> Size: 6177 bytes
> Desc: not available
>
> ------------------------------
>
> Message: 2
> Date: Thu, 4 Aug 2022 22:58:14 +0200
> From: Matteo Negri <negri(a)fbk.eu>
> Subject: [Corpora-List] PhD Position in Speech Translation at
> Fondazione Bruno Kessler
> To: corpora(a)list.elra.info, elsnet-list(a)list.hum.uu.nl,
> dbworld(a)cs.wisc.edu, mt-list(a)eamt.org,
> LINGUIST(a)listserv.linguistlist.org
> Message-ID:
> <CAOE0tRuS5pwrcaujkihv=kqW5TBiNOaed=XyVxUu5UQVQ+1R7Q(a)mail.gmail.com>
> Content-Type: multipart/alternative;
> boundary="0000000000005bab1c05e5709cf4"
>
> --
>
> Apologies for cross-posting.
>
> --
>
> Have you recently completed or expect very soon an MSc or equivalent degree
> in computer science, artificial intelligence, computational linguistics,
> engineering, or a related area? Are you interested in carrying out research
> on Speech Translation during the next few years? Are you excited to spend a
> part of your life in a pleasant city in the heart of the Italian Alps?
>
> WE ARE LOOKING FOR YOU!!!
>
> The Machine Translation <https://ict.fbk.eu/units/hlt-mt/> (MT) group at
> Fondazione Bruno Kessler (Trento, Italy) in conjunction with the ICT
> International Doctorate School of the University of Trento
> <https://iecs.unitn.it/> is pleased to announce the availability of the
> following fully-funded Ph.D. position in Speech Translation.
>
> TITLE: Application-oriented Speech Translation
>
> DESCRIPTION:
>
> The need to translate audio input from one language into text in a target
> language has dramatically increased in the last few years with the growth
> of audiovisual content freely available on the Web. Current speech
> translation (ST) systems are now required to be flexible and robust enough
> to operate in different application scenarios. On one side, the industry
> calls for key features like real-time processing, domain adaptability,
> extended language coverage, and the capability to meet application-specific
> constraints. On the other side, society calls for new efforts towards
> inclusiveness with respect to specific categories and groups (e.g.
> gender-sensitivity, customization to the needs of impaired users). Both
> industry and society face the orthogonal challenges posed by the
> variability of audio conditions (e.g. background noise, strong speakers’
> accent, overlapping speakers). The objective of this Ph.D. is to make ST
> flexible and robust to these and other factors.
>
> CONTACT: negri(a)fbk.eu
>
> COMPLETE DETAILS AVAILABLE AT:
> https://iecs.unitn.it/education/admission/call-for-application
>
>
> IMPORTANT DATES:
>
> The deadline for application is September 6, 2022, hrs. 04:00 PM (CEST)
>
> Potential candidates are strongly invited to contact us in advance for
> preliminary interviews. Precedence for interviews will be given to
> short-listed candidates that will send us a complete CV via email (
> negri(a)fbk.eu) by August 18, 2022.
>
> Candidate profile
>
> The ideal candidate must have recently completed or expect very soon an MSc
> or equivalent degree in computer science, artificial intelligence,
> computational linguistics, engineering, or a closely related area. In
> addition, the applicant should:
>
> -
>
> Have interest in Machine and Speech Translation
> -
>
> Have experience in deep learning and machine learning, in general
> -
>
> Have good programming skills in Python and experience in PyTorch
> -
>
> Enjoy working with real-world problems and large data sets
> -
>
> Have good knowledge of written and spoken English
> -
>
> Enjoy working in a closely collaborating team
>
> Working Environment
>
> The doctoral student will be employed at the MT group at Fondazione Bruno
> Kessler, Trento, Italy. The group (about 10 people including staff and
> students) has a long tradition in research on machine and speech
> translation and is currently involved in several projects. Former students
> are nowadays employed in leading IT companies in the world.
>
> Benefits
>
> Fondazione Bruno Kessler offers an attractive benefits package, including a
> flexible work week, full reimbursement for conferences and summer schools,
> a competitive salary, an excellent team of supervisors and mentors, help
> for housing, full health insurance, the possibility of Italian courses, and
> sporting facilities.
>
> Further Information
>
> For preliminary interviews, and should you need further information about
> the position, please contact Matteo Negri (negri(a)fbk.eu).
>
> Best Regards,
>
> Matteo Negri
>
> --
> --
> Le informazioni contenute nella presente comunicazione sono di natura
> privata e come tali sono da considerarsi riservate ed indirizzate
> esclusivamente ai destinatari indicati e per le finalità strettamente
> legate al relativo contenuto. Se avete ricevuto questo messaggio per
> errore, vi preghiamo di eliminarlo e di inviare una comunicazione
> all’indirizzo e-mail del mittente.
>
> --
> The information transmitted is
> intended only for the person or entity to which it is addressed and may
> contain confidential and/or privileged material. If you received this in
> error, please contact the sender and delete the material.
> -------------- next part --------------
> A message part incompatible with plain text digests has been removed ...
> Name: not available
> Type: text/html
> Size: 18048 bytes
> Desc: not available
>
> ------------------------------
>
> Message: 3
> Date: Fri, 5 Aug 2022 10:05:10 +0200
> From: Guglielmo Faggioli <guglielmo.faggioli(a)phd.unipd.it>
> Subject: [Corpora-List] [QPP++] 2nd CfP Query Performance Prediction
> and Its Evaluation in New Tasks @ CIKM 2022
> To: corpora(a)list.elra.info
> Message-ID:
> <CAD3vDD-Ts1eGy6Mzq=gjKiwrq6PO=J+58OhJdPyCpiywNNzvxg(a)mail.gmail.com>
> Content-Type: multipart/alternative;
> boundary="00000000000076a3d705e579ed95"
>
> Query Performance Prediction (QPP) is currently primarily used for ad-hoc
> retrieval tasks. The Information Retrieval (IR) field is reaching new
> heights thanks to recent advances in large language models and neural
> networks, as well as emerging new ways of searching, such as conversational
> search. Such advancements are quickly spreading to adjacent research areas,
> including QPP, necessitating a reconsideration of how we perform and
> evaluate QPP.
>
> Important Dates
> ---------------------
> Submission deadline: September 2, 2022
> Notification of acceptance: September 27, 2022
> Camera ready: October 06, 2022
> Conference days: October 17-20, 2022
> Workshop day: October 21, 2022
>
> Call for Papers
> -------------------
> This workshop aims at stimulating discussion on three main aspects
> concerning the future of QPP:
>
> What are the emerging QPP challenges posed by new methods and
> technologies, including but not limited to dense retrieval, contextualized
> embeddings, and conversational search?
> How might these new techniques be used to improve the quality of QPP?
> Can we claim that the current techniques for evaluating QPP are
> effective in all arising scenarios? Can we envision new evaluation
> protocols capable of granting generalizability in new domains?
>
> We plan to foster the discussion via two focus groups led by the workshop's
> organizers.
>
> The first focus group will identify what possibilities the QPP offers
> regarding new research models and IR tasks, primary considerations, issues
> linked to different aspects of the QPP, and the potentialities provided by
> new tools.
>
> The second focus group will gather the community’s concerns and solutions
> with respect to the QPP evaluation, especially for what concerns emerging
> domains.
> Themes and Topics
>
> The workshop will focus on the following themes:
> Query performance prediction applied to new tasks:
> Can existing QPP techniques be exploited, or which new QPP theories and
> models need to be devised for new tasks, such as passage-retrieval, Q&A,
> and conversational search?
> Query performance prediction exploiting new techniques:
> How can new technologies, such as contextualized embeddings, large
> language models, and neural networks be exploited to improve QPP?
> Evaluation of query performance prediction:
> How should QPP techniques be evaluated, including best practices,
> datasets, and resources, and, in particular, should QPP be evaluated the
> same for different IR tasks?
>
> It is possible to submit three main categories of manuscripts to the
> workshop:
> Full papers: up to 6 pages.
> Short papers: up to 3 pages.
> Discussion papers: up to 3 pages.
>
> All manuscripts are expected to address the workshop's themes as mentioned
> above.
> Full and short papers should contain innovative ideas and their
> experimental evaluation. We are also interested in works containing
> (methodologically sound) preliminary results and incremental endeavours.
> Discussion papers should include work with or without preliminary results,
> position papers, and papers describing failures. Such papers should foster
> the discussion and thus are not required to contain full-fledged results.
> In this sense, the experimental evaluation of the submitted discussion
> paper is appreciated but not required. We are also interested in receiving
> contributions regarding (methodologically sound) failed experiments; since
> the workshop will focus on new research directions, we consider it
> necessary also to discuss the reasons and causes of failures.
> Each manuscript will be peer-reviewed by at least two program committee
> members
> Accepted papers will be published online as a volume of the CEUR-WS
> proceeding series.
>
> Website
> --------------
> qpp2022.dei.unipd.it
>
> Organizers
> --------------
> Guglielmo Faggioli, University of Padova, Italy, faggioli(a)dei.unipd.it
> <http://faggiolidei.unipd.it/>
> Nicola Ferro, University of Padova, Italy, ferro(a)unipd.it
> <http://ferrounipd.it/>
> Josiane Mothe, Université de Toulouse, IRIT, France, josiane.mothe(a)irit.fr
> <http://josiane.motheirit.fr/>
> Fiana Raiber, Yahoo Research, Israel, fiana(a)yahooinc.com
> <http://fianayahooinc.com/>
>
> ----------
> Guglielmo Faggioli
> Dipartimento di Ingegneria Informatica, University of Padua
> Via Gradenigo 6/b, 35138, Padua, Italy
> -------------- next part --------------
> A message part incompatible with plain text digests has been removed ...
> Name: not available
> Type: text/html
> Size: 4759 bytes
> Desc: not available
>
> ------------------------------
>
> Subject: Digest Footer
>
> _______________________________________________
> Corpora mailing list -- corpora(a)list.elra.info
> To unsubscribe send an email to corpora-leave(a)list.elra.info
>
>
> ------------------------------
>
> End of Corpora Digest, Vol 212, Issue 1
> ***************************************
*ICNLSP 2022: 3rd call for papers*
Dear all,
This is the 3rd call for papers for ICNLSP 2022
<https://www.icnlsp.org/2022welcome/>, the 5*th* edition of the
International Conference on Natural Language and Speech Processing, hosted
by DataScientia (University of Trento)
<http://datascientia.disi.unitn.it/events/> for the third time, will be
held online, on 16-17 December 2022.
Authors are invited to present their work relevant to the topics of the
conference. The following list includes the topics of ICNLSP 2022
<https://www.icnlsp.org/2022welcome/> but not limited to:
Signal processing, acoustic modeling
Architecture of speech recognition system
Deep learning for speech recognition
Analysis of speech
Paralinguistics in Speech and Language
Pathological speech and language
Speech coding
Speech comprehension
Summarization
Speech Translation
Speech synthesis
Speaker and language identification
Phonetics, phonology and prosody
Cognition and natural language processing
Text categorization
Sentiment analysis and opinion mining
Computational Social Web
Arabic dialects processing
Under-resourced languages: tools and corpora
New language models
Arabic OCR
Lexical semantics and knowledge representation
Requirements engineering and NLP
NLP tools for software requirements and engineering
Knowledge fundamentals
Knowledge management systems
Information extraction
Data mining and information retrieval
Machine translation
NLP for Arabic heritage documents
*Important dates*
*Submission deadline*: *30 August 2022*
*Notification of acceptance*: *31 October 2022*
*Camera-ready paper due*: *20 November 2022*
*Conference dates*: *16, 17 Decemberber 2022*
*Publication*
1- All accepted papers will be published in ACL Anthology, and indexed in
DBLP.
2- Selected papers will be published in Signals and Communication
Technology (Springer) (https://www.springer.com/series/4748), indexed in
Scopus.
*Keynote speakers*
1. Eric Laporte, Gustave Eiffel University, France.
2. Jan Niehues, University of Maastricht, Netherlands.
3. Ahmed Ali, QCRI, Qatar.
*Workshop: NSURL 2022*
The workshop on NLP Solutions for Under Resourced Languages NSURL
<http://nsurl.org> will be held with ICNLSP 2022
<https://www.icnlsp.org/2022welcome/>. The workshop aim to be a forum for
solving NLP tasks concerning Arabic and its dialects and also
under-resourced languages as African, Persian, etc.
We look forward to welcome you to ICNLSP 2022
<https://www.icnlsp.org/2022welcome/> that will be an opportunity to get
acquainted with the latest research in the field of natural language and
speech processing, hoping that it will be successful with your active
participation.
*Contact*
icnlsp2022(a)easychair.org
Query Performance Prediction (QPP) is currently primarily used for ad-hoc
retrieval tasks. The Information Retrieval (IR) field is reaching new
heights thanks to recent advances in large language models and neural
networks, as well as emerging new ways of searching, such as conversational
search. Such advancements are quickly spreading to adjacent research areas,
including QPP, necessitating a reconsideration of how we perform and
evaluate QPP.
Important Dates
---------------------
Submission deadline: September 2, 2022
Notification of acceptance: September 27, 2022
Camera ready: October 06, 2022
Conference days: October 17-20, 2022
Workshop day: October 21, 2022
Call for Papers
-------------------
This workshop aims at stimulating discussion on three main aspects
concerning the future of QPP:
What are the emerging QPP challenges posed by new methods and
technologies, including but not limited to dense retrieval, contextualized
embeddings, and conversational search?
How might these new techniques be used to improve the quality of QPP?
Can we claim that the current techniques for evaluating QPP are
effective in all arising scenarios? Can we envision new evaluation
protocols capable of granting generalizability in new domains?
We plan to foster the discussion via two focus groups led by the workshop's
organizers.
The first focus group will identify what possibilities the QPP offers
regarding new research models and IR tasks, primary considerations, issues
linked to different aspects of the QPP, and the potentialities provided by
new tools.
The second focus group will gather the community’s concerns and solutions
with respect to the QPP evaluation, especially for what concerns emerging
domains.
Themes and Topics
The workshop will focus on the following themes:
Query performance prediction applied to new tasks:
Can existing QPP techniques be exploited, or which new QPP theories and
models need to be devised for new tasks, such as passage-retrieval, Q&A,
and conversational search?
Query performance prediction exploiting new techniques:
How can new technologies, such as contextualized embeddings, large
language models, and neural networks be exploited to improve QPP?
Evaluation of query performance prediction:
How should QPP techniques be evaluated, including best practices,
datasets, and resources, and, in particular, should QPP be evaluated the
same for different IR tasks?
It is possible to submit three main categories of manuscripts to the
workshop:
Full papers: up to 6 pages.
Short papers: up to 3 pages.
Discussion papers: up to 3 pages.
All manuscripts are expected to address the workshop's themes as mentioned
above.
Full and short papers should contain innovative ideas and their
experimental evaluation. We are also interested in works containing
(methodologically sound) preliminary results and incremental endeavours.
Discussion papers should include work with or without preliminary results,
position papers, and papers describing failures. Such papers should foster
the discussion and thus are not required to contain full-fledged results.
In this sense, the experimental evaluation of the submitted discussion
paper is appreciated but not required. We are also interested in receiving
contributions regarding (methodologically sound) failed experiments; since
the workshop will focus on new research directions, we consider it
necessary also to discuss the reasons and causes of failures.
Each manuscript will be peer-reviewed by at least two program committee
members
Accepted papers will be published online as a volume of the CEUR-WS
proceeding series.
Website
--------------
qpp2022.dei.unipd.it
Organizers
--------------
Guglielmo Faggioli, University of Padova, Italy, faggioli(a)dei.unipd.it
<http://faggiolidei.unipd.it/>
Nicola Ferro, University of Padova, Italy, ferro(a)unipd.it
<http://ferrounipd.it/>
Josiane Mothe, Université de Toulouse, IRIT, France, josiane.mothe(a)irit.fr
<http://josiane.motheirit.fr/>
Fiana Raiber, Yahoo Research, Israel, fiana(a)yahooinc.com
<http://fianayahooinc.com/>
----------
Guglielmo Faggioli
Dipartimento di Ingegneria Informatica, University of Padua
Via Gradenigo 6/b, 35138, Padua, Italy
--
Apologies for cross-posting.
--
Have you recently completed or expect very soon an MSc or equivalent degree
in computer science, artificial intelligence, computational linguistics,
engineering, or a related area? Are you interested in carrying out research
on Speech Translation during the next few years? Are you excited to spend a
part of your life in a pleasant city in the heart of the Italian Alps?
WE ARE LOOKING FOR YOU!!!
The Machine Translation <https://ict.fbk.eu/units/hlt-mt/> (MT) group at
Fondazione Bruno Kessler (Trento, Italy) in conjunction with the ICT
International Doctorate School of the University of Trento
<https://iecs.unitn.it/> is pleased to announce the availability of the
following fully-funded Ph.D. position in Speech Translation.
TITLE: Application-oriented Speech Translation
DESCRIPTION:
The need to translate audio input from one language into text in a target
language has dramatically increased in the last few years with the growth
of audiovisual content freely available on the Web. Current speech
translation (ST) systems are now required to be flexible and robust enough
to operate in different application scenarios. On one side, the industry
calls for key features like real-time processing, domain adaptability,
extended language coverage, and the capability to meet application-specific
constraints. On the other side, society calls for new efforts towards
inclusiveness with respect to specific categories and groups (e.g.
gender-sensitivity, customization to the needs of impaired users). Both
industry and society face the orthogonal challenges posed by the
variability of audio conditions (e.g. background noise, strong speakers’
accent, overlapping speakers). The objective of this Ph.D. is to make ST
flexible and robust to these and other factors.
CONTACT: negri(a)fbk.eu
COMPLETE DETAILS AVAILABLE AT:
https://iecs.unitn.it/education/admission/call-for-application
IMPORTANT DATES:
The deadline for application is September 6, 2022, hrs. 04:00 PM (CEST)
Potential candidates are strongly invited to contact us in advance for
preliminary interviews. Precedence for interviews will be given to
short-listed candidates that will send us a complete CV via email (
negri(a)fbk.eu) by August 18, 2022.
Candidate profile
The ideal candidate must have recently completed or expect very soon an MSc
or equivalent degree in computer science, artificial intelligence,
computational linguistics, engineering, or a closely related area. In
addition, the applicant should:
-
Have interest in Machine and Speech Translation
-
Have experience in deep learning and machine learning, in general
-
Have good programming skills in Python and experience in PyTorch
-
Enjoy working with real-world problems and large data sets
-
Have good knowledge of written and spoken English
-
Enjoy working in a closely collaborating team
Working Environment
The doctoral student will be employed at the MT group at Fondazione Bruno
Kessler, Trento, Italy. The group (about 10 people including staff and
students) has a long tradition in research on machine and speech
translation and is currently involved in several projects. Former students
are nowadays employed in leading IT companies in the world.
Benefits
Fondazione Bruno Kessler offers an attractive benefits package, including a
flexible work week, full reimbursement for conferences and summer schools,
a competitive salary, an excellent team of supervisors and mentors, help
for housing, full health insurance, the possibility of Italian courses, and
sporting facilities.
Further Information
For preliminary interviews, and should you need further information about
the position, please contact Matteo Negri (negri(a)fbk.eu).
Best Regards,
Matteo Negri
--
--
Le informazioni contenute nella presente comunicazione sono di natura
privata e come tali sono da considerarsi riservate ed indirizzate
esclusivamente ai destinatari indicati e per le finalità strettamente
legate al relativo contenuto. Se avete ricevuto questo messaggio per
errore, vi preghiamo di eliminarlo e di inviare una comunicazione
all’indirizzo e-mail del mittente.
--
The information transmitted is
intended only for the person or entity to which it is addressed and may
contain confidential and/or privileged material. If you received this in
error, please contact the sender and delete the material.
******************************************************
*********** EVALITA 2023: Call for tasks ***********
******************************************************
*EVALITA 2023 *is an initiative of AILC (Associazione Italiana di
Linguistica Computazionale, *AILC* https://www.ai-lc.it/).
As in the previous editions (https://www.evalita.it/), EVALITA 2023 will
be organized along a few selected tasks, which provide participants with
opportunities to discuss and explore both emerging and traditional areas
of *Natural Language Processing and Speech*. The participation is
encouraged for teams working both in academic institutions and
industrial organizations.
*TASK PROPOSAL SUBMISSION*
Tasks proposals should be no longer than 4 pages and should include:
- task title and acronym;
- names and affiliation of the organizers (minimum 2 organizers);
- brief task description, including motivations and state of the art;
- explanation of the international relevance of the task;
- description and examples of the data, including information about
their availability, development stage, and issues concerning privacy and
data sensitivity. The examples are mandatory because they are intended
to give potential participants an idea of what the task data will look
like, how it’ll be formatted, etc.
- expected number of participants and attendees;
- names and contact information of the organizers.
/In submitting your proposal, please bear in mind that we encourage:/
- c*hallenging tasks* involving linguistic analysis, e.g., beyond
“simple” classification problems;
- tasks focused on multimodality, e.g., considering both textual and
visual information;
- tasks characterized by *different levels of complexity*, e.g., with a
straightforward main subtask and one or more sophisticated additional
subtasks;
- the re-annotation/expansion of datasets from previous years with new
annotation levels, and texts from publicly available corpora;
- both new tasks and re-runs: for new tasks, organizers will have to
specify in the proposal why it would attract a reasonable number of
participants, and why it is needed;
- application-oriented tasks, that is tasks that have a clearly defined
end-user application showcasing;
- *multilingual tasks*, i.e. with data both in Italian and in other
languages;
- *industrial tasks*, i.e. task with real data provided by companies.
The organizers of the accepted tasks should take care of planning,
according to the scheduled deadlines (see below):
- the development and distribution of datasets needed for the contest,
i.e. data for training and development, and data for testing; the scorer
to be used to evaluate the submitted systems should be included in the
release of development data;
- the development of task guidelines, where all the instructions for the
participation are made clear together with a detailed description of
data and evaluation metrics applied for the evaluation of the
participant results;
- the collection of participants results;
- the evaluation of participants results according to standard metrics
and baseline(s);
- the solicitation of participation and of submissions;
- the reviewing process of the papers describing the participants
approach and results (according to the template to be made available by
the EVALITA 2023 chairs);
- the production of a paper describing the task (according to the
template to be made available by the EVALITA 2023 chairs).
**** Email your proposal in PDF format to evalita2023(a)gmail.com with
"EVALITA 2023 TASK Proposal" as the subject line by the submission
deadline: October 4th 2022. ****
Please feel free to contact the EVALITA 2023 chairs at
evalita2023(a)gmail.com in case of any questions or suggestions.
Deadlines of the task proposal:
- October 4th 2022: submission of task proposals
- October 18th 2022: notification of task proposal acceptance
*Tentative timelines of EVALITA 2023:*
- 17th January 2023: development data available to participants
- 13th April 2023: registration closes
- 14th - 27th April 2023: evaluation windows
- 4th May 2023: assessment returned to participants
- first half of July: final workshop, location to be announced shortly
*EVALITA 2023 CHAIRS*
Mirko Lai (Università di Torino)
Stefano Menini (Fondazione Bruno Kessler)
Marco Polignano (Università di Bari Aldo Moro)
Valentina Russo (Logogramma SRL)
Rachele Sprugnoli (Università degli Studi di Parma)
Giulia Venturi (Istituto di Linguistica Computazionale “A. Zampolli” - CNR)
************************************
Call for participation: TempoWiC shared task at EvoNLP shared task (co-located with EMNLP)
Training and test data available!
Shared Task website: https://sites.google.com/view/evonlp/shared-task
Codalab evaluation page: https://codalab.lisn.upsaclay.fr/competitions/5360
Important dates:
* 1 August 2022: Test data released and evaluation phase starts
* 12 September 2022: Evaluation phase ends
* 16 September 2022: Results released
* 10 October 2022: System description paper deadline
************************************
TempoWiC is the Shared Task for the "EvoNLP: The First Workshop on Ever Evolving NLP" workshop, co-located with EMNLP 2022. For this novel temporal meaning shift task, users are given a pair of sentences (or, in this case, tweets) and a target word (e.g. delta), and the task consists of deciding whether the meaning of the target word is the same or not. Basically, the framing is the same binary classification as the original WiC (Word-in-Context) task but adapted so the temporal aspect is taken into account (tweets in each pair were selected from different time periods).
For example, we can observe a meaning shift happening to the word folklore in the following instance, where its meaning represents a recent music album in the second example.
(1) There's a thunderstorm outside so clearly it's the perfect time to watch videos about folklore monsters. (August 2019)
(2) Cardigan on folklore is my favorite song. I wish @taylorswift13 would love me (August 2020)
--
Jose Camacho Collados
http://www.josecamachocollados.com<http://www.josecamachocollados.com/>
The Department of Computer Science at the Technical University of Darmstadt keeps growing - we are now looking for an
Independent Research Group Leader (equivalent to the assistant professor level) in NLP
to join our team!
Are you interested in doing cutting-edge research on Natural Language Processing and AI, often in collaboration with top academic and industrial partners? We are a diverse team working on some of the hardest and most exciting machine learning challenges, including representation learning, neural IR, explainable NLP, continual learning, multi-task learning, self-supervised learning, and more.
Visit our website and read more about the opening:
https://www.informatik.tu-darmstadt.de/ukp/ukp_home/jobs_ukp/2022_independe…
--------------------------------------------------------------------
Prof. Dr. Iryna Gurevych
UKP Lab
Computer Science Department
Technical University Darmstadt, Germany
http://www.ukp.tu-darmstadt.de/
The Ubiquitous Knowledge Processing (UKP) Lab at the Department of Computer Science, Technical University Darmstadt in Germany has several ERC-, HORIZON- or DFG-funded openings for an
Associate Research Scientist (PostDoc- or PhD-level; for an initial term of two to three years) in Machine Learning and Natural Language Processing.
As an interdisciplinary team, we are happy to accommodate a wide variety of research questions and backgrounds, ranging from machine learning to dataset creation. We particularly welcome proposals on neuro-symbolic reasoning, multilingual and multimodal language modeling, and creative ideas on modeling cross-document relations.
Interested to become part of our diverse, ambitious and successful team doing cutting edge research? Visit our website and read more about the openings:
https://www.informatik.tu-darmstadt.de/ukp/ukp_home/jobs_ukp/2022_associiat…
--------------------------------------------------------------------
Prof. Dr. Iryna Gurevych
UKP Lab
Technical University Darmstadt, Germany
http://www.ukp.tu-darmstadt.de/
Important UPDATES/EXTENSION: ClinSpEn sub-track (Biomedical WMT Task,
EMNLP 2022)
Machine Translation of Clinical cases, ontologies & medical entities:
Spanish - English
https://temu.bsc.es/clinspen/
Evaluation period extension, test and background data available on Zenodo
and CodaLab submission available.
The ClinSpEn track of the Biomedical WMT 2022 shared task tries to address
a pressing need and emerging research topic related to the development and
exploitation of multilingual clinical NLP and text mining applications.
Recent advances in neural machine translation approaches (MT) adapted to
specific domains and text genres have resulted in promising results that
facilitate processing of healthcare and clinical data beyond language
silos.
The ClinSpEn sub-track tries to promote the use of advanced machine
translation technologies applied to three high impact healthcare
application scenarios:
(1) automatic translation of clinical case documents of importance to
examine how MT could be further applied to cope with clinical records
(2) automatic translation of clinical terms and entity mentions extracted
directly from medical records and literature to improve multilingual
semantic annotation technologies
(3) automatic translation of ontologies and controlled vocabulary concepts
of uttermost importance for multilingual data and concept normalization
These three scenarios will be addressed by three specific benchmark data
collections used for evaluation purposes by the ClinSpEn biomedical WMT
track:
ClinSpEn-CC (Clinical Cases): EN>ES translation of clinical case documents.
ClinSpEn-CT (Clinical Terms): ES>EN translation of clinical terms and
entity mentions extracted from records and literature.
ClinSpEn-OC (Ontology Concepts): EN>ES translation of highly used open
clinical controlled vocabularies and ontology concepts.
Important links:
-
ClinSpEn web: https://temu.bsc.es/clinspen/
-
Biomedical WMT web:
https://statmt.org/wmt22/biomedical-translation-task.html
-
WMT2022: https://statmt.org/wmt22/
-
EMNLP conference: https://2022.emnlp.org/
-
Data (NEW!):
-
Clinical Cases: https://doi.org/10.5281/zenodo.6497350
-
Clinical Terms: https://doi.org/10.5281/zenodo.6497372
-
Ontology Concepts: https://doi.org/10.5281/zenodo.6497388
-
CodaLab: https://codalab.lisn.upsaclay.fr/competitions/6696
-
Team Registration (mandatory): https://temu.bsc.es/clinspen/registration/
For the ClinSpEn track Gold Standard manual translations generated by
professional medical translators have been generated to evaluate
participating teams. The primary evaluation metric to be used for this
track will be SacreBLEU.
Participants will also have access to a larger background collection to
promote scalability and robustness assessment of machine translation
technology.
Updated schedule:
-
Participant Predictions Due: August 30th, 2022 (UPDATED EXTENSION!)
-
Paper Submission: September 7th, 2022
-
Acceptance notification: October 9th, 2022
-
Camera-ready version: October 16th, 2022
-
WMT workshop at EMNLP: December 7th and 8th, 2022
Publications and workshop
Participating teams will be invited to contribute a systems description
paper for the WMT 2022 Working Notes proceedings. This workshop will be
part of the prestigious EMNLP 2022 conference. More information on the
paper’s specifications, formatting guidelines and review process at:
https://statmt.org/wmt22/index.html.
ClinSpEn Track Organizers
-
Salvador Lima-López (BSC)
-
Darryl Johan Estrada (BSC)
-
Eulàlia Farré-Maduell (BSC)
-
Martin Krallinger (BSC)
Biomedical WMT Organizers
-
Rachel Bawden (University of Edinburgh, UK)
-
Giorgio Maria Di Nunzio (University of Padua, Italy)
-
Darryl Johan Estrada (Barcelona Supercomputing Center, Spain)
-
Eulàlia Farré-Maduell (Barcelona Supercomputing Center, Spain)
-
Cristian Grozea (Fraunhofer Institute, Germany)
-
Antonio Jimeno Yepes (University of Melbourne, Australia)
-
Salvador Lima-López (Barcelona Supercomputing Center, Spain)
-
Martin Krallinger (Barcelona Supercomputing Center, Spain)
-
Aurélie Névéol (Université Paris Saclay, CNRS, LISN, France)
-
Mariana Neves (German Federal Institute for Risk Assessment, Germany)
-
Roland Roller (DFKI, Germany)
-
Amy Siu (Beuth University of Applied Sciences, Germany)
-
Philippe Thomas (DFKI, Germany)
-
Federica Vezzani (University of Padua, Italy)
-
Maika Vicente Navarro, Maika Spanish Translator, Melbourne, Australia
-
Dina Wiemann (Novartis, Switzerland)
-
Lana Yeganova (NCBI/NLM/NIH, USA
--
Salvador Lima Lopez
RESEARCH ENGINEER
Life Sciences - Text Mining, BSC-CNS
Barcelona, Spain