[Previous message][Next message][Back to index]

[Commlist] Call for papers: 2nd Workshop on Novel Evaluation Approaches for Text Classification Systems

Tue Feb 14 04:44:15 GMT 2023




2nd Workshop on Novel Evaluation Approaches for Text Classification Systems

Co-located with ICWSM 2023, 5 June 2023, Limassol, Cyprus

https://neatclass-workshop.github.io/

The automatic or semiautomatic analysis of textual data is a keyapproach to analyse the massive amounts of user-generated contentonline, from the identification of sentiment in text and topicclassification to the detection of abusive language, misinformation orpropaganda. However, the development of such systems faces a crucialchallenge. Static benchmarking datasets and performance metrics are theprimary method for measuring progress in the field, and the publicationof research on new systems typically requires demonstrating animprovement over state-of-the-art approaches in this way. Yet, theseperformance metrics can obscure critical failings in current models.Improvements in metrics often do not reflect improvements in thereal-world performance of models. There is clearly a need to rethinkperformance evaluation for text classification and analysis systems tobe usable and trustable.

If unreliable systems achieve astonishing scores with traditionalmetrics, how do we recognise progress when we see it? The goal of theWorkshop on Novel Evaluation Approaches for Text Classification Systems(NEATCLasS) is to promote the development and use of novel metrics forabuse detection, hate speech recognition, sentiment analysis and similartasks within the community, to better be able to measure whether modelsreally improve upon the state of the art, and to encourage a wide rangeof models to be tested on these new metrics.

Recently there have been attempts to address the problem of benchmarksand metrics that do not represent performance well. For example, inabusive language detection, there are both static datasets ofhard-to-detect examples (Röttger et al. 2021) and dynamic approaches forgenerating such examples (Calabrese et al. 2021). On the platformDynaBench (Kiela et al. 2021), benchmarks are dynamic and constantlyupdated with hard-to-classify examples, avoiding overfitting apredetermined dataset. However, these approaches only capture a tinyfraction of issues with benchmarking. There is still much work to do.

We welcome submissions discussing such new evaluation approaches,introducing new or refining existing ones, promoting the use of novelmetrics for abuse detection, sentiment analysis and similar tasks withinthe community. Furthermore, the workshop will promote discussion on theimportance, potential and danger of disagreement in tasks that requiresubjective judgements. This discussion will also focus on how toevaluate human annotations, and how to find the most suitable set ofannotators (if any) for a given instance and task. The workshop willsolicit, among others, research papers about

* Issues with current evaluation metrics and benchmarking datasets
* New evaluation metrics

* User-centred (qualitative or quantitative) evaluation of social mediatext analysis tools* Adaptations and translations of novel evaluation metrics for otherlanguages

* New datasets for benchmarking

* Increasing data quality in benchmarking datasets, e.g., avoidance ofselection bias, identification of suitable expert human annotators fortasks involving subjective judgements

* Systems that facilitate dynamic evaluation and benchmarking

* Models that perform better at hard-to-classify instances and novelevaluation metrics such as AAA, DynaBench and HateCheck

* Bias, error analysis and model diagnostics

* Phenomena not captured by existing evaluation metrics (such as modelsmaking the right predictions for the wrong reason)

* Approaches to mitigating bias and common errors

* Alternative designs for NLP competitions that evaluate a wide range ofmodel characteristics (such as bias, error analysis, cross-domainperformance)* Challenges of downstream applications (in industry, computationalsocial science and elsewhere) and reflections on how these challengescan be captured in evaluation metrics


Format and Submissions

We invite research papers (8 pages), position and short papers (4pages), and demo papers (2 pages). Detailed submission instructions canbe found on the workshop website.

The workshop will take place as a half-day meeting on 5 June. We arelooking forward to an exciting mix of activities including invitedtalks, paper presentations and a group discussion. Authors of acceptedpapers will be invited to trial an innovative format for paperpresentations: presenters will be given 5 minutes to describe theirresearch questions and hypothesis, and a group discussion will startafter that. Then, presenters will be given 5 more minutes to describetheir method and results, followed by a new group discussion about theinterpretation and implications of such results. The group discussion tobring researchers together and collect ideas for new evaluationapproaches and future work in the field.

While we would encourage attending the workshop in person, we are alsoplanning to live stream the workshop on Zoom and record talks to allowas many people as possible to participate.

Authors of accepted papers will have the opportunity to publish theirpapers through workshop proceedings by the AAAI Press. Submissioninstructions will be uploaded to the workshop web page in due course:https://neatclass-workshop.github.io/


Timeline
* Submission link: TBD – see https://neatclass-workshop.github.io/
* Papers submission deadline: 17 April 2023
* Paper acceptance notification: 30 April 2023
* Final camera-ready paper due: 6 May 2023
* Workshop Day: 5 June 2023

Organisers

Björn Ross, University of Edinburgh (Contact: (b.ross /at/ ed.ac.uk))
Roberto Navigli, Sapienza University of Rome
Agostina Calabrese, University of Edinburgh
Sheikh Muhammad Sarwar, Amazon

---------------
The COMMLIST
---------------
This mailing list is a free service offered by Nico Carpentier. Please use it responsibly and wisely.
--
To subscribe or unsubscribe, please visit http://commlist.org/
--
Before sending a posting request, please always read the guidelines at http://commlist.org/
--
To contact the mailing list manager:
Email: (nico.carpentier /at/ commlist.org)
URL: http://nicocarpentier.net
---------------

[Previous message][Next message][Back to index]

Archive for February 2023

[Commlist] Call for papers: 2nd Workshop on Novel Evaluation Approaches for Text Classification Systems

Tue Feb 14 04:44:15 GMT 2023