[Previous message][Next message][Back to index]

[Commlist] Call for papers: 1st Workshop on Novel Evaluation Approaches for Text Classification Systems on Social Media

Sun Feb 13 10:05:55 GMT 2022

1st Workshop on Novel Evaluation Approaches for Text ClassificationSystems on Social Media

Co-located with ICWSM 2022, 6 June 2022, Hybrid format - Atlanta,Georgia (US) and online


https://neatclass-workshop.github.io/

The automatic or semiautomatic analysis of textual data is a keyapproach to analyse the massive amounts of user-generated contentonline, from the identification of sentiment in text and topicclassification to the detection of abusive language, misinformation orpropaganda. However, the development of such systems faces a crucialchallenge. Static benchmarking datasets and performance metrics are theprimary method for measuring progress in the field, and the publicationof research on new systems typically requires demonstrating animprovement over state-of-the-art approaches in this way. Yet, theseperformance metrics can obscure critical failings in current models.Improvements in metrics often do not reflect improvements in thereal-world performance of models. There is clearly a need to rethinkperformance evaluation for text classification and analysis systems tobe usable and trustable.

If unreliable systems achieve astonishing scores with traditionalmetrics, how do we recognise progress when we see it? The goal of theWorkshop on Novel Evaluation Approaches for Text Classification Systemson Social Media (NEATCLasS) is to promote the development and use ofnovel metrics for abuse detection, hate speech recognition, sentimentanalysis and similar tasks within the community, to better be able tomeasure whether models really improve upon the state of the art, and toencourage a wide range of models to be tested on these new metrics.

Recently there have been attempts to address the problem of benchmarksand metrics that do not represent performance well. For example, inabusive language detection, there are both static datasets ofhard-to-detect examples (Röttger et al. 2021) and dynamic approaches forgenerating such examples (Calabrese et al. 2021). On the platformDynaBench (Kiela et al. 2021), benchmarks are dynamic and constantlyupdated with hard-to-classify examples, avoiding overfitting apredetermined dataset. However, these approaches only capture a tinyfraction of issues with benchmarking. There is still much work to do.

For the first edition of the workshop on Novel Evaluation Approaches forText Classification Systems (NEATCLasS) we welcome submissionsdiscussing such new evaluation approaches, introducing new or refiningexisting ones, promoting the use of novel metrics for abuse detection,sentiment analysis and similar tasks within the community. Furthermore,the workshop will promote discussion on the importance, potential anddanger of disagreement in tasks that require subjective judgements. Thisdiscussion will also focus on how to evaluate human annotations, and howto find the most suitable set of annotators (if any) for a giveninstance and task. The workshop will solicit, among others, researchpapers about

* Issues with current evaluation metrics and benchmarking datasets
* New evaluation metrics

* User-centred (qualitative or quantitative) evaluation of social mediatext analysis tools* Adaptations and translations of novel evaluation metrics for otherlanguages

* New datasets for benchmarking

* Increasing data quality in benchmarking datasets, e.g., avoidance ofselection bias, identification of suitable expert human annotators fortasks involving subjective judgements

* Systems that facilitate dynamic evaluation and benchmarking

* Models that perform better at hard-to-classify instances and novelevaluation metrics such as AAA, DynaBench and HateCheck

* Bias, error analysis and model diagnostics

* Phenomena not captured by existing evaluation metrics (such as modelsmaking the right predictions for the wrong reason)

* Approaches to mitigating bias and common errors

* Alternative designs for NLP competitions that evaluate a wide range ofmodel characteristics (such as bias, error analysis, cross-domainperformance)* Challenges of downstream applications (in industry, computationalsocial science, computational communication science, and others) andreflections on how these challenges can be captured in evaluation metrics


Format and Submissions

The workshop will take place as a full-day meeting on 6 June.Participants will be invited to trial an innovative format for paperpresentations: presenters will be given 5 minutes to describe theirresearch questions and hypotheses, and a group discussion will startafter that. Then, presenters will be given 5 more minutes to describetheir method and results, followed by a new group discussion about theinterpretation and implications of such results. In the afternoon therewill be collaborative group activities to bring researchers together andcollect ideas for new evaluation approaches and future work in thefield. We will discuss how we can organise competitions when there aremultiple evaluation metrics and benchmarking datasets are dynamic.

We invite research papers (8 pages), position and short papers (4pages), and demo papers (2 pages). Submissions must be original andshould not have been published previously or be under consideration forpublication while being evaluated for this workshop. Submissions will beevaluated by the program committee based on the quality of the work andits fit to the workshop themes. All submissions should be double-blindand a high-resolution PDF of the paper should be uploaded to theEasyChair submission site (link TBD) before the paper submissiondeadline. All papers must be submitted, and formatted in AAAItwo-column, camera-ready style. Authors of accepted papers will have theopportunity to publish their papers through workshop proceedings by theAAAI Press. Submission instructions will be uploaded to the workshop webpage in due course: https://neatclass-workshop.github.io/


Timeline
* Submission link: TBD – see https://neatclass-workshop.github.io/
* Papers submission deadline: March 27, 2022
* Paper acceptance notification: April 10, 2022
* Final camera-ready paper due: April 17, 2022
* Workshop Day: June 6, 2022

Organisers

Björn Ross, University of Edinburgh (Contact: (b.ross /at/ ed.ac.uk))
Roberto Navigli, Sapienza University of Rome
Agostina Calabrese, University of Edinburgh

---------------
The COMMLIST
---------------
This mailing list is a free service offered by Nico Carpentier. Please use it responsibly and wisely.
--
To subscribe or unsubscribe, please visit http://commlist.org/
--
Before sending a posting request, please always read the guidelines at http://commlist.org/
--
To contact the mailing list manager:
Email: (nico.carpentier /at/ vub.ac.be)
URL: http://nicocarpentier.net
---------------

[Previous message][Next message][Back to index]

Archive for 2022

[Commlist] Call for papers: 1st Workshop on Novel Evaluation Approaches for Text Classification Systems on Social Media

Sun Feb 13 10:05:55 GMT 2022