Contributo in atti di convegno, 2022, ENG, 10.1007/978-3-031-13643-6_23

A concise overview of LeQua@CLEF 2022: Learning to Quantify

Esuli A.; Moreo A.; Sebastiani F.; Sperduti G.

CNR-ISTI, Pisa, Italy; CNR-ISTI, Pisa, Italy; CNR-ISTI, Pisa, Italy; CNR-ISTI, Pisa, Italy

LeQua 2022 is a new lab for the evaluation of methods for "learning to quantify" in textual datasets, i.e., for training predictors of the relative frequencies of the classes of interest Y={y1,...,yn} in sets of unlabelled textual documents. While these predictions could be easily achieved by first classifying all documents via a text classifier and then counting the numbers of documents assigned to the classes, a growing body of literature has shown this approach to be suboptimal, and has proposed better methods. The goal of this lab is to provide a setting for the comparative evaluation of methods for learning to quantify, both in the binary setting and in the single-label multiclass setting; this is the first time that an evaluation exercise solely dedicated to quantification is organized. For both the binary setting and the single-label multiclass setting, data were provided to participants both in ready-made vector form and in raw document form. In this overview article we describe the structure of the lab, we report the results obtained by the participants in the four proposed tasks and subtasks, and we comment on the lessons that can be learned from these results.

CLEF 2022 - 13th Conference and Labs of the Evaluation Forum, pp. 362–381, Bologna, Italy, 5-8/9/2022

Keywords

Quantification

CNR authors

Sperduti Gianluca, Esuli Andrea, Moreo Fernandez Alejandro David, Sebastiani Fabrizio

CNR institutes

ISTI – Istituto di scienza e tecnologie dell'informazione "Alessandro Faedo"

ID: 470263

Year: 2022

Type: Contributo in atti di convegno

Creation: 2022-08-30 17:16:54.000

Last update: 2022-11-03 16:18:15.000

External IDs

CNR OAI-PMH: oai:it.cnr:prodotti:470263

DOI: 10.1007/978-3-031-13643-6_23

Scopus: 2-s2.0-85137990750

ISI Web of Science (WOS): 000870333000023