Articolo in rivista, 2023, ENG, 10.1007/s10618-023-00961-5
Molinari A.; Esuli A.
CNR-ISTI, Pisa, Italy; CNR-ISTI, Pisa, Italy
In high recall retrieval tasks, human experts review a large pool of documents with the goal of satisfying an information need. Documents are prioritized for review through an active learning policy, and the process is usually referred to as Technology-Assisted Review (TAR). TAR tasks also aim to stop the review process once the target recall is achieved to minimize the annotation cost. In this paper, we introduce a new stopping rule called SALR? (SLD for Active Learning), a modified version of the Saerens-Latinne-Decaestecker algorithm (SLD) that has been adapted for use in active learning. Experiments show that our algorithm stops the review well ahead of the current state-of-the-art methods, while providing the same guarantees of achieving the target recall.
Data mining and knowledge discovery (Dordrecht. Online)
Technology assisted review, Machine Learning, Text classification, Active learning
Molinari Alessio, Esuli Andrea
ISTI – Istituto di scienza e tecnologie dell'informazione "Alessandro Faedo"
ID: 486036
Year: 2023
Type: Articolo in rivista
Creation: 2023-09-05 14:33:30.000
Last update: 2023-09-05 17:25:14.000
CNR authors
External links
OAI-PMH: Dublin Core
OAI-PMH: Mods
OAI-PMH: RDF
DOI: 10.1007/s10618-023-00961-5
URL: https://link.springer.com/article/10.1007/s10618-023-00961-5
External IDs
CNR OAI-PMH: oai:it.cnr:prodotti:486036
DOI: 10.1007/s10618-023-00961-5
Scopus: 2-s2.0-85168873636