Annotating reliability to enhance disinformation detection: annotation scheme, resource and evaluation

Bonet-Jover, Alba; Sepúlveda-Torres, Robiert; Saquete Boró, Estela; Martínez-Barco, Patricio

Annotating reliability to enhance disinformation detection: annotation scheme, resource and evaluation

Por favor, use este identificador para citar o enlazar este ítem: http://hdl.handle.net/10045/133233

Información del item - Informació de l'item - Item information
Título:	Annotating reliability to enhance disinformation detection: annotation scheme, resource and evaluation
Título alternativo:	Anotando la confiabilidad para mejorar la tarea de detección de desinformación: esquema de anotación, recurso y evaluación
Autor/es:	Bonet-Jover, Alba \| Sepúlveda-Torres, Robiert \| Saquete Boró, Estela \| Martínez-Barco, Patricio
Grupo/s de investigación o GITE:	Procesamiento del Lenguaje y Sistemas de Información (GPLSI)
Centro, Departamento o Servicio:	Universidad de Alicante. Departamento de Lenguajes y Sistemas Informáticos
Palabras clave:	Natural Language Processing \| Annotation Guideline \| Dataset Annotation \| Reliability Detection \| Disinformation Detection \| Procesamiento Lenguaje Natural \| Guía Anotación \| Anotación Corpus \| Detección Confiabilidad \| Detección Desinformación
Fecha de publicación:	mar-2023
Editor:	Sociedad Española para el Procesamiento del Lenguaje Natural
Cita bibliográfica:	Procesamiento del Lenguaje Natural. 2023, 70: 15-26. https://doi.org/10.26342/2023-70-1
Resumen:	Disinformation is a critical problem in our society. The COVID-19 pandemic and the Russia-Ukraine war have been key events for the spreading of fake news. Assuming that fake news mixes reliable and unreliable information, we propose RUN-AS (Reliable and Unreliable Annotation Scheme), a fine-grained annotation scheme that labels the structural parts and essential content elements of a news item to enable their classification into Reliable and Unreliable. This type of annotation will be used for training systems to automatically classify the reliability of a news item. To this end, RUN dataset in Spanish was built and annotated with RUN-AS. A set of experiments were conducted to validate the annotation scheme. The experiments evidence the validity of the annotation scheme proposed, obtaining the best F1m, i.e., 0.948. \| La desinformación es un problema crítico en nuestra sociedad. La pandemia de covid-19 y la guerra entre Rusia y Ucrania han sido escenarios clave para la difusión de noticias falsas. Partiendo de la base de que las noticias falsas mezclan información confiable y no confiable, proponemos RUN-AS (Reliable and Unreliable Annotation Scheme), un esquema de anotación de grano fino que etiqueta las partes estructurales y los elementos de contenido esenciales de una noticia y permite clasificarlos en Confiable y No confiable. Esta anotación será usada en el entrenamiento de sistemas para la clasificación automática de la confiabilidad de una noticia. Para ello, se construyó el corpus RUN en español y se anotó con RUN-AS. Se llevó a cabo un conjunto de experimentos para validar el esquema de anotación. Los experimentos evidencian la validez del esquema de anotación propuesto, obteniendo el mejor F1m 0,948.
Patrocinador/es:	This research work is funded by MCIN/AEI/10.13039/501100011033 and European Union NextGenerationEU/PRTR through the projects “TRIVIAL” (PID2021-122263OB-C22) and “SocialTrust” (PDC2022-133146-C22). It is also supported by Generalitat Valenciana through the project “NL4DISMIS” (CIPROM/2021/21) and Consellería de Innovación, Universidades, Ciencia y Sociedad Digital (ACIF/2020/177).
URI:	http://hdl.handle.net/10045/133233
ISSN:	1135-5948
DOI:	10.26342/2023-70-1
Idioma:	eng
Tipo:	info:eu-repo/semantics/article
Derechos:	© Sociedad Española para el Procesamiento del Lenguaje Natural. Distribuido bajo Licencia Creative Commons Reconocimiento-NoComercial-SinObraDerivada 4.0
Revisión científica:	si
Versión del editor:	https://doi.org/10.26342/2023-70-1
Aparece en las colecciones:	Procesamiento del Lenguaje Natural - Nº 70 (2023) INV - GPLSI - Artículos de Revistas

Archivos en este ítem:

Archivos en este ítem:
Archivo	Descripción	Tamaño	Formato
PLN_70_01.pdf		1,29 MB	Adobe PDF	Abrir Vista previa Cerrar vista previa

Ver citas en Google Académico

Muestra el registro completo

Este ítem está licenciado bajo Licencia Creative Commons