Ayuda
Ir al contenido

Dialnet


Dataset and Baselines for IID and OOD Image Classification Considering Data Quality and Evolving Environments

  • Zhuo Zhang [1] ; Yang Li [2] ; Yicheng Gong [1] ; Yue Yang [1] ; Shukun Ma [1] ; Xiaolan Guo [1] ; Sezai Ercisli [3]
    1. [1] Tianjin University

      Tianjin University

      China

    2. [2] Shihezi University

      Shihezi University

      China

    3. [3] Atatürk University

      Atatürk University

      Turquía

  • Localización: IJIMAI, ISSN-e 1989-1660, Vol. 8, Nº. 1, 2023 (Ejemplar dedicado a: Special Issue on AI-driven Algorithms and Applications in the Dynamic and Evolving Environments), págs. 6-12
  • Idioma: inglés
  • Enlaces
  • Resumen
    • At present, artificial intelligence is in a period of rapid development, and deep learning has begun to be applied in various fields. Data, as a key part of the deep learning, its efficiency and stability, will directly affect the performance of the model, so it is valued by people. In order to make the dataset efficient, many active learning methods have been proposed, the dataset containing independent identically distribution (IID) samples is reduced with excellent performance; in order to make the dataset more stable, it should be solved that the model encounters out-of-distribution (OOD) samples to improve generalization performance. However, the current active learning method design and the method of adding OOD samples lack guidance, and people do not know what samples should be selected and which OOD samples will be added to better improve the generalization performance. In this paper, we propose a dataset containing a variety of elements called a dataset with Complete Sample Elements(CSE), the labels such as rotation angle and distance in addition to the common classification labels. These labels can help people analyze the distribution characteristics of each element of an efficient dataset, thereby inspiring new active learning methods; we also construct a corresponding OOD test set, which can not only detect the generalization performance of the model, but also helps explore metrics between OOD samples and existing dataset to guide the selected method of OOD samples, so that it can improve generalization efficiently. In this paper, we explore the distribution characteristics of efficient datasets in terms of angle element, and confirm that an efficient dataset tends to contain samples with different appearance. At the same time, experiments have proved the positive influence of the addition of OOD samples on the generalization performance of dataset.


Fundación Dialnet

Dialnet Plus

  • Más información sobre Dialnet Plus

Opciones de compartir

Opciones de entorno