Ayuda
Ir al contenido

Dialnet


CrossClus: user-guided multi-relational clustering

    1. [1] University of Illinois at Urbana Champaign

      University of Illinois at Urbana Champaign

      Township of Cunningham, Estados Unidos

    2. [2] IBM Research – Thomas J. Watson Research Center

      IBM Research – Thomas J. Watson Research Center

      Town of Yorktown, Estados Unidos

  • Localización: Data mining and knowledge discovery, ISSN 1384-5810, Vol. 15, Nº 3, 2007, págs. 321-348
  • Idioma: inglés
  • Texto completo no disponible (Saber más ...)
  • Resumen
    • Most structured data in real-life applications are stored in relational databases containing multiple semantically linked relations. Unlike clustering in a single table, when clustering objects in relational databases there are usually a large number of features conveying very different semantic information, and using all features indiscriminately is unlikely to generate meaningful results. Because the user knows her goal of clustering, we propose a new approach called CrossClus, which performs multi-relational clustering under user’s guidance. Unlike semi-supervised clustering which requires the user to provide a training set, we minimize the user’s effort by using a very simple form of user guidance. The user is only required to select one or a small set of features that are pertinent to the clustering goal, and CrossClus searches for other pertinent features in multiple relations. Each feature is evaluated by whether it clusters objects in a similar way with the user specified features. We design efficient and accurate approaches for both feature selection and object clustering. Our comprehensive experiments demonstrate the effectiveness and scalability of CrossClus.


Fundación Dialnet

Dialnet Plus

  • Más información sobre Dialnet Plus

Opciones de compartir

Opciones de entorno