Title:
|
Unsupervised document clustering by weighted combination
|
Author:
|
González Pellicer, Edgar; Turmo Borras, Jorge
|
Other authors:
|
Universitat Politècnica de Catalunya. Departament de Llenguatges i Sistemes Informàtics; Universitat Politècnica de Catalunya. GPLN - Grup de Processament del Llenguatge Natural |
Abstract:
|
This report proposes a novel unsupervised document clustering approach based on weighted combination of individual clusterings. Two non-weighted combination methods are adapted to work in a weighted fashion: a graph based method and a probability based one. The performance of the weighted approach is evaluated on real-world collections, and compared to that of individual clustering and non-weighted combination. The results of this evaluation confirm that graph based weighted combination consistently outperforms the other approaches. |
Subject(s):
|
-Àrees temàtiques de la UPC::Informàtica::Intel·ligència artificial -Document clustering -Weighted combination |
Rights:
|
|
Document type:
|
Article - Published version Report |
Share:
|
|