Documents Clustering techniques

Łukasz Machnik

Abstract


Documents Clustering is a technique in which relationships between sets of documents are being automatically discovered and documents are divided into groups of similar specimens. The groups that are created during the process of clustering should be specified by the high degree of similarity between the elements that belong to the same group and low degree of similarity between the elements that belong to different groups. Such way of organizing documents allows the user to review content quickly and makes it easier to retrieve particularly interesting information. The following article describes the most popular documents clustering techniques and issues associated with it, like: text documents representation and similarity measure of documents. Additionally, the author is going to introduce his own concept of new effective method of documents clustering based on Ant System.

Full Text:

PDF


DOI: http://dx.doi.org/10.17951/ai.2004.2.1.401-411
Data publikacji: 2015-01-04 00:00:00
Data złożenia artykułu: 2016-04-27 10:11:24


Statistics


Total abstract view - 188
Downloads (from 2020-06-17) - PDF - 0

Indicators



Refbacks

  • There are currently no refbacks.


Copyright (c) 2015 Annales UMCS Sectio AI Informatica

Creative Commons License
This work is licensed under a Creative Commons Attribution 4.0 International License.