Implémentation de l'algorithme de clustering k-means en utilisant le framework Hadoop version 3.1.3 (MapReduce).
-
Updated
Mar 4, 2021 - Java
Implémentation de l'algorithme de clustering k-means en utilisant le framework Hadoop version 3.1.3 (MapReduce).
DocClusterizer is a Java desktop application designed to analyze and cluster documents based on their content similarity. The application utilizes Lucene and Tika libraries to process various file extensions such as txt, pdf, docx, and pptx.
Exploration of the different phases of Data Mining: Data visualization, their preprocessing and the implementation of multiple algorithms for Data Mining.
Add a description, image, and links to the unsupervised-clustering topic page so that developers can more easily learn about it.
To associate your repository with the unsupervised-clustering topic, visit your repo's landing page and select "manage topics."