This project implements online-k-clustering algorithm as mentioned in this paper(http://cseweb.ucsd.edu/~dasgupta/291/lec6.pdf). It produces REALTIME k-clustering on an infinite stream of data. It is implemented on top of twitter storm and uses cassandra as database. It deals with 2-dimensional matrices and clusters in Euclidean space.
30 Day SummaryJan 10 2025 — Feb 9 2025
|
12 Month SummaryFeb 9 2024 — Feb 9 2025
|