This project implements online-k-clustering algorithm as mentioned in this paper(http://cseweb.ucsd.edu/~dasgupta/291/lec6.pdf). It produces REALTIME k-clustering on an infinite stream of data. It is implemented on top of twitter storm and uses cassandra as database. It deals with 2-dimensional matrices and clusters in Euclidean space.
30 Day SummaryJan 15 2026 — Feb 14 2026
|
12 Month SummaryFeb 14 2025 — Feb 14 2026
|