A Grid-Coupled Clustering Algorithm with Soft Constraints for Mixed-Attribute Data Streams

Abstract

Mixed-attribute data contains both numerical and categorical attributes, posing challenges for traditional clustering algorithms in managing its dynamics and concept drift. This article proposes a hybrid attribute data stream clustering algorithm that combines soft constraints. Firstly, normalize the mixed-attribute data stream and apply local linear embedding for dimensionality reduction; Secondly, design a mixed-attribute sliding window, based on the idea of grid coupling update, to analyze changes in grid centroids to adapt to dynamic data flows; Finally, fuzzy mathematics is introduced to set soft constraints on interval boundaries (width) and grid cell density, restricting high-frequency cluster shifts. In the experimental section, a comparison was made between the time dimension feature extraction method based on unsupervised learning and the dual interactive generative adversarial network method. On the Forest Cover Type, GMD-4C2D800 Linear, and KDD CUP 99 datasets, the proposed method achieved a minimum CMM value of 0.894 and a minimum Purity value of 0.856, with an accuracy of up to 99.94% and a maximum NMI value of 1, all of which were superior to the comparison methods. The results indicate that the proposed algorithm can effectively adapt to changes in data flow distribution, enhance both clustering accuracy and computational efficiency.

Author Biography

Wenbo Wu, Minnan Science and Technology College

School of Computer Information

Authors

  • Wenbo Wu Minnan Science and Technology College

DOI:

https://doi.org/10.31449/inf.v50i13.10911

Downloads

Published

05/18/2026

How to Cite

Wu, W. (2026). A Grid-Coupled Clustering Algorithm with Soft Constraints for Mixed-Attribute Data Streams. Informatica, 50(13). https://doi.org/10.31449/inf.v50i13.10911