Integrating Apriori Mining and Speech Recognition for Intelligent and Secure Online Classroom Interaction

Wei Zhao

doi:10.31449/inf.v49i37.11444

Abstract

Real-time structuring of speech and anomaly detection remain critical challenges in online classroom interactions, especially under noisy and multi-speaker conditions. This study proposes an integrated framework that combines Automatic Speech Recognition (ASR) with Apriori-based pattern mining to enhance the intelligence and security of online classrooms. The system first applies multi-speaker ASR with acoustic feature separation to achieve robust transcription, speaker labeling, and noise suppression. The transcribed text is pre-processed through Chinese word segmentation and stop-word filtering to construct a transactional dataset. Frequent Pattern Growth (FP-Growth) is then employed to generate frequent itemsets and extract high-confidence association rules, forming a reference speech-pattern library. Local anomaly factors are introduced to quantify deviations in support and confidence between new corpora and the rule library, thereby enabling early detection of sensitive or off-topic speech. Experimental validation on 120 classroom sessions demonstrates an 80.2% recognition accuracy in highly noisy environments, with rule coverage and confidence reaching 87% and 89%, respectively. The proposed framework significantly improves anomaly warning efficiency, reducing sensitive-speech combinations by 87.0%. These results highlight the feasibility and effectiveness of integrating ASR and Apriori mining for intelligent speech structuring, pattern extraction, and anomaly detection in secure online classroom environments.

References

.Chen Y, Zhang J, Yuan X, et al. Sok: A modularized approach to study the security of automatic speech recognition systems[J]. ACM Transactions on Privacy and Security, 2022, 25(3): 1-31.

.Aldarwbi MY, Lashkari AH, Ghorbani A A. The sound of intrusion: A novel network intrusion detection system[J]. Computers and Electrical Engineering, 2022, 104: 108455.

.Zhang L. Data mining and learning behavior analysis of French online education data-driven teaching based on generative adversarial network improvement Apriori algorithm[J]. International Journal of Wireless and Mobile Computing, 2025, 28(2): 205-215.

.Onishi S, Yasumori T, Shiina H. Classroom Utterance Analysis and Visualization Using a Generative Deep Neural Networks for Dialogue Model[J]. International Journal of Smart Computing and Artificial Intelligence, 2024, 8(2).

.Chen Z, Han B, Wang S, et al. Attention-based encoder-decoder end-to-end neural diarization with embedding enhancer[J]. IEEE/ACM Transactions on Audio, Speech, and Language Processing, 2024, 32: 1636-1649.

.Gomez A, Pattichis MS, Celedón- Pattichis S. Speaker diarization and identification from single channel classroom audio recordings using virtual microphones[J]. IEEE Access, 2022, 10: 56256-56266.

.Ge Y, Zhao L, Wang Q, et al. Advddos : Zero-query adversarial attacks against commercial speech recognition systems[J]. IEEE Transactions on Information Forensics and Security, 2023, 18: 3647-3661.

.Lamichhane B. Speaker Diarization With Embeddings From a VGGish Model[J]. Dihardchallenge . Github . Io, 2024: 12-14.

.Kothalkar PV, Hansen JHL, Irvin D, et al. Child-adult speech diarization in naturalistic conditions of preschool classrooms using room-independent ResNet model and automatic speech recognition-based re-segmentation[J]. The Journal of the Acoustical Society of America, 2024, 155(2): 1198-1215.

.Xylogiannis P, Vryzas N, Vrysis L, et al. Multisensory fusion for unsupervised spatiotemporal speaker diarization [J]. Sensors, 2024, 24(13): 4229.

.Liu R, Shi J, Chen X, et al. Network anomaly detection and security defense technology based on machine learning: A review[J]. Computers and Electrical Engineering, 2024, 119: 109581.

.Bagui S S , Khan MP, Valmyr C, et al. Model Retraining upon Concept Drift Detection in Network Traffic Big Data[J]. Future Internet, 2025, 17(8): 328.

.Zheng G. Construction of ideological and political education in universities based on intelligent digital education[J]. Advances in Educational Technology and Psychology, 2024, 8(1): 45-54.

.Serafini L, Cornell S, Morrone G, et al. An experimental review of speaker diarization methods with application to two-speaker conversational telephone speech recordings[J]. Computer speech & language, 2023, 82: 101534.

.Southwell R, Pugh S, Perkoff M, et al. Challenges and feasibility of automatic speech recognition for modeling student collaborative discourse in classrooms[J]. International Educational Data Mining Society, 2022.

.Luo Y. Artificial intelligence model for real-time monitoring of ideological and political teaching system[J]. Journal of Intelligent & Fuzzy Systems, 2021, 40(2): 3585-3594.

.Zhan H, Meng X, Asif M. Risk early warning of a dynamic ideological and political education system based on LSTM-MLP: Online education data processing and optimization[J]. Mobile Networks and Applications, 2024, 29(2): 1 -13 .

.Cheng M, Lin Y, Li M. Sequence-to-sequence neural diarization with automatic speaker detection and representation[J]. IEEE Transactions on Audio, Speech and Language Processing, 2025.

.Bois A, Tervil B, Oudre L. A persistent homology-based algorithm for unsupervised anomaly detection in time series[J]. Transactions on Machine Learning Research, 2024.

.Wang J., Dudy S., He X., Wang Z., Southwell R., & Whitehill J. Optimizing Speaker Diarization for the Classroom: Applications in Timing Student Speech and Distinguishing Teachers from Children[J]. Journal of Educational Data Mining, 2025, 17(1): 98-125.

.Cai J, Li Y. Fuzzy association rule mining for Personalized English Language Teaching from higher education[J]. Journal of Computational Methods in Sciences and Engineering, 2024, 24(6): 3617-3631.

.Essalmi H, El Affar A. Dynamic Algorithm for Mining Relevant Association Rules via Meta-Patterns and Refinement-Based Measures[J]. Information, 2025, 16(6): 438 -467 .

.Sun Z, Peng Q, Mou X, et al. An artificial intelligence-based real-time monitoring framework for time series[J]. Journal of Intelligent & Fuzzy Systems, 2021, 40(6): 10401-10415.

.Ryan S, Djukic P, Morris T, et al. Pattern detection in time-series data: US Patent 11,620,528[P]. 2023-4-4.

.Janský J, Koldovský Z, Málek J, et al. Auxiliary function-based algorithm for blind extraction of a moving speaker[J]. EURASIP Journal on Audio, Speech, and Music Processing, 2022, 2022(1): 1.

.Schwartz A, Schwartz O, Chazan SE, et al. Multi-microphone simultaneous speakers detection and localization of multi-sources for separation and noise reduction[J]. EURASIP Journal on Audio, Speech, and Music Processing, 2024, 2024(1): 50.

.Tian J, Yu J, Weng C, et al. Improving mandarin end-to-end speech recognition with word n-gram language model[J]. IEEE Signal Processing Letters, 2022, 29: 812-816.

.Kim M, Jang G J. Speaker-Attributed Training for Multi-Speaker Speech Recognition Using Multi-Stage Encoders and Attention-Weighted Speaker Embedding[J]. Applied Sciences, 2024, 14(18): 8138.

.Huang Z, Delcroix M, Garcia LP, et al. Joint speaker diarization and speech recognition based on region proposal networks[J]. Computer Speech & Language, 2022, 72: 101316.

.He MK, Du J, Liu QF, et al. Ansd -ma- mse : Adaptive neural speaker diarization using memory-aware multi-speaker embedding[J]. IEEE/ACM Transactions on Audio, Speech, and Language Processing, 2023, 31: 1561-1573.

Integrating Apriori Mining and Speech Recognition for Intelligent and Secure Online Classroom Interaction

Abstract

References

Authors

DOI:

Downloads

Published

Issue

Section

License

How to Cite

Developed By

Information