Optimizing Social Media Analytics with the DQEA Framework for Superior Data Quality Management

Karthick B, Meyappan T

Abstract


This paper introduces the Data Quality Enhancement and Analytics (DQEA) Framework to enhance data quality in social media analytics by leveraging advanced data analytics tools. Departing from the previous BDMS approach, the DQEA framework addresses data quality issues such as noise, bias, and incompleteness using modern data analytics techniques. The efficacy of the framework is validated through features tested against human coders on Amazon Mechanical Turk, achieving an inter-coder reliability score of 0.85, indicating high agreement. Furthermore, two case studies with a large social media dataset from Tumblr were conducted to demonstrate the effectiveness of the proposed content features. In the first case study, the DQEA framework reduced data noise by 30% and bias by 25%, while increasing completeness by 20%. In the second case study, the framework improved data consistency by 35% and overall data quality score by 28%. Comparative analysis with state-of-the-art models, including Random Forest and Support Vector Machines (SVM), showed significant improvements in data reliability and decision-making accuracy. Specifically, the DQEA framework outperformed the Random Forest model by 15% in accuracy and 20% in true positive rate, and the SVM model by 10% in error rate reduction and 18% in reliability. Overall, the DQEA framework demonstrated a 22% improvement in data quality metrics compared to existing solutions. These quantitative metrics validate the framework’s ability to enhance data quality in social media analytics which provides a robust solution for addressing critical data quality challenges. This research contributes to the field of business intelligence by offering a comprehensive and effective framework that can be easily integrated into existing data analytics workflows, ensuring more reliable and accurate decision-making processes based on social media data. The results underscore the potential of advanced data analytics tools in transforming social media data into a valuable asset for organizations, highlighting the practical implications and future research directions in this domain.

Full Text:

PDF


DOI: https://doi.org/10.31449/inf.v49i3.8306

Creative Commons License
This work is licensed under a Creative Commons Attribution 3.0 License.