INTERACTIVE TWITTER BOT DETECTION THROUGH NETWORK GRAPH ANALYSIS (2023)

  • Developed an interactive approach to identify and annotate Twitter bots by integrating natural language processing (NLP) models, network graph analysis, and expert human judgment. Utilized the Twibot-20 and Twibot-22 datasets to train graph embedding models, followed by hierarchical clustering to identify accounts with similar behavioral patterns. Simultaneously implemented an NLP model to generate bot likelihood scores based on tweet content analysis.

  • The interactive bot detection system operates through a multi-stage process: retrieving unlabeled clusters from an SQLite database, presenting account characteristics via t-SNE visualization, and enabling users to examine individual profiles alongside pre-calculated NLP-based bot likelihood scores. This human-in-the-loop approach allows for informed decision-making regarding account bot status, with user feedback continuously improving the system’s precision.

  • Detailed methodology and results are available in the research report. The ultimate objective is to deploy this interactive bot detection system as a web application, demonstrating superior accuracy and efficiency compared to existing bot detection solutions.