Skip to content

Final Project for CS221 - Detecting anomalous vertices within a large, directed graph dataset representing a network of users on Twitter

Notifications You must be signed in to change notification settings

vsawal/Network-Anomaly-Detection

 
 

Repository files navigation

Network-Anomaly-Detection

Final Project for CS221 - Detecting anomalous vertices within a large, directed graph dataset representing a network of users on Twitter

Final Report: https://github.com/anish-saha/Network-Anomaly-Detection/blob/master/CS_221_Project_Report.pdf

CodaLab Worksheet: https://worksheets.codalab.org/worksheets/0xb972380293b449d896f3c0ade600ee05

In this project, we try to construct a robust prediction algorithm to predict whether or not a social network user is a Sybil (fake account) using artificial intelligence. Our original dataset is a network dataset of over 5.3 million Twitter users.

Graph

Models

Three models were used to perform node classification:

  • A two-step link prediction/aggregation pipeline

Two

  • Graph Convolutional Network (GCN)

GCN

  • Graph Attention Networks (GAT)

GAT

Setup

pip3 install --user -r requirements.txt

Usage

Two-stage Classifier

python3 eda.py

python3 sybil_detect.py

GCN

cd pygcn/

python3 train.py

GAT

cd pygat/

python3 train.py --sparse

About

Final Project for CS221 - Detecting anomalous vertices within a large, directed graph dataset representing a network of users on Twitter

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Jupyter Notebook 64.8%
  • Python 35.2%