Skip to content

Latest commit

 

History

History
84 lines (60 loc) · 7.48 KB

README.md

File metadata and controls

84 lines (60 loc) · 7.48 KB

benchmark-coordination Python 3.11+ codecov

benchmark-coordination

This project is an integrated collection of network-based approaches for coordination detection, as found in the scientific literature. The coordination detection methods are abstracted and implemented in a modular and extensible way, allowing the common building blocks to be reused and combined in different ways.

StructureCoordination Detection MethodsDevelopmentLicense

🧩 Structure

  • features_builder/: functions to extract features from the raw data (e.g. text preprocessing, tf-idf, etc.)
  • network_builder/: functions to create networks based on similarity, and to filter relevant edges
  • pipeline/: abstractions to combine multiple steps into a single workflow
  • similarity_calculator/: implementation of similarity scores
  • types/: utilities for consistent code typing and datetime formatting
  • utils/: general utilities that are re-used throughout the codebase (e.g. logging)
  • windowing/: functions to slice the data (e.g. sliding time windows approaches)

🎓 Coordination Detection Methods

Reference Trace (Features) Filter Window Type Metric (Threshold) Community Detection / Classification Detection Scores
Pacheco, Diogo, et al. "Uncovering coordinated networks on social media: methods and case studies." Proceedings of the international AAAI conference on web and social media. Vol. 15. 2021. handle handle used by >= 2 users entire time cardinality - -
Pacheco, Diogo, et al. "Uncovering coordinated networks on social media: methods and case studies." Proceedings of the international AAAI conference on web and social media. Vol. 15. 2021. image (RGB vector) users who shared >= 3 images entire time Jaccard similarity (>= 99th percentile) - -
Pacheco, Diogo, et al. "Uncovering coordinated networks on social media: methods and case studies." Proceedings of the international AAAI conference on web and social media. Vol. 15. 2021. hashtag sequence users with >= 5 posts and >=5 unique hashtags entire time Jaccard similarity MinHash (> 0.8) - -
Pacheco, Diogo, et al. "Uncovering coordinated networks on social media: methods and case studies." Proceedings of the international AAAI conference on web and social media. Vol. 15. 2021. retweet - entire time cosine similarity (> 0.9) - -
Pacheco, Diogo, et al. "Uncovering coordinated networks on social media: methods and case studies." Proceedings of the international AAAI conference on web and social media. Vol. 15. 2021. tweet time (30m bins) - entire time cosine similarity (> 0.9) - -
Vishnuprasad, Padinjaredath Suresh, et al. "Tracking fringe and coordinated activity on Twitter leading up to the US Capitol attack." Proceedings of the international AAAI conference on web and social media. Vol. 18. 2024. retweet - time (len: , stride: ) cardinality (> 1) - -
Vishnuprasad, Padinjaredath Suresh, et al. "Tracking fringe and coordinated activity on Twitter leading up to the US Capitol attack." Proceedings of the international AAAI conference on web and social media. Vol. 18. 2024. text - activity (len: 10, stride: 1) Ratclif-Obershelp similarity (> 0.7) - -
Hristakieva, Kristina, et al. "The spread of propaganda by coordinated communities on social media." Proceedings of the 14th ACM Web Science Conference 2022. 2022. retweet - entire time cosine similarity Louvain -

🛠️ Development

python

This project is developed and tested on multiple python versions (3.11+). Dependencies are managed with pipenv and security vulnerabilities are scanned with pip_audit.

pipenv can be installed via pip:

$ pip install pipenv

Before running the project, the virtual environment, including development dependencies, needs to be installed and then activated:

# install dependencies (including dev)
$ pipenv install --dev

# activate environment
$ pipenv shell

The code in this repository follows PEP 8 style guide. Code can be formatted and linted with:

python -m black .
python -m ruff check .

Also see Black.

Additionally, static typing can be checked with:

python -m mypy

Unit tests are implemented with pytest and can be run with:

python -m pytest tests/

📜 License

Apache License 2.0