GitHub

Sensor Reading Csv Data Processing

This repo implement Work Queue System to process multiple large csv files in a distributed way

To scale up the system, we can increase the worker replicas / concurrency. But please be reminded that we need to increase the connection pool size of the database as well

TL;DR

make setup
make start
make enqueue ARGS="data"

Setup

./scripts/setup.sh

or

make setup

Start the work queue

docker compose up -d

or

make start

Enqueue the jobs

docker compose run --rm --no-deps enqueue [-h] [--dbhost DBHOST] [--dbname DBNAME] [--dbuser DBUSER] [--dbpass DBPASS] [--dbport DBPORT] directory

or

make enqueue ARGS="[-h] [--dbhost DBHOST] [--dbname DBNAME] [--dbuser DBUSER] [--dbpass DBPASS] [--dbport DBPORT] directory"

Generate Data

docker compose run --rm --no-deps worker python ./scripts/generate_data.py [--size] [--output]

or

make generate-data ARGS="[--size] [--output]"

Test

make test

Monitor the job status

1. docker compose up monitor
2. Access the panel via http://localhost:5555/

ER diagram

Table Structure

Tech Stack

Celery
Rabbitmq
Postgres
Docker
SQLAlchemy

Name		Name	Last commit message	Last commit date
Latest commit History 51 Commits
alembic		alembic
data		data
docs		docs
scripts		scripts
server		server
.dockerignore		.dockerignore
.env.example		.env.example
.gitignore		.gitignore
Dockerfile		Dockerfile
Dockerfile.worker		Dockerfile.worker
Makefile		Makefile
README.md		README.md
alembic.ini		alembic.ini
docker-compose.yml		docker-compose.yml
enqueue.py		enqueue.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Sensor Reading Csv Data Processing

TL;DR

Setup

Start the work queue

Enqueue the jobs

Generate Data

Test

Monitor the job status

ER diagram

Table Structure

Tech Stack

About

Releases

Packages

Languages

Ngkaokis/sensor_data_processing

Folders and files

Latest commit

History

Repository files navigation

Sensor Reading Csv Data Processing

TL;DR

Setup

Start the work queue

Enqueue the jobs

Generate Data

Test

Monitor the job status

ER diagram

Table Structure

Tech Stack

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages