Skip to content

Genomic data processing platform for the Global Parkinson's Genetics Program (GP2) - featuring QC tools, carrier analysis, data visualization, and cohort management services

Notifications You must be signed in to change notification settings

datatecnica/genotools-server

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

GenoTools Server

Monorepo containing genomic data processing services and applications for the Global Parkinson's Genetics Program (GP2).

Applications

GP2 Browser

Streamlit web application for browsing GP2 cohort data releases.

  • Location: apps/gp2_browser/
  • Features: Data releases, quality control, ancestry analysis, SNP metrics, rare variants

Microservices

GenoTools API

RESTful API interface for genomic data quality control and analysis.

  • Location: microservices/genotools_api/
  • Features: GenoTools command execution, GCS integration, API key authentication
  • Deployment: Docker, Kubernetes (GKE)

GenoTracker

FastAPI + Streamlit application for managing genomic cohort data.

  • Location: microservices/genotracker/
  • Features: Data visualization, local/GCS data support, interactive exploration

Carriers API

API service for genetic carrier analysis.

  • Location: microservices/carriers_api/
  • Features: Carrier status processing, PLINK integration

IDAT Utils

Illumina IDAT file processing toolkit for SNP metrics generation.

  • Location: microservices/idat_utils/
  • Features: IDAT to VCF conversion, SNP metrics extraction, parquet output

SNP Checksums

Duplicate detection service using SNP-based hashing.

  • Location: microservices/snp_checksums/
  • Features: Genotype hashing, duplicate identification, PLINK file processing

Infrastructure

  • Batch Services: batch_services/
  • Deployment: deploy/

Getting Started

Each service contains its own README with specific setup instructions. Services are containerized with Docker and can be deployed to Google Cloud Platform.

About

Genomic data processing platform for the Global Parkinson's Genetics Program (GP2) - featuring QC tools, carrier analysis, data visualization, and cohort management services

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Contributors 4

  •  
  •  
  •  
  •