Skip to content

mdjaffardjy/Snakemake_workflow_analysis

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

8 Commits
 
 
 
 
 
 
 
 

Repository files navigation

MIT licensed Version 1.0.1

Snakemake_workflow_analysis

In this repository is presented a Snakemake workflow analyser. It takes one or several snakefiles and it has three major libraries :

  • The snakemake parser that allows users to extract information (info on processes or and the structure) of a single or multiple Snakemake workflows. It is found in the folder src. It extracts the snakemake components of a workflow (notably processors) and helps extracting info on the workflow such as the number of processors (here, rules), as well as their code, name, inputs and outputs. It doesn't extract the structure. It also finds the tools present in each processor.
  • The second one, search_biotools finds tools matches and informations in bio.tools using a dump found in the inputs.
  • The last one, get_characteristics compiles in a dataframethe extracted information about the processors (rules) previously parsed with parsing_snkmk.

Contribute

Please submit GitHub issues to provide feedback or ask for new features, and contact us for any related question.

Install and Run

Conda environment

The conda environment can be found here.

Source data

The source data can be found in the crawl file. This file contains git metadata as well as links for the workflows. One can download the Snakefiles from this json. It is heavily recommended to store the data in the form ///.snakefile

To run

The analyzer needs as input the address of a folder containing all the snakefiles to analyze.

In a script, import parsing_snkmk and use these two functions:

import parsing_snkmk as ps

path_wf = "path/to/snakefiles/folder"

# workflow parsing
wf_list = ps.parse_wf_folder(path_wf)
dict_tool, total_extr = ps.make_dict_wf_with_biotools_dump(wf)

Your results will be in the object wf_list. You will have additional information about the tools you extracted in dict_tool and total_extr.

About

No description, website, or topics provided.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published