Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Refactored Snakemake workflow and generate example output #41

Closed
wants to merge 12 commits into from

Commits on Jul 9, 2024

  1. Update Dockerfile to use snakemake image

    Updated the dockerfile to use a two stage process. Snakemake image is used as the runtime environment.
    Supports / Partially addresses #17
    alexhambley committed Jul 9, 2024
    Configuration menu
    Copy the full SHA
    6629446 View commit details
    Browse the repository at this point in the history
  2. Update snakemake workflow to fix module not found bug

    Updated snakemake workflow to work with Dockerfile. Also fixes module not found error by exposing pythonpath. Partially addresses #17
    alexhambley committed Jul 9, 2024
    Configuration menu
    Copy the full SHA
    13d60f4 View commit details
    Browse the repository at this point in the history

Commits on Jul 11, 2024

  1. Update source_crates.py to work with all workflows, not just a range

    Partially addresses #28 and #17
    User can now provide range (e.g. --workflow-ids 1-40) or omit this to download all workflows
    alexhambley committed Jul 11, 2024
    Configuration menu
    Copy the full SHA
    08ca081 View commit details
    Browse the repository at this point in the history

Commits on Jul 12, 2024

  1. Temporary fix for ValueError with multiple context URLs (#34)

    - Temporary fix for ValueError with multiple context URLs (#34) by returning empty context for URLs that don't match allowed_urls_pattern.
    alexhambley committed Jul 12, 2024
    Configuration menu
    Copy the full SHA
    0d78980 View commit details
    Browse the repository at this point in the history
  2. Addressed FromAsCasing warning in Dockerfile

    FromAsCasing - FROM and AS should be in the same case.
    alexhambley committed Jul 12, 2024
    Configuration menu
    Copy the full SHA
    cc23121 View commit details
    Browse the repository at this point in the history
  3. Find maximum ID from sourced workflows when sourcing all workflows

    - Added get_max_id_from_files method.
    - If --workflow-ids argument is not provided, created_files.json should contain all workflows downloaded. get_max_id_from_files() finds the maximum ID in the sourced workflows.
    - Part of wider effort to address #17
    alexhambley committed Jul 12, 2024
    Configuration menu
    Copy the full SHA
    66c47ed View commit details
    Browse the repository at this point in the history

Commits on Jul 15, 2024

  1. Tidy up Snakefile

    Removes list_expected_files() method.
    alexhambley committed Jul 15, 2024
    Configuration menu
    Copy the full SHA
    eead0f7 View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    b01db25 View commit details
    Browse the repository at this point in the history

Commits on Jul 16, 2024

  1. Progress bar added and logging edited (#33)

    - Added progress bar and removed line-by-line logging for easier evaluation of download and processing progress.
    - Addresses #33
    alexhambley committed Jul 16, 2024
    Configuration menu
    Copy the full SHA
    caa9e75 View commit details
    Browse the repository at this point in the history

Commits on Jul 30, 2024

  1. Preliminary RO Crate functionality added (#37)

    - Added preliminary module create_ro_crate to create an RO crate for a given file.
    - Addresses #37
    alexhambley committed Jul 30, 2024
    Configuration menu
    Copy the full SHA
    39da012 View commit details
    Browse the repository at this point in the history

Commits on Aug 5, 2024

  1. Added create_ro_crate rule

    - Added a rule in Snakemake workflow to create an RO crate.
    - Addresses #37
    alexhambley committed Aug 5, 2024
    Configuration menu
    Copy the full SHA
    32a3413 View commit details
    Browse the repository at this point in the history
  2. Example RO crate

    Addresses #37
    alexhambley committed Aug 5, 2024
    Configuration menu
    Copy the full SHA
    51bd5c8 View commit details
    Browse the repository at this point in the history