Skip to content
Change the repository type filter

All

    Repositories list

    • Great Expectations Airflow operator
      Python
      Apache License 2.0
      56000Updated Apr 3, 2022Apr 3, 2022
    • 🛸 spaCy pipelines for pre-trained BERT, XLNet and GPT-2
      Python
      MIT License
      166200Updated Sep 17, 2020Sep 17, 2020
    • A conda-smithy repository for fasttext.
      Shell
      BSD 3-Clause "New" or "Revised" License
      12000Updated Jan 29, 2020Jan 29, 2020
    • Terraform Google Cloud Platform provider
      Go
      Mozilla Public License 2.0
      1.8k000Updated Nov 19, 2019Nov 19, 2019
    • Apache Airflow (Incubating)
      Python
      Apache License 2.0
      14k000Updated Sep 15, 2019Sep 15, 2019
    • dragnet

      Public
      Just the facts -- web page content extraction
      Python
      MIT License
      180100Updated Jul 20, 2019Jul 20, 2019
    • A place to submit conda recipes before they become fully fledged conda-forge feedstocks
      Python
      BSD 3-Clause "New" or "Revised" License
      5k000Updated Feb 20, 2019Feb 20, 2019
    • Alphabetical list of free/public domain datasets with text data for use in Natural Language Processing (NLP)
      967000Updated Jan 31, 2019Jan 31, 2019
    • Bitbucket Branch Source Plugin
      Java
      354000Updated Dec 18, 2018Dec 18, 2018
    • Mirror of Apache Beam
      Java
      Apache License 2.0
      4.3k000Updated Oct 28, 2018Oct 28, 2018
    • Jenkins plugin to run dynamic slaves in a Kubernetes/Docker environment
      Java
      Apache License 2.0
      1.3k000Updated Oct 25, 2018Oct 25, 2018
    • Run in all nodes of your cluster before the cluster starts - let's you customize your cluster
      Shell
      Apache License 2.0
      512000Updated Sep 26, 2018Sep 26, 2018
    • pubsub-to-bigquery

      Public archive
      A highly configurable Google Cloud Dataflow pipeline that writes data into Google Big Query table from Pub/Sub
      Java
      Apache License 2.0
      76700Updated Apr 23, 2018Apr 23, 2018
    • Multilingual word vectors in 78 languages
      Jupyter Notebook
      BSD 3-Clause "New" or "Revised" License
      121000Updated Feb 7, 2018Feb 7, 2018
    • Command line tool for generating a changelog from git tags and commit history
      JavaScript
      MIT License
      158000Updated Jan 19, 2018Jan 19, 2018
    • skorch

      Public
      A scikit-learn compatible neural network library that wraps pytorch
      Python
      BSD 3-Clause "New" or "Revised" License
      393000Updated Dec 21, 2017Dec 21, 2017
    • Google Cloud Client Library for Python
      Python
      Apache License 2.0
      1.5k000Updated Oct 23, 2017Oct 23, 2017
    • Code samples used on cloud.google.com
      Python
      Apache License 2.0
      6.5k000Updated Oct 27, 2016Oct 27, 2016
    • Google Cloud Dataflow provides a simple, powerful model for building both batch and streaming parallel data processing pipelines.
      Java
      Apache License 2.0
      322000Updated Sep 28, 2016Sep 28, 2016
    • bombora-tutorials

      Public archive
      Notebooks for learning general concepts regarding Bombora interfaces, services and data schemas.
      Jupyter Notebook
      MIT License
      2100Updated Jul 14, 2016Jul 14, 2016
    • nbsphinx

      Public
      Sphinx source parser for *.ipynb files
      Python
      MIT License
      132000Updated Jul 11, 2016Jul 11, 2016
    • findspark

      Public
      Python
      BSD 3-Clause "New" or "Revised" License
      72000Updated May 15, 2016May 15, 2016
    • Example unit tests for Apache Spark Python scripts using the py.test framework
      Other
      45000Updated Mar 22, 2016Mar 22, 2016