Google Summer Of Code

Spend your summer doing something exciting and valuable for the open-source community, and join Google Summer of Code. Read more about how the program works on this page.

OpenVINO Toolkit has been a mentoring organization since 2022!

Announcements

Please subscribe this discussion and check it regularly for important announcements.

Prerequisite task

We require one pull request sent to our OpenVINO repository from each potential GSoC contributor before accepting participation for GSoC. We would like to see if you know how to code, use git and GitHub, and your coding style. To fulfill this requirement, please:

Visit the Good First Issues board.
Select one of the unassigned tickets ("Contributors Needed" column) and ask for the assignment.
Discuss the solution with the OpenVINO developers.
Implement it according to the contribution guide.
If you encounter any issues talk to our developers on Discord.
Create a new pull request with your work.
Wait for the review and eventual merge.

Please note the above task is mandatory. However, we reserve the right to review and merge only the selected PRs. Not merging or closing your PR doesn't change your chances of being accepted for GSoC. Due to the expected large number of requests, the review process can be delayed, so please be patient.

If you're unfamiliar with git and GitHub, check out this blog. The blog is about contributing to OpenVINO core project, but the workflow is same for all projects.

Application Template

Your application should consist of the following parts:

About you
1. Your full name
2. Your university/current enrollment
3. The timezone you live in
4. Short bio
5. Your experience in programming (especially C++ and Python)
6. Your experience in ML and DL
About the project
1. What is your choice?
2. Why did you choose this specific idea?
3. How much time do you plan to invest in the project?
4. Provide an abstract of the solution
5. Provide a detailed timeline of how you want to implement the project (include the main points you want to cover and dates)
General questions
1. How do you know OpenVINO?
2. What do you know about OpenVINO?
3. Have you already contributed to the OpenVINO project? (please include links)
4. How could you apply it to your professional development?
5. Describe any other career development plan you have for the summer in addition to GSoC.
6. Why should we pick you?
Tasks
1. Link to your pull request (for the prerequisite task – the top part of this document), even if it is already merged or closed

Proposal examples can be found here and here. Please get in touch with us early to discuss your application with the mentor.

Project ideas for 2024

1. OpenVINO Extension for Automatic1111 Stable Diffusion WebUI

Short description: Automatic1111 is a powerful web user interface based on Gradio library specifically designed for Stable Diffusion. It’s most popular open-source Stable Diffusion WebUI on GitHub with 119K+ stars, which supports a lot of features like text-to-image, image-to-image, inpainting, Lora models, custom models from model hubs like civitai.com and huggingface etc. OpenVINO support for Automatic1111 enables Stable Diffusion run on Intel CPUs and GPUs, this solution is currently supported using a custom script. Implementing OpenVINO through the Automatic1111 extension will provide an easier way to use OpenVINO. This project will also aim to provide support for more AUTOMATIC111 features with OpenVINO. Task Description:

Develop a built-in extension for Automatic1111 SD WebUI based on the existing OpenVINO custom script leveraging Diffusers library.
Support some of the new features like Hires upscalers, new samplers, tiling, face restoration etc. Develop test scripts to evaluate these features.
Evaluate with different Stable Diffusion variants (V1.5, V2.1, XL, LCM etc.) on Intel CPUs and GPUs.
Optional: Evaluate compatibility with other extensions like ControlNet.

Expected outcomes:

Raise a PR with all the contributions to the OpenVINO fork and eventually to the mainstream Automatic1111 repo.
Documentation with clear description of features in the extension and demo videos.
Medium/OpenVINO blogs.

Skills required/preferred: Python, good understanding of Stable diffusion architectures, experience with Hugging Face and Diffusers libraries, experience with PyTorch (OpenVINO is a plus), Git.

Mentors: Anna Likholat, Mustafa Cavus

Size of project: 350 hours

Difficulty: Medium to hard

2. Inference of Neural model in Node.js environment using OpenVINO

Short description: There is a new OpenVINO NPM package. You can work with a neural model directly from your Node.js application now. We propose to rework existing samples that use Python API to Node.js API, or even implement new examples using OpenVINO JS API.

Expected outcomes:

Several reworked samples from documentation
Project uses a neural model to solve the task
Project uses OpenVINO in Node.js environment

Skills required/preferred: JavaScript, Node.js, Python

Mentors: Nikolai Vishniakov, Alicja Miłoszewska

Size of project: 90 hours

Difficulty: Easy to medium

3. Desktop Chat-Bot application

Short description: Neural Language Model can work locally without the internet. You will write your own Chatbot desktop cross-platform application using OpenVINO and Electron (or analogs). The Chatbot may be general or crafted to your needs (subject to the NLP model).

Expected outcomes:

Desktop Chat-Bot application works without the internet
Project uses an NLP model
Project uses OpenVINO in Electron environment

Skills required/preferred: JavaScript, Electron

Mentors: Nikolai Vishniakov, Alicja Miłoszewska

Size of project: 175 hours

Difficulty: Medium

4. Add Image-to-Image and Image-to-Text generation with GUI support for OpenVINO GenAI

Short description: Currently, openvino.genai has C++ image generation pipeline with Stable Diffusion and LCM, but only Text-to-Image is supported. This project would add Image-to-Image generation and Image-to-Text generation support with C++ cross platform GUI.

Expected outcomes:

Enable Image-to-Image Generation with ControlNet Conditioning(Canny edge or OpenPose), based on the python OpenVINO notebook and C++ Text-to-Image pipeline.
Enable Image-to-Text generation pipeline, based on the Python CLIP image classification OpenVINO notebook.

Skills required/preferred: C++ programming, knowledge of PyTorch and Qt Creator

Mentors: Su Yang, Fiona Zhao

Size of project: 175 hours

Difficulty: Medium

5. OpenVINO CPU plugin functional test cases WASM enablement

Short description: OpenVINO has supported WASM build, but there are hundreds of thousands of CPU plugin functional test cases that cannot run based on WASM build due to there are many native issues that need to be resolved, such as multi-threads, local file access, dynamic libraries, memory sharing. We propose talents to participate in this activity to enable more WASM tests of the OpenVINO CPU plugin.

Expected outcomes:

Learn to build and apply OpenVINO WASM libraries with a C/C++ project
Enable more than 80% CPU plugin C++ functional tests based on WASM

Skills required/preferred: C++, JS, WASM and Emscripten

Mentors: River Li, Xuejun Zhai

Size of project: 350 hours

Difficulty: Medium to hard

6. OpenVINO Conditional Compilation improvement

Short description: OpenVINO adopts conditional compilation to optimize package binaries size, which has achieved considerable binarie size reduction. But conditional compilation has a limitation - it is device dependent, that to say the conditional compilation packages generated in one CPU platform cannot always run on another different CPU platform. We encourage talents to research a solution to support device agnostic condition compilation(permit slight binaries size increase), which can support the conditional compilation packages generated in one CPU platform and can run on other Intel CPU platforms.

Expected outcomes:

Learn conditional compilation and provide more improvement as possible
Enable device agnostic conditional compilation with binaries size increasing < 5%

Skills required/preferred: C++

Mentors: River Li, Xuejun Zhai

Size of project: 175 hours

Difficulty: Medium

7. OpenVINO adapters - PyTorch adapter

Short description: OpenVINO adapters' goal is to provide a lightweight layer of Python code that enables easy switching between already used framework (PyTorch) and OpenVINO - targeting inference purposes. The goal is to showcase the potential performance benefits of using OV. The change on the user side should look like import torch --> import openvino_adapters.torch and... that's it! (maybe some minor changes like excluding cuda() calls:)) The project includes the whole development cycle: creating POC, productization of the solution, adding tests, making an installable package, and creating documentation.

Expected outcomes:

openvino_adapters package that might be the full fledge open-sourced solution
Learning OpenVINO and PyTorch frameworks and creating translations between different APIs
Creating the first adapter for PyTorch

Skills required/preferred: Python, PyTorch knowledge

Mentors: Jan Iwaszkiewicz, Anastasia Kuporosova

Size of project: 175 hours

Difficulty: Medium to hard

8. Improve Generative AI workload performance on ARM devices with OpenVINO

Short description: The goal of this project is to implement set of optimizations inside OpenVINO runtime which target GenAI models (e.g. text-generation, diffusers). Optimizations should include improvements in terms of latency/throughput metrics, faster model compilation time and lower memory consumption.

Expected outcomes:

Better GenAI workloads adoption in OpenVINO on ARM devices
Better compilation and latency/throughput performance on GenAI models
Lower memory requirements to run GenAI applications

Skills required/preferred: Mac device with M1 / M2 / M3 chip, C++

Mentors: Alexandr Voron, Dmitry Gorokhov

Size of project: 350 hours

Difficulty: Medium

Projects already implemented

The projects below have already been implemented and are here just for reference. Don't select any project below for your application.

1. Train a DL model for synthetic data generation for model optimization

Short description: In most cases, DL model optimization requires the presence of real data that the user should provide for the optimization method (e.g. quantization or pruning). However, it was noticed that such methods can perform quite well on synthetic data which are not even relevant to the use case. In this task, we will create a synthetic GAN or VAE-based DL model that is capable of generating synthetic images based on the text hints provided by the user. This data will be used to evaluate the 8-bit post-training quantization method on a wide range of Computer Vision models.

Expected outcomes: DL model that generates synthetic data based on the text input.

Skills required/preferred: DL model training, understanding of GAN, VAE architectures

Mentors: Mansi Sharma

Size of project: 350 hours

Difficulty: Medium

Year: 2022

Implemented by: ThanosM97

The result: Code | Article 1 | Article 2 | Technical report

2. Object Detection demo application to run AI on ARM-based Android devices

Short description: ARM CPUs support has been added to Inference Engine via the dedicated ARM CPU plugin. ARM processors are widely used in Android smartphones, so we want to develop an Android demo application that demonstrates plugin possibilities on this platform. The demo should be written in Java and use Java wrappers to reach Inference Engine public API. We suggest reviewing the functionality of OMZ object detection demo and propagating its core functionality to the Android demo.

Expected outcomes: Android demo application with object detection functionality. Any model (or several models) from the list supported by the plugin could be used.

Skills required/preferred: Practical experience with developing Java apps for Android, at least a basic understanding of computer vision and deep learning technologies, enough to run a network and make it run at real-time speed

Mentors: Junwen Wu

Size of project: 350 hours

Difficulty: Medium

Year: 2022

Implemented by: IRONICBo

The result: Code | Article

3. 3D Jupyter Notebooks and WebGL Integration for OpenVINO

Short description: The world is 3D, but in AI development, we often work on flat 2D displays with flat data visualization plots and charts. In machine learning, many of the tasks may be much better understood if we provide a 3D or 4D (space and time) perspective. In this project, we will answer this question by providing the beginner-friendly Jupyter 3D engine for machine learning visualization along with WebGL integration. Our main goal is not only to make 3D or 4D datasets easier to visualize, but also to make machine learning easier to understand in a more humanistic way.

Expected outcomes: Working 3D support for Jupyter Notebooks running OpenVINO (AI inference). For example, visualizing 3D body pose and characters, visualization 2D-3D mapping, and also providing a clean interface to set up these without a 3D engine or graphics programming background.

Skills required/preferred: Understanding of graphics pipelines (GPU programming is a plus), Software Engineer background and code releasing, Python, C++

Mentors: Raymond Lo

Size of project: 350 hours

Difficulty: Medium to hard

Year: 2022

Implemented by: spencergotowork

The result: Code | Article

4. Showcase performance of PyTorch Image Models (timm) with OpenVINO

Short description: DL model inference performance has been a hop trend in recent years aimed at bringing AI into real-world applications. And OpenVINO is a famous DL inference solution that delivers best-in-class performance on Intel Architectures. In this task, we will provide a Jupyter notebook/tutorial where we showcase inference performance on the set of popular DL models that come from the PyTorch Image Models project on GitHub (timm). The tutorial will provide details on installing OpenVINO, converting models to OpenVINO Intermediate Representation, and properly benchmarking under the various settings.

Expected outcomes: Pull-request with the tutorial into PyTorch Image Models.

Skills required/preferred: DL basics, understanding of ML model optimization

Mentors: Alexander Kozlov, Liubov Talamanova

Size of project: 175 hours

Difficulty: Easy

Year: 2023

Implemented by: sawradip

The result: Code | Article

5. Add OpenVINO support to John Snow Labs Spark NLP

Short description: Spark NLP is an open-source NLP library in production that offers state-of-the-art transformers at scale by extending Apache Spark natively. It supports Python, R as well as JVM ecosystems (Java, Scala, and Kotlin). It is available on PyPI, Conda, and Maven and ships with many NLP features, pre-trained models and pipelines. Currently, CPU optimization is realized via Intel Optimized Tensorflow. The goal of this project is to add support for OpenVINO in Spark NLP, such that users can create and deploy optimized NLP models on Intel hardware through OpenVINO. The project will involve the following tasks:

Explore OpenVINO for Java solution and use JNI to load and infer OpenVINO IR models in Java. Identify additional ops and configs or additional engineering work that is needed for this JNI implementation.
Benchmarking on some representative pre-trained models.

This project would benefit the Spark-NLP developers to take advantage of OpenVINO's optimization capabilities on Intel hardware. It would also benefit Intel and the OpenVINO community by expanding the reach of OpenVINO to a wider range of use cases and developers.

Expected outcomes:

Pull request to the Spark-NLP repo for OpenVINO integration
Documentation and sample scripts to demonstrate how to use OpenVINO with Spark-NLP
Summary on model/operator coverages for Spark-NLP OpenVINO integration (optional)

Skills required/preferred: Java knowledge, basic knowledge of OpenVINO framework, basic knowledge of Spark. Experience with NLP deep learning models, such as BERT etc.

Mentors: Junwen Wu, Ravi Panchumarthy

Size of project: 350 hours

Difficulty: Medium

Year: 2023

Implemented by: rajatkrishna

The result: Code | Article 1 | Article 2 | Article 3

6. Industrial meter reading with OpenVINO

Short description: Automatic industrial meter reading could help utility providers (such as gas or electricity providers) and manufacturers monitor the real-time status data from industrial meters and enable further diagnosis of possible anomalies without periodic trips to each physical location to read a meter. We have provided code examples of automatic meter reading using the PaddlePaddle DL framework for analog and digital meters. To extend the automatic meter reading solutions, we’d like to see solutions using the DL framework on TensorFlow or PyTorch and supporting multi-task input with GUI. The solution pipeline could follow our code examples on the OpenVINO Notebooks repository.

Expected outcomes: In the first stage, we expect a demo repository including testing datasets, demo codes, and test results for metering reading with OpenVINO, using DL models from TensorFlow or PyTorch frameworks. The result should cover contributions to Open Model Zoo demos and can be represented as a notebook as well. We can follow the workflow and reuse the functions in 203-meter-reader and 405-paddle-ocr-webcam, and combine the new work with these notebooks.

In the second stage, this demo can be refined to support multiple camera inputs with GUI, making it more like a real solution for the industry. In this case, the features of OpenVINO, like async API and performance hints, can help improve multi-task workload performance. With GUI, we can demonstrate a live demo with some web/IP cameras for a better user experience. Tools like Flask, HTML, or Java Script can help us quickly create an interactive user interface.

Skills required/preferred: Python programming, knowledge of popular frameworks for DL - PyTorch or/and TensorFlow, Flask, HTML, Java Script

Mentors: Ethan Yang, Zhuo Wu

Size of project: 350 hours

Difficulty: Medium

Year: 2023

Implemented by: ashish-2005

The result: Report | Code | Article

7. Add ensembling methods for tiling to Anomalib

Short description: Detecting small defects in high-resolution images can be a challenging task for anomaly detection models. Anomalib currently has a tiling mechanism in place to improve the detection capabilities for such datasets, which involves dividing the input images into a grid of tiles which are then processed separately by the model. A limitation of the tiling mechanism is that a single model is trained on all tile locations combined, leaving the approach ineffective for locally-aware models that require a fixed position and orientation of the objects in the images. For such models, an ensemble approach would be required. The idea is to train separate models for each of the tile locations and combine the predictions of the models in the post-processing stage. This project involves adding such an ensemble approach to tiling to the Anomalib library.

Expected outcomes: Data-to-prediction pipeline incorporating ensemble-based tiling

Skills required/preferred: ML basics, Python

Mentors: Dick Ameln, Samet Akcay

Size of project: 175 hours

Difficulty: Medium

Year: 2023

Implemented by: blaz-r

The result: Report | Code | Article

8. Anomaly Segmentation Metrics for anomalib

Short description: This project proposes novel evaluation metrics for anomaly segmentation in computer vision, taking into account pixel-level and spatial information. The aim is to provide a more comprehensive evaluation of anomaly segmentation algorithms, aiding researchers and practitioners in selecting and fine-tuning models. The first proposed metric is the False Positive Blob Relative Volume (FP-BRV), which accounts for the visual nuisance of false positive pixels, complementing the Per-Region Overlap (PRO). The proposed metric will be evaluated on popular anomaly segmentation public datasets and visually validated. Milestones: prototype implementation, testing and validation on public datasets, production implementation, optimization/unit testing/documentation, and research paper writing. See "Section 5 Detailed project proposal" in the PDF for details and a timeline.

Expected outcomes: A new metric for segmentation prediction

Skills required/preferred: ML basics, Python

Mentors: Samet Akcay, Dick Ameln

Size of project: 175 hours

Difficulty: Medium

Year: 2023

Implemented by: jpcbertoldo

The result: Report | Demo

Contribution guidelines

Contribution guidelines can be found here.

Contact us

Open OpenVINO discussions tab
Start a new discussion by pushing the green button (if you cannot see the button, it means you're not logged in)
Select a "Google Summer of Code" category and add the "gsoc" label
Ask your question (please be aware everything you post there is publicly available)

Please get in touch with us early to discuss your application with the mentor. Mentors will do their best to reply to all contributors, but due to a large contributor interest this year, they may not be able to respond to all inquiries

Home
General resources
- Getting started
- Contribute
  - Google Summer of Code
How to build
Developer documentation
- Inference Engine architecture
- CPU plugin
- GPU plugin
- HETERO plugin architecture
- Snippets
- Sample for IE C++/C/Python API
- Proxy plugin (Concept)
Tests

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Google Summer Of Code

Announcements

Prerequisite task

Application Template

Project ideas for 2024

1. OpenVINO Extension for Automatic1111 Stable Diffusion WebUI

2. Inference of Neural model in Node.js environment using OpenVINO

3. Desktop Chat-Bot application

4. Add Image-to-Image and Image-to-Text generation with GUI support for OpenVINO GenAI

5. OpenVINO CPU plugin functional test cases WASM enablement

6. OpenVINO Conditional Compilation improvement

7. OpenVINO adapters - PyTorch adapter

8. Improve Generative AI workload performance on ARM devices with OpenVINO

Projects already implemented

1. Train a DL model for synthetic data generation for model optimization

2. Object Detection demo application to run AI on ARM-based Android devices

3. 3D Jupyter Notebooks and WebGL Integration for OpenVINO

4. Showcase performance of PyTorch Image Models (timm) with OpenVINO

5. Add OpenVINO support to John Snow Labs Spark NLP

6. Industrial meter reading with OpenVINO

7. Add ensembling methods for tiling to Anomalib

8. Anomaly Segmentation Metrics for anomalib

Contribution guidelines

Contact us

Clone this wiki locally