name | description | authors | links | colaboratory | update |
---|---|---|---|---|---|
DiffBIR | Towards Blind Image Restoration with Generative Diffusion Prior | 23.10.2023 | |||
ESM | Evolutionary Scale Modeling: Pretrained language models for proteins | 20.10.2023 | |||
Show-1 | Hybrid model, dubbed as Show-1, which marries pixel-based and latent-based VDMs for text-to-video generation |
|
15.10.2023 | ||
LLaVA | Large Language and Vision Assistant, an end-to-end trained large multimodal model that connects a vision encoder and LLM for general-purpose visual and language understanding | 14.10.2023 | |||
AudioSep | Foundation model for open-domain audio source separation with natural language queries | 12.10.2023 | |||
DA-CLIP | Degradation-aware vision-language model to better transfer pretrained vision-language models to low-level vision tasks as a universal framework for image restoration | 11.10.2023 | |||
SadTalker | Generates 3D motion coefficients of the 3DMM from audio and implicitly modulates a novel 3D-aware face render for talking head generation |
|
10.10.2023 | ||
Musika | Music generation system that can be trained on hundreds of hours of music using a single consumer GPU, and that allows for much faster than real-time generation of music of arbitrary length on a consumer CPU | 09.10.2023 | |||
YOLOv6 | Single-stage object detection framework dedicated to industrial applications | 08.10.2023 | |||
DreamGaussian | Algorithm to convert 3D Gaussians into textured meshes and apply a fine-tuning stage to refine the details |
|
04.10.2023 | ||
Qwen-VL | Set of large-scale vision-language models designed to perceive and understand both text and images |
|
22.09.2023 | ||
Würstchen | Architecture for text-to-image synthesis that combines competitive performance with unprecedented cost-effectiveness for large-scale text-to-image diffusion models | 21.09.2023 | |||
ICON | Given a set of images, method estimates a detailed 3D surface from each image and then combines these into an animatable avatar |
|
31.08.2023 | ||
DINOv2 | Produce high-performance visual features that can be directly employed with classifiers as simple as linear layers on a variety of computer vision tasks; these visual features are robust and perform well across domains without any requirement for fine-tuning |
others |
31.08.2023 | ||
Neuralangelo | Framework for high-fidelity 3D surface reconstruction from RGB video captures | 27.08.2023 | |||
GHOST | One-shot pipeline for image-to-image and image-to-video face swap solutions | 22.08.2023 | |||
DALL·E Mini | Generate images from a text prompt | 22.08.2023 | |||
OWL-ViT | Simple Open-Vocabulary Object Detection with Vision Transformers | 21.08.2023 | |||
Wav2Lip | A Lip Sync Expert Is All You Need for Speech to Lip Generation In the Wild | 19.08.2023 | |||
VALL-E X | Cross-lingual neural codec language model for cross-lingual speech synthesis | 14.08.2023 | |||
StyleGAN3 | Alias-Free Generative Adversarial Networks |
|
13.08.2023 | ||
Gaussian Splatting | State-of-the-art visual quality while maintaining competitive training times and importantly allow high-quality real-time (≥ 100 fps) novel-view synthesis at 1080p resolution |
|
12.08.2023 | ||
Kandinsky 2.1 | As text and image encoder it uses CLIP model and diffusion image prior between latent spaces of CLIP modalities | 07.08.2023 | |||
Parallel WaveGAN | State-of-the-art non-autoregressive models to build your own great vocoder | Tomoki Hayashi |
|
07.08.2023 | |
Big GAN | Large Scale GAN Training for High Fidelity Natural Image Synthesis | 03.08.2023 | |||
FILM | A frame interpolation algorithm that synthesizes multiple intermediate frames from two input images with large in-between motion | 03.08.2023 | |||
CoTracker | Architecture that jointly tracks multiple points throughout an entire video |
|
30.07.2023 | ||
AudioLDM | Text-to-audio system that is built on a latent space to learn the continuous audio representations from contrastive language-audio pretraining latents |
|
25.07.2023 | ||
HiDT | A generative image-to-image model and a new upsampling scheme that allows to apply image translation at high resolution | 24.07.2023 | |||
CutLER | Simple approach for training unsupervised object detection and segmentation models |
|
24.07.2023 | ||
AlphaFold | Highly accurate protein structure prediction | 20.07.2023 | |||
Recognize Anything & Tag2Text | Vision language pre-training framework, which introduces image tagging into vision-language models to guide the learning of visual-linguistic features | 09.07.2023 | |||
Thin-Plate Spline Motion Model | End-to-end unsupervised motion transfer framework |
|
07.07.2023 | ||
DragGAN | Drag Your GAN: Interactive Point-based Manipulation on the Generative Image Manifold | 03.07.2023 | |||
Fast Segment Anything | CNN Segment Anything Model trained using only 2% of the SA-1B dataset published by SAM authors | 30.06.2023 | |||
MobileSAM | Towards Lightweight SAM for Mobile Applications | 30.06.2023 | |||
Grounding DINO | Marrying DINO with Grounded Pre-Training for Open-Set Object Detection | 28.06.2023 | |||
T5X | Modular, composable, research-friendly framework for high-performance, configurable, self-service training, evaluation, and inference of sequence models at many scales |
others |
27.06.2023 | ||
First Order Motion Model for Image Animation | Transferring facial movements from video to image | Aliaksandr Siarohin | 04.06.2023 | ||
TabPFN | Neural network that learned to do tabular data prediction |
|
31.05.2023 | ||
MMS | The Massively Multilingual Speech project expands speech technology from about 100 languages to over 1000 by building a single multilingual speech recognition model supporting over 1100 languages, language identification models able to identify over 4000 languages, pretrained models supporting over 1400 languages, and text-to-speech models for over 1100 languages |
|
26.05.2023 | ||
DFL-Colab | This project provides you IPython Notebook to use DeepFaceLab | chervonij | 30.04.2023 | ||
FAB | Flow AIS Bootstrap uses AIS to generate samples in regions where the flow is a poor approximation of the target, facilitating the discovery of new modes | 29.04.2023 | |||
MiniGPT-4 | Enhancing Vision-language Understanding with Advanced Large Language Models |
|
23.04.2023 | ||
CodeFormer | Transformer-based prediction network to model global composition and context of the low-quality faces for code prediction, enabling the discovery of natural faces that closely approximate the target faces even when the inputs are severely degraded |
|
21.04.2023 | ||
Text2Video-Zero | Text-to-Image Diffusion Models are Zero-Shot Video Generators | 11.04.2023 | |||
Segment Anything | The Segment Anything Model produces high quality object masks from input prompts such as points or boxes, and it can be used to generate masks for all objects in an image | 10.04.2023 | |||
EVA3D | High-quality unconditional 3D human generative model that only requires 2D image collections for training | 06.04.2023 | |||
Stable Dreamfusion | Using a pretrained 2D text-to-image diffusion model to perform text-to-3D synthesis |
|
04.04.2023 | ||
UniFormer | Unified Transformer for Efficient Spatiotemporal Representation Learning | 31.03.2023 | |||
PIFuHD | Multi-Level Pixel-Aligned Implicit Function for High-Resolution 3D Human Digitization | 26.03.2023 | |||
AudioLM | Framework for high-quality audio generation with long-term consistency | 23.03.2023 | |||
Visual ChatGPT | Connects ChatGPT and a series of Visual Foundation Models to enable sending and receiving images during chatting | 15.03.2023 | |||
Tune-A-Video | One-Shot Tuning of Image Diffusion Models for Text-to-Video Generation |
|
23.02.2023 | ||
LaMa | Resolution-robust Large Mask Inpainting with Fourier Convolutions |
|
15.02.2023 | ||
GPEN | GAN Prior Embedded Network for Blind Face Restoration in the Wild |
|
15.02.2023 | ||
Disco Diffusion | A frankensteinian amalgamation of notebooks, models and techniques for the generation of AI Art and Animations | 11.02.2023 | |||
Open-Unmix | A deep neural network reference implementation for music source separation, applicable for researchers, audio engineers and artists | 09.02.2023 | |||
GrooVAE | Some applications of machine learning for generating and manipulating beats and drum performances | 01.02.2023 | |||
Multitrack MusicVAE | The models in this notebook are capable of encoding and decoding single measures of up to 8 tracks, optionally conditioned on an underlying chord | 01.02.2023 | |||
MusicVAE | A Hierarchical Latent Vector Model for Learning Long-Term Structure in Music | 01.02.2023 | |||
Learning to Paint | Learning to Paint With Model-based Deep Reinforcement Learning | Manuel Romero | 01.02.2023 | ||
VALL-E | Language modeling approach for text to speech synthesis |
|
18.01.2023 | ||
Instant-NGP | Instant Neural Graphics Primitives with a Multiresolution Hash Encoding | 18.01.2023 | |||
Fourier Feature Networks | Fourier Features Let Networks Learn High Frequency Functions in Low Dimensional Domains |
|
17.01.2023 | ||
HybrIK | Hybrid Analytical-Neural Inverse Kinematics Solution for 3D Human Pose and Shape Estimation | 01.01.2023 | |||
Demucs | Hybrid Spectrogram and Waveform Source Separation | Alexandre Défossez | 21.11.2022 | ||
MotionDiffuse | The first diffusion model-based text-driven motion generation framework, which demonstrates several desired properties over existing methods | 13.10.2022 | |||
VToonify | Leverages the mid- and high-resolution layers of StyleGAN to render high-quality artistic portraits based on the multi-scale content features extracted by an encoder to better preserve the frame details |
|
07.10.2022 | ||
PyMAF | Pyramidal Mesh Alignment Feedback loop in regression network for well-aligned body mesh recovery and extend it for the recovery of expressive full-body models |
|
06.10.2022 | ||
AlphaTensor | Discovering faster matrix multiplication algorithms with reinforcement learning |
|
04.10.2022 | ||
Swin2SR | Novel Swin Transformer V2, to improve SwinIR for image super-resolution, and in particular, the compressed input scenario | 03.10.2022 | |||
Functa | From data to functa: Your data point is a function and you can treat it like one | 24.09.2022 | |||
Whisper | Automatic speech recognition system trained on 680,000 hours of multilingual and multitask supervised data collected from the web |
|
21.09.2022 | ||
DeOldify (video) | Colorize your own videos! | Jason Antic | 19.09.2022 | ||
DeOldify (photo) | Colorize your own photos! | 19.09.2022 | |||
Real-ESRGAN | Extend the powerful ESRGAN to a practical restoration application, which is trained with pure synthetic data | 18.09.2022 | |||
IDE-3D | Interactive Disentangled Editing for High-Resolution 3D-aware Portrait Synthesis | 08.09.2022 | |||
Decision Transformers | An architecture that casts the problem of RL as conditional sequence modeling |
|
06.09.2022 | ||
Dream Fields | Zero-Shot Text-Guided Object Generation |
|
05.09.2022 | ||
GANgealing | Framework for learning discriminative models and their GAN-generated training data jointly end-to-end |
|
01.09.2022 | ||
textual-inversion | An Image is Worth One Word: Personalizing Text-to-Image Generation using Textual Inversion | 21.08.2022 | |||
StyleGAN-Human | A Data-Centric Odyssey of Human Generation |
|
19.08.2022 | ||
Make-A-Scene | Scene-Based Text-to-Image Generation with Human Priors | 12.08.2022 | |||
StyleGAN-NADA | Zero-Shot non-adversarial domain adaptation of pre-trained generators |
|
09.08.2022 | ||
YOLOv7 | Trainable bag-of-freebies sets new state-of-the-art for real-time object detectors | 09.08.2022 | |||
GLIP | Grounded language-image pre-training model for learning object-level, language-aware, and semantic-rich visual representations |
|
30.07.2022 | ||
Anycost GAN | Interactive natural image editing |
|
20.07.2022 | ||
GFPGAN | Towards Real-World Blind Face Restoration with Generative Facial Prior |
|
13.07.2022 | ||
EPro-PnP | Generalized End-to-End Probabilistic Perspective-n-Points for Monocular Object Pose Estimation |
|
12.07.2022 | ||
VQ-Diffusion | Based on a VQ-VAE whose latent space is modeled by a conditional variant of the recently developed Denoising Diffusion Probabilistic Model | 30.06.2022 | |||
OPT | Open Pre-trained Transformers is a family of NLP models trained on billions of tokens of text obtained from the internet |
|
29.06.2022 | ||
Customizing a Transformer Encoder | We will learn how to customize the encoder to employ new network architectures | Chen Chen | 22.06.2022 | ||
MTTR | End-to-End Referring Video Object Segmentation with Multimodal Transformers | 20.06.2022 | |||
SwinIR | Image Restoration Using Swin Transformer | 17.06.2022 | |||
VRT | A Video Restoration Transformer | 15.06.2022 | |||
Omnivore | A single model which excels at classifying images, videos, and single-view 3D data using exactly the same model parameters |
|
14.06.2022 | ||
Detic | Detecting Twenty-thousand Classes using Image-level Supervision | 07.06.2022 | |||
AMARETTO | Multiscale and multimodal inference of regulatory networks to identify cell circuits and their drivers shared and distinct within and across biological systems of human disease | 01.06.2022 | |||
T0 | Multitask Prompted Training Enables Zero-Shot Task Generalization |
others |
29.05.2022 | ||
AvatarCLIP | A zero-shot text-driven framework for 3D avatar generation and animation | 15.05.2022 | |||
Text2Mesh | Text-Driven Neural Stylization for Meshes | 14.05.2022 | |||
T5 | Text-To-Text Transfer Transformer | 11.05.2022 | |||
XLS-R | Self-supervised Cross-lingual Speech Representation Learning at Scale | 10.05.2022 | |||
DiffCSE | Unsupervised contrastive learning framework for learning sentence embeddings | 24.04.2022 | |||
ViDT+ | An Extendable, Efficient and Effective Transformer-based Object Detector | 20.04.2022 | |||
NAFNet | Nonlinear Activation Free Network for Image Restoration | 15.04.2022 | |||
Panini-Net | GAN Prior based Degradation-Aware Feature Interpolation for Face Restoration | 13.04.2022 | |||
Deep Painterly Harmonization | Algorithm produces significantly better results than photo compositing or global stylization techniques and that it enables creative painterly edits that would be otherwise difficult to achieve | 07.04.2022 | |||
E2FGVI | An End-to-End framework for Flow-Guided Video Inpainting through elaborately designed three trainable modules, namely, flow completion, feature propagation, and content hallucination modules | 06.04.2022 | |||
LDM | High-Resolution Image Synthesis with Latent Diffusion Models | 04.04.2022 | |||
GP-UNIT | Novel framework, Generative Prior-guided UNsupervised Image-to-image Translation, to improve the overall quality and applicability of the translation algorithm | 02.04.2022 | |||
DualStyleGAN | More challenging exemplar-based high-resolution portrait style transfer by introducing a novel DualStyleGAN with flexible control of dual styles of the original face domain and the extended artistic portrait domain | 24.03.2022 | |||
CLIPasso | Semantically-Aware Object Sketching | 21.03.2022 | |||
StyleSDF | A high resolution, 3D-consistent image and shape generation technique |
|
05.03.2022 | ||
VideoGPT | A conceptually simple architecture for scaling likelihood based generative modeling to natural videos | 02.03.2022 | |||
Disentangled Lifespan Face Synthesis | LFS model is proposed to disentangle the key face characteristics including shape, texture and identity so that the unique shape and texture age transformations can be modeled effectively | 22.02.2022 | |||
Mask2Former | Masked-attention Mask Transformer for Universal Image Segmentation | 09.02.2022 | |||
SpecVQGAN | Taming the visually guided sound generation by shrinking a training dataset to a set of representative vectors |
|
03.02.2022 | ||
JoJoGAN | One Shot Face Stylization | 02.02.2022 | |||
Pose with Style | Detail-Preserving Pose-Guided Image Synthesis with Conditional StyleGAN | 19.01.2022 | |||
ConvNeXt | A pure ConvNet model constructed entirely from standard ConvNet modules | 19.01.2022 | |||
diffsort | Differentiable Sorting Networks | 17.01.2022 | |||
Taming Transformers for High-Resolution Image Synthesis | We combine the efficiancy of convolutional approaches with the expressivity of transformers by introducing a convolutional VQGAN, which learns a codebook of context-rich visual parts, whose composition is modeled with an autoregressive transformer | 13.01.2022 | |||
FuseDream | Training-Free Text-to-Image Generation with Improved CLIP+GAN Space Optimization | 02.01.2022 | |||
GLIDE | Towards Photorealistic Image Generation and Editing with Text-Guided Diffusion Models | 22.12.2021 | |||
Music Composer | Synthesizing symbolic music in MIDI format using the Music Transformer model | bazanovvanya | 20.12.2021 | ||
PoolFormer | MetaFormer Is Actually What You Need for Vision | 05.12.2021 | |||
HyperStyle | A hypernetwork that learns to modulate StyleGAN's weights to faithfully express a given image in editable regions of the latent space | 03.12.2021 | |||
encoder4editing | Designing an Encoder for StyleGAN Image Manipulation | 02.12.2021 | |||
StyleCariGAN | Caricature Generation via StyleGAN Feature Map Modulation |
|
30.11.2021 | ||
CartoonGAN | The implementation of the cartoon GAN model with PyTorch | Tobias Sunderdiek | 24.11.2021 | ||
SimSwap | An efficient framework, called Simple Swap, aiming for generalized and high fidelity face swapping | 24.11.2021 | |||
RVM | Robust High-Resolution Video Matting with Temporal Guidance |
|
24.11.2021 | ||
AnimeGANv2 | An improved version of AnimeGAN - it prevents the generation of high-frequency artifacts by simply changing the normalization of features in the network |
|
17.11.2021 | ||
SOAT | StyleGAN of All Trades: Image Manipulation with Only Pretrained StyleGAN | 13.11.2021 | |||
Arnheim | Generative Art Using Neural Visual Grammars and Dual Encoders | 11.11.2021 | |||
Reformer | Performs on par with Transformer models while being much more memory-efficient and much faster on long sequences |
|
07.11.2021 | ||
StyleGAN 2 | Generation of faces, cars, etc. | Mikael Christensen | 05.11.2021 | ||
ruDALL·E | Generate images from texts in Russian | Alex Shonenkov |
|
03.11.2021 | |
ByteTrack | Multi-Object Tracking by Associating Every Detection Box | 30.10.2021 | |||
GPT-2 | Retrain an advanced text generating neural network on any text dataset using gpt-2-simple! | Max Woolf | 18.10.2021 | ||
ConvMixer | An extremely simple model that is similar in spirit to the ViT and the even-more-basic MLP-Mixer in that it operates directly on patches as input, separates the mixing of spatial and channel dimensions, and maintains equal size and resolution throughout the network | 05.10.2021 | |||
IC-GAN | Instance-Conditioned GAN |
|
01.10.2021 | ||
Skillful Precipitation Nowcasting Using Deep Generative Models of Radar | Open-sourced dataset and model snapshot for precipitation nowcasting | 29.09.2021 | |||
Live Speech Portraits | Real-Time Photorealistic Talking-Head Animation |
|
26.09.2021 | ||
StylEx | Training a GAN to explain a classifier in StyleSpace |
|
25.08.2021 | ||
VITS | Parallel end-to-end TTS method that generates more natural sounding audio than current two-stage models | 23.08.2021 | |||
Bringing Old Photo Back to Life | Restoring old photos that suffer from severe degradation through a deep learning approach | 13.07.2021 | |||
PTI | Pivotal Tuning Inversion enables employing off-the-shelf latent based semantic editing techniques on real images using StyleGAN | 01.07.2021 | |||
TediGAN | Framework for multi-modal image generation and manipulation with textual descriptions | 30.06.2021 | |||
GANs N' Roses | Stable, Controllable, Diverse Image to Image Translation | 19.06.2021 | |||
Rethinking Style Transfer: From Pixels to Parameterized Brushstrokes | A method to stylize images by optimizing parameterized brushstrokes instead of pixels | 02.06.2021 | |||
Pixel2Style2Pixel | Encoding in Style: A StyleGAN Encoder for Image-to-Image Translation |
|
01.06.2021 | ||
Fine-tuning a BERT | We will work through fine-tuning a BERT model using the tensorflow-models PIP package | 24.05.2021 | |||
ReStyle | A Residual-Based StyleGAN Encoder via Iterative Refinement |
|
21.05.2021 | ||
Motion Representations for Articulated Animation | Novel motion representations for animating articulated objects consisting of distinct parts | 29.04.2021 | |||
SAM | Age Transformation Using a Style-Based Regression Model |
|
26.04.2021 | ||
SkinDeep | Remove Body Tattoo Using Deep Learning | Vijish Madhavan | 24.04.2021 | ||
Geometry-Free View Synthesis | Is a geometric model required to synthesize novel views from a single image? | 22.04.2021 | |||
NeRViS | An algorithm for full-frame video stabilization by first estimating dense warp fields | 11.04.2021 | |||
NeX | View synthesis based on enhancements of multiplane image that can reproduce NeXt-level view-dependent effects in real time | 25.03.2021 | |||
Score SDE | Score-Based Generative Modeling through Stochastic Differential Equations | 18.03.2021 | |||
Big Sleep | Text to image generation, using OpenAI's CLIP and a BigGAN | Phil Wang | 17.03.2021 | ||
Deep Daze | Text to image generation using OpenAI's CLIP and Siren | Phil Wang | 17.03.2021 | ||
Talking Head Anime from a Single Image | The network takes as input an image of an anime character's face and a desired pose, and it outputs another image of the same character in the given pose | Pramook Khungurn |
|
23.02.2021 | |
NFNet | An adaptive gradient clipping technique, a significantly improved class of Normalizer-Free ResNets | 17.02.2021 | |||
CLIP | A neural network which efficiently learns visual concepts from natural language supervision | 29.01.2021 | |||
Adversarial Patch | A method to create universal, robust, targeted adversarial image patches in the real world | Tom Brown | 27.01.2021 | ||
MSG-Net | Multi-style Generative Network with a novel Inspiration Layer, which retains the functionality of optimization-based approaches and has the fast speed of feed-forward networks | 25.01.2021 | |||
Neural Style Transfer | Implementation of Neural Style Transfer in Keras 2.0+ | Somshubra Majumdar | 22.01.2021 | ||
SkyAR | A vision-based method for video sky replacement and harmonization, which can automatically generate realistic and dramatic sky backgrounds in videos with controllable styles | Zhengxia Zou | 18.01.2021 | ||
MusicXML Documentation | The goal of this notebook is to explore one of the magenta libraries for music | 08.01.2021 | |||
SVG VAE | A colab demo for the SVG VAE model | Raphael Gontijo Lopes | 08.01.2021 | ||
Neural Magic Eye | Learning to See and Understand the Scene Behind an Autostereogram | 01.01.2021 | |||
Flow-edge Guided Video Completion | Method first extracts and completes motion edges, and then uses them to guide piecewise-smooth flow completion with sharp edges | 30.12.2020 | |||
ArtLine | A Deep Learning based project for creating line art portraits | Vijish Madhavan |
|
24.12.2020 | |
WikiArt (stylegan2-ada) | Generation of paintings of different styles and genres | Doron Adler | 08.12.2020 | ||
GANSpace | A simple technique to analyze GANs and create interpretable controls for image synthesis, such as change of viewpoint, aging, lighting, and time of day | 06.12.2020 | |||
SeFa | A closed-form approach for unsupervised latent semantic factorization in GANs | 06.12.2020 | |||
Stylized Neural Painting | An image-to-painting translation method that generates vivid and realistic painting artworks with controllable styles | 01.12.2020 | |||
MakeItTalk | A method that generates expressive talking-head videos from a single facial image with audio as the only input | 10.11.2020 | |||
LaSAFT | Latent Source Attentive Frequency Transformation for Conditioned Source Separation | Woosung Choi | 01.11.2020 | ||
Lifespan Age Transformation Synthesis | Multi-domain image-to-image generative adversarial network architecture, whose learned latent space models a continuous bi-directional aging process |
|
31.10.2020 | ||
HiGAN | Semantic Hierarchy Emerges in Deep Generative Representations for Scene Synthesis |
|
14.10.2020 | ||
InterFaceGAN | Interpreting the Latent Space of GANs for Semantic Face Editing |
|
13.10.2020 | ||
Faceswap-GAN | A minimum demo for faceswap-GAN v2.2 | shaoanlu | 12.09.2020 | ||
Instance-aware Image Colorization | Novel deep learning framework to achieve instance-aware colorization | Jheng-Wei Su | 30.08.2020 | ||
MoCo | Momentum Contrast for unsupervised visual representation learning | 20.08.2020 | |||
Rewriting a Deep Generative Model | We ask if a deep network can be reprogrammed to follow different rules, by enabling a user to directly change the weights, instead of training with a data set |
|
31.07.2020 | ||
BERT score | An automatic evaluation metric for text generation | Tianyi Zhang | 17.07.2020 | ||
SIREN | Implicit Neural Representations with Periodic Activation Functions | 24.06.2020 | |||
PIFu | Pixel-Aligned Implicit Function for High-Resolution Clothed Human Digitization | 17.06.2020 | |||
3D Ken Burns | A reference implementation of 3D Ken Burns Effect from a Single Image using PyTorch - given a single input image, it animates this still image with a virtual camera scan and zoom subject to motion parallax | Manuel Romero | 13.06.2020 | ||
HRFAE | An encoder-decoder architecture for face age editing |
|
14.05.2020 | ||
Jukebox | A neural net that generates music, including rudimentary singing, as raw audio in a variety of genres and artist styles | Christine McLeavey | 04.05.2020 | ||
3D Photo Inpainting | Method for converting a single RGB-D input image into a 3D photo, i.e., a multi-layer representation for novel view synthesis that contains hallucinated color and depth structures in regions occluded in the original view | 04.05.2020 | |||
Global Flow Local Attention | Differentiable global-flow local-attention framework to reassemble the inputs at the feature level | 30.04.2020 | |||
Motion Supervised co-part Segmentation | A self-supervised deep learning method for co-part segmentation | 07.04.2020 | |||
Onsets and Frames | Onsets and Frames is an automatic music transcription framework with piano and drums models | 02.04.2020 | |||
WikiArt (stylegan2) | Generation of paintings of different styles and genres | Doron Adler | 27.01.2020 | ||
Siamese NN | Implementation of Siamese Neural Networks built upon multihead attention mechanism for text semantic similarity task | Tomasz Latkowski | 19.12.2019 | ||
Generating Piano Music with Transformer | This Colab notebook lets you play with pretrained Transformer models for piano music generation, based on the Music Transformer | 16.09.2019 | |||
GMCNN | Generative Multi-column Convolutional Neural Networks inpainting model in Keras | 09.08.2019 | |||
BERT with TPU | Using a free Colab Cloud TPU to fine-tune sentence and sentence-pair classification tasks built on top of pretrained BERT models and run predictions on tuned model | Sourabh Bajaj | 29.03.2019 | ||
GANSynth | This notebook is a demo GANSynth, which generates audio with Generative Adversarial Networks | Jesse Engel |
|
25.02.2019 | |
Latent Constraints | Conditional Generation from Unconditional Generative Models | 27.11.2017 | |||
Performance RNN | This notebook shows you how to generate new performed compositions from a trained model | 11.07.2017 | |||
NSynth | This colab notebook has everything you need to upload your own sounds and use NSynth models to reconstruct and interpolate between them | 06.04.2017 |
name | description | authors | links | colaboratory | update |
---|---|---|---|---|---|
AnimateDiff | Practical framework to animate most of the existing personalized text-to-image models once and for all, saving efforts in model-specific tuning |
|
30.10.2023 | ||
Building Your Own Federated Learning Algorithm | We discuss how to implement federated learning algorithms without deferring to the tff.learning API | Zachary Charles | 25.10.2023 | ||
Federated Learning for Image Classification | We use the classic MNIST training example to introduce the Federated Learning API layer of TFF, tff.learning - a set of higher-level interfaces that can be used to perform common types of federated learning tasks, such as federated training, against user-supplied models implemented in TensorFlow | Krzysztof Ostrowski |
|
25.10.2023 | |
Federated Learning for Text Generation | We start with a RNN that generates ASCII characters, and refine it via federated learning | Krzysztof Ostrowski | 25.10.2023 | ||
Custom Federated Algorithms, Part 1: Introduction to the Federated Core | This tutorial is the first part of a two-part series that demonstrates how to implement custom types of federated algorithms in TensorFlow Federated using the Federated Core - a set of lower-level interfaces that serve as a foundation upon which we have implemented the Federated Learning layer | Krzysztof Ostrowski | 25.10.2023 | ||
Custom Federated Algorithms, Part 2: Implementing Federated Averaging | This tutorial is the second part of a two-part series that demonstrates how to implement custom types of federated algorithms in TFF using the Federated Core, which serves as a foundation for the Federated Learning layer | Krzysztof Ostrowski | 25.10.2023 | ||
TFF for Federated Learning Research: Model and Update Compression | We use the EMNIST dataset to demonstrate how to enable lossy compression algorithms to reduce communication cost in the Federated Averaging algorithm | Weikang Song | 25.10.2023 | ||
High-performance simulations with TFF | This tutorial will describe how to setup high-performance simulations with TFF in a variety of common scenarios | Krzysztof Ostrowski | 25.10.2023 | ||
Bark | Transformer-based text-to-audio model | suno |
|
25.10.2023 | |
AutoGen | Framework that enables development of LLM applications using multiple agents that can converse with each other to solve tasks | microsoft | 23.10.2023 | ||
highway-env | A collection of environments for autonomous driving and tactical decision-making tasks | Edouard Leurent | 22.10.2023 | ||
Gorilla | Finetuned LLaMA-based model that surpasses the performance of GPT-4 on writing API calls |
|
21.10.2023 | ||
Deforum Stable Diffusion | Open source project is designed to be free to use and easy to modify for custom needs and pipelines |
|
20.10.2023 | ||
dm_control | DeepMind Infrastructure for Physics-Based Simulation |
|
18.10.2023 | ||
MuJoCo | A general purpose physics engine that aims to facilitate research and development in robotics, biomechanics, graphics and animation, machine learning, and other areas which demand fast and accurate simulation of articulated structures interacting with their environment | 18.10.2023 | |||
ComfyUI | Powerful and modular stable diffusion GUI and backend | comfyanonymous | 17.10.2023 | ||
Open Interpreter | An open-source, locally running implementation of OpenAI's Code Interpreter | Killian Lucas |
|
12.10.2023 | |
SAGE | Methodology for generative spelling correction, which was tested on English and Russian languages and potentially can be extended to any language with minor changes | 11.10.2023 | |||
Mistral Transformer | The most powerful language model for its size to date |
|
09.10.2023 | ||
Fooocus | Image generating software | Lvmin Zhang | 03.10.2023 | ||
Actor-Critic | This tutorial demonstrates how to implement the Actor-Critic method using TensorFlow to train an agent on the Open AI Gym CartPole-V0 environment | Mark Daoust |
|
27.09.2023 | |
Simple audio recognition | This tutorial will show you how to build a basic speech recognition network that recognizes ten different words | 27.09.2023 | |||
YOLOv8 | State-of-the-art model that builds upon the success of previous YOLO versions and introduces new features and improvements to further boost performance and flexibility | Glenn Jocher | 26.09.2023 | ||
Feast | An open source feature store for machine learning |
|
21.09.2023 | ||
YOLOv5 | You Only Look Once | Glenn Jocher |
|
10.09.2023 | |
YOLOv3 | You Only Look Once | Glenn Jocher |
|
10.09.2023 | |
MMAction2 | An open-source toolbox for video understanding based on PyTorch | MMAction2 Contributors | 06.09.2023 | ||
Ray | Unified framework for scaling AI and Python applications |
|
06.09.2023 | ||
Transfer learning and fine-tuning | You will learn how to classify images of cats and dogs by using transfer learning from a pre-trained network | François Chollet | 31.08.2023 | ||
Neural style transfer | This tutorial uses deep learning to compose one image in the style of another image | Billy Lamberta | 31.08.2023 | ||
CycleGAN | This notebook demonstrates unpaired image to image translation using conditional GAN's | Billy Lamberta | 31.08.2023 | ||
Pix2Pix | This notebook demonstrates image to image translation using conditional GAN's | Billy Lamberta | 31.08.2023 | ||
Image classification | This tutorial shows how to classify images of flowers | Billy Lamberta | 31.08.2023 | ||
Home Robot | Low-level API for controlling various home robots | Chris Paxton | 30.08.2023 | ||
Stable Diffusion 2 | New stable diffusion model at 768x768 resolution. Same number of parameters in the U-Net as 1.5, but uses OpenCLIP-ViT/H as the text encoder and is trained from scratch | 26.08.2023 | |||
Brax | A differentiable physics engine that simulates environments made up of rigid bodies, joints, and actuators | 25.08.2023 | |||
The Autodiff Cookbook | You'll go through a whole bunch of neat autodiff ideas that you can cherry pick for your own work, starting with the basics | 18.08.2023 | |||
Composer | PyTorch library that enables you to train neural networks faster, at lower cost, and to higher accuracy | The Mosaic ML Team | 16.08.2023 | ||
Integrated gradients | This tutorial demonstrates how to implement Integrated Gradients, an Explainable AI technique |
|
16.08.2023 | ||
Autoencoders | This tutorial introduces autoencoders with three examples: the basics, image denoising, and anomaly detection | 14.08.2023 | |||
xFormers | Toolbox to Accelerate Research on Transformers | 11.08.2023 | |||
Deep RL Course | The Hugging Face Deep Reinforcement Learning Course |
|
08.08.2023 | ||
Classify text with BERT | This tutorial contains complete code to fine-tune BERT to perform sentiment analysis on a dataset of plain-text IMDB movie reviews |
|
08.08.2023 | ||
SoftVC VITS | Singing Voice Conversion | svc develop team | 31.07.2023 | ||
Image captioning | Given an image our goal is to generate a caption | Billy Lamberta | 25.07.2023 | ||
Word2Vec | Word2Vec is not a singular algorithm, rather, it is a family of model architectures and optimizations that can be used to learn word embeddings from large datasets | 25.07.2023 | |||
Word embeddings | This tutorial contains an introduction to word embeddings | Billy Lamberta | 25.07.2023 | ||
Tortoise | A multi-voice TTS system trained with an emphasis on quality | James Betker |
|
15.07.2023 | |
TRL | Set of tools to train transformer language models with Reinforcement Learning, from the Supervised Fine-tuning step, Reward Modeling step to the Proximal Policy Optimization step | 14.07.2023 | |||
Petals | Run 100B+ language models at home, BitTorrent-style | BigScience |
|
05.07.2023 | |
PEFT | Parameter-Efficient Fine-Tuning methods enable efficient adaptation of pre-trained language models to various downstream applications without fine-tuning all the model's parameters |
|
28.06.2023 | ||
Epistemic Neural Networks | A library for neural networks that know what they don't know | 26.06.2023 | |||
DeepFloyd IF | State-of-the-art open-source text-to-image model with a high degree of photorealism and language understanding |
|
26.06.2023 | ||
normflows | PyTorch implementation of discrete normalizing flows | 26.06.2023 | |||
MyoSuite | A collection of musculoskeletal environments and tasks simulated with the MuJoCo physics engine and wrapped in the OpenAI gym API to enable the application of Machine Learning to bio-mechanic control problems | 16.06.2023 | |||
Audiocraft | PyTorch library for deep learning research on audio generation |
|
11.06.2023 | ||
Nerfstudio | API that allows for a simplified end-to-end process of creating, training, and testing NeRFs |
|
05.06.2023 | ||
Transformer | This tutorial trains a Transformer model to translate Portuguese to English | Billy Lamberta |
|
02.06.2023 | |
Detectron2 | FAIR's next-generation platform for object detection and segmentation | Yuxin Wu | 26.05.2023 | ||
Anomalib | Deep learning library that aims to collect state-of-the-art anomaly detection algorithms for benchmarking on both public and private datasets |
|
24.05.2023 | ||
ChatRWKV | Like ChatGPT but powered by RWKV (100% RNN) language model, which is the only RNN that can match transformers in quality and scaling, while being faster and saves VRAM |
|
08.05.2023 | ||
Python Data Science Handbook | Jupyter notebook version of the Python Data Science Handbook by Jake VanderPlas | Jake Vanderplas | 05.05.2023 | ||
PGMax | General factor graphs for discrete probabilistic graphical models, and hardware-accelerated differentiable loopy belief propagation in JAX | 05.05.2023 | |||
StableLM | Stability AI Language Models | Stability AI |
|
27.04.2023 | |
TTS | A library for advanced Text-to-Speech generation, built on the latest research, was designed to achieve the best trade-off among ease-of-training, speed and quality | 26.04.2023 | |||
OpenCLIP | An open source implementation of CLIP | 16.04.2023 | |||
Stable Baselines3 | Set of reliable implementations of reinforcement learning algorithms in PyTorch |
|
14.04.2023 | ||
RL Baselines3 Zoo | Training Framework for Stable Baselines3 Reinforcement Learning Agents | Antonin Raffin | 14.04.2023 | ||
Grounded-SAM | Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect, Segment and Generate Anything | IDEA-Research | 12.04.2023 | ||
SentencePiece | An unsupervised text tokenizer and detokenizer mainly for Neural Network-based text generation systems where the vocabulary size is predetermined prior to the neural model training | 08.04.2023 | |||
TorchGeo | PyTorch domain library that provides datasets, transforms, samplers, and pre-trained models specific to geospatial data | 29.03.2023 | |||
LAVIS | Python deep learning library for LAnguage-and-VISion intelligence research and applications |
|
24.03.2023 | ||
Hello, many worlds | This tutorial shows how a classical neural network can learn to correct qubit calibration errors | Michael Broughton | 20.03.2023 | ||
Image segmentation | This tutorial focuses on the task of image segmentation, using a modified U-Net | Billy Lamberta | 17.03.2023 | ||
Tzer | Coverage-Guided Tensor Compiler Fuzzing with Joint IR-Pass Mutation | 09.03.2023 | |||
Haiku | A library built on top of JAX designed to provide simple, composable abstractions for machine learning research | 02.03.2023 | |||
Data augmentation | This tutorial demonstrates data augmentation: a technique to increase the diversity of your training set by applying random transformations such as image rotation | Billy Lamberta | 02.03.2023 | ||
SAHI | A lightweight vision library for performing large scale object detection & instance segmentation | 23.02.2023 | |||
AmpliGraph | A suite of neural machine learning models for relational Learning, a branch of machine learning that deals with supervised learning on knowledge graphs | 23.02.2023 | |||
NMT with attention | This notebook trains a seq2seq model for Spanish to English translation | Billy Lamberta |
|
15.02.2023 | |
GLUE using BERT on TPU | This tutorial contains complete end-to-end code to train models on a TPU | 15.02.2023 | |||
Kornia | Library is composed by a subset of packages containing operators that can be inserted within neural networks to train models to perform image transformations, epipolar geometry, depth estimation, and low-level image processing such as filtering and edge detection that operate directly on tensors | 11.02.2023 | |||
TensorBoard | Suite of web applications for inspecting and understanding your TensorFlow runs and graphs | Yuan Tang |
|
10.02.2023 | |
High-performance Simulation with Kubernetes | This tutorial will describe how to set up high-performance simulation using a TFF runtime running on Kubernetes | Jason Roselander | 31.01.2023 | ||
Compel | Text prompt weighting and blending library for transformers-type text embedding systems | Damian Stewart | 26.01.2023 | ||
DALL·E Flow | An interactive workflow for generating high-definition images from text prompt | 26.01.2023 | |||
Diffusers | Provides pretrained diffusion models across multiple modalities, such as vision and audio, and serves as a modular toolbox for inference and training of diffusion models | Hugging Face | 17.01.2023 | ||
Sample Factory | One of the fastest RL libraries focused on very efficient synchronous and asynchronous implementations of policy gradients | 17.01.2023 | |||
Open-Assistant | Chat-based assistant that understands tasks, can interact with third-party systems, and retrieve information dynamically to do so |
|
14.01.2023 | ||
CleanRL | Deep Reinforcement Learning library that provides high-quality single-file implementation with research-friendly features |
|
12.01.2023 | ||
NeMo | A conversational AI toolkit built for researchers working on automatic speech recognition, natural language processing, and text-to-speech synthesis | 05.01.2023 | |||
BANMo | Given multiple casual videos capturing a deformable object, BANMo reconstructs an animatable 3D model, including an implicit canonical 3D shape, appearance, skinning weights, and time-varying articulations, without pre-defined shape templates or registered cameras |
|
30.12.2022 | ||
TF-Agents | A reliable, scalable and easy to use TensorFlow library for Contextual Bandits and Reinforcement Learning | 15.12.2022 | |||
PyG | Library built upon PyTorch to easily write and train Graph Neural Networks for a wide range of applications related to structured data | 08.12.2022 | |||
ruGPT3 | Example of inference of RuGPT3XL | Anton Emelyanov | 07.12.2022 | ||
Stable Diffusion Videos | Create videos with Stable Diffusion by exploring the latent space and morphing between text prompts | Nathan Raw | 05.12.2022 | ||
PyTerrier | A Python framework for performing information retrieval experiments | 02.11.2022 | |||
DSP theory | Theory of digital signal processing: signals, filtration (IIR, FIR, CIC, MAF), transforms (FFT, DFT, Hilbert, Z-transform) etc | 18.10.2022 | |||
Mubert | Prompt-based music generation via Mubert API | Ilya Belikov |
|
18.10.2022 | |
Batch RL | Offline RL using the DQN replay dataset comprising the entire replay experience of a DQN agent on 60 Atari 2600 games | 04.10.2022 | |||
EfficientDet | New family of object detectors, called EfficientDet, which consistently achieve much better efficiency than prior art across a wide spectrum of resource constraints | 27.09.2022 | |||
ACME | A library of reinforcement learning components and agents |
|
26.09.2022 | ||
RWKV | Reinventing RNNs for the Transformer Era | 21.09.2022 | |||
NetKet | Open-source project delivering cutting-edge methods for the study of many-body quantum systems with artificial neural networks and machine learning techniques |
others |
|
15.09.2022 | |
pymdp | Package for simulating Active Inference agents in Markov Decision Process environments | 24.08.2022 | |||
Stable Diffusion | A latent text-to-image diffusion model | 10.08.2022 | |||
Deep-MAC | Welcome to the Novel class segmentation demo | Vighnesh Birodkar | 09.08.2022 | ||
NL-Augmenter | A collaborative effort intended to add transformations of datasets dealing with natural language |
others |
06.08.2022 | ||
Accelerate | A simple way to train and use PyTorch models with multi-GPU, TPU, mixed-precision | Hugging Face | 27.07.2022 | ||
YOLOv5 on Custom Objects | This notebook shows training on your own custom objects | Jacob Solawetz | 20.07.2022 | ||
MindsEye | Graphical user interface built to run multimodal ai art models for free from a Google Colab, without needing edit a single line of code or know any programming | 06.07.2022 | |||
py-irt | Fitting Item Response Theory models using variational inference | 30.06.2022 | |||
SberSwap | A new face swap method for image and video domains | 29.06.2022 | |||
BIG-bench | A collaborative benchmark intended to probe large language models and extrapolate their future capabilities | 27.06.2022 | |||
HuggingArtists | Choose your favorite Artist and train a language model to write new lyrics based on their unique voice | Aleksey Korshuk | 25.06.2022 | ||
Introduction to the TensorFlow Models NLP library | You will learn how to build transformer-based models for common NLP tasks including pretraining, span labelling and classification using the building blocks from NLP modeling library | Chen Chen | 22.06.2022 | ||
Cirq | A python framework for creating, editing, and invoking Noisy Intermediate Scale Quantum circuits | 21.06.2022 | |||
CLIP-as-service | A low-latency high-scalability service for embedding images and text | Han Xiao | 19.06.2022 | ||
Jina | MLOps framework that empowers anyone to build cross-modal and multi-modal applications on the cloud | Han Xiao | 11.06.2022 | ||
Flashlight | Fast, flexible machine learning library written entirely in C++ | 01.06.2022 | |||
Evidently | An open-source framework to evaluate, test and monitor ML models in production | 30.05.2022 | |||
RL Unplugged | Suite of benchmarks for offline reinforcement learning |
|
26.05.2022 | ||
Text generation with RNN | This tutorial demonstrates how to generate text using a character-based RNN | Billy Lamberta | 02.05.2022 | ||
CLIPDraw | Synthesize drawings to match a text prompt |
|
28.04.2022 | ||
deep-significance | Easy-to-use package containing different significance tests and utility functions specifically tailored towards research needs and usability |
|
12.04.2022 | ||
Text classification with RNN | This text classification tutorial trains a recurrent neural network on the IMDB large movie review dataset for sentiment analysis | Billy Lamberta | 17.03.2022 | ||
RLDS | Reinforcement Learning Datasets and it is an ecosystem of tools to store, retrieve and manipulate episodic data in the context of Sequential Decision Making including RL, Learning for Demonstrations, Offline RL or Imitation Learning |
|
16.03.2022 | ||
Real-Time Voice Cloning | SV2TTS with a vocoder that works in real-time | 07.03.2022 | |||
BLIP | VLP framework which transfers flexibly to both vision-language understanding and generation tasks |
|
02.03.2022 | ||
Silero Models | Pre-trained speech-to-text, text-to-speech and text-enhancement models made embarrassingly simple | Silero team | 27.02.2022 | ||
ArcaneGAN | Process video in the style of the Arcane animated series | Alexander Spirin | 17.02.2022 | ||
textlesslib | A library aimed to facilitate research in Textless NLP | 15.02.2022 | |||
AV-HuBERT | Self-supervised representation learning framework for audio-visual speech |
|
12.02.2022 | ||
Lingvo | Framework for building neural networks in Tensorflow, particularly sequence models | 28.01.2022 | |||
RuDOLPH | A fast and light text-image-text transformer designed for a quick and easy fine-tuning setup for the solution of various tasks: from generating images by text description and image classification to visual question answering and more | 14.01.2022 | |||
DeepDream | This tutorial contains a minimal implementation of DeepDream: an experiment that visualizes the patterns learned by a neural network | Billy Lamberta | 13.01.2022 | ||
MLP | The most basic neural network architectures, a multilayer perceptron, also known as a feedforward network | Ben Trevett | 26.12.2021 | ||
AlexNet | A neural network model that uses convolutional neural network layers and was designed for the ImageNet challenge | Ben Trevett | 26.12.2021 | ||
VGG | Very Deep Convolutional Networks for Large-Scale Image Recognition | Ben Trevett | 26.12.2021 | ||
LeNet | A neural network model that uses convolutional neural network layers and was designed for classifying handwritten characters | Ben Trevett | 26.12.2021 | ||
FLAML | Lightweight Python library that finds accurate machine learning models automatically, efficiently and economically |
|
17.12.2021 | ||
CompilerGym | A reinforcement learning toolkit for compiler optimizations | 16.11.2021 | |||
DeepStyle | The Neural Style algorithm synthesizes a pastiche by separating and combining the content of one image with the style of another image using convolutional neural networks |
|
01.10.2021 | ||
Text2Animation | Generate images from text phrases with VQGAN and CLIP with animation and keyframes | 29.09.2021 | |||
EfficientNetV2 | A family of image classification models, which achieve better parameter efficiency and faster training speed than prior arts | 24.09.2021 | |||
Droidlet | A modular embodied agent architecture and platform for building embodied agents | 15.09.2021 | |||
GPT-J-6B | A 6 billion parameter, autoregressive text generation model trained on The Pile | 15.09.2021 | |||
Sentence Transformers | Multilingual Sentence, Paragraph, and Image Embeddings using BERT & Co | 13.09.2021 | |||
Machine learning course | This course is broad and shallow, but author will provide additional links so that you can deepen your understanding of the ML method you need | Тимчишин Віталій |
|
02.09.2021 | |
Lucid Sonic Dreams | Syncs GAN-generated visuals to music | Mikael Alafriz | 24.08.2021 | ||
textgenrnn | Generate text using a pretrained neural network with a few lines of code, or easily train your own text-generating neural network of any size and complexity | Max Woolf | 13.07.2021 | ||
TensorRT | SDK for high-performance deep learning inference, includes a deep learning inference optimizer and runtime that delivers low latency and high throughput for inference applications | nvidia | 10.06.2021 | ||
BasicSR | Open Source Image and Video Restoration Toolbox for Super-resolution, Denoise, Deblurring, etc. | 07.06.2021 | |||
Hyperopt | Python library for serial and parallel optimization over awkward search spaces, which may include real-valued, discrete, and conditional dimensions |
|
01.06.2021 | ||
CNN | This tutorial demonstrates training a simple Convolutional Neural Network to classify CIFAR images | Billy Lamberta | 21.05.2021 | ||
Custom GPT-2 + Tokenizer | Train a custom GPT-2 model for free on a GPU using aitextgen! | Max Woolf | 17.05.2021 | ||
Train a GPT-2 Text-Generating Model | Retrain an advanced text generating neural network on any text dataset for free on a GPU using Colaboratory using aitextgen! | Max Woolf | 17.05.2021 | ||
EasyNMT | Easy to use, state-of-the-art machine translation for more than 100+ languages | Nils Reimers |
|
26.04.2021 | |
OCTIS | Framework for training, analyzing, and comparing Topic Models, whose optimal hyper-parameters are estimated using a Bayesian Optimization approach | 19.04.2021 | |||
PyTorchVideo | Deeplearning library with a focus on video understanding work | 13.04.2021 | |||
GPT Neo | An implementation of model & data parallel GPT2 & GPT3 -like models, with the ability to scale up to full GPT3 sizes (and possibly more!), using the mesh-tensorflow library | EleutherAI |
|
28.03.2021 | |
CVAE | This notebook demonstrates how train a Variational Autoencoder on the MNIST dataset | Billy Lamberta | 22.03.2021 | ||
DCGAN | This tutorial demonstrates how to generate images of handwritten digits using a Deep Convolutional Generative Adversarial Network | Billy Lamberta | 12.03.2021 | ||
Adversarial FGSM | This tutorial creates an adversarial example using the Fast Gradient Signed Method attack. This was one of the first and most popular attacks to fool a neural network. | Billy Lamberta | 12.03.2021 | ||
GAN steerability | We will navigate in GAN latent space to simulate various camera transformations |
|
04.03.2021 | ||
Trax | End-to-end library for deep learning that focuses on clear code and speed |
|
17.02.2021 | ||
bsuite | A collection of carefully-designed experiments that investigate core capabilities of an RL agent with two main objectives | 13.02.2021 | |||
TF-Ranking | End-to-end walkthrough of training a TensorFlow Ranking neural network model which incorporates sparse textual features | Rama Kumar |
|
04.02.2021 | |
Toon-Me | A fun project to toon portrait images | Vijish Madhavan | 22.01.2021 | ||
TensorNetwork | A library for easy and efficient manipulation of tensor networks | Chase Roberts | 21.01.2021 | ||
Spleeter | Deezer source separation library including pretrained models | 10.01.2021 | |||
Person Remover | Project that combines Pix2Pix and YOLO arhitectures in order to remove people or other objects from photos | 22.08.2020 | |||
Semantic Segmentation | Pytorch implementation for Semantic Segmentation/Scene Parsing on MIT ADE20K dataset |
|
21.08.2020 | ||
Gin Config | Lightweight configuration framework for Python, based on dependency injection | 13.08.2020 | |||
CoVoST | A Large-Scale Multilingual Speech-To-Text Translation Corpus |