Skip to content

Usable implementation of Mogrifier, a circuit for enhancing LSTMs and potentially other networks, from Deepmind

License

Notifications You must be signed in to change notification settings

lucidrains/mogrifier

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

18 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

PyPI version

Mogrifier

A complete implementation of Mogrifier, a circuit for enhancing LSTMs and potentially other networks. It allows two vectors to modulate each other by having each gate the other in an interleaved, iterative fashion.

Install

$ pip install mogrifier

Usage

import torch
from mogrifier import Mogrifier

mogrify = Mogrifier(
    dim = 512,
    dim_hidden = 256,
    iters = 5,          # number of iterations, defaults to 5 as paper recommended for LSTM
    factorize_k = 16    # factorize weight matrices into (dim x k) and (k x dim), if specified
)

x = torch.randn(1, 16, 512)
h = torch.randn(1, 16, 256)

out, hidden_out = mogrify(x, h) # (1, 16, 512), (1, 16, 256)

assert out.shape == x.shape
assert hidden_out.shape == h.shape

Citation

@inproceedings{Melis2020Mogrifier,
    title   = {Mogrifier LSTM},
    author  = {Gábor Melis and Tomáš Kočiský and Phil Blunsom},
    booktitle = {International Conference on Learning Representations},
    year    = {2020},
    url     = {https://openreview.net/forum?id=SJe5P6EYvS}
}

About

Usable implementation of Mogrifier, a circuit for enhancing LSTMs and potentially other networks, from Deepmind

Topics

Resources

License

Stars

Watchers

Forks

Packages

No packages published

Languages