Skip to content

ExpressNet is a ready-to-use, weightless text classification model architecture that you can import and start training immediately.

Notifications You must be signed in to change notification settings

anarlavrenov/ExpressNet

Repository files navigation

ExpressNet

image

ExpressNet is an autoregressive BiLSTM-based decoder-only model with Additive Attention Mechanism (Bahdanau Attention). Developed by Anar Lavrenov, Head of AI at SPUNCH. For now ExpressNet is made for binary/multi classification tasks and there are plans for adding other tasks as well. The main distinguishing feature of this model is high perfomance without text preprocessing.

Architecture

image

Quick Start in Google Colab Open In Colab

Clone this repository

!git clone https://github.com/anarlavrenov/ExpressNet
%cd ExpressNet

Import model

from ExpressNet.model import ExpressNet

Initialize model with your own hyperparameters

model = ExpressNet(
    d_model=256,
    vocab_size=len(vocab),
    classification_type="multiclass",
    n_classes=4
).to(device)

Validation

ExpressNet showed decent results on validatation on most of torchtext datasets. General parameters were used:

  1. No text preprocessing at all: no stopwords removal, no lemmatization etc.
  2. basic english torch tokenizer everywhere
  3. d_model: 256 everywhere

image

Usage Purposes

  1. Primarily usage: playground for Machine Learning Researches and Data Scientists. You are very welcome to share your insights and recommendations.
  2. Secondary usage: baseline for most of classification tasks without any text preprocessing. If you want to achieve instant high validation accuracy - you are welcome to use ExpressNet.

About

ExpressNet is a ready-to-use, weightless text classification model architecture that you can import and start training immediately.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published