Description

This phase will focus exclusively on NLU (natural language understanding).
NLU engines perform two tasks:

Intent matching,
entity extraction.

How can developers who have no experience in NLU attack these subjects? What data can they use for training and testing of NLU engines? How can a developer or even a user easily fix a defect (i.e., the…

Description

This phase will focus exclusively on NLU (natural language understanding).
NLU engines perform two tasks:

Intent matching,
entity extraction.

How can developers who have no experience in NLU attack these subjects? What data can they use for training and testing of NLU engines? How can a developer or even a user easily fix a defect (i.e., the utterance is routed to the wrong intent)?

We want to help developers understand NLU engines through example engines, benchmark these examples, and deep dive into current popular engines.

Developers can either clean their own data set with the tools provided, or they can use the comprehensive, quality NLU data set from Secret Sauce AI.

High-level user stories

As a developer,

I want a data set I can convert to other formats, that contains many annotated utterances so that I can use them for my domains/skills.
I want several NLU engines benchmarked so that I can pick the right one for me.
I want easy to follow examples of how an NLU engine works so that I have a basic understanding.

Prototype deliverables

NLU-engine-prototype-benchmarks repo

Find the best possible intent and entity dataset across many domains (skills)
Explore possible current solutions for NLG that include grammar agreement
Create notebook using the most basic engine components (intent and entity extraction) to demonstrate the process and benchmark solutions
Deep dive into a popular NLU engine (i.e. Snips), write a dataset convertor, and benchmark results
Create a data cleaning pipeline
Make an intent and entity classifier with DistilBERT and benchmark the results
Write an article with a summary, links to the dataset, notebooks, models, etc.

DoD (definition of done)

See NLU Engine Prototype Benchmark milestones

KPI—intent and entity tagging

clean data set
intent f1 score for the whole dataset (all intents)
entity extraction (entity tagging) f1 ones for the whole dataset
intent and entity extraction per domain f1 scores
documentation
- prototype engine
- onboarding
- results
- cleaning tool
- DistilBERT engine
document

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

NLU-NLG phase 1

Description

Description

High-level user stories

Prototype deliverables

DoD (definition of done)