Mask vs sequence #5565

evgeniiaraz · 2020-04-02T13:20:41Z

Proposed changes:

store sequence length instead of mask; compute mask when needed;

Status (please check what you already did):

added some tests for the functionality
updated the documentation
updated the changelog (please check changelog for instructions)
reformat files using black (please check Readme for instructions)

tabergma

Looks good so far 👍. Left some comments.

Did you saw any performance improvements?

rasa/utils/tensorflow/model_data.py

rasa/nlu/classifiers/diet_classifier.py

evgeniiaraz · 2020-04-02T16:10:41Z

I haven't seen performance improvements F1-accuracy-wise (it's the same), and I wouldn't expect that because we're essentially doing the same thing. Would need to check memory / time!
I think I've addressed previous comments.

tabergma

👍 Thanks for improving this!

Sorry, I meant performance = time/memory. F1 score should not change. Would be great is we could to a quick comparison time wise at least. I think we are fine, but we should make sure that we don't merge something that slows things down.

tabergma · 2020-04-03T06:17:14Z

rasa/nlu/classifiers/diet_classifier.py

@@ -1255,10 +1251,12 @@ def _create_sequence(

    def _create_all_labels(self) -> Tuple[tf.Tensor, tf.Tensor]:
        all_label_ids = self.tf_label_data[LABEL_IDS][0]
+
+        label_lengths = tf.cast(self.tf_label_data[LABEL_SEQ_LENGTH][0], dtype=tf.int32)


Should we add a helper method for this? Something like sequence_length_for? This logic is repeated all over the place.

tabergma · 2020-04-03T06:20:22Z

rasa/nlu/classifiers/diet_classifier.py

@@ -1352,13 +1350,19 @@ def _calculate_entity_loss(

        return loss, f1

+    def _compute_mask(self, sequence_lengths: tf.Tensor) -> tf.Tensor:


Can be static.

rasa/utils/tensorflow/model_data.py

Co-Authored-By: Tanja <tabergma@gmail.com>

…o mask-vs-sequence

evgeniiaraz · 2020-04-03T11:27:25Z

Timewise it is the same as the previous way of doing it.

evgeniiaraz added 2 commits April 2, 2020 07:43

merge conflict

8d8f421

small changes

d4564ec

evgeniiaraz requested a review from tabergma April 2, 2020 13:21

tabergma requested changes Apr 2, 2020

View reviewed changes

rasa/utils/tensorflow/model_data.py Outdated Show resolved Hide resolved

rasa/utils/tensorflow/model_data.py Outdated Show resolved Hide resolved

rasa/nlu/classifiers/diet_classifier.py Outdated Show resolved Hide resolved

rasa/nlu/classifiers/diet_classifier.py Show resolved Hide resolved

evgeniiaraz added 2 commits April 2, 2020 11:33

small changed by comments

18e4b79

None return type

e0201f1

tabergma approved these changes Apr 3, 2020

View reviewed changes

tabergma reviewed Apr 3, 2020

View reviewed changes

rasa/utils/tensorflow/model_data.py Outdated Show resolved Hide resolved

evgeniiaraz and others added 4 commits April 3, 2020 03:40

Update docstring

d4b7b51

Co-Authored-By: Tanja <tabergma@gmail.com>

sequence length as a separate method

ae6dd7b

Merge branch 'mask-vs-sequence' of https://github.com/RasaHQ/rasa int…

2bf5f23

…o mask-vs-sequence

black

75ac652

evgeniiaraz merged commit 993893a into 1.9.x Apr 15, 2020

erohmensing mentioned this pull request Apr 23, 2020

Revert model breaking changes in 1.9.x #5709

Merged

tmbo deleted the mask-vs-sequence branch May 1, 2020 10:13

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Mask vs sequence #5565

Mask vs sequence #5565

evgeniiaraz commented Apr 2, 2020 •

edited

Loading

tabergma left a comment

evgeniiaraz commented Apr 2, 2020

tabergma left a comment

tabergma Apr 3, 2020

tabergma Apr 3, 2020

evgeniiaraz commented Apr 3, 2020

		@@ -1352,13 +1350,19 @@ def _calculate_entity_loss(

		return loss, f1

		def _compute_mask(self, sequence_lengths: tf.Tensor) -> tf.Tensor:

Mask vs sequence #5565

Mask vs sequence #5565

Conversation

evgeniiaraz commented Apr 2, 2020 • edited Loading

tabergma left a comment

Choose a reason for hiding this comment

evgeniiaraz commented Apr 2, 2020

tabergma left a comment

Choose a reason for hiding this comment

tabergma Apr 3, 2020

Choose a reason for hiding this comment

tabergma Apr 3, 2020

Choose a reason for hiding this comment

evgeniiaraz commented Apr 3, 2020

evgeniiaraz commented Apr 2, 2020 •

edited

Loading