Randperm on PyTorch and MxNet #2084

KexinFeng · 2022-10-14T18:01:51Z

Description

A new method randPerm is added.

codecov-commenter · 2022-10-14T18:29:50Z

Codecov Report

Base: 72.08% // Head: 71.36% // Decreases project coverage by -0.71% ⚠️

Coverage data is based on head (e767e89) compared to base (bb5073f).
Patch coverage: 71.66% of modified lines in pull request are covered.

❗ Current head e767e89 differs from pull request most recent head 485fe3b. Consider uploading reports for the commit 485fe3b to get more accurate results

Additional details and impacted files

@@             Coverage Diff              @@
##             master    #2084      +/-   ##
============================================
- Coverage     72.08%   71.36%   -0.72%     
- Complexity     5126     6279    +1153     
============================================
  Files           473      624     +151     
  Lines         21970    27806    +5836     
  Branches       2351     2997     +646     
============================================
+ Hits          15838    19845    +4007     
- Misses         4925     6501    +1576     
- Partials       1207     1460     +253

Impacted Files	Coverage Δ
api/src/main/java/ai/djl/modality/cv/Image.java	`69.23% <ø> (-4.11%)`	⬇️
...rc/main/java/ai/djl/modality/cv/MultiBoxPrior.java	`76.00% <ø> (ø)`
...rc/main/java/ai/djl/modality/cv/output/Joints.java	`71.42% <ø> (ø)`
.../main/java/ai/djl/modality/cv/output/Landmark.java	`100.00% <ø> (ø)`
...main/java/ai/djl/modality/cv/output/Rectangle.java	`72.41% <0.00%> (ø)`
...i/djl/modality/cv/translator/BigGANTranslator.java	`21.42% <0.00%> (-5.24%)`	⬇️
.../modality/cv/translator/ImageFeatureExtractor.java	`0.00% <0.00%> (ø)`
.../ai/djl/modality/cv/translator/YoloTranslator.java	`27.77% <0.00%> (+18.95%)`	⬆️
...modality/cv/translator/wrapper/FileTranslator.java	`44.44% <ø> (ø)`
...y/cv/translator/wrapper/InputStreamTranslator.java	`44.44% <ø> (ø)`
... and 558 more

Help us with your feedback. Take ten seconds to tell us how you rate us. Have a feature suggestion? Share it here.

☔ View full report at Codecov.
📢 Do you have feedback about the report comment? Let us know in this issue.

api/src/main/java/ai/djl/ndarray/BaseNDManager.java

api/src/main/java/ai/djl/nn/core/Linear.java

api/src/main/java/ai/djl/translate/StackBatchifier.java

engines/pytorch/pytorch-engine/src/main/java/ai/djl/pytorch/engine/PtModel.java

zachgk · 2022-10-24T20:00:13Z

engines/pytorch/pytorch-engine/src/main/java/ai/djl/pytorch/engine/PtModel.java

            for (int i = 0; i < extraFileKeys.length; i++) {
                properties.put(extraFileKeys[i], extraFileValues[i]);
            }
+            // Freeze the parameters if not retrain
+            for (Pair<String, Parameter> paramPair : block.getParameters()) {


Is there a reason that retrain has to be added as an option to the model? Can we not just direct users to call freeze() themselves if they don't wish to retrain the model? And as part of that, I am not sure if freezing the parameters is the default that users would expect

I see. Yes, this should be replaced by baseBlock.freezeParameters(false);. I'll edit it in the pr #2070.

The reason the default is to set the baseBlock frozen() is that even the pretrained model is not frozen, its learning rate cannot be very large, or the same as the subsequent blocks. Ie the pretrained parameters should not be trained too much. So it is safe to freeze those parameters by default. Maybe we can add some comments about this.

zachgk · 2022-10-24T20:02:46Z

api/src/main/java/ai/djl/training/optimizer/Sgd.java

@@ -56,6 +57,9 @@ protected Sgd(Builder builder) {
    /** {@inheritDoc} */
    @Override
    public void update(String parameterId, NDArray weight, NDArray grad) {
+        if (learningRateTracker instanceof FixedPerVarTracker) {
+            ((FixedPerVarTracker) learningRateTracker).setParameterId(parameterId);


It isn't great to use a system like this to pass a special case argument of parameterId here. Instead, maybe we could add a new kind of tracker named something like ParameterTracker. The ParamterTracker can require the parameter to get a new value, and all standard Trackers would also be a ParameterTracker if we have Tracker extend ParameterTracker. If we leave it as a special case like this, it would make it difficult if a new tracker that also required parameters were created or if new optimizers were created without the special handling for the FixedPerVarTracker

zachgk · 2022-10-24T20:27:01Z

engines/pytorch/pytorch-native/src/main/native/ai_djl_pytorch_jni_PyTorchLibrary_inference.cc

@@ -44,7 +44,25 @@ Java_ai_djl_pytorch_jni_PyTorchLibrary_moduleLoad__Ljava_lang_String_2_3IZ_3Ljav
    map[name] = "";
  }

-  JITCallGuard guard;
+  if (!jretrain) {


I couldn't find the difference in behavior between with and without retrain. Is there one?

examples/src/main/java/ai/djl/examples/training/transferlearning/TransferFreshFruit.java

zachgk · 2022-10-24T20:33:23Z

examples/src/main/java/ai/djl/examples/training/transferlearning/TransferFreshFruit.java

+                .addTrainingListeners(listener);
+    }
+
+    private static class SoftmaxCrossEntropy extends Loss {


Why not use the standard SoftmaxCrossEntropyLoss?

The standard SoftmaxCrossEntropyLoss doesn't take the output of the softmax function; it currently either take the values before applying softmax, or the values after applying logit ie log(softmax(*)).
Here, I built the network to output softmax. This is mainly because it is natural to understand (as probabilities). Correspondingly, I also corrected the calculation in Accuracy.java. You can take a look if it is correct. (It looks like before the edition, the calculation there does not work well)

training data cut transfer features

KexinFeng requested review from zachgk and frankfliu as code owners October 14, 2022 18:01

KexinFeng changed the title ~~Randperm on PyTorch~~ Randperm on PyTorch and MxNet Oct 14, 2022

zachgk reviewed Oct 24, 2022

View reviewed changes

KexinFeng added 2 commits October 28, 2022 11:19

randperm

e849ac6

training data cut transfer features

clean

e767e89

KexinFeng force-pushed the randperm branch from dc5c328 to e767e89 Compare October 28, 2022 18:21

KexinFeng added 2 commits October 28, 2022 11:34

format

1f790c4

format

485fe3b

KexinFeng requested a review from zachgk October 31, 2022 22:05

zachgk approved these changes Oct 31, 2022

View reviewed changes

KexinFeng merged commit b0427a8 into deepjavalibrary:master Nov 1, 2022

KexinFeng deleted the randperm branch November 22, 2022 16:49

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Randperm on PyTorch and MxNet #2084

Randperm on PyTorch and MxNet #2084

KexinFeng commented Oct 14, 2022

codecov-commenter commented Oct 14, 2022 •

edited

Loading

zachgk Oct 24, 2022

KexinFeng Oct 26, 2022

zachgk Oct 24, 2022

zachgk Oct 24, 2022

zachgk Oct 24, 2022

KexinFeng Oct 26, 2022 •

edited

Loading

Randperm on PyTorch and MxNet #2084

Randperm on PyTorch and MxNet #2084

Conversation

KexinFeng commented Oct 14, 2022

Description

codecov-commenter commented Oct 14, 2022 • edited Loading

Codecov Report

zachgk Oct 24, 2022

Choose a reason for hiding this comment

KexinFeng Oct 26, 2022

Choose a reason for hiding this comment

zachgk Oct 24, 2022

Choose a reason for hiding this comment

zachgk Oct 24, 2022

Choose a reason for hiding this comment

zachgk Oct 24, 2022

Choose a reason for hiding this comment

KexinFeng Oct 26, 2022 • edited Loading

Choose a reason for hiding this comment

codecov-commenter commented Oct 14, 2022 •

edited

Loading

KexinFeng Oct 26, 2022 •

edited

Loading