release/530-release-candidate #14164

maziyarpanahi · 2024-02-06T14:21:51Z

* SPARKNLP-942: MPNetForSequenceClassification * SPARKNLP-942: MPNetForQuestionAnswering * SPARKNLP-942: MPNet Classifiers Documentation * Restore RobertaforQA bugfix

…#14158)

* introducing LLAMA2 * Added option to read model from model path to onnx wrapper * Added option to read model from model path to onnx wrapper * updated text description * LLAMA2 python API * added method to save onnx_data * added position ids * - updated Generate.scala to accept onnx tensors - added beam search support for LLAMA2 * updated max input length * updated python default params changed test to slow test * fixed serialization bug

* Added retrieval interface to the doc sim rank approach * Added Python interface as retriever in doc sim ranker --------- Co-authored-by: Stefano Lori <s.lori@izicap.com>

* adding code * adding notebook for import --------- Co-authored-by: Maziyar Panahi <maziyar.panahi@iscpif.fr>

…4155) * introducing LLAMA2 * Added option to read model from model path to onnx wrapper * Added option to read model from model path to onnx wrapper * updated text description * LLAMA2 python API * added method to save onnx_data * added position ids * - updated Generate.scala to accept onnx tensors - added beam search support for LLAMA2 * updated max input length * updated python default params changed test to slow test * fixed serialization bug * Added Scala code for M2M100 * Documentation for scala code * Python API for M2M100 * added more tests for scala * added tests for python * added pretrained * rewording * fixed serialization bug * fixed serialization bug --------- Co-authored-by: Maziyar Panahi <maziyar.panahi@iscpif.fr>

Some annotators might have different naming schemes for their files. Added a parameter to control this.

…bs/spark-nlp into release/530-release-candidate

#14167) * [SPARKNLP-940] Adding changes to correctly copy cluster index storage when defined * [SPARKNLP-940] Moving local mode control to its right place * [SPARKNLP-940] Refactoring sentToCLuster method

…ter_tmp_dir)

Fixes `java.lang.IllegalArgumentException: No Operation named [init_all_tables] in the Graph` when model needs to be deserialized. The deserialization is skipped when the modelis already loaded (so it will only appear on the worker nodes and not the driver) GPT2 does not contain tables and so does not require this command.

…warnings-in-SBT-build fixed all sbt warnings

This reverts commit eb91fde.

…bs/spark-nlp into release/530-release-candidate

ahmedlone127 and others added 11 commits February 1, 2024 00:12

fixed all sbt warnings

283be9a

remove file system url prefix (#14132)

9377bb3

SPARKNLP-942: MPNet Classifiers (#14147)

db55524

* SPARKNLP-942: MPNetForSequenceClassification * SPARKNLP-942: MPNetForQuestionAnswering * SPARKNLP-942: MPNet Classifiers Documentation * Restore RobertaforQA bugfix

adding import notebook + changing default model + adding onnx support (…

37c4df2

…#14158)

Doc sim rank as retriever (#14149)

54d4605

* Added retrieval interface to the doc sim rank approach * Added Python interface as retriever in doc sim ranker --------- Co-authored-by: Stefano Lori <s.lori@izicap.com>

812 implement de berta for zero shot classification annotator (#14151)

6566239

* adding code * adding notebook for import --------- Co-authored-by: Maziyar Panahi <maziyar.panahi@iscpif.fr>

Add notebook for fine tuning sbert (#14152)

2e8410a

[SPARKNLP-986] Fixing optional input col validations (#14153)

c97e877

[SPARKNLP-984] Fixing Deberta notebooks URIs (#14154)

0e01a2c

maziyarpanahi added enhancement documentation bug-fix new-feature Introducing a new feature new model DON'T MERGE Do not merge this PR labels Feb 6, 2024

maziyarpanahi self-assigned this Feb 6, 2024

SPARKNLP-985: Add flexible naming for onnx_data (#14165)

2efa215

Some annotators might have different naming schemes for their files. Added a parameter to control this.

maziyarpanahi changed the title ~~remove file system url prefix (#14132)~~ release/530-release-candidate Feb 8, 2024

maziyarpanahi and others added 10 commits February 8, 2024 09:53

Add LLAMA2Transformer and M2M100Transformer to annotator

8d66d3b

Add LLAMA2Transformer and M2M100Transformer to ResourceDownloader

41d2e1b

Merge branch 'release/530-release-candidate' of github.com:johnsnowla…

bb9f58b

…bs/spark-nlp into release/530-release-candidate

bump version to 5.3.0 [skip test]

08e9211

SPARKNLP-999: Fix remote model loading for some onnx models

6010244

used filesystem to check for the onnx_data file (#14169)

0e9b54d

[SPARKNLP-940] Adding changes to correctly copy cluster index storage… (

219fc19

#14167) * [SPARKNLP-940] Adding changes to correctly copy cluster index storage when defined * [SPARKNLP-940] Moving local mode control to its right place * [SPARKNLP-940] Refactoring sentToCLuster method

[SPARKNLP-988] Updating EntityRuler documentation (#14168)

f00f11a

[SPARKNLP-940] Adding changes to support storage temp directory (clus…

1175050

…ter_tmp_dir)

maziyarpanahi linked an issue Feb 19, 2024 that may be closed by this pull request

No Operation named [init_all_tables] in the Graph when using GPT2Transformer #14110

Closed

1 task

ahmedlone127 and others added 19 commits February 19, 2024 15:47

fixes python documentation (#14172)

3cff1f8

revert MarianTransformer.scala

4e59301

revert HasBatchedAnnotate.scala

47ab709

revert Preprocessor.scala

e5cfd63

Revert ViTClassifier.scala

1bf9220

disable hard exception

eb91fde

Merge pull request #14156 from JohnSnowLabs/SPARKNLP-975-Fix-all-the-…

5067417

…warnings-in-SBT-build fixed all sbt warnings

Replace hard exception with soft logs (#14179)

94f6900

This reverts commit eb91fde.

move the example from root to examples/ [skip test]

59e98b3

Cleanup some code [skip test]

67917f0

Update onnxruntime to 1.17.0 [skip test]

e4f3310

Fix M2M100 default model's name [skip test]

318c3b2

Update docs [run doc]

e38f15e

Update Scala and Python APIs

bbbddd3

Fix unit test for DocSim [skip test]

71ee817

Merge branch 'release/530-release-candidate' of github.com:johnsnowla…

89457f0

…bs/spark-nlp into release/530-release-candidate

Fix onnx try/catch for MPNet classifier [ski test]

e9fdbe6

Update CHANGELOG [run doc]

af1536b

Publish 5.3.0 on Conda [skip test]

fa2cb23

maziyarpanahi merged commit ad5a4ea into master Feb 27, 2024
5 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

release/530-release-candidate #14164

release/530-release-candidate #14164

maziyarpanahi commented Feb 6, 2024 •

edited

Loading

release/530-release-candidate #14164

release/530-release-candidate #14164

Conversation

maziyarpanahi commented Feb 6, 2024 • edited Loading

maziyarpanahi commented Feb 6, 2024 •

edited

Loading