From bb19a1c185072ffa2d567c9bce1652e4807529f0 Mon Sep 17 00:00:00 2001 From: ahmedlone127 Date: Fri, 18 Aug 2023 23:10:20 +0700 Subject: [PATCH 001/155] Add model 2023-08-18-mpnet_embedding_mpnet_snli_en --- ...023-08-18-mpnet_embedding_mpnet_snli_en.md | 88 +++++++++++++++++++ 1 file changed, 88 insertions(+) create mode 100644 docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_mpnet_snli_en.md diff --git a/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_mpnet_snli_en.md b/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_mpnet_snli_en.md new file mode 100644 index 00000000000000..d2e332fcb2bdd7 --- /dev/null +++ b/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_mpnet_snli_en.md @@ -0,0 +1,88 @@ +--- +layout: model +title: English mpnet_embedding_mpnet_snli TFMPNetModel from jamescalam +author: John Snow Labs +name: mpnet_embedding_mpnet_snli +date: 2023-08-18 +tags: [mpnet, en, open_source, tensorflow] +task: Embeddings +language: en +edition: Spark NLP 5.1.0 +spark_version: 3.0 +supported: true +engine: tensorflow +annotator: MPNetEmbeddings +article_header: + type: cover +use_language_switcher: "Python-Scala-Java" +--- + +## Description + +Pretrained mpnet model, adapted from Hugging Face and curated to provide scalability and production-readiness using Spark NLP.`mpnet_embedding_mpnet_snli` is a English model originally trained by jamescalam. + +{:.btn-box} + + +[Download](https://s3.amazonaws.com/auxdata.johnsnowlabs.com/public/models/mpnet_embedding_mpnet_snli_en_5.1.0_3.0_1692374834042.zip){:.button.button-orange.button-orange-trans.arr.button-icon} +[Copy S3 URI](s3://auxdata.johnsnowlabs.com/public/models/mpnet_embedding_mpnet_snli_en_5.1.0_3.0_1692374834042.zip){:.button.button-orange.button-orange-trans.button-icon.button-copy-s3} + +## How to use + + + +
+{% include programmingLanguageSelectScalaPythonNLU.html %} +```python + +document_assembler = DocumentAssembler() \ + .setInputCol("text") \ + .setOutputCol("documents") + +instruction = MPNetEmbeddings \ + .pretrained("mpnet_embedding_mpnet_snli", "en")\ + .setInputCols(["documents"]) \ + .setOutputCol("mpnet_embeddings") + +pipeline = Pipeline(stages=[ + document_assembler, + instruction, +]) + +pipelineModel = pipeline.fit(data) + +pipelineDF = pipelineModel.transform(data) +``` +```scala + +val document_assembler = new DocumentAssembler() + .setInputCol("text") + .setOutputCol("documents") + +val instruction = MPNetEmbeddings + .pretrained("mpnet_embedding_mpnet_snli", "en") + .setInputCols(Array("documents")) + .setOutputCol("mpnet_embeddings") + +val pipeline = new Pipeline().setStages(Array(document_assembler, speechToText)) + +val pipelineModel = pipeline.fit(data) + +val pipelineDF = pipelineModel.transform(data) + +``` +
+ +{:.model-param} +## Model Information + +{:.table-model} +|---|---| +|Model Name:|mpnet_embedding_mpnet_snli| +|Compatibility:|Spark NLP 5.1.0+| +|License:|Open Source| +|Edition:|Official| +|Input Labels:|[documents]| +|Output Labels:|[mpnet_embeddings]| +|Language:|en| +|Size:|399.4 MB| \ No newline at end of file From 9e0ef487e9fc25cfb37829ad017cea6031ec71ad Mon Sep 17 00:00:00 2001 From: ahmedlone127 Date: Fri, 18 Aug 2023 23:11:20 +0700 Subject: [PATCH 002/155] Add model 2023-08-18-mpnet_embedding_Setfit_few_shot_classifier_en --- ...embedding_Setfit_few_shot_classifier_en.md | 88 +++++++++++++++++++ 1 file changed, 88 insertions(+) create mode 100644 docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_Setfit_few_shot_classifier_en.md diff --git a/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_Setfit_few_shot_classifier_en.md b/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_Setfit_few_shot_classifier_en.md new file mode 100644 index 00000000000000..083a38728d8496 --- /dev/null +++ b/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_Setfit_few_shot_classifier_en.md @@ -0,0 +1,88 @@ +--- +layout: model +title: English mpnet_embedding_Setfit_few_shot_classifier TFMPNetModel from Kuaaangwen +author: John Snow Labs +name: mpnet_embedding_Setfit_few_shot_classifier +date: 2023-08-18 +tags: [mpnet, en, open_source, tensorflow] +task: Embeddings +language: en +edition: Spark NLP 5.1.0 +spark_version: 3.0 +supported: true +engine: tensorflow +annotator: MPNetEmbeddings +article_header: + type: cover +use_language_switcher: "Python-Scala-Java" +--- + +## Description + +Pretrained mpnet model, adapted from Hugging Face and curated to provide scalability and production-readiness using Spark NLP.`mpnet_embedding_Setfit_few_shot_classifier` is a English model originally trained by Kuaaangwen. + +{:.btn-box} + + +[Download](https://s3.amazonaws.com/auxdata.johnsnowlabs.com/public/models/mpnet_embedding_Setfit_few_shot_classifier_en_5.1.0_3.0_1692374847627.zip){:.button.button-orange.button-orange-trans.arr.button-icon} +[Copy S3 URI](s3://auxdata.johnsnowlabs.com/public/models/mpnet_embedding_Setfit_few_shot_classifier_en_5.1.0_3.0_1692374847627.zip){:.button.button-orange.button-orange-trans.button-icon.button-copy-s3} + +## How to use + + + +
+{% include programmingLanguageSelectScalaPythonNLU.html %} +```python + +document_assembler = DocumentAssembler() \ + .setInputCol("text") \ + .setOutputCol("documents") + +instruction = MPNetEmbeddings \ + .pretrained("mpnet_embedding_Setfit_few_shot_classifier", "en")\ + .setInputCols(["documents"]) \ + .setOutputCol("mpnet_embeddings") + +pipeline = Pipeline(stages=[ + document_assembler, + instruction, +]) + +pipelineModel = pipeline.fit(data) + +pipelineDF = pipelineModel.transform(data) +``` +```scala + +val document_assembler = new DocumentAssembler() + .setInputCol("text") + .setOutputCol("documents") + +val instruction = MPNetEmbeddings + .pretrained("mpnet_embedding_Setfit_few_shot_classifier", "en") + .setInputCols(Array("documents")) + .setOutputCol("mpnet_embeddings") + +val pipeline = new Pipeline().setStages(Array(document_assembler, speechToText)) + +val pipelineModel = pipeline.fit(data) + +val pipelineDF = pipelineModel.transform(data) + +``` +
+ +{:.model-param} +## Model Information + +{:.table-model} +|---|---| +|Model Name:|mpnet_embedding_Setfit_few_shot_classifier| +|Compatibility:|Spark NLP 5.1.0+| +|License:|Open Source| +|Edition:|Official| +|Input Labels:|[documents]| +|Output Labels:|[mpnet_embeddings]| +|Language:|en| +|Size:|410.2 MB| \ No newline at end of file From b079d181daac9da0702591f30a8cf615c80bbf88 Mon Sep 17 00:00:00 2001 From: ahmedlone127 Date: Fri, 18 Aug 2023 23:12:21 +0700 Subject: [PATCH 003/155] Add model 2023-08-18-mpnet_embedding_multi_qa_mpnet_base_dot_v1_eclass_en --- ...ng_multi_qa_mpnet_base_dot_v1_eclass_en.md | 88 +++++++++++++++++++ 1 file changed, 88 insertions(+) create mode 100644 docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_multi_qa_mpnet_base_dot_v1_eclass_en.md diff --git a/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_multi_qa_mpnet_base_dot_v1_eclass_en.md b/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_multi_qa_mpnet_base_dot_v1_eclass_en.md new file mode 100644 index 00000000000000..3dcf873eaf3a57 --- /dev/null +++ b/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_multi_qa_mpnet_base_dot_v1_eclass_en.md @@ -0,0 +1,88 @@ +--- +layout: model +title: English mpnet_embedding_multi_qa_mpnet_base_dot_v1_eclass TFMPNetModel from JoBeer +author: John Snow Labs +name: mpnet_embedding_multi_qa_mpnet_base_dot_v1_eclass +date: 2023-08-18 +tags: [mpnet, en, open_source, tensorflow] +task: Embeddings +language: en +edition: Spark NLP 5.1.0 +spark_version: 3.0 +supported: true +engine: tensorflow +annotator: MPNetEmbeddings +article_header: + type: cover +use_language_switcher: "Python-Scala-Java" +--- + +## Description + +Pretrained mpnet model, adapted from Hugging Face and curated to provide scalability and production-readiness using Spark NLP.`mpnet_embedding_multi_qa_mpnet_base_dot_v1_eclass` is a English model originally trained by JoBeer. + +{:.btn-box} + + +[Download](https://s3.amazonaws.com/auxdata.johnsnowlabs.com/public/models/mpnet_embedding_multi_qa_mpnet_base_dot_v1_eclass_en_5.1.0_3.0_1692374614697.zip){:.button.button-orange.button-orange-trans.arr.button-icon} +[Copy S3 URI](s3://auxdata.johnsnowlabs.com/public/models/mpnet_embedding_multi_qa_mpnet_base_dot_v1_eclass_en_5.1.0_3.0_1692374614697.zip){:.button.button-orange.button-orange-trans.button-icon.button-copy-s3} + +## How to use + + + +
+{% include programmingLanguageSelectScalaPythonNLU.html %} +```python + +document_assembler = DocumentAssembler() \ + .setInputCol("text") \ + .setOutputCol("documents") + +instruction = MPNetEmbeddings \ + .pretrained("mpnet_embedding_multi_qa_mpnet_base_dot_v1_eclass", "en")\ + .setInputCols(["documents"]) \ + .setOutputCol("mpnet_embeddings") + +pipeline = Pipeline(stages=[ + document_assembler, + instruction, +]) + +pipelineModel = pipeline.fit(data) + +pipelineDF = pipelineModel.transform(data) +``` +```scala + +val document_assembler = new DocumentAssembler() + .setInputCol("text") + .setOutputCol("documents") + +val instruction = MPNetEmbeddings + .pretrained("mpnet_embedding_multi_qa_mpnet_base_dot_v1_eclass", "en") + .setInputCols(Array("documents")) + .setOutputCol("mpnet_embeddings") + +val pipeline = new Pipeline().setStages(Array(document_assembler, speechToText)) + +val pipelineModel = pipeline.fit(data) + +val pipelineDF = pipelineModel.transform(data) + +``` +
+ +{:.model-param} +## Model Information + +{:.table-model} +|---|---| +|Model Name:|mpnet_embedding_multi_qa_mpnet_base_dot_v1_eclass| +|Compatibility:|Spark NLP 5.1.0+| +|License:|Open Source| +|Edition:|Official| +|Input Labels:|[documents]| +|Output Labels:|[mpnet_embeddings]| +|Language:|en| +|Size:|409.9 MB| \ No newline at end of file From 2ca6d72bf09161f132e0f7ec1848858c47dcbd9f Mon Sep 17 00:00:00 2001 From: ahmedlone127 Date: Fri, 18 Aug 2023 23:13:21 +0700 Subject: [PATCH 004/155] Add model 2023-08-18-mpnet_embedding_all_mpnet_base_v2_embedding_all_en --- ...ding_all_mpnet_base_v2_embedding_all_en.md | 88 +++++++++++++++++++ 1 file changed, 88 insertions(+) create mode 100644 docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_all_mpnet_base_v2_embedding_all_en.md diff --git a/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_all_mpnet_base_v2_embedding_all_en.md b/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_all_mpnet_base_v2_embedding_all_en.md new file mode 100644 index 00000000000000..a0ca15be2a9423 --- /dev/null +++ b/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_all_mpnet_base_v2_embedding_all_en.md @@ -0,0 +1,88 @@ +--- +layout: model +title: English mpnet_embedding_all_mpnet_base_v2_embedding_all TFMPNetModel from LLukas22 +author: John Snow Labs +name: mpnet_embedding_all_mpnet_base_v2_embedding_all +date: 2023-08-18 +tags: [mpnet, en, open_source, tensorflow] +task: Embeddings +language: en +edition: Spark NLP 5.1.0 +spark_version: 3.0 +supported: true +engine: tensorflow +annotator: MPNetEmbeddings +article_header: + type: cover +use_language_switcher: "Python-Scala-Java" +--- + +## Description + +Pretrained mpnet model, adapted from Hugging Face and curated to provide scalability and production-readiness using Spark NLP.`mpnet_embedding_all_mpnet_base_v2_embedding_all` is a English model originally trained by LLukas22. + +{:.btn-box} + + +[Download](https://s3.amazonaws.com/auxdata.johnsnowlabs.com/public/models/mpnet_embedding_all_mpnet_base_v2_embedding_all_en_5.1.0_3.0_1692375164501.zip){:.button.button-orange.button-orange-trans.arr.button-icon} +[Copy S3 URI](s3://auxdata.johnsnowlabs.com/public/models/mpnet_embedding_all_mpnet_base_v2_embedding_all_en_5.1.0_3.0_1692375164501.zip){:.button.button-orange.button-orange-trans.button-icon.button-copy-s3} + +## How to use + + + +
+{% include programmingLanguageSelectScalaPythonNLU.html %} +```python + +document_assembler = DocumentAssembler() \ + .setInputCol("text") \ + .setOutputCol("documents") + +instruction = MPNetEmbeddings \ + .pretrained("mpnet_embedding_all_mpnet_base_v2_embedding_all", "en")\ + .setInputCols(["documents"]) \ + .setOutputCol("mpnet_embeddings") + +pipeline = Pipeline(stages=[ + document_assembler, + instruction, +]) + +pipelineModel = pipeline.fit(data) + +pipelineDF = pipelineModel.transform(data) +``` +```scala + +val document_assembler = new DocumentAssembler() + .setInputCol("text") + .setOutputCol("documents") + +val instruction = MPNetEmbeddings + .pretrained("mpnet_embedding_all_mpnet_base_v2_embedding_all", "en") + .setInputCols(Array("documents")) + .setOutputCol("mpnet_embeddings") + +val pipeline = new Pipeline().setStages(Array(document_assembler, instruction)) + +val pipelineModel = pipeline.fit(data) + +val pipelineDF = pipelineModel.transform(data) + +``` +
+ +{:.model-param} +## Model Information + +{:.table-model} +|---|---| +|Model Name:|mpnet_embedding_all_mpnet_base_v2_embedding_all| +|Compatibility:|Spark NLP 5.1.0+| +|License:|Open Source| +|Edition:|Official| +|Input Labels:|[documents]| +|Output Labels:|[mpnet_embeddings]| +|Language:|en| +|Size:|410.3 MB| \ No newline at end of file From 21ce86838b1a6483f8af9588548bd893c9d2572e Mon Sep 17 00:00:00 2001 From: ahmedlone127 Date: Fri, 18 Aug 2023 23:14:21 +0700 Subject: [PATCH 005/155] Add model 2023-08-18-mpnet_embedding_multi_qa_mpnet_base_dot_v1_eclass_en --- ...-mpnet_embedding_multi_qa_mpnet_base_dot_v1_eclass_en.md | 6 +++--- 1 file changed, 3 insertions(+), 3 deletions(-) diff --git a/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_multi_qa_mpnet_base_dot_v1_eclass_en.md b/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_multi_qa_mpnet_base_dot_v1_eclass_en.md index 3dcf873eaf3a57..695ca8c2aefc6e 100644 --- a/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_multi_qa_mpnet_base_dot_v1_eclass_en.md +++ b/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_multi_qa_mpnet_base_dot_v1_eclass_en.md @@ -24,8 +24,8 @@ Pretrained mpnet model, adapted from Hugging Face and curated to provide scalab {:.btn-box} -[Download](https://s3.amazonaws.com/auxdata.johnsnowlabs.com/public/models/mpnet_embedding_multi_qa_mpnet_base_dot_v1_eclass_en_5.1.0_3.0_1692374614697.zip){:.button.button-orange.button-orange-trans.arr.button-icon} -[Copy S3 URI](s3://auxdata.johnsnowlabs.com/public/models/mpnet_embedding_multi_qa_mpnet_base_dot_v1_eclass_en_5.1.0_3.0_1692374614697.zip){:.button.button-orange.button-orange-trans.button-icon.button-copy-s3} +[Download](https://s3.amazonaws.com/auxdata.johnsnowlabs.com/public/models/mpnet_embedding_multi_qa_mpnet_base_dot_v1_eclass_en_5.1.0_3.0_1692375204024.zip){:.button.button-orange.button-orange-trans.arr.button-icon} +[Copy S3 URI](s3://auxdata.johnsnowlabs.com/public/models/mpnet_embedding_multi_qa_mpnet_base_dot_v1_eclass_en_5.1.0_3.0_1692375204024.zip){:.button.button-orange.button-orange-trans.button-icon.button-copy-s3} ## How to use @@ -64,7 +64,7 @@ val instruction = MPNetEmbeddings .setInputCols(Array("documents")) .setOutputCol("mpnet_embeddings") -val pipeline = new Pipeline().setStages(Array(document_assembler, speechToText)) +val pipeline = new Pipeline().setStages(Array(document_assembler, instruction)) val pipelineModel = pipeline.fit(data) From 3c993947143ab55f19f274f41a9669e102231cec Mon Sep 17 00:00:00 2001 From: ahmedlone127 Date: Fri, 18 Aug 2023 23:15:22 +0700 Subject: [PATCH 006/155] Add model 2023-08-18-mpnet_embedding_PatentSBERTa_en --- ...3-08-18-mpnet_embedding_PatentSBERTa_en.md | 88 +++++++++++++++++++ 1 file changed, 88 insertions(+) create mode 100644 docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_PatentSBERTa_en.md diff --git a/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_PatentSBERTa_en.md b/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_PatentSBERTa_en.md new file mode 100644 index 00000000000000..070692a5b3a101 --- /dev/null +++ b/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_PatentSBERTa_en.md @@ -0,0 +1,88 @@ +--- +layout: model +title: English mpnet_embedding_PatentSBERTa TFMPNetModel from AI-Growth-Lab +author: John Snow Labs +name: mpnet_embedding_PatentSBERTa +date: 2023-08-18 +tags: [mpnet, en, open_source, tensorflow] +task: Embeddings +language: en +edition: Spark NLP 5.1.0 +spark_version: 3.0 +supported: true +engine: tensorflow +annotator: MPNetEmbeddings +article_header: + type: cover +use_language_switcher: "Python-Scala-Java" +--- + +## Description + +Pretrained mpnet model, adapted from Hugging Face and curated to provide scalability and production-readiness using Spark NLP.`mpnet_embedding_PatentSBERTa` is a English model originally trained by AI-Growth-Lab. + +{:.btn-box} + + +[Download](https://s3.amazonaws.com/auxdata.johnsnowlabs.com/public/models/mpnet_embedding_PatentSBERTa_en_5.1.0_3.0_1692375218186.zip){:.button.button-orange.button-orange-trans.arr.button-icon} +[Copy S3 URI](s3://auxdata.johnsnowlabs.com/public/models/mpnet_embedding_PatentSBERTa_en_5.1.0_3.0_1692375218186.zip){:.button.button-orange.button-orange-trans.button-icon.button-copy-s3} + +## How to use + + + +
+{% include programmingLanguageSelectScalaPythonNLU.html %} +```python + +document_assembler = DocumentAssembler() \ + .setInputCol("text") \ + .setOutputCol("documents") + +instruction = MPNetEmbeddings \ + .pretrained("mpnet_embedding_PatentSBERTa", "en")\ + .setInputCols(["documents"]) \ + .setOutputCol("mpnet_embeddings") + +pipeline = Pipeline(stages=[ + document_assembler, + instruction, +]) + +pipelineModel = pipeline.fit(data) + +pipelineDF = pipelineModel.transform(data) +``` +```scala + +val document_assembler = new DocumentAssembler() + .setInputCol("text") + .setOutputCol("documents") + +val instruction = MPNetEmbeddings + .pretrained("mpnet_embedding_PatentSBERTa", "en") + .setInputCols(Array("documents")) + .setOutputCol("mpnet_embeddings") + +val pipeline = new Pipeline().setStages(Array(document_assembler, instruction)) + +val pipelineModel = pipeline.fit(data) + +val pipelineDF = pipelineModel.transform(data) + +``` +
+ +{:.model-param} +## Model Information + +{:.table-model} +|---|---| +|Model Name:|mpnet_embedding_PatentSBERTa| +|Compatibility:|Spark NLP 5.1.0+| +|License:|Open Source| +|Edition:|Official| +|Input Labels:|[documents]| +|Output Labels:|[mpnet_embeddings]| +|Language:|en| +|Size:|410.0 MB| \ No newline at end of file From a38546f09e5b1d1cff0efe442349ed6408af2d81 Mon Sep 17 00:00:00 2001 From: ahmedlone127 Date: Fri, 18 Aug 2023 23:16:22 +0700 Subject: [PATCH 007/155] Add model 2023-08-18-mpnet_embedding_ecolo_pas_ecolo_v0.1_en --- ...mpnet_embedding_ecolo_pas_ecolo_v0.1_en.md | 88 +++++++++++++++++++ 1 file changed, 88 insertions(+) create mode 100644 docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_ecolo_pas_ecolo_v0.1_en.md diff --git a/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_ecolo_pas_ecolo_v0.1_en.md b/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_ecolo_pas_ecolo_v0.1_en.md new file mode 100644 index 00000000000000..764601fa7edb81 --- /dev/null +++ b/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_ecolo_pas_ecolo_v0.1_en.md @@ -0,0 +1,88 @@ +--- +layout: model +title: English mpnet_embedding_ecolo_pas_ecolo_v0.1 TFMPNetModel from eclaircies +author: John Snow Labs +name: mpnet_embedding_ecolo_pas_ecolo_v0.1 +date: 2023-08-18 +tags: [mpnet, en, open_source, tensorflow] +task: Embeddings +language: en +edition: Spark NLP 5.1.0 +spark_version: 3.0 +supported: true +engine: tensorflow +annotator: MPNetEmbeddings +article_header: + type: cover +use_language_switcher: "Python-Scala-Java" +--- + +## Description + +Pretrained mpnet model, adapted from Hugging Face and curated to provide scalability and production-readiness using Spark NLP.`mpnet_embedding_ecolo_pas_ecolo_v0.1` is a English model originally trained by eclaircies. + +{:.btn-box} + + +[Download](https://s3.amazonaws.com/auxdata.johnsnowlabs.com/public/models/mpnet_embedding_ecolo_pas_ecolo_v0.1_en_5.1.0_3.0_1692375351351.zip){:.button.button-orange.button-orange-trans.arr.button-icon} +[Copy S3 URI](s3://auxdata.johnsnowlabs.com/public/models/mpnet_embedding_ecolo_pas_ecolo_v0.1_en_5.1.0_3.0_1692375351351.zip){:.button.button-orange.button-orange-trans.button-icon.button-copy-s3} + +## How to use + + + +
+{% include programmingLanguageSelectScalaPythonNLU.html %} +```python + +document_assembler = DocumentAssembler() \ + .setInputCol("text") \ + .setOutputCol("documents") + +instruction = MPNetEmbeddings \ + .pretrained("mpnet_embedding_ecolo_pas_ecolo_v0.1", "en")\ + .setInputCols(["documents"]) \ + .setOutputCol("mpnet_embeddings") + +pipeline = Pipeline(stages=[ + document_assembler, + instruction, +]) + +pipelineModel = pipeline.fit(data) + +pipelineDF = pipelineModel.transform(data) +``` +```scala + +val document_assembler = new DocumentAssembler() + .setInputCol("text") + .setOutputCol("documents") + +val instruction = MPNetEmbeddings + .pretrained("mpnet_embedding_ecolo_pas_ecolo_v0.1", "en") + .setInputCols(Array("documents")) + .setOutputCol("mpnet_embeddings") + +val pipeline = new Pipeline().setStages(Array(document_assembler, instruction)) + +val pipelineModel = pipeline.fit(data) + +val pipelineDF = pipelineModel.transform(data) + +``` +
+ +{:.model-param} +## Model Information + +{:.table-model} +|---|---| +|Model Name:|mpnet_embedding_ecolo_pas_ecolo_v0.1| +|Compatibility:|Spark NLP 5.1.0+| +|License:|Open Source| +|Edition:|Official| +|Input Labels:|[documents]| +|Output Labels:|[mpnet_embeddings]| +|Language:|en| +|Size:|409.9 MB| \ No newline at end of file From 169ad488324f14a3dda2285f0a26f6ddd3a31882 Mon Sep 17 00:00:00 2001 From: ahmedlone127 Date: Fri, 18 Aug 2023 23:17:22 +0700 Subject: [PATCH 008/155] Add model 2023-08-18-mpnet_embedding_Setfit_few_shot_classifier_en --- ...3-08-18-mpnet_embedding_Setfit_few_shot_classifier_en.md | 6 +++--- 1 file changed, 3 insertions(+), 3 deletions(-) diff --git a/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_Setfit_few_shot_classifier_en.md b/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_Setfit_few_shot_classifier_en.md index 083a38728d8496..a5136966704ae5 100644 --- a/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_Setfit_few_shot_classifier_en.md +++ b/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_Setfit_few_shot_classifier_en.md @@ -24,8 +24,8 @@ Pretrained mpnet model, adapted from Hugging Face and curated to provide scalab {:.btn-box} -[Download](https://s3.amazonaws.com/auxdata.johnsnowlabs.com/public/models/mpnet_embedding_Setfit_few_shot_classifier_en_5.1.0_3.0_1692374847627.zip){:.button.button-orange.button-orange-trans.arr.button-icon} -[Copy S3 URI](s3://auxdata.johnsnowlabs.com/public/models/mpnet_embedding_Setfit_few_shot_classifier_en_5.1.0_3.0_1692374847627.zip){:.button.button-orange.button-orange-trans.button-icon.button-copy-s3} +[Download](https://s3.amazonaws.com/auxdata.johnsnowlabs.com/public/models/mpnet_embedding_Setfit_few_shot_classifier_en_5.1.0_3.0_1692375392849.zip){:.button.button-orange.button-orange-trans.arr.button-icon} +[Copy S3 URI](s3://auxdata.johnsnowlabs.com/public/models/mpnet_embedding_Setfit_few_shot_classifier_en_5.1.0_3.0_1692375392849.zip){:.button.button-orange.button-orange-trans.button-icon.button-copy-s3} ## How to use @@ -64,7 +64,7 @@ val instruction = MPNetEmbeddings .setInputCols(Array("documents")) .setOutputCol("mpnet_embeddings") -val pipeline = new Pipeline().setStages(Array(document_assembler, speechToText)) +val pipeline = new Pipeline().setStages(Array(document_assembler, instruction)) val pipelineModel = pipeline.fit(data) From a9bd62b99f301525f27265195885caaf8f143a94 Mon Sep 17 00:00:00 2001 From: ahmedlone127 Date: Fri, 18 Aug 2023 23:18:22 +0700 Subject: [PATCH 009/155] Add model 2023-08-18-mpnet_embedding_all_mpnet_base_v2_finetuned_v2_en --- ...dding_all_mpnet_base_v2_finetuned_v2_en.md | 88 +++++++++++++++++++ 1 file changed, 88 insertions(+) create mode 100644 docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_all_mpnet_base_v2_finetuned_v2_en.md diff --git a/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_all_mpnet_base_v2_finetuned_v2_en.md b/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_all_mpnet_base_v2_finetuned_v2_en.md new file mode 100644 index 00000000000000..465721618eee23 --- /dev/null +++ b/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_all_mpnet_base_v2_finetuned_v2_en.md @@ -0,0 +1,88 @@ +--- +layout: model +title: English mpnet_embedding_all_mpnet_base_v2_finetuned_v2 TFMPNetModel from Humair +author: John Snow Labs +name: mpnet_embedding_all_mpnet_base_v2_finetuned_v2 +date: 2023-08-18 +tags: [mpnet, en, open_source, tensorflow] +task: Embeddings +language: en +edition: Spark NLP 5.1.0 +spark_version: 3.0 +supported: true +engine: tensorflow +annotator: MPNetEmbeddings +article_header: + type: cover +use_language_switcher: "Python-Scala-Java" +--- + +## Description + +Pretrained mpnet model, adapted from Hugging Face and curated to provide scalability and production-readiness using Spark NLP.`mpnet_embedding_all_mpnet_base_v2_finetuned_v2` is a English model originally trained by Humair. + +{:.btn-box} + + +[Download](https://s3.amazonaws.com/auxdata.johnsnowlabs.com/public/models/mpnet_embedding_all_mpnet_base_v2_finetuned_v2_en_5.1.0_3.0_1692375397544.zip){:.button.button-orange.button-orange-trans.arr.button-icon} +[Copy S3 URI](s3://auxdata.johnsnowlabs.com/public/models/mpnet_embedding_all_mpnet_base_v2_finetuned_v2_en_5.1.0_3.0_1692375397544.zip){:.button.button-orange.button-orange-trans.button-icon.button-copy-s3} + +## How to use + + + +
+{% include programmingLanguageSelectScalaPythonNLU.html %} +```python + +document_assembler = DocumentAssembler() \ + .setInputCol("text") \ + .setOutputCol("documents") + +instruction = MPNetEmbeddings \ + .pretrained("mpnet_embedding_all_mpnet_base_v2_finetuned_v2", "en")\ + .setInputCols(["documents"]) \ + .setOutputCol("mpnet_embeddings") + +pipeline = Pipeline(stages=[ + document_assembler, + instruction, +]) + +pipelineModel = pipeline.fit(data) + +pipelineDF = pipelineModel.transform(data) +``` +```scala + +val document_assembler = new DocumentAssembler() + .setInputCol("text") + .setOutputCol("documents") + +val instruction = MPNetEmbeddings + .pretrained("mpnet_embedding_all_mpnet_base_v2_finetuned_v2", "en") + .setInputCols(Array("documents")) + .setOutputCol("mpnet_embeddings") + +val pipeline = new Pipeline().setStages(Array(document_assembler, instruction)) + +val pipelineModel = pipeline.fit(data) + +val pipelineDF = pipelineModel.transform(data) + +``` +
+ +{:.model-param} +## Model Information + +{:.table-model} +|---|---| +|Model Name:|mpnet_embedding_all_mpnet_base_v2_finetuned_v2| +|Compatibility:|Spark NLP 5.1.0+| +|License:|Open Source| +|Edition:|Official| +|Input Labels:|[documents]| +|Output Labels:|[mpnet_embeddings]| +|Language:|en| +|Size:|409.6 MB| \ No newline at end of file From 49355d7f3eea70d0ce54ad580ec049815fef192a Mon Sep 17 00:00:00 2001 From: ahmedlone127 Date: Fri, 18 Aug 2023 23:19:23 +0700 Subject: [PATCH 010/155] Add model 2023-08-18-mpnet_embedding_FewShotIssueClassifier_NLBSE23_en --- ...dding_FewShotIssueClassifier_NLBSE23_en.md | 88 +++++++++++++++++++ 1 file changed, 88 insertions(+) create mode 100644 docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_FewShotIssueClassifier_NLBSE23_en.md diff --git a/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_FewShotIssueClassifier_NLBSE23_en.md b/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_FewShotIssueClassifier_NLBSE23_en.md new file mode 100644 index 00000000000000..7ce867459489e7 --- /dev/null +++ b/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_FewShotIssueClassifier_NLBSE23_en.md @@ -0,0 +1,88 @@ +--- +layout: model +title: English mpnet_embedding_FewShotIssueClassifier_NLBSE23 TFMPNetModel from PeppoCola +author: John Snow Labs +name: mpnet_embedding_FewShotIssueClassifier_NLBSE23 +date: 2023-08-18 +tags: [mpnet, en, open_source, tensorflow] +task: Embeddings +language: en +edition: Spark NLP 5.1.0 +spark_version: 3.0 +supported: true +engine: tensorflow +annotator: MPNetEmbeddings +article_header: + type: cover +use_language_switcher: "Python-Scala-Java" +--- + +## Description + +Pretrained mpnet model, adapted from Hugging Face and curated to provide scalability and production-readiness using Spark NLP.`mpnet_embedding_FewShotIssueClassifier_NLBSE23` is a English model originally trained by PeppoCola. + +{:.btn-box} + + +[Download](https://s3.amazonaws.com/auxdata.johnsnowlabs.com/public/models/mpnet_embedding_FewShotIssueClassifier_NLBSE23_en_5.1.0_3.0_1692375535519.zip){:.button.button-orange.button-orange-trans.arr.button-icon} +[Copy S3 URI](s3://auxdata.johnsnowlabs.com/public/models/mpnet_embedding_FewShotIssueClassifier_NLBSE23_en_5.1.0_3.0_1692375535519.zip){:.button.button-orange.button-orange-trans.button-icon.button-copy-s3} + +## How to use + + + +
+{% include programmingLanguageSelectScalaPythonNLU.html %} +```python + +document_assembler = DocumentAssembler() \ + .setInputCol("text") \ + .setOutputCol("documents") + +instruction = MPNetEmbeddings \ + .pretrained("mpnet_embedding_FewShotIssueClassifier_NLBSE23", "en")\ + .setInputCols(["documents"]) \ + .setOutputCol("mpnet_embeddings") + +pipeline = Pipeline(stages=[ + document_assembler, + instruction, +]) + +pipelineModel = pipeline.fit(data) + +pipelineDF = pipelineModel.transform(data) +``` +```scala + +val document_assembler = new DocumentAssembler() + .setInputCol("text") + .setOutputCol("documents") + +val instruction = MPNetEmbeddings + .pretrained("mpnet_embedding_FewShotIssueClassifier_NLBSE23", "en") + .setInputCols(Array("documents")) + .setOutputCol("mpnet_embeddings") + +val pipeline = new Pipeline().setStages(Array(document_assembler, instruction)) + +val pipelineModel = pipeline.fit(data) + +val pipelineDF = pipelineModel.transform(data) + +``` +
+ +{:.model-param} +## Model Information + +{:.table-model} +|---|---| +|Model Name:|mpnet_embedding_FewShotIssueClassifier_NLBSE23| +|Compatibility:|Spark NLP 5.1.0+| +|License:|Open Source| +|Edition:|Official| +|Input Labels:|[documents]| +|Output Labels:|[mpnet_embeddings]| +|Language:|en| +|Size:|410.0 MB| \ No newline at end of file From 28140456fd07b0969c2b6a2159d985ffbbc2b823 Mon Sep 17 00:00:00 2001 From: ahmedlone127 Date: Fri, 18 Aug 2023 23:20:23 +0700 Subject: [PATCH 011/155] Add model 2023-08-18-mpnet_embedding_nooks_amd_detection_v2_full_en --- ...mbedding_nooks_amd_detection_v2_full_en.md | 88 +++++++++++++++++++ 1 file changed, 88 insertions(+) create mode 100644 docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_nooks_amd_detection_v2_full_en.md diff --git a/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_nooks_amd_detection_v2_full_en.md b/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_nooks_amd_detection_v2_full_en.md new file mode 100644 index 00000000000000..dff60b46f15b81 --- /dev/null +++ b/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_nooks_amd_detection_v2_full_en.md @@ -0,0 +1,88 @@ +--- +layout: model +title: English mpnet_embedding_nooks_amd_detection_v2_full TFMPNetModel from nikcheerla +author: John Snow Labs +name: mpnet_embedding_nooks_amd_detection_v2_full +date: 2023-08-18 +tags: [mpnet, en, open_source, tensorflow] +task: Embeddings +language: en +edition: Spark NLP 5.1.0 +spark_version: 3.0 +supported: true +engine: tensorflow +annotator: MPNetEmbeddings +article_header: + type: cover +use_language_switcher: "Python-Scala-Java" +--- + +## Description + +Pretrained mpnet model, adapted from Hugging Face and curated to provide scalability and production-readiness using Spark NLP.`mpnet_embedding_nooks_amd_detection_v2_full` is a English model originally trained by nikcheerla. + +{:.btn-box} + + +[Download](https://s3.amazonaws.com/auxdata.johnsnowlabs.com/public/models/mpnet_embedding_nooks_amd_detection_v2_full_en_5.1.0_3.0_1692375579854.zip){:.button.button-orange.button-orange-trans.arr.button-icon} +[Copy S3 URI](s3://auxdata.johnsnowlabs.com/public/models/mpnet_embedding_nooks_amd_detection_v2_full_en_5.1.0_3.0_1692375579854.zip){:.button.button-orange.button-orange-trans.button-icon.button-copy-s3} + +## How to use + + + +
+{% include programmingLanguageSelectScalaPythonNLU.html %} +```python + +document_assembler = DocumentAssembler() \ + .setInputCol("text") \ + .setOutputCol("documents") + +instruction = MPNetEmbeddings \ + .pretrained("mpnet_embedding_nooks_amd_detection_v2_full", "en")\ + .setInputCols(["documents"]) \ + .setOutputCol("mpnet_embeddings") + +pipeline = Pipeline(stages=[ + document_assembler, + instruction, +]) + +pipelineModel = pipeline.fit(data) + +pipelineDF = pipelineModel.transform(data) +``` +```scala + +val document_assembler = new DocumentAssembler() + .setInputCol("text") + .setOutputCol("documents") + +val instruction = MPNetEmbeddings + .pretrained("mpnet_embedding_nooks_amd_detection_v2_full", "en") + .setInputCols(Array("documents")) + .setOutputCol("mpnet_embeddings") + +val pipeline = new Pipeline().setStages(Array(document_assembler, instruction)) + +val pipelineModel = pipeline.fit(data) + +val pipelineDF = pipelineModel.transform(data) + +``` +
+ +{:.model-param} +## Model Information + +{:.table-model} +|---|---| +|Model Name:|mpnet_embedding_nooks_amd_detection_v2_full| +|Compatibility:|Spark NLP 5.1.0+| +|License:|Open Source| +|Edition:|Official| +|Input Labels:|[documents]| +|Output Labels:|[mpnet_embeddings]| +|Language:|en| +|Size:|410.0 MB| \ No newline at end of file From 46560a43f2b22563873cf04a63663b8cc0424a61 Mon Sep 17 00:00:00 2001 From: ahmedlone127 Date: Fri, 18 Aug 2023 23:21:24 +0700 Subject: [PATCH 012/155] Add model 2023-08-18-mpnet_embedding_action_policy_plans_classifier_en --- ...dding_action_policy_plans_classifier_en.md | 88 +++++++++++++++++++ 1 file changed, 88 insertions(+) create mode 100644 docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_action_policy_plans_classifier_en.md diff --git a/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_action_policy_plans_classifier_en.md b/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_action_policy_plans_classifier_en.md new file mode 100644 index 00000000000000..0287ba11282f9e --- /dev/null +++ b/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_action_policy_plans_classifier_en.md @@ -0,0 +1,88 @@ +--- +layout: model +title: English mpnet_embedding_action_policy_plans_classifier TFMPNetModel from ppsingh +author: John Snow Labs +name: mpnet_embedding_action_policy_plans_classifier +date: 2023-08-18 +tags: [mpnet, en, open_source, tensorflow] +task: Embeddings +language: en +edition: Spark NLP 5.1.0 +spark_version: 3.0 +supported: true +engine: tensorflow +annotator: MPNetEmbeddings +article_header: + type: cover +use_language_switcher: "Python-Scala-Java" +--- + +## Description + +Pretrained mpnet model, adapted from Hugging Face and curated to provide scalability and production-readiness using Spark NLP.`mpnet_embedding_action_policy_plans_classifier` is a English model originally trained by ppsingh. + +{:.btn-box} + + +[Download](https://s3.amazonaws.com/auxdata.johnsnowlabs.com/public/models/mpnet_embedding_action_policy_plans_classifier_en_5.1.0_3.0_1692375624301.zip){:.button.button-orange.button-orange-trans.arr.button-icon} +[Copy S3 URI](s3://auxdata.johnsnowlabs.com/public/models/mpnet_embedding_action_policy_plans_classifier_en_5.1.0_3.0_1692375624301.zip){:.button.button-orange.button-orange-trans.button-icon.button-copy-s3} + +## How to use + + + +
+{% include programmingLanguageSelectScalaPythonNLU.html %} +```python + +document_assembler = DocumentAssembler() \ + .setInputCol("text") \ + .setOutputCol("documents") + +instruction = MPNetEmbeddings \ + .pretrained("mpnet_embedding_action_policy_plans_classifier", "en")\ + .setInputCols(["documents"]) \ + .setOutputCol("mpnet_embeddings") + +pipeline = Pipeline(stages=[ + document_assembler, + instruction, +]) + +pipelineModel = pipeline.fit(data) + +pipelineDF = pipelineModel.transform(data) +``` +```scala + +val document_assembler = new DocumentAssembler() + .setInputCol("text") + .setOutputCol("documents") + +val instruction = MPNetEmbeddings + .pretrained("mpnet_embedding_action_policy_plans_classifier", "en") + .setInputCols(Array("documents")) + .setOutputCol("mpnet_embeddings") + +val pipeline = new Pipeline().setStages(Array(document_assembler, instruction)) + +val pipelineModel = pipeline.fit(data) + +val pipelineDF = pipelineModel.transform(data) + +``` +
+ +{:.model-param} +## Model Information + +{:.table-model} +|---|---| +|Model Name:|mpnet_embedding_action_policy_plans_classifier| +|Compatibility:|Spark NLP 5.1.0+| +|License:|Open Source| +|Edition:|Official| +|Input Labels:|[documents]| +|Output Labels:|[mpnet_embeddings]| +|Language:|en| +|Size:|409.9 MB| \ No newline at end of file From c9fdd356d7e56de8df7caecf9d075d228c6f50c6 Mon Sep 17 00:00:00 2001 From: ahmedlone127 Date: Fri, 18 Aug 2023 23:25:25 +0700 Subject: [PATCH 013/155] Add model 2023-08-18-mpnet_embedding_cross_all_mpnet_base_v2_finetuned_WebNLG2020_metric_average_en --- ..._finetuned_WebNLG2020_metric_average_en.md | 88 +++++++++++++++++++ 1 file changed, 88 insertions(+) create mode 100644 docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_cross_all_mpnet_base_v2_finetuned_WebNLG2020_metric_average_en.md diff --git a/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_cross_all_mpnet_base_v2_finetuned_WebNLG2020_metric_average_en.md b/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_cross_all_mpnet_base_v2_finetuned_WebNLG2020_metric_average_en.md new file mode 100644 index 00000000000000..cb67773aa568f7 --- /dev/null +++ b/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_cross_all_mpnet_base_v2_finetuned_WebNLG2020_metric_average_en.md @@ -0,0 +1,88 @@ +--- +layout: model +title: English mpnet_embedding_cross_all_mpnet_base_v2_finetuned_WebNLG2020_metric_average TFMPNetModel from teven +author: John Snow Labs +name: mpnet_embedding_cross_all_mpnet_base_v2_finetuned_WebNLG2020_metric_average +date: 2023-08-18 +tags: [mpnet, en, open_source, tensorflow] +task: Embeddings +language: en +edition: Spark NLP 5.1.0 +spark_version: 3.0 +supported: true +engine: tensorflow +annotator: MPNetEmbeddings +article_header: + type: cover +use_language_switcher: "Python-Scala-Java" +--- + +## Description + +Pretrained mpnet model, adapted from Hugging Face and curated to provide scalability and production-readiness using Spark NLP.`mpnet_embedding_cross_all_mpnet_base_v2_finetuned_WebNLG2020_metric_average` is a English model originally trained by teven. + +{:.btn-box} + + +[Download](https://s3.amazonaws.com/auxdata.johnsnowlabs.com/public/models/mpnet_embedding_cross_all_mpnet_base_v2_finetuned_WebNLG2020_metric_average_en_5.1.0_3.0_1692375601480.zip){:.button.button-orange.button-orange-trans.arr.button-icon} +[Copy S3 URI](s3://auxdata.johnsnowlabs.com/public/models/mpnet_embedding_cross_all_mpnet_base_v2_finetuned_WebNLG2020_metric_average_en_5.1.0_3.0_1692375601480.zip){:.button.button-orange.button-orange-trans.button-icon.button-copy-s3} + +## How to use + + + +
+{% include programmingLanguageSelectScalaPythonNLU.html %} +```python + +document_assembler = DocumentAssembler() \ + .setInputCol("text") \ + .setOutputCol("documents") + +instruction = MPNetEmbeddings \ + .pretrained("mpnet_embedding_cross_all_mpnet_base_v2_finetuned_WebNLG2020_metric_average", "en")\ + .setInputCols(["documents"]) \ + .setOutputCol("mpnet_embeddings") + +pipeline = Pipeline(stages=[ + document_assembler, + instruction, +]) + +pipelineModel = pipeline.fit(data) + +pipelineDF = pipelineModel.transform(data) +``` +```scala + +val document_assembler = new DocumentAssembler() + .setInputCol("text") + .setOutputCol("documents") + +val instruction = MPNetEmbeddings + .pretrained("mpnet_embedding_cross_all_mpnet_base_v2_finetuned_WebNLG2020_metric_average", "en") + .setInputCols(Array("documents")) + .setOutputCol("mpnet_embeddings") + +val pipeline = new Pipeline().setStages(Array(document_assembler, instruction)) + +val pipelineModel = pipeline.fit(data) + +val pipelineDF = pipelineModel.transform(data) + +``` +
+ +{:.model-param} +## Model Information + +{:.table-model} +|---|---| +|Model Name:|mpnet_embedding_cross_all_mpnet_base_v2_finetuned_WebNLG2020_metric_average| +|Compatibility:|Spark NLP 5.1.0+| +|License:|Open Source| +|Edition:|Official| +|Input Labels:|[documents]| +|Output Labels:|[mpnet_embeddings]| +|Language:|en| +|Size:|409.9 MB| \ No newline at end of file From fad6f8dd6ceb4ec63b01e5ef0bbdb0b7fb4d7d59 Mon Sep 17 00:00:00 2001 From: ahmedlone127 Date: Fri, 18 Aug 2023 23:26:25 +0700 Subject: [PATCH 014/155] Add model 2023-08-18-mpnet_embedding_setfit_zero_shot_classification_pbsp_p3_func_en --- ...ero_shot_classification_pbsp_p3_func_en.md | 88 +++++++++++++++++++ 1 file changed, 88 insertions(+) create mode 100644 docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_setfit_zero_shot_classification_pbsp_p3_func_en.md diff --git a/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_setfit_zero_shot_classification_pbsp_p3_func_en.md b/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_setfit_zero_shot_classification_pbsp_p3_func_en.md new file mode 100644 index 00000000000000..5f32be3634d7a6 --- /dev/null +++ b/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_setfit_zero_shot_classification_pbsp_p3_func_en.md @@ -0,0 +1,88 @@ +--- +layout: model +title: English mpnet_embedding_setfit_zero_shot_classification_pbsp_p3_func TFMPNetModel from aammari +author: John Snow Labs +name: mpnet_embedding_setfit_zero_shot_classification_pbsp_p3_func +date: 2023-08-18 +tags: [mpnet, en, open_source, tensorflow] +task: Embeddings +language: en +edition: Spark NLP 5.1.0 +spark_version: 3.0 +supported: true +engine: tensorflow +annotator: MPNetEmbeddings +article_header: + type: cover +use_language_switcher: "Python-Scala-Java" +--- + +## Description + +Pretrained mpnet model, adapted from Hugging Face and curated to provide scalability and production-readiness using Spark NLP.`mpnet_embedding_setfit_zero_shot_classification_pbsp_p3_func` is a English model originally trained by aammari. + +{:.btn-box} + + +[Download](https://s3.amazonaws.com/auxdata.johnsnowlabs.com/public/models/mpnet_embedding_setfit_zero_shot_classification_pbsp_p3_func_en_5.1.0_3.0_1692375940356.zip){:.button.button-orange.button-orange-trans.arr.button-icon} +[Copy S3 URI](s3://auxdata.johnsnowlabs.com/public/models/mpnet_embedding_setfit_zero_shot_classification_pbsp_p3_func_en_5.1.0_3.0_1692375940356.zip){:.button.button-orange.button-orange-trans.button-icon.button-copy-s3} + +## How to use + + + +
+{% include programmingLanguageSelectScalaPythonNLU.html %} +```python + +document_assembler = DocumentAssembler() \ + .setInputCol("text") \ + .setOutputCol("documents") + +instruction = MPNetEmbeddings \ + .pretrained("mpnet_embedding_setfit_zero_shot_classification_pbsp_p3_func", "en")\ + .setInputCols(["documents"]) \ + .setOutputCol("mpnet_embeddings") + +pipeline = Pipeline(stages=[ + document_assembler, + instruction, +]) + +pipelineModel = pipeline.fit(data) + +pipelineDF = pipelineModel.transform(data) +``` +```scala + +val document_assembler = new DocumentAssembler() + .setInputCol("text") + .setOutputCol("documents") + +val instruction = MPNetEmbeddings + .pretrained("mpnet_embedding_setfit_zero_shot_classification_pbsp_p3_func", "en") + .setInputCols(Array("documents")) + .setOutputCol("mpnet_embeddings") + +val pipeline = new Pipeline().setStages(Array(document_assembler, instruction)) + +val pipelineModel = pipeline.fit(data) + +val pipelineDF = pipelineModel.transform(data) + +``` +
+ +{:.model-param} +## Model Information + +{:.table-model} +|---|---| +|Model Name:|mpnet_embedding_setfit_zero_shot_classification_pbsp_p3_func| +|Compatibility:|Spark NLP 5.1.0+| +|License:|Open Source| +|Edition:|Official| +|Input Labels:|[documents]| +|Output Labels:|[mpnet_embeddings]| +|Language:|en| +|Size:|409.8 MB| \ No newline at end of file From fd167edec837259b7f0e006047ee0396c3470cd3 Mon Sep 17 00:00:00 2001 From: ahmedlone127 Date: Fri, 18 Aug 2023 23:27:25 +0700 Subject: [PATCH 015/155] Add model 2023-08-18-mpnet_embedding_ouvrage_classif_en --- ...8-18-mpnet_embedding_ouvrage_classif_en.md | 88 +++++++++++++++++++ 1 file changed, 88 insertions(+) create mode 100644 docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_ouvrage_classif_en.md diff --git a/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_ouvrage_classif_en.md b/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_ouvrage_classif_en.md new file mode 100644 index 00000000000000..f17a6564e9a9e8 --- /dev/null +++ b/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_ouvrage_classif_en.md @@ -0,0 +1,88 @@ +--- +layout: model +title: English mpnet_embedding_ouvrage_classif TFMPNetModel from TomPWM +author: John Snow Labs +name: mpnet_embedding_ouvrage_classif +date: 2023-08-18 +tags: [mpnet, en, open_source, tensorflow] +task: Embeddings +language: en +edition: Spark NLP 5.1.0 +spark_version: 3.0 +supported: true +engine: tensorflow +annotator: MPNetEmbeddings +article_header: + type: cover +use_language_switcher: "Python-Scala-Java" +--- + +## Description + +Pretrained mpnet model, adapted from Hugging Face and curated to provide scalability and production-readiness using Spark NLP.`mpnet_embedding_ouvrage_classif` is a English model originally trained by TomPWM. + +{:.btn-box} + + +[Download](https://s3.amazonaws.com/auxdata.johnsnowlabs.com/public/models/mpnet_embedding_ouvrage_classif_en_5.1.0_3.0_1692376002579.zip){:.button.button-orange.button-orange-trans.arr.button-icon} +[Copy S3 URI](s3://auxdata.johnsnowlabs.com/public/models/mpnet_embedding_ouvrage_classif_en_5.1.0_3.0_1692376002579.zip){:.button.button-orange.button-orange-trans.button-icon.button-copy-s3} + +## How to use + + + +
+{% include programmingLanguageSelectScalaPythonNLU.html %} +```python + +document_assembler = DocumentAssembler() \ + .setInputCol("text") \ + .setOutputCol("documents") + +instruction = MPNetEmbeddings \ + .pretrained("mpnet_embedding_ouvrage_classif", "en")\ + .setInputCols(["documents"]) \ + .setOutputCol("mpnet_embeddings") + +pipeline = Pipeline(stages=[ + document_assembler, + instruction, +]) + +pipelineModel = pipeline.fit(data) + +pipelineDF = pipelineModel.transform(data) +``` +```scala + +val document_assembler = new DocumentAssembler() + .setInputCol("text") + .setOutputCol("documents") + +val instruction = MPNetEmbeddings + .pretrained("mpnet_embedding_ouvrage_classif", "en") + .setInputCols(Array("documents")) + .setOutputCol("mpnet_embeddings") + +val pipeline = new Pipeline().setStages(Array(document_assembler, instruction)) + +val pipelineModel = pipeline.fit(data) + +val pipelineDF = pipelineModel.transform(data) + +``` +
+ +{:.model-param} +## Model Information + +{:.table-model} +|---|---| +|Model Name:|mpnet_embedding_ouvrage_classif| +|Compatibility:|Spark NLP 5.1.0+| +|License:|Open Source| +|Edition:|Official| +|Input Labels:|[documents]| +|Output Labels:|[mpnet_embeddings]| +|Language:|en| +|Size:|409.9 MB| \ No newline at end of file From ecea0caf1df4c0d6fe4a8a28eb12618a5f11be60 Mon Sep 17 00:00:00 2001 From: ahmedlone127 Date: Fri, 18 Aug 2023 23:28:26 +0700 Subject: [PATCH 016/155] Add model 2023-08-18-mpnet_embedding_tiny_random_MPNetForTokenClassification_en --- ...y_random_MPNetForTokenClassification_en.md | 88 +++++++++++++++++++ 1 file changed, 88 insertions(+) create mode 100644 docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_tiny_random_MPNetForTokenClassification_en.md diff --git a/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_tiny_random_MPNetForTokenClassification_en.md b/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_tiny_random_MPNetForTokenClassification_en.md new file mode 100644 index 00000000000000..095e0d6663c917 --- /dev/null +++ b/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_tiny_random_MPNetForTokenClassification_en.md @@ -0,0 +1,88 @@ +--- +layout: model +title: English mpnet_embedding_tiny_random_MPNetForTokenClassification TFMPNetModel from hf-tiny-model-private +author: John Snow Labs +name: mpnet_embedding_tiny_random_MPNetForTokenClassification +date: 2023-08-18 +tags: [mpnet, en, open_source, tensorflow] +task: Embeddings +language: en +edition: Spark NLP 5.1.0 +spark_version: 3.0 +supported: true +engine: tensorflow +annotator: MPNetEmbeddings +article_header: + type: cover +use_language_switcher: "Python-Scala-Java" +--- + +## Description + +Pretrained mpnet model, adapted from Hugging Face and curated to provide scalability and production-readiness using Spark NLP.`mpnet_embedding_tiny_random_MPNetForTokenClassification` is a English model originally trained by hf-tiny-model-private. + +{:.btn-box} + + +[Download](https://s3.amazonaws.com/auxdata.johnsnowlabs.com/public/models/mpnet_embedding_tiny_random_MPNetForTokenClassification_en_5.1.0_3.0_1692376082863.zip){:.button.button-orange.button-orange-trans.arr.button-icon} +[Copy S3 URI](s3://auxdata.johnsnowlabs.com/public/models/mpnet_embedding_tiny_random_MPNetForTokenClassification_en_5.1.0_3.0_1692376082863.zip){:.button.button-orange.button-orange-trans.button-icon.button-copy-s3} + +## How to use + + + +
+{% include programmingLanguageSelectScalaPythonNLU.html %} +```python + +document_assembler = DocumentAssembler() \ + .setInputCol("text") \ + .setOutputCol("documents") + +instruction = MPNetEmbeddings \ + .pretrained("mpnet_embedding_tiny_random_MPNetForTokenClassification", "en")\ + .setInputCols(["documents"]) \ + .setOutputCol("mpnet_embeddings") + +pipeline = Pipeline(stages=[ + document_assembler, + instruction, +]) + +pipelineModel = pipeline.fit(data) + +pipelineDF = pipelineModel.transform(data) +``` +```scala + +val document_assembler = new DocumentAssembler() + .setInputCol("text") + .setOutputCol("documents") + +val instruction = MPNetEmbeddings + .pretrained("mpnet_embedding_tiny_random_MPNetForTokenClassification", "en") + .setInputCols(Array("documents")) + .setOutputCol("mpnet_embeddings") + +val pipeline = new Pipeline().setStages(Array(document_assembler, instruction)) + +val pipelineModel = pipeline.fit(data) + +val pipelineDF = pipelineModel.transform(data) + +``` +
+ +{:.model-param} +## Model Information + +{:.table-model} +|---|---| +|Model Name:|mpnet_embedding_tiny_random_MPNetForTokenClassification| +|Compatibility:|Spark NLP 5.1.0+| +|License:|Open Source| +|Edition:|Official| +|Input Labels:|[documents]| +|Output Labels:|[mpnet_embeddings]| +|Language:|en| +|Size:|1.2 MB| \ No newline at end of file From f09fbb59be8f0b701ccac4f627f199c9da4d18a0 Mon Sep 17 00:00:00 2001 From: ahmedlone127 Date: Fri, 18 Aug 2023 23:29:26 +0700 Subject: [PATCH 017/155] Add model 2023-08-18-mpnet_embedding_review_intent_20230116_en --- ...net_embedding_review_intent_20230116_en.md | 88 +++++++++++++++++++ 1 file changed, 88 insertions(+) create mode 100644 docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_review_intent_20230116_en.md diff --git a/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_review_intent_20230116_en.md b/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_review_intent_20230116_en.md new file mode 100644 index 00000000000000..9c09ccabbc29b2 --- /dev/null +++ b/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_review_intent_20230116_en.md @@ -0,0 +1,88 @@ +--- +layout: model +title: English mpnet_embedding_review_intent_20230116 TFMPNetModel from meichen91 +author: John Snow Labs +name: mpnet_embedding_review_intent_20230116 +date: 2023-08-18 +tags: [mpnet, en, open_source, tensorflow] +task: Embeddings +language: en +edition: Spark NLP 5.1.0 +spark_version: 3.0 +supported: true +engine: tensorflow +annotator: MPNetEmbeddings +article_header: + type: cover +use_language_switcher: "Python-Scala-Java" +--- + +## Description + +Pretrained mpnet model, adapted from Hugging Face and curated to provide scalability and production-readiness using Spark NLP.`mpnet_embedding_review_intent_20230116` is a English model originally trained by meichen91. + +{:.btn-box} + + +[Download](https://s3.amazonaws.com/auxdata.johnsnowlabs.com/public/models/mpnet_embedding_review_intent_20230116_en_5.1.0_3.0_1692376123645.zip){:.button.button-orange.button-orange-trans.arr.button-icon} +[Copy S3 URI](s3://auxdata.johnsnowlabs.com/public/models/mpnet_embedding_review_intent_20230116_en_5.1.0_3.0_1692376123645.zip){:.button.button-orange.button-orange-trans.button-icon.button-copy-s3} + +## How to use + + + +
+{% include programmingLanguageSelectScalaPythonNLU.html %} +```python + +document_assembler = DocumentAssembler() \ + .setInputCol("text") \ + .setOutputCol("documents") + +instruction = MPNetEmbeddings \ + .pretrained("mpnet_embedding_review_intent_20230116", "en")\ + .setInputCols(["documents"]) \ + .setOutputCol("mpnet_embeddings") + +pipeline = Pipeline(stages=[ + document_assembler, + instruction, +]) + +pipelineModel = pipeline.fit(data) + +pipelineDF = pipelineModel.transform(data) +``` +```scala + +val document_assembler = new DocumentAssembler() + .setInputCol("text") + .setOutputCol("documents") + +val instruction = MPNetEmbeddings + .pretrained("mpnet_embedding_review_intent_20230116", "en") + .setInputCols(Array("documents")) + .setOutputCol("mpnet_embeddings") + +val pipeline = new Pipeline().setStages(Array(document_assembler, instruction)) + +val pipelineModel = pipeline.fit(data) + +val pipelineDF = pipelineModel.transform(data) + +``` +
+ +{:.model-param} +## Model Information + +{:.table-model} +|---|---| +|Model Name:|mpnet_embedding_review_intent_20230116| +|Compatibility:|Spark NLP 5.1.0+| +|License:|Open Source| +|Edition:|Official| +|Input Labels:|[documents]| +|Output Labels:|[mpnet_embeddings]| +|Language:|en| +|Size:|409.9 MB| \ No newline at end of file From 0bca523ff9230d88f074c7adc6bf1480e7c0e2c0 Mon Sep 17 00:00:00 2001 From: ahmedlone127 Date: Fri, 18 Aug 2023 23:30:26 +0700 Subject: [PATCH 018/155] Add model 2023-08-18-mpnet_embedding_spiced_en --- .../2023-08-18-mpnet_embedding_spiced_en.md | 88 +++++++++++++++++++ 1 file changed, 88 insertions(+) create mode 100644 docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_spiced_en.md diff --git a/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_spiced_en.md b/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_spiced_en.md new file mode 100644 index 00000000000000..d7339e83a36d92 --- /dev/null +++ b/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_spiced_en.md @@ -0,0 +1,88 @@ +--- +layout: model +title: English mpnet_embedding_spiced TFMPNetModel from copenlu +author: John Snow Labs +name: mpnet_embedding_spiced +date: 2023-08-18 +tags: [mpnet, en, open_source, tensorflow] +task: Embeddings +language: en +edition: Spark NLP 5.1.0 +spark_version: 3.0 +supported: true +engine: tensorflow +annotator: MPNetEmbeddings +article_header: + type: cover +use_language_switcher: "Python-Scala-Java" +--- + +## Description + +Pretrained mpnet model, adapted from Hugging Face and curated to provide scalability and production-readiness using Spark NLP.`mpnet_embedding_spiced` is a English model originally trained by copenlu. + +{:.btn-box} + + +[Download](https://s3.amazonaws.com/auxdata.johnsnowlabs.com/public/models/mpnet_embedding_spiced_en_5.1.0_3.0_1692376162204.zip){:.button.button-orange.button-orange-trans.arr.button-icon} +[Copy S3 URI](s3://auxdata.johnsnowlabs.com/public/models/mpnet_embedding_spiced_en_5.1.0_3.0_1692376162204.zip){:.button.button-orange.button-orange-trans.button-icon.button-copy-s3} + +## How to use + + + +
+{% include programmingLanguageSelectScalaPythonNLU.html %} +```python + +document_assembler = DocumentAssembler() \ + .setInputCol("text") \ + .setOutputCol("documents") + +instruction = MPNetEmbeddings \ + .pretrained("mpnet_embedding_spiced", "en")\ + .setInputCols(["documents"]) \ + .setOutputCol("mpnet_embeddings") + +pipeline = Pipeline(stages=[ + document_assembler, + instruction, +]) + +pipelineModel = pipeline.fit(data) + +pipelineDF = pipelineModel.transform(data) +``` +```scala + +val document_assembler = new DocumentAssembler() + .setInputCol("text") + .setOutputCol("documents") + +val instruction = MPNetEmbeddings + .pretrained("mpnet_embedding_spiced", "en") + .setInputCols(Array("documents")) + .setOutputCol("mpnet_embeddings") + +val pipeline = new Pipeline().setStages(Array(document_assembler, instruction)) + +val pipelineModel = pipeline.fit(data) + +val pipelineDF = pipelineModel.transform(data) + +``` +
+ +{:.model-param} +## Model Information + +{:.table-model} +|---|---| +|Model Name:|mpnet_embedding_spiced| +|Compatibility:|Spark NLP 5.1.0+| +|License:|Open Source| +|Edition:|Official| +|Input Labels:|[documents]| +|Output Labels:|[mpnet_embeddings]| +|Language:|en| +|Size:|409.8 MB| \ No newline at end of file From 2bb7410ab6ad112fb455d836796154f137627d42 Mon Sep 17 00:00:00 2001 From: ahmedlone127 Date: Fri, 18 Aug 2023 23:31:27 +0700 Subject: [PATCH 019/155] Add model 2023-08-18-mpnet_embedding_mpnet_snli_negatives_en --- ...mpnet_embedding_mpnet_snli_negatives_en.md | 88 +++++++++++++++++++ 1 file changed, 88 insertions(+) create mode 100644 docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_mpnet_snli_negatives_en.md diff --git a/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_mpnet_snli_negatives_en.md b/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_mpnet_snli_negatives_en.md new file mode 100644 index 00000000000000..38434753e80c99 --- /dev/null +++ b/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_mpnet_snli_negatives_en.md @@ -0,0 +1,88 @@ +--- +layout: model +title: English mpnet_embedding_mpnet_snli_negatives TFMPNetModel from jamescalam +author: John Snow Labs +name: mpnet_embedding_mpnet_snli_negatives +date: 2023-08-18 +tags: [mpnet, en, open_source, tensorflow] +task: Embeddings +language: en +edition: Spark NLP 5.1.0 +spark_version: 3.0 +supported: true +engine: tensorflow +annotator: MPNetEmbeddings +article_header: + type: cover +use_language_switcher: "Python-Scala-Java" +--- + +## Description + +Pretrained mpnet model, adapted from Hugging Face and curated to provide scalability and production-readiness using Spark NLP.`mpnet_embedding_mpnet_snli_negatives` is a English model originally trained by jamescalam. + +{:.btn-box} + + +[Download](https://s3.amazonaws.com/auxdata.johnsnowlabs.com/public/models/mpnet_embedding_mpnet_snli_negatives_en_5.1.0_3.0_1692375780359.zip){:.button.button-orange.button-orange-trans.arr.button-icon} +[Copy S3 URI](s3://auxdata.johnsnowlabs.com/public/models/mpnet_embedding_mpnet_snli_negatives_en_5.1.0_3.0_1692375780359.zip){:.button.button-orange.button-orange-trans.button-icon.button-copy-s3} + +## How to use + + + +
+{% include programmingLanguageSelectScalaPythonNLU.html %} +```python + +document_assembler = DocumentAssembler() \ + .setInputCol("text") \ + .setOutputCol("documents") + +instruction = MPNetEmbeddings \ + .pretrained("mpnet_embedding_mpnet_snli_negatives", "en")\ + .setInputCols(["documents"]) \ + .setOutputCol("mpnet_embeddings") + +pipeline = Pipeline(stages=[ + document_assembler, + instruction, +]) + +pipelineModel = pipeline.fit(data) + +pipelineDF = pipelineModel.transform(data) +``` +```scala + +val document_assembler = new DocumentAssembler() + .setInputCol("text") + .setOutputCol("documents") + +val instruction = MPNetEmbeddings + .pretrained("mpnet_embedding_mpnet_snli_negatives", "en") + .setInputCols(Array("documents")) + .setOutputCol("mpnet_embeddings") + +val pipeline = new Pipeline().setStages(Array(document_assembler, instruction)) + +val pipelineModel = pipeline.fit(data) + +val pipelineDF = pipelineModel.transform(data) + +``` +
+ +{:.model-param} +## Model Information + +{:.table-model} +|---|---| +|Model Name:|mpnet_embedding_mpnet_snli_negatives| +|Compatibility:|Spark NLP 5.1.0+| +|License:|Open Source| +|Edition:|Official| +|Input Labels:|[documents]| +|Output Labels:|[mpnet_embeddings]| +|Language:|en| +|Size:|401.1 MB| \ No newline at end of file From bba436b323dd2c866e6a0f405c8f3d396166ac4a Mon Sep 17 00:00:00 2001 From: ahmedlone127 Date: Fri, 18 Aug 2023 23:32:27 +0700 Subject: [PATCH 020/155] Add model 2023-08-18-mpnet_embedding_review_multiclass_20230116_en --- ...embedding_review_multiclass_20230116_en.md | 88 +++++++++++++++++++ 1 file changed, 88 insertions(+) create mode 100644 docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_review_multiclass_20230116_en.md diff --git a/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_review_multiclass_20230116_en.md b/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_review_multiclass_20230116_en.md new file mode 100644 index 00000000000000..e6ea0df2da10da --- /dev/null +++ b/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_review_multiclass_20230116_en.md @@ -0,0 +1,88 @@ +--- +layout: model +title: English mpnet_embedding_review_multiclass_20230116 TFMPNetModel from meichen91 +author: John Snow Labs +name: mpnet_embedding_review_multiclass_20230116 +date: 2023-08-18 +tags: [mpnet, en, open_source, tensorflow] +task: Embeddings +language: en +edition: Spark NLP 5.1.0 +spark_version: 3.0 +supported: true +engine: tensorflow +annotator: MPNetEmbeddings +article_header: + type: cover +use_language_switcher: "Python-Scala-Java" +--- + +## Description + +Pretrained mpnet model, adapted from Hugging Face and curated to provide scalability and production-readiness using Spark NLP.`mpnet_embedding_review_multiclass_20230116` is a English model originally trained by meichen91. + +{:.btn-box} + + +[Download](https://s3.amazonaws.com/auxdata.johnsnowlabs.com/public/models/mpnet_embedding_review_multiclass_20230116_en_5.1.0_3.0_1692376308718.zip){:.button.button-orange.button-orange-trans.arr.button-icon} +[Copy S3 URI](s3://auxdata.johnsnowlabs.com/public/models/mpnet_embedding_review_multiclass_20230116_en_5.1.0_3.0_1692376308718.zip){:.button.button-orange.button-orange-trans.button-icon.button-copy-s3} + +## How to use + + + +
+{% include programmingLanguageSelectScalaPythonNLU.html %} +```python + +document_assembler = DocumentAssembler() \ + .setInputCol("text") \ + .setOutputCol("documents") + +instruction = MPNetEmbeddings \ + .pretrained("mpnet_embedding_review_multiclass_20230116", "en")\ + .setInputCols(["documents"]) \ + .setOutputCol("mpnet_embeddings") + +pipeline = Pipeline(stages=[ + document_assembler, + instruction, +]) + +pipelineModel = pipeline.fit(data) + +pipelineDF = pipelineModel.transform(data) +``` +```scala + +val document_assembler = new DocumentAssembler() + .setInputCol("text") + .setOutputCol("documents") + +val instruction = MPNetEmbeddings + .pretrained("mpnet_embedding_review_multiclass_20230116", "en") + .setInputCols(Array("documents")) + .setOutputCol("mpnet_embeddings") + +val pipeline = new Pipeline().setStages(Array(document_assembler, instruction)) + +val pipelineModel = pipeline.fit(data) + +val pipelineDF = pipelineModel.transform(data) + +``` +
+ +{:.model-param} +## Model Information + +{:.table-model} +|---|---| +|Model Name:|mpnet_embedding_review_multiclass_20230116| +|Compatibility:|Spark NLP 5.1.0+| +|License:|Open Source| +|Edition:|Official| +|Input Labels:|[documents]| +|Output Labels:|[mpnet_embeddings]| +|Language:|en| +|Size:|409.9 MB| \ No newline at end of file From 1bd1c8d2e13eea76531bad7970574c4e16d67aef Mon Sep 17 00:00:00 2001 From: ahmedlone127 Date: Fri, 18 Aug 2023 23:33:27 +0700 Subject: [PATCH 021/155] Add model 2023-08-18-mpnet_embedding_contradiction_psb_lds_en --- ...pnet_embedding_contradiction_psb_lds_en.md | 88 +++++++++++++++++++ 1 file changed, 88 insertions(+) create mode 100644 docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_contradiction_psb_lds_en.md diff --git a/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_contradiction_psb_lds_en.md b/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_contradiction_psb_lds_en.md new file mode 100644 index 00000000000000..e7e7b02497623b --- /dev/null +++ b/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_contradiction_psb_lds_en.md @@ -0,0 +1,88 @@ +--- +layout: model +title: English mpnet_embedding_contradiction_psb_lds TFMPNetModel from nategro +author: John Snow Labs +name: mpnet_embedding_contradiction_psb_lds +date: 2023-08-18 +tags: [mpnet, en, open_source, tensorflow] +task: Embeddings +language: en +edition: Spark NLP 5.1.0 +spark_version: 3.0 +supported: true +engine: tensorflow +annotator: MPNetEmbeddings +article_header: + type: cover +use_language_switcher: "Python-Scala-Java" +--- + +## Description + +Pretrained mpnet model, adapted from Hugging Face and curated to provide scalability and production-readiness using Spark NLP.`mpnet_embedding_contradiction_psb_lds` is a English model originally trained by nategro. + +{:.btn-box} + + +[Download](https://s3.amazonaws.com/auxdata.johnsnowlabs.com/public/models/mpnet_embedding_contradiction_psb_lds_en_5.1.0_3.0_1692376358959.zip){:.button.button-orange.button-orange-trans.arr.button-icon} +[Copy S3 URI](s3://auxdata.johnsnowlabs.com/public/models/mpnet_embedding_contradiction_psb_lds_en_5.1.0_3.0_1692376358959.zip){:.button.button-orange.button-orange-trans.button-icon.button-copy-s3} + +## How to use + + + +
+{% include programmingLanguageSelectScalaPythonNLU.html %} +```python + +document_assembler = DocumentAssembler() \ + .setInputCol("text") \ + .setOutputCol("documents") + +instruction = MPNetEmbeddings \ + .pretrained("mpnet_embedding_contradiction_psb_lds", "en")\ + .setInputCols(["documents"]) \ + .setOutputCol("mpnet_embeddings") + +pipeline = Pipeline(stages=[ + document_assembler, + instruction, +]) + +pipelineModel = pipeline.fit(data) + +pipelineDF = pipelineModel.transform(data) +``` +```scala + +val document_assembler = new DocumentAssembler() + .setInputCol("text") + .setOutputCol("documents") + +val instruction = MPNetEmbeddings + .pretrained("mpnet_embedding_contradiction_psb_lds", "en") + .setInputCols(Array("documents")) + .setOutputCol("mpnet_embeddings") + +val pipeline = new Pipeline().setStages(Array(document_assembler, instruction)) + +val pipelineModel = pipeline.fit(data) + +val pipelineDF = pipelineModel.transform(data) + +``` +
+ +{:.model-param} +## Model Information + +{:.table-model} +|---|---| +|Model Name:|mpnet_embedding_contradiction_psb_lds| +|Compatibility:|Spark NLP 5.1.0+| +|License:|Open Source| +|Edition:|Official| +|Input Labels:|[documents]| +|Output Labels:|[mpnet_embeddings]| +|Language:|en| +|Size:|410.1 MB| \ No newline at end of file From c71e4a2837e5386b51e9192070c1f91972f03c10 Mon Sep 17 00:00:00 2001 From: ahmedlone127 Date: Fri, 18 Aug 2023 23:34:27 +0700 Subject: [PATCH 022/155] Add model 2023-08-18-mpnet_embedding_setfit_alpaca_es_unprocessable_sample_detection_es --- ...ca_es_unprocessable_sample_detection_es.md | 88 +++++++++++++++++++ 1 file changed, 88 insertions(+) create mode 100644 docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_setfit_alpaca_es_unprocessable_sample_detection_es.md diff --git a/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_setfit_alpaca_es_unprocessable_sample_detection_es.md b/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_setfit_alpaca_es_unprocessable_sample_detection_es.md new file mode 100644 index 00000000000000..fba091d7f83686 --- /dev/null +++ b/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_setfit_alpaca_es_unprocessable_sample_detection_es.md @@ -0,0 +1,88 @@ +--- +layout: model +title: Castilian, Spanish mpnet_embedding_setfit_alpaca_es_unprocessable_sample_detection TFMPNetModel from hackathon-somos-nlp-2023 +author: John Snow Labs +name: mpnet_embedding_setfit_alpaca_es_unprocessable_sample_detection +date: 2023-08-18 +tags: [mpnet, es, open_source, tensorflow] +task: Embeddings +language: es +edition: Spark NLP 5.1.0 +spark_version: 3.0 +supported: true +engine: tensorflow +annotator: MPNetEmbeddings +article_header: + type: cover +use_language_switcher: "Python-Scala-Java" +--- + +## Description + +Pretrained mpnet model, adapted from Hugging Face and curated to provide scalability and production-readiness using Spark NLP.`mpnet_embedding_setfit_alpaca_es_unprocessable_sample_detection` is a Castilian, Spanish model originally trained by hackathon-somos-nlp-2023. + +{:.btn-box} + + +[Download](https://s3.amazonaws.com/auxdata.johnsnowlabs.com/public/models/mpnet_embedding_setfit_alpaca_es_unprocessable_sample_detection_es_5.1.0_3.0_1692376401533.zip){:.button.button-orange.button-orange-trans.arr.button-icon} +[Copy S3 URI](s3://auxdata.johnsnowlabs.com/public/models/mpnet_embedding_setfit_alpaca_es_unprocessable_sample_detection_es_5.1.0_3.0_1692376401533.zip){:.button.button-orange.button-orange-trans.button-icon.button-copy-s3} + +## How to use + + + +
+{% include programmingLanguageSelectScalaPythonNLU.html %} +```python + +document_assembler = DocumentAssembler() \ + .setInputCol("text") \ + .setOutputCol("documents") + +instruction = MPNetEmbeddings \ + .pretrained("mpnet_embedding_setfit_alpaca_es_unprocessable_sample_detection", "es")\ + .setInputCols(["documents"]) \ + .setOutputCol("mpnet_embeddings") + +pipeline = Pipeline(stages=[ + document_assembler, + instruction, +]) + +pipelineModel = pipeline.fit(data) + +pipelineDF = pipelineModel.transform(data) +``` +```scala + +val document_assembler = new DocumentAssembler() + .setInputCol("text") + .setOutputCol("documents") + +val instruction = MPNetEmbeddings + .pretrained("mpnet_embedding_setfit_alpaca_es_unprocessable_sample_detection", "es") + .setInputCols(Array("documents")) + .setOutputCol("mpnet_embeddings") + +val pipeline = new Pipeline().setStages(Array(document_assembler, instruction)) + +val pipelineModel = pipeline.fit(data) + +val pipelineDF = pipelineModel.transform(data) + +``` +
+ +{:.model-param} +## Model Information + +{:.table-model} +|---|---| +|Model Name:|mpnet_embedding_setfit_alpaca_es_unprocessable_sample_detection| +|Compatibility:|Spark NLP 5.1.0+| +|License:|Open Source| +|Edition:|Official| +|Input Labels:|[documents]| +|Output Labels:|[mpnet_embeddings]| +|Language:|es| +|Size:|410.2 MB| \ No newline at end of file From e4225f78eb59fab3dcd0a74368e6d7385ab9373a Mon Sep 17 00:00:00 2001 From: ahmedlone127 Date: Fri, 18 Aug 2023 23:35:28 +0700 Subject: [PATCH 023/155] Add model 2023-08-18-mpnet_embedding_multi_qa_mpnet_base_dot_v1_legal_finetune_en --- ..._qa_mpnet_base_dot_v1_legal_finetune_en.md | 88 +++++++++++++++++++ 1 file changed, 88 insertions(+) create mode 100644 docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_multi_qa_mpnet_base_dot_v1_legal_finetune_en.md diff --git a/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_multi_qa_mpnet_base_dot_v1_legal_finetune_en.md b/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_multi_qa_mpnet_base_dot_v1_legal_finetune_en.md new file mode 100644 index 00000000000000..3af998f5f8239d --- /dev/null +++ b/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_multi_qa_mpnet_base_dot_v1_legal_finetune_en.md @@ -0,0 +1,88 @@ +--- +layout: model +title: English mpnet_embedding_multi_qa_mpnet_base_dot_v1_legal_finetune TFMPNetModel from oliviamga2 +author: John Snow Labs +name: mpnet_embedding_multi_qa_mpnet_base_dot_v1_legal_finetune +date: 2023-08-18 +tags: [mpnet, en, open_source, tensorflow] +task: Embeddings +language: en +edition: Spark NLP 5.1.0 +spark_version: 3.0 +supported: true +engine: tensorflow +annotator: MPNetEmbeddings +article_header: + type: cover +use_language_switcher: "Python-Scala-Java" +--- + +## Description + +Pretrained mpnet model, adapted from Hugging Face and curated to provide scalability and production-readiness using Spark NLP.`mpnet_embedding_multi_qa_mpnet_base_dot_v1_legal_finetune` is a English model originally trained by oliviamga2. + +{:.btn-box} + + +[Download](https://s3.amazonaws.com/auxdata.johnsnowlabs.com/public/models/mpnet_embedding_multi_qa_mpnet_base_dot_v1_legal_finetune_en_5.1.0_3.0_1692376503346.zip){:.button.button-orange.button-orange-trans.arr.button-icon} +[Copy S3 URI](s3://auxdata.johnsnowlabs.com/public/models/mpnet_embedding_multi_qa_mpnet_base_dot_v1_legal_finetune_en_5.1.0_3.0_1692376503346.zip){:.button.button-orange.button-orange-trans.button-icon.button-copy-s3} + +## How to use + + + +
+{% include programmingLanguageSelectScalaPythonNLU.html %} +```python + +document_assembler = DocumentAssembler() \ + .setInputCol("text") \ + .setOutputCol("documents") + +instruction = MPNetEmbeddings \ + .pretrained("mpnet_embedding_multi_qa_mpnet_base_dot_v1_legal_finetune", "en")\ + .setInputCols(["documents"]) \ + .setOutputCol("mpnet_embeddings") + +pipeline = Pipeline(stages=[ + document_assembler, + instruction, +]) + +pipelineModel = pipeline.fit(data) + +pipelineDF = pipelineModel.transform(data) +``` +```scala + +val document_assembler = new DocumentAssembler() + .setInputCol("text") + .setOutputCol("documents") + +val instruction = MPNetEmbeddings + .pretrained("mpnet_embedding_multi_qa_mpnet_base_dot_v1_legal_finetune", "en") + .setInputCols(Array("documents")) + .setOutputCol("mpnet_embeddings") + +val pipeline = new Pipeline().setStages(Array(document_assembler, instruction)) + +val pipelineModel = pipeline.fit(data) + +val pipelineDF = pipelineModel.transform(data) + +``` +
+ +{:.model-param} +## Model Information + +{:.table-model} +|---|---| +|Model Name:|mpnet_embedding_multi_qa_mpnet_base_dot_v1_legal_finetune| +|Compatibility:|Spark NLP 5.1.0+| +|License:|Open Source| +|Edition:|Official| +|Input Labels:|[documents]| +|Output Labels:|[mpnet_embeddings]| +|Language:|en| +|Size:|410.0 MB| \ No newline at end of file From e9ba1b7c936f0fccdbf7131bc0832fdfacd8f452 Mon Sep 17 00:00:00 2001 From: ahmedlone127 Date: Fri, 18 Aug 2023 23:36:28 +0700 Subject: [PATCH 024/155] Add model 2023-08-18-mpnet_embedding_nps_psb_lds_en --- ...23-08-18-mpnet_embedding_nps_psb_lds_en.md | 88 +++++++++++++++++++ 1 file changed, 88 insertions(+) create mode 100644 docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_nps_psb_lds_en.md diff --git a/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_nps_psb_lds_en.md b/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_nps_psb_lds_en.md new file mode 100644 index 00000000000000..aa69e1f8dfcf36 --- /dev/null +++ b/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_nps_psb_lds_en.md @@ -0,0 +1,88 @@ +--- +layout: model +title: English mpnet_embedding_nps_psb_lds TFMPNetModel from nategro +author: John Snow Labs +name: mpnet_embedding_nps_psb_lds +date: 2023-08-18 +tags: [mpnet, en, open_source, tensorflow] +task: Embeddings +language: en +edition: Spark NLP 5.1.0 +spark_version: 3.0 +supported: true +engine: tensorflow +annotator: MPNetEmbeddings +article_header: + type: cover +use_language_switcher: "Python-Scala-Java" +--- + +## Description + +Pretrained mpnet model, adapted from Hugging Face and curated to provide scalability and production-readiness using Spark NLP.`mpnet_embedding_nps_psb_lds` is a English model originally trained by nategro. + +{:.btn-box} + + +[Download](https://s3.amazonaws.com/auxdata.johnsnowlabs.com/public/models/mpnet_embedding_nps_psb_lds_en_5.1.0_3.0_1692376546159.zip){:.button.button-orange.button-orange-trans.arr.button-icon} +[Copy S3 URI](s3://auxdata.johnsnowlabs.com/public/models/mpnet_embedding_nps_psb_lds_en_5.1.0_3.0_1692376546159.zip){:.button.button-orange.button-orange-trans.button-icon.button-copy-s3} + +## How to use + + + +
+{% include programmingLanguageSelectScalaPythonNLU.html %} +```python + +document_assembler = DocumentAssembler() \ + .setInputCol("text") \ + .setOutputCol("documents") + +instruction = MPNetEmbeddings \ + .pretrained("mpnet_embedding_nps_psb_lds", "en")\ + .setInputCols(["documents"]) \ + .setOutputCol("mpnet_embeddings") + +pipeline = Pipeline(stages=[ + document_assembler, + instruction, +]) + +pipelineModel = pipeline.fit(data) + +pipelineDF = pipelineModel.transform(data) +``` +```scala + +val document_assembler = new DocumentAssembler() + .setInputCol("text") + .setOutputCol("documents") + +val instruction = MPNetEmbeddings + .pretrained("mpnet_embedding_nps_psb_lds", "en") + .setInputCols(Array("documents")) + .setOutputCol("mpnet_embeddings") + +val pipeline = new Pipeline().setStages(Array(document_assembler, instruction)) + +val pipelineModel = pipeline.fit(data) + +val pipelineDF = pipelineModel.transform(data) + +``` +
+ +{:.model-param} +## Model Information + +{:.table-model} +|---|---| +|Model Name:|mpnet_embedding_nps_psb_lds| +|Compatibility:|Spark NLP 5.1.0+| +|License:|Open Source| +|Edition:|Official| +|Input Labels:|[documents]| +|Output Labels:|[mpnet_embeddings]| +|Language:|en| +|Size:|410.1 MB| \ No newline at end of file From d757be432af3e1939db7a23da5f9ff2863ab8a0a Mon Sep 17 00:00:00 2001 From: ahmedlone127 Date: Fri, 18 Aug 2023 23:37:28 +0700 Subject: [PATCH 025/155] Add model 2023-08-18-mpnet_embedding_ATTACK_BERT_en --- ...23-08-18-mpnet_embedding_ATTACK_BERT_en.md | 88 +++++++++++++++++++ 1 file changed, 88 insertions(+) create mode 100644 docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_ATTACK_BERT_en.md diff --git a/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_ATTACK_BERT_en.md b/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_ATTACK_BERT_en.md new file mode 100644 index 00000000000000..8d7a64142160ff --- /dev/null +++ b/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_ATTACK_BERT_en.md @@ -0,0 +1,88 @@ +--- +layout: model +title: English mpnet_embedding_ATTACK_BERT TFMPNetModel from basel +author: John Snow Labs +name: mpnet_embedding_ATTACK_BERT +date: 2023-08-18 +tags: [mpnet, en, open_source, tensorflow] +task: Embeddings +language: en +edition: Spark NLP 5.1.0 +spark_version: 3.0 +supported: true +engine: tensorflow +annotator: MPNetEmbeddings +article_header: + type: cover +use_language_switcher: "Python-Scala-Java" +--- + +## Description + +Pretrained mpnet model, adapted from Hugging Face and curated to provide scalability and production-readiness using Spark NLP.`mpnet_embedding_ATTACK_BERT` is a English model originally trained by basel. + +{:.btn-box} + + +[Download](https://s3.amazonaws.com/auxdata.johnsnowlabs.com/public/models/mpnet_embedding_ATTACK_BERT_en_5.1.0_3.0_1692376584683.zip){:.button.button-orange.button-orange-trans.arr.button-icon} +[Copy S3 URI](s3://auxdata.johnsnowlabs.com/public/models/mpnet_embedding_ATTACK_BERT_en_5.1.0_3.0_1692376584683.zip){:.button.button-orange.button-orange-trans.button-icon.button-copy-s3} + +## How to use + + + +
+{% include programmingLanguageSelectScalaPythonNLU.html %} +```python + +document_assembler = DocumentAssembler() \ + .setInputCol("text") \ + .setOutputCol("documents") + +instruction = MPNetEmbeddings \ + .pretrained("mpnet_embedding_ATTACK_BERT", "en")\ + .setInputCols(["documents"]) \ + .setOutputCol("mpnet_embeddings") + +pipeline = Pipeline(stages=[ + document_assembler, + instruction, +]) + +pipelineModel = pipeline.fit(data) + +pipelineDF = pipelineModel.transform(data) +``` +```scala + +val document_assembler = new DocumentAssembler() + .setInputCol("text") + .setOutputCol("documents") + +val instruction = MPNetEmbeddings + .pretrained("mpnet_embedding_ATTACK_BERT", "en") + .setInputCols(Array("documents")) + .setOutputCol("mpnet_embeddings") + +val pipeline = new Pipeline().setStages(Array(document_assembler, instruction)) + +val pipelineModel = pipeline.fit(data) + +val pipelineDF = pipelineModel.transform(data) + +``` +
+ +{:.model-param} +## Model Information + +{:.table-model} +|---|---| +|Model Name:|mpnet_embedding_ATTACK_BERT| +|Compatibility:|Spark NLP 5.1.0+| +|License:|Open Source| +|Edition:|Official| +|Input Labels:|[documents]| +|Output Labels:|[mpnet_embeddings]| +|Language:|en| +|Size:|409.8 MB| \ No newline at end of file From 723aa3310707352e72c19a6360fa2503ac4c48a9 Mon Sep 17 00:00:00 2001 From: ahmedlone127 Date: Fri, 18 Aug 2023 23:38:28 +0700 Subject: [PATCH 026/155] Add model 2023-08-18-mpnet_embedding_setfit_ethos_multilabel_example_en --- ...ding_setfit_ethos_multilabel_example_en.md | 88 +++++++++++++++++++ 1 file changed, 88 insertions(+) create mode 100644 docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_setfit_ethos_multilabel_example_en.md diff --git a/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_setfit_ethos_multilabel_example_en.md b/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_setfit_ethos_multilabel_example_en.md new file mode 100644 index 00000000000000..b7cfc9b51ed4f0 --- /dev/null +++ b/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_setfit_ethos_multilabel_example_en.md @@ -0,0 +1,88 @@ +--- +layout: model +title: English mpnet_embedding_setfit_ethos_multilabel_example TFMPNetModel from neilthematic +author: John Snow Labs +name: mpnet_embedding_setfit_ethos_multilabel_example +date: 2023-08-18 +tags: [mpnet, en, open_source, tensorflow] +task: Embeddings +language: en +edition: Spark NLP 5.1.0 +spark_version: 3.0 +supported: true +engine: tensorflow +annotator: MPNetEmbeddings +article_header: + type: cover +use_language_switcher: "Python-Scala-Java" +--- + +## Description + +Pretrained mpnet model, adapted from Hugging Face and curated to provide scalability and production-readiness using Spark NLP.`mpnet_embedding_setfit_ethos_multilabel_example` is a English model originally trained by neilthematic. + +{:.btn-box} + + +[Download](https://s3.amazonaws.com/auxdata.johnsnowlabs.com/public/models/mpnet_embedding_setfit_ethos_multilabel_example_en_5.1.0_3.0_1692376693736.zip){:.button.button-orange.button-orange-trans.arr.button-icon} +[Copy S3 URI](s3://auxdata.johnsnowlabs.com/public/models/mpnet_embedding_setfit_ethos_multilabel_example_en_5.1.0_3.0_1692376693736.zip){:.button.button-orange.button-orange-trans.button-icon.button-copy-s3} + +## How to use + + + +
+{% include programmingLanguageSelectScalaPythonNLU.html %} +```python + +document_assembler = DocumentAssembler() \ + .setInputCol("text") \ + .setOutputCol("documents") + +instruction = MPNetEmbeddings \ + .pretrained("mpnet_embedding_setfit_ethos_multilabel_example", "en")\ + .setInputCols(["documents"]) \ + .setOutputCol("mpnet_embeddings") + +pipeline = Pipeline(stages=[ + document_assembler, + instruction, +]) + +pipelineModel = pipeline.fit(data) + +pipelineDF = pipelineModel.transform(data) +``` +```scala + +val document_assembler = new DocumentAssembler() + .setInputCol("text") + .setOutputCol("documents") + +val instruction = MPNetEmbeddings + .pretrained("mpnet_embedding_setfit_ethos_multilabel_example", "en") + .setInputCols(Array("documents")) + .setOutputCol("mpnet_embeddings") + +val pipeline = new Pipeline().setStages(Array(document_assembler, instruction)) + +val pipelineModel = pipeline.fit(data) + +val pipelineDF = pipelineModel.transform(data) + +``` +
+ +{:.model-param} +## Model Information + +{:.table-model} +|---|---| +|Model Name:|mpnet_embedding_setfit_ethos_multilabel_example| +|Compatibility:|Spark NLP 5.1.0+| +|License:|Open Source| +|Edition:|Official| +|Input Labels:|[documents]| +|Output Labels:|[mpnet_embeddings]| +|Language:|en| +|Size:|409.9 MB| \ No newline at end of file From 77bb39a72d5cb111e5f8385548962111880524e1 Mon Sep 17 00:00:00 2001 From: ahmedlone127 Date: Fri, 18 Aug 2023 23:39:28 +0700 Subject: [PATCH 027/155] Add model 2023-08-18-mpnet_embedding_mpnet_snli_en --- .../2023-08-18-mpnet_embedding_mpnet_snli_en.md | 6 +++--- 1 file changed, 3 insertions(+), 3 deletions(-) diff --git a/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_mpnet_snli_en.md b/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_mpnet_snli_en.md index d2e332fcb2bdd7..e3ffc25679bde7 100644 --- a/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_mpnet_snli_en.md +++ b/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_mpnet_snli_en.md @@ -24,8 +24,8 @@ Pretrained mpnet model, adapted from Hugging Face and curated to provide scalab {:.btn-box} -[Download](https://s3.amazonaws.com/auxdata.johnsnowlabs.com/public/models/mpnet_embedding_mpnet_snli_en_5.1.0_3.0_1692374834042.zip){:.button.button-orange.button-orange-trans.arr.button-icon} -[Copy S3 URI](s3://auxdata.johnsnowlabs.com/public/models/mpnet_embedding_mpnet_snli_en_5.1.0_3.0_1692374834042.zip){:.button.button-orange.button-orange-trans.button-icon.button-copy-s3} +[Download](https://s3.amazonaws.com/auxdata.johnsnowlabs.com/public/models/mpnet_embedding_mpnet_snli_en_5.1.0_3.0_1692375412994.zip){:.button.button-orange.button-orange-trans.arr.button-icon} +[Copy S3 URI](s3://auxdata.johnsnowlabs.com/public/models/mpnet_embedding_mpnet_snli_en_5.1.0_3.0_1692375412994.zip){:.button.button-orange.button-orange-trans.button-icon.button-copy-s3} ## How to use @@ -64,7 +64,7 @@ val instruction = MPNetEmbeddings .setInputCols(Array("documents")) .setOutputCol("mpnet_embeddings") -val pipeline = new Pipeline().setStages(Array(document_assembler, speechToText)) +val pipeline = new Pipeline().setStages(Array(document_assembler, instruction)) val pipelineModel = pipeline.fit(data) From 67d2373f9c2965ccc49b80b6556da502ba44b889 Mon Sep 17 00:00:00 2001 From: ahmedlone127 Date: Fri, 18 Aug 2023 23:40:29 +0700 Subject: [PATCH 028/155] Add model 2023-08-18-mpnet_embedding_multi_QA_v1_mpnet_asymmetric_A_en --- ...dding_multi_QA_v1_mpnet_asymmetric_A_en.md | 88 +++++++++++++++++++ 1 file changed, 88 insertions(+) create mode 100644 docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_multi_QA_v1_mpnet_asymmetric_A_en.md diff --git a/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_multi_QA_v1_mpnet_asymmetric_A_en.md b/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_multi_QA_v1_mpnet_asymmetric_A_en.md new file mode 100644 index 00000000000000..4125d55cb5e7b3 --- /dev/null +++ b/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_multi_QA_v1_mpnet_asymmetric_A_en.md @@ -0,0 +1,88 @@ +--- +layout: model +title: English mpnet_embedding_multi_QA_v1_mpnet_asymmetric_A TFMPNetModel from flax-sentence-embeddings +author: John Snow Labs +name: mpnet_embedding_multi_QA_v1_mpnet_asymmetric_A +date: 2023-08-18 +tags: [mpnet, en, open_source, tensorflow] +task: Embeddings +language: en +edition: Spark NLP 5.1.0 +spark_version: 3.0 +supported: true +engine: tensorflow +annotator: MPNetEmbeddings +article_header: + type: cover +use_language_switcher: "Python-Scala-Java" +--- + +## Description + +Pretrained mpnet model, adapted from Hugging Face and curated to provide scalability and production-readiness using Spark NLP.`mpnet_embedding_multi_QA_v1_mpnet_asymmetric_A` is a English model originally trained by flax-sentence-embeddings. + +{:.btn-box} + + +[Download](https://s3.amazonaws.com/auxdata.johnsnowlabs.com/public/models/mpnet_embedding_multi_QA_v1_mpnet_asymmetric_A_en_5.1.0_3.0_1692376012864.zip){:.button.button-orange.button-orange-trans.arr.button-icon} +[Copy S3 URI](s3://auxdata.johnsnowlabs.com/public/models/mpnet_embedding_multi_QA_v1_mpnet_asymmetric_A_en_5.1.0_3.0_1692376012864.zip){:.button.button-orange.button-orange-trans.button-icon.button-copy-s3} + +## How to use + + + +
+{% include programmingLanguageSelectScalaPythonNLU.html %} +```python + +document_assembler = DocumentAssembler() \ + .setInputCol("text") \ + .setOutputCol("documents") + +instruction = MPNetEmbeddings \ + .pretrained("mpnet_embedding_multi_QA_v1_mpnet_asymmetric_A", "en")\ + .setInputCols(["documents"]) \ + .setOutputCol("mpnet_embeddings") + +pipeline = Pipeline(stages=[ + document_assembler, + instruction, +]) + +pipelineModel = pipeline.fit(data) + +pipelineDF = pipelineModel.transform(data) +``` +```scala + +val document_assembler = new DocumentAssembler() + .setInputCol("text") + .setOutputCol("documents") + +val instruction = MPNetEmbeddings + .pretrained("mpnet_embedding_multi_QA_v1_mpnet_asymmetric_A", "en") + .setInputCols(Array("documents")) + .setOutputCol("mpnet_embeddings") + +val pipeline = new Pipeline().setStages(Array(document_assembler, instruction)) + +val pipelineModel = pipeline.fit(data) + +val pipelineDF = pipelineModel.transform(data) + +``` +
+ +{:.model-param} +## Model Information + +{:.table-model} +|---|---| +|Model Name:|mpnet_embedding_multi_QA_v1_mpnet_asymmetric_A| +|Compatibility:|Spark NLP 5.1.0+| +|License:|Open Source| +|Edition:|Official| +|Input Labels:|[documents]| +|Output Labels:|[mpnet_embeddings]| +|Language:|en| +|Size:|409.6 MB| \ No newline at end of file From a4668fbfd07c2d5df9de3d799f1582765e53c63d Mon Sep 17 00:00:00 2001 From: ahmedlone127 Date: Fri, 18 Aug 2023 23:41:29 +0700 Subject: [PATCH 029/155] Add model 2023-08-18-mpnet_embedding_contradiction_psb_en --- ...18-mpnet_embedding_contradiction_psb_en.md | 88 +++++++++++++++++++ 1 file changed, 88 insertions(+) create mode 100644 docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_contradiction_psb_en.md diff --git a/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_contradiction_psb_en.md b/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_contradiction_psb_en.md new file mode 100644 index 00000000000000..bccef97f878860 --- /dev/null +++ b/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_contradiction_psb_en.md @@ -0,0 +1,88 @@ +--- +layout: model +title: English mpnet_embedding_contradiction_psb TFMPNetModel from nategro +author: John Snow Labs +name: mpnet_embedding_contradiction_psb +date: 2023-08-18 +tags: [mpnet, en, open_source, tensorflow] +task: Embeddings +language: en +edition: Spark NLP 5.1.0 +spark_version: 3.0 +supported: true +engine: tensorflow +annotator: MPNetEmbeddings +article_header: + type: cover +use_language_switcher: "Python-Scala-Java" +--- + +## Description + +Pretrained mpnet model, adapted from Hugging Face and curated to provide scalability and production-readiness using Spark NLP.`mpnet_embedding_contradiction_psb` is a English model originally trained by nategro. + +{:.btn-box} + + +[Download](https://s3.amazonaws.com/auxdata.johnsnowlabs.com/public/models/mpnet_embedding_contradiction_psb_en_5.1.0_3.0_1692376734761.zip){:.button.button-orange.button-orange-trans.arr.button-icon} +[Copy S3 URI](s3://auxdata.johnsnowlabs.com/public/models/mpnet_embedding_contradiction_psb_en_5.1.0_3.0_1692376734761.zip){:.button.button-orange.button-orange-trans.button-icon.button-copy-s3} + +## How to use + + + +
+{% include programmingLanguageSelectScalaPythonNLU.html %} +```python + +document_assembler = DocumentAssembler() \ + .setInputCol("text") \ + .setOutputCol("documents") + +instruction = MPNetEmbeddings \ + .pretrained("mpnet_embedding_contradiction_psb", "en")\ + .setInputCols(["documents"]) \ + .setOutputCol("mpnet_embeddings") + +pipeline = Pipeline(stages=[ + document_assembler, + instruction, +]) + +pipelineModel = pipeline.fit(data) + +pipelineDF = pipelineModel.transform(data) +``` +```scala + +val document_assembler = new DocumentAssembler() + .setInputCol("text") + .setOutputCol("documents") + +val instruction = MPNetEmbeddings + .pretrained("mpnet_embedding_contradiction_psb", "en") + .setInputCols(Array("documents")) + .setOutputCol("mpnet_embeddings") + +val pipeline = new Pipeline().setStages(Array(document_assembler, instruction)) + +val pipelineModel = pipeline.fit(data) + +val pipelineDF = pipelineModel.transform(data) + +``` +
+ +{:.model-param} +## Model Information + +{:.table-model} +|---|---| +|Model Name:|mpnet_embedding_contradiction_psb| +|Compatibility:|Spark NLP 5.1.0+| +|License:|Open Source| +|Edition:|Official| +|Input Labels:|[documents]| +|Output Labels:|[mpnet_embeddings]| +|Language:|en| +|Size:|410.1 MB| \ No newline at end of file From 5b6f2283cc11cd8a2c745a3215a9b9676cf5b0a9 Mon Sep 17 00:00:00 2001 From: ahmedlone127 Date: Fri, 18 Aug 2023 23:42:29 +0700 Subject: [PATCH 030/155] Add model 2023-08-18-mpnet_embedding_sb_temfac_en --- ...2023-08-18-mpnet_embedding_sb_temfac_en.md | 88 +++++++++++++++++++ 1 file changed, 88 insertions(+) create mode 100644 docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_sb_temfac_en.md diff --git a/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_sb_temfac_en.md b/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_sb_temfac_en.md new file mode 100644 index 00000000000000..68f0ceefd347ff --- /dev/null +++ b/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_sb_temfac_en.md @@ -0,0 +1,88 @@ +--- +layout: model +title: English mpnet_embedding_sb_temfac TFMPNetModel from stealthpy +author: John Snow Labs +name: mpnet_embedding_sb_temfac +date: 2023-08-18 +tags: [mpnet, en, open_source, tensorflow] +task: Embeddings +language: en +edition: Spark NLP 5.1.0 +spark_version: 3.0 +supported: true +engine: tensorflow +annotator: MPNetEmbeddings +article_header: + type: cover +use_language_switcher: "Python-Scala-Java" +--- + +## Description + +Pretrained mpnet model, adapted from Hugging Face and curated to provide scalability and production-readiness using Spark NLP.`mpnet_embedding_sb_temfac` is a English model originally trained by stealthpy. + +{:.btn-box} + + +[Download](https://s3.amazonaws.com/auxdata.johnsnowlabs.com/public/models/mpnet_embedding_sb_temfac_en_5.1.0_3.0_1692375226093.zip){:.button.button-orange.button-orange-trans.arr.button-icon} +[Copy S3 URI](s3://auxdata.johnsnowlabs.com/public/models/mpnet_embedding_sb_temfac_en_5.1.0_3.0_1692375226093.zip){:.button.button-orange.button-orange-trans.button-icon.button-copy-s3} + +## How to use + + + +
+{% include programmingLanguageSelectScalaPythonNLU.html %} +```python + +document_assembler = DocumentAssembler() \ + .setInputCol("text") \ + .setOutputCol("documents") + +instruction = MPNetEmbeddings \ + .pretrained("mpnet_embedding_sb_temfac", "en")\ + .setInputCols(["documents"]) \ + .setOutputCol("mpnet_embeddings") + +pipeline = Pipeline(stages=[ + document_assembler, + instruction, +]) + +pipelineModel = pipeline.fit(data) + +pipelineDF = pipelineModel.transform(data) +``` +```scala + +val document_assembler = new DocumentAssembler() + .setInputCol("text") + .setOutputCol("documents") + +val instruction = MPNetEmbeddings + .pretrained("mpnet_embedding_sb_temfac", "en") + .setInputCols(Array("documents")) + .setOutputCol("mpnet_embeddings") + +val pipeline = new Pipeline().setStages(Array(document_assembler, instruction)) + +val pipelineModel = pipeline.fit(data) + +val pipelineDF = pipelineModel.transform(data) + +``` +
+ +{:.model-param} +## Model Information + +{:.table-model} +|---|---| +|Model Name:|mpnet_embedding_sb_temfac| +|Compatibility:|Spark NLP 5.1.0+| +|License:|Open Source| +|Edition:|Official| +|Input Labels:|[documents]| +|Output Labels:|[mpnet_embeddings]| +|Language:|en| +|Size:|410.3 MB| \ No newline at end of file From 2908c5e08f33ad39aed25431339ba99eb494aa17 Mon Sep 17 00:00:00 2001 From: ahmedlone127 Date: Fri, 18 Aug 2023 23:43:29 +0700 Subject: [PATCH 031/155] Add model 2023-08-18-mpnet_embedding_setfit_finetuned_financial_text_en --- ...ding_setfit_finetuned_financial_text_en.md | 88 +++++++++++++++++++ 1 file changed, 88 insertions(+) create mode 100644 docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_setfit_finetuned_financial_text_en.md diff --git a/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_setfit_finetuned_financial_text_en.md b/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_setfit_finetuned_financial_text_en.md new file mode 100644 index 00000000000000..c7be99ee2518be --- /dev/null +++ b/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_setfit_finetuned_financial_text_en.md @@ -0,0 +1,88 @@ +--- +layout: model +title: English mpnet_embedding_setfit_finetuned_financial_text TFMPNetModel from nickmuchi +author: John Snow Labs +name: mpnet_embedding_setfit_finetuned_financial_text +date: 2023-08-18 +tags: [mpnet, en, open_source, tensorflow] +task: Embeddings +language: en +edition: Spark NLP 5.1.0 +spark_version: 3.0 +supported: true +engine: tensorflow +annotator: MPNetEmbeddings +article_header: + type: cover +use_language_switcher: "Python-Scala-Java" +--- + +## Description + +Pretrained mpnet model, adapted from Hugging Face and curated to provide scalability and production-readiness using Spark NLP.`mpnet_embedding_setfit_finetuned_financial_text` is a English model originally trained by nickmuchi. + +{:.btn-box} + + +[Download](https://s3.amazonaws.com/auxdata.johnsnowlabs.com/public/models/mpnet_embedding_setfit_finetuned_financial_text_en_5.1.0_3.0_1692376937069.zip){:.button.button-orange.button-orange-trans.arr.button-icon} +[Copy S3 URI](s3://auxdata.johnsnowlabs.com/public/models/mpnet_embedding_setfit_finetuned_financial_text_en_5.1.0_3.0_1692376937069.zip){:.button.button-orange.button-orange-trans.button-icon.button-copy-s3} + +## How to use + + + +
+{% include programmingLanguageSelectScalaPythonNLU.html %} +```python + +document_assembler = DocumentAssembler() \ + .setInputCol("text") \ + .setOutputCol("documents") + +instruction = MPNetEmbeddings \ + .pretrained("mpnet_embedding_setfit_finetuned_financial_text", "en")\ + .setInputCols(["documents"]) \ + .setOutputCol("mpnet_embeddings") + +pipeline = Pipeline(stages=[ + document_assembler, + instruction, +]) + +pipelineModel = pipeline.fit(data) + +pipelineDF = pipelineModel.transform(data) +``` +```scala + +val document_assembler = new DocumentAssembler() + .setInputCol("text") + .setOutputCol("documents") + +val instruction = MPNetEmbeddings + .pretrained("mpnet_embedding_setfit_finetuned_financial_text", "en") + .setInputCols(Array("documents")) + .setOutputCol("mpnet_embeddings") + +val pipeline = new Pipeline().setStages(Array(document_assembler, instruction)) + +val pipelineModel = pipeline.fit(data) + +val pipelineDF = pipelineModel.transform(data) + +``` +
+ +{:.model-param} +## Model Information + +{:.table-model} +|---|---| +|Model Name:|mpnet_embedding_setfit_finetuned_financial_text| +|Compatibility:|Spark NLP 5.1.0+| +|License:|Open Source| +|Edition:|Official| +|Input Labels:|[documents]| +|Output Labels:|[mpnet_embeddings]| +|Language:|en| +|Size:|409.8 MB| \ No newline at end of file From 46707b43e9cee3b33ec24ad518ee55fcaab890d9 Mon Sep 17 00:00:00 2001 From: ahmedlone127 Date: Fri, 18 Aug 2023 23:44:30 +0700 Subject: [PATCH 032/155] Add model 2023-08-18-mpnet_embedding_setfit_zero_shot_classification_pbsp_p1_comm_en --- ...ero_shot_classification_pbsp_p1_comm_en.md | 88 +++++++++++++++++++ 1 file changed, 88 insertions(+) create mode 100644 docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_setfit_zero_shot_classification_pbsp_p1_comm_en.md diff --git a/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_setfit_zero_shot_classification_pbsp_p1_comm_en.md b/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_setfit_zero_shot_classification_pbsp_p1_comm_en.md new file mode 100644 index 00000000000000..5db33b416ff729 --- /dev/null +++ b/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_setfit_zero_shot_classification_pbsp_p1_comm_en.md @@ -0,0 +1,88 @@ +--- +layout: model +title: English mpnet_embedding_setfit_zero_shot_classification_pbsp_p1_comm TFMPNetModel from aammari +author: John Snow Labs +name: mpnet_embedding_setfit_zero_shot_classification_pbsp_p1_comm +date: 2023-08-18 +tags: [mpnet, en, open_source, tensorflow] +task: Embeddings +language: en +edition: Spark NLP 5.1.0 +spark_version: 3.0 +supported: true +engine: tensorflow +annotator: MPNetEmbeddings +article_header: + type: cover +use_language_switcher: "Python-Scala-Java" +--- + +## Description + +Pretrained mpnet model, adapted from Hugging Face and curated to provide scalability and production-readiness using Spark NLP.`mpnet_embedding_setfit_zero_shot_classification_pbsp_p1_comm` is a English model originally trained by aammari. + +{:.btn-box} + + +[Download](https://s3.amazonaws.com/auxdata.johnsnowlabs.com/public/models/mpnet_embedding_setfit_zero_shot_classification_pbsp_p1_comm_en_5.1.0_3.0_1692376886552.zip){:.button.button-orange.button-orange-trans.arr.button-icon} +[Copy S3 URI](s3://auxdata.johnsnowlabs.com/public/models/mpnet_embedding_setfit_zero_shot_classification_pbsp_p1_comm_en_5.1.0_3.0_1692376886552.zip){:.button.button-orange.button-orange-trans.button-icon.button-copy-s3} + +## How to use + + + +
+{% include programmingLanguageSelectScalaPythonNLU.html %} +```python + +document_assembler = DocumentAssembler() \ + .setInputCol("text") \ + .setOutputCol("documents") + +instruction = MPNetEmbeddings \ + .pretrained("mpnet_embedding_setfit_zero_shot_classification_pbsp_p1_comm", "en")\ + .setInputCols(["documents"]) \ + .setOutputCol("mpnet_embeddings") + +pipeline = Pipeline(stages=[ + document_assembler, + instruction, +]) + +pipelineModel = pipeline.fit(data) + +pipelineDF = pipelineModel.transform(data) +``` +```scala + +val document_assembler = new DocumentAssembler() + .setInputCol("text") + .setOutputCol("documents") + +val instruction = MPNetEmbeddings + .pretrained("mpnet_embedding_setfit_zero_shot_classification_pbsp_p1_comm", "en") + .setInputCols(Array("documents")) + .setOutputCol("mpnet_embeddings") + +val pipeline = new Pipeline().setStages(Array(document_assembler, instruction)) + +val pipelineModel = pipeline.fit(data) + +val pipelineDF = pipelineModel.transform(data) + +``` +
+ +{:.model-param} +## Model Information + +{:.table-model} +|---|---| +|Model Name:|mpnet_embedding_setfit_zero_shot_classification_pbsp_p1_comm| +|Compatibility:|Spark NLP 5.1.0+| +|License:|Open Source| +|Edition:|Official| +|Input Labels:|[documents]| +|Output Labels:|[mpnet_embeddings]| +|Language:|en| +|Size:|409.8 MB| \ No newline at end of file From d0d6a62cdf2b35ee2d9fd3f39986f02105cd6f29 Mon Sep 17 00:00:00 2001 From: ahmedlone127 Date: Fri, 18 Aug 2023 23:45:30 +0700 Subject: [PATCH 033/155] Add model 2023-08-18-mpnet_embedding_setfit_zero_shot_classification_pbsp_p1_life_en --- ...ero_shot_classification_pbsp_p1_life_en.md | 88 +++++++++++++++++++ 1 file changed, 88 insertions(+) create mode 100644 docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_setfit_zero_shot_classification_pbsp_p1_life_en.md diff --git a/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_setfit_zero_shot_classification_pbsp_p1_life_en.md b/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_setfit_zero_shot_classification_pbsp_p1_life_en.md new file mode 100644 index 00000000000000..0f0182c48d53ec --- /dev/null +++ b/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_setfit_zero_shot_classification_pbsp_p1_life_en.md @@ -0,0 +1,88 @@ +--- +layout: model +title: English mpnet_embedding_setfit_zero_shot_classification_pbsp_p1_life TFMPNetModel from aammari +author: John Snow Labs +name: mpnet_embedding_setfit_zero_shot_classification_pbsp_p1_life +date: 2023-08-18 +tags: [mpnet, en, open_source, tensorflow] +task: Embeddings +language: en +edition: Spark NLP 5.1.0 +spark_version: 3.0 +supported: true +engine: tensorflow +annotator: MPNetEmbeddings +article_header: + type: cover +use_language_switcher: "Python-Scala-Java" +--- + +## Description + +Pretrained mpnet model, adapted from Hugging Face and curated to provide scalability and production-readiness using Spark NLP.`mpnet_embedding_setfit_zero_shot_classification_pbsp_p1_life` is a English model originally trained by aammari. + +{:.btn-box} + + +[Download](https://s3.amazonaws.com/auxdata.johnsnowlabs.com/public/models/mpnet_embedding_setfit_zero_shot_classification_pbsp_p1_life_en_5.1.0_3.0_1692377068378.zip){:.button.button-orange.button-orange-trans.arr.button-icon} +[Copy S3 URI](s3://auxdata.johnsnowlabs.com/public/models/mpnet_embedding_setfit_zero_shot_classification_pbsp_p1_life_en_5.1.0_3.0_1692377068378.zip){:.button.button-orange.button-orange-trans.button-icon.button-copy-s3} + +## How to use + + + +
+{% include programmingLanguageSelectScalaPythonNLU.html %} +```python + +document_assembler = DocumentAssembler() \ + .setInputCol("text") \ + .setOutputCol("documents") + +instruction = MPNetEmbeddings \ + .pretrained("mpnet_embedding_setfit_zero_shot_classification_pbsp_p1_life", "en")\ + .setInputCols(["documents"]) \ + .setOutputCol("mpnet_embeddings") + +pipeline = Pipeline(stages=[ + document_assembler, + instruction, +]) + +pipelineModel = pipeline.fit(data) + +pipelineDF = pipelineModel.transform(data) +``` +```scala + +val document_assembler = new DocumentAssembler() + .setInputCol("text") + .setOutputCol("documents") + +val instruction = MPNetEmbeddings + .pretrained("mpnet_embedding_setfit_zero_shot_classification_pbsp_p1_life", "en") + .setInputCols(Array("documents")) + .setOutputCol("mpnet_embeddings") + +val pipeline = new Pipeline().setStages(Array(document_assembler, instruction)) + +val pipelineModel = pipeline.fit(data) + +val pipelineDF = pipelineModel.transform(data) + +``` +
+ +{:.model-param} +## Model Information + +{:.table-model} +|---|---| +|Model Name:|mpnet_embedding_setfit_zero_shot_classification_pbsp_p1_life| +|Compatibility:|Spark NLP 5.1.0+| +|License:|Open Source| +|Edition:|Official| +|Input Labels:|[documents]| +|Output Labels:|[mpnet_embeddings]| +|Language:|en| +|Size:|409.8 MB| \ No newline at end of file From a3b8401edcc8cbf9e82db0e743fe6aa347d4c6fd Mon Sep 17 00:00:00 2001 From: ahmedlone127 Date: Fri, 18 Aug 2023 23:46:30 +0700 Subject: [PATCH 034/155] Add model 2023-08-18-mpnet_embedding_setfit_model_en --- ...3-08-18-mpnet_embedding_setfit_model_en.md | 88 +++++++++++++++++++ 1 file changed, 88 insertions(+) create mode 100644 docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_setfit_model_en.md diff --git a/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_setfit_model_en.md b/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_setfit_model_en.md new file mode 100644 index 00000000000000..c3727f9548b043 --- /dev/null +++ b/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_setfit_model_en.md @@ -0,0 +1,88 @@ +--- +layout: model +title: English mpnet_embedding_setfit_model TFMPNetModel from Pradipta11 +author: John Snow Labs +name: mpnet_embedding_setfit_model +date: 2023-08-18 +tags: [mpnet, en, open_source, tensorflow] +task: Embeddings +language: en +edition: Spark NLP 5.1.0 +spark_version: 3.0 +supported: true +engine: tensorflow +annotator: MPNetEmbeddings +article_header: + type: cover +use_language_switcher: "Python-Scala-Java" +--- + +## Description + +Pretrained mpnet model, adapted from Hugging Face and curated to provide scalability and production-readiness using Spark NLP.`mpnet_embedding_setfit_model` is a English model originally trained by Pradipta11. + +{:.btn-box} + + +[Download](https://s3.amazonaws.com/auxdata.johnsnowlabs.com/public/models/mpnet_embedding_setfit_model_en_5.1.0_3.0_1692377155690.zip){:.button.button-orange.button-orange-trans.arr.button-icon} +[Copy S3 URI](s3://auxdata.johnsnowlabs.com/public/models/mpnet_embedding_setfit_model_en_5.1.0_3.0_1692377155690.zip){:.button.button-orange.button-orange-trans.button-icon.button-copy-s3} + +## How to use + + + +
+{% include programmingLanguageSelectScalaPythonNLU.html %} +```python + +document_assembler = DocumentAssembler() \ + .setInputCol("text") \ + .setOutputCol("documents") + +instruction = MPNetEmbeddings \ + .pretrained("mpnet_embedding_setfit_model", "en")\ + .setInputCols(["documents"]) \ + .setOutputCol("mpnet_embeddings") + +pipeline = Pipeline(stages=[ + document_assembler, + instruction, +]) + +pipelineModel = pipeline.fit(data) + +pipelineDF = pipelineModel.transform(data) +``` +```scala + +val document_assembler = new DocumentAssembler() + .setInputCol("text") + .setOutputCol("documents") + +val instruction = MPNetEmbeddings + .pretrained("mpnet_embedding_setfit_model", "en") + .setInputCols(Array("documents")) + .setOutputCol("mpnet_embeddings") + +val pipeline = new Pipeline().setStages(Array(document_assembler, instruction)) + +val pipelineModel = pipeline.fit(data) + +val pipelineDF = pipelineModel.transform(data) + +``` +
+ +{:.model-param} +## Model Information + +{:.table-model} +|---|---| +|Model Name:|mpnet_embedding_setfit_model| +|Compatibility:|Spark NLP 5.1.0+| +|License:|Open Source| +|Edition:|Official| +|Input Labels:|[documents]| +|Output Labels:|[mpnet_embeddings]| +|Language:|en| +|Size:|409.9 MB| \ No newline at end of file From 85c79e48907854f2df47d3bef08b2b4594ac0da3 Mon Sep 17 00:00:00 2001 From: ahmedlone127 Date: Fri, 18 Aug 2023 23:47:31 +0700 Subject: [PATCH 035/155] Add model 2023-08-18-mpnet_embedding_multi_qa_mpnet_base_cos_v1_en --- ...embedding_multi_qa_mpnet_base_cos_v1_en.md | 88 +++++++++++++++++++ 1 file changed, 88 insertions(+) create mode 100644 docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_multi_qa_mpnet_base_cos_v1_en.md diff --git a/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_multi_qa_mpnet_base_cos_v1_en.md b/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_multi_qa_mpnet_base_cos_v1_en.md new file mode 100644 index 00000000000000..0fef444dd2bc79 --- /dev/null +++ b/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_multi_qa_mpnet_base_cos_v1_en.md @@ -0,0 +1,88 @@ +--- +layout: model +title: English mpnet_embedding_multi_qa_mpnet_base_cos_v1 TFMPNetModel from navteca +author: John Snow Labs +name: mpnet_embedding_multi_qa_mpnet_base_cos_v1 +date: 2023-08-18 +tags: [mpnet, en, open_source, tensorflow] +task: Embeddings +language: en +edition: Spark NLP 5.1.0 +spark_version: 3.0 +supported: true +engine: tensorflow +annotator: MPNetEmbeddings +article_header: + type: cover +use_language_switcher: "Python-Scala-Java" +--- + +## Description + +Pretrained mpnet model, adapted from Hugging Face and curated to provide scalability and production-readiness using Spark NLP.`mpnet_embedding_multi_qa_mpnet_base_cos_v1` is a English model originally trained by navteca. + +{:.btn-box} + + +[Download](https://s3.amazonaws.com/auxdata.johnsnowlabs.com/public/models/mpnet_embedding_multi_qa_mpnet_base_cos_v1_en_5.1.0_3.0_1692377234842.zip){:.button.button-orange.button-orange-trans.arr.button-icon} +[Copy S3 URI](s3://auxdata.johnsnowlabs.com/public/models/mpnet_embedding_multi_qa_mpnet_base_cos_v1_en_5.1.0_3.0_1692377234842.zip){:.button.button-orange.button-orange-trans.button-icon.button-copy-s3} + +## How to use + + + +
+{% include programmingLanguageSelectScalaPythonNLU.html %} +```python + +document_assembler = DocumentAssembler() \ + .setInputCol("text") \ + .setOutputCol("documents") + +instruction = MPNetEmbeddings \ + .pretrained("mpnet_embedding_multi_qa_mpnet_base_cos_v1", "en")\ + .setInputCols(["documents"]) \ + .setOutputCol("mpnet_embeddings") + +pipeline = Pipeline(stages=[ + document_assembler, + instruction, +]) + +pipelineModel = pipeline.fit(data) + +pipelineDF = pipelineModel.transform(data) +``` +```scala + +val document_assembler = new DocumentAssembler() + .setInputCol("text") + .setOutputCol("documents") + +val instruction = MPNetEmbeddings + .pretrained("mpnet_embedding_multi_qa_mpnet_base_cos_v1", "en") + .setInputCols(Array("documents")) + .setOutputCol("mpnet_embeddings") + +val pipeline = new Pipeline().setStages(Array(document_assembler, instruction)) + +val pipelineModel = pipeline.fit(data) + +val pipelineDF = pipelineModel.transform(data) + +``` +
+ +{:.model-param} +## Model Information + +{:.table-model} +|---|---| +|Model Name:|mpnet_embedding_multi_qa_mpnet_base_cos_v1| +|Compatibility:|Spark NLP 5.1.0+| +|License:|Open Source| +|Edition:|Official| +|Input Labels:|[documents]| +|Output Labels:|[mpnet_embeddings]| +|Language:|en| +|Size:|409.8 MB| \ No newline at end of file From 0e981b99ebef9edbdfd98fd7712c2efc82ab1ffc Mon Sep 17 00:00:00 2001 From: ahmedlone127 Date: Fri, 18 Aug 2023 23:48:31 +0700 Subject: [PATCH 036/155] Add model 2023-08-18-mpnet_embedding_java_deprecation_classifier_en --- ...mbedding_java_deprecation_classifier_en.md | 88 +++++++++++++++++++ 1 file changed, 88 insertions(+) create mode 100644 docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_java_deprecation_classifier_en.md diff --git a/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_java_deprecation_classifier_en.md b/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_java_deprecation_classifier_en.md new file mode 100644 index 00000000000000..935bccbccd1285 --- /dev/null +++ b/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_java_deprecation_classifier_en.md @@ -0,0 +1,88 @@ +--- +layout: model +title: English mpnet_embedding_java_deprecation_classifier TFMPNetModel from AISE-TUDelft +author: John Snow Labs +name: mpnet_embedding_java_deprecation_classifier +date: 2023-08-18 +tags: [mpnet, en, open_source, tensorflow] +task: Embeddings +language: en +edition: Spark NLP 5.1.0 +spark_version: 3.0 +supported: true +engine: tensorflow +annotator: MPNetEmbeddings +article_header: + type: cover +use_language_switcher: "Python-Scala-Java" +--- + +## Description + +Pretrained mpnet model, adapted from Hugging Face and curated to provide scalability and production-readiness using Spark NLP.`mpnet_embedding_java_deprecation_classifier` is a English model originally trained by AISE-TUDelft. + +{:.btn-box} + + +[Download](https://s3.amazonaws.com/auxdata.johnsnowlabs.com/public/models/mpnet_embedding_java_deprecation_classifier_en_5.1.0_3.0_1692377248542.zip){:.button.button-orange.button-orange-trans.arr.button-icon} +[Copy S3 URI](s3://auxdata.johnsnowlabs.com/public/models/mpnet_embedding_java_deprecation_classifier_en_5.1.0_3.0_1692377248542.zip){:.button.button-orange.button-orange-trans.button-icon.button-copy-s3} + +## How to use + + + +
+{% include programmingLanguageSelectScalaPythonNLU.html %} +```python + +document_assembler = DocumentAssembler() \ + .setInputCol("text") \ + .setOutputCol("documents") + +instruction = MPNetEmbeddings \ + .pretrained("mpnet_embedding_java_deprecation_classifier", "en")\ + .setInputCols(["documents"]) \ + .setOutputCol("mpnet_embeddings") + +pipeline = Pipeline(stages=[ + document_assembler, + instruction, +]) + +pipelineModel = pipeline.fit(data) + +pipelineDF = pipelineModel.transform(data) +``` +```scala + +val document_assembler = new DocumentAssembler() + .setInputCol("text") + .setOutputCol("documents") + +val instruction = MPNetEmbeddings + .pretrained("mpnet_embedding_java_deprecation_classifier", "en") + .setInputCols(Array("documents")) + .setOutputCol("mpnet_embeddings") + +val pipeline = new Pipeline().setStages(Array(document_assembler, instruction)) + +val pipelineModel = pipeline.fit(data) + +val pipelineDF = pipelineModel.transform(data) + +``` +
+ +{:.model-param} +## Model Information + +{:.table-model} +|---|---| +|Model Name:|mpnet_embedding_java_deprecation_classifier| +|Compatibility:|Spark NLP 5.1.0+| +|License:|Open Source| +|Edition:|Official| +|Input Labels:|[documents]| +|Output Labels:|[mpnet_embeddings]| +|Language:|en| +|Size:|409.9 MB| \ No newline at end of file From 75c5e78ada8d40d7f7e9129fd7877804018bdb01 Mon Sep 17 00:00:00 2001 From: ahmedlone127 Date: Fri, 18 Aug 2023 23:49:31 +0700 Subject: [PATCH 037/155] Add model 2023-08-18-mpnet_embedding_my_awesome_setfit_model_98_en --- ...embedding_my_awesome_setfit_model_98_en.md | 88 +++++++++++++++++++ 1 file changed, 88 insertions(+) create mode 100644 docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_my_awesome_setfit_model_98_en.md diff --git a/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_my_awesome_setfit_model_98_en.md b/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_my_awesome_setfit_model_98_en.md new file mode 100644 index 00000000000000..7960690d037bc6 --- /dev/null +++ b/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_my_awesome_setfit_model_98_en.md @@ -0,0 +1,88 @@ +--- +layout: model +title: English mpnet_embedding_my_awesome_setfit_model_98 TFMPNetModel from lewtun +author: John Snow Labs +name: mpnet_embedding_my_awesome_setfit_model_98 +date: 2023-08-18 +tags: [mpnet, en, open_source, tensorflow] +task: Embeddings +language: en +edition: Spark NLP 5.1.0 +spark_version: 3.0 +supported: true +engine: tensorflow +annotator: MPNetEmbeddings +article_header: + type: cover +use_language_switcher: "Python-Scala-Java" +--- + +## Description + +Pretrained mpnet model, adapted from Hugging Face and curated to provide scalability and production-readiness using Spark NLP.`mpnet_embedding_my_awesome_setfit_model_98` is a English model originally trained by lewtun. + +{:.btn-box} + + +[Download](https://s3.amazonaws.com/auxdata.johnsnowlabs.com/public/models/mpnet_embedding_my_awesome_setfit_model_98_en_5.1.0_3.0_1692377344662.zip){:.button.button-orange.button-orange-trans.arr.button-icon} +[Copy S3 URI](s3://auxdata.johnsnowlabs.com/public/models/mpnet_embedding_my_awesome_setfit_model_98_en_5.1.0_3.0_1692377344662.zip){:.button.button-orange.button-orange-trans.button-icon.button-copy-s3} + +## How to use + + + +
+{% include programmingLanguageSelectScalaPythonNLU.html %} +```python + +document_assembler = DocumentAssembler() \ + .setInputCol("text") \ + .setOutputCol("documents") + +instruction = MPNetEmbeddings \ + .pretrained("mpnet_embedding_my_awesome_setfit_model_98", "en")\ + .setInputCols(["documents"]) \ + .setOutputCol("mpnet_embeddings") + +pipeline = Pipeline(stages=[ + document_assembler, + instruction, +]) + +pipelineModel = pipeline.fit(data) + +pipelineDF = pipelineModel.transform(data) +``` +```scala + +val document_assembler = new DocumentAssembler() + .setInputCol("text") + .setOutputCol("documents") + +val instruction = MPNetEmbeddings + .pretrained("mpnet_embedding_my_awesome_setfit_model_98", "en") + .setInputCols(Array("documents")) + .setOutputCol("mpnet_embeddings") + +val pipeline = new Pipeline().setStages(Array(document_assembler, instruction)) + +val pipelineModel = pipeline.fit(data) + +val pipelineDF = pipelineModel.transform(data) + +``` +
+ +{:.model-param} +## Model Information + +{:.table-model} +|---|---| +|Model Name:|mpnet_embedding_my_awesome_setfit_model_98| +|Compatibility:|Spark NLP 5.1.0+| +|License:|Open Source| +|Edition:|Official| +|Input Labels:|[documents]| +|Output Labels:|[mpnet_embeddings]| +|Language:|en| +|Size:|409.9 MB| \ No newline at end of file From 6647bb116c75a8b678cd92651bc21072c0822004 Mon Sep 17 00:00:00 2001 From: ahmedlone127 Date: Fri, 18 Aug 2023 23:50:31 +0700 Subject: [PATCH 038/155] Add model 2023-08-18-mpnet_embedding_DomainAdaptM2_en --- ...-08-18-mpnet_embedding_DomainAdaptM2_en.md | 88 +++++++++++++++++++ 1 file changed, 88 insertions(+) create mode 100644 docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_DomainAdaptM2_en.md diff --git a/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_DomainAdaptM2_en.md b/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_DomainAdaptM2_en.md new file mode 100644 index 00000000000000..9a0c261ed16d69 --- /dev/null +++ b/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_DomainAdaptM2_en.md @@ -0,0 +1,88 @@ +--- +layout: model +title: English mpnet_embedding_DomainAdaptM2 TFMPNetModel from dani0f +author: John Snow Labs +name: mpnet_embedding_DomainAdaptM2 +date: 2023-08-18 +tags: [mpnet, en, open_source, tensorflow] +task: Embeddings +language: en +edition: Spark NLP 5.1.0 +spark_version: 3.0 +supported: true +engine: tensorflow +annotator: MPNetEmbeddings +article_header: + type: cover +use_language_switcher: "Python-Scala-Java" +--- + +## Description + +Pretrained mpnet model, adapted from Hugging Face and curated to provide scalability and production-readiness using Spark NLP.`mpnet_embedding_DomainAdaptM2` is a English model originally trained by dani0f. + +{:.btn-box} + + +[Download](https://s3.amazonaws.com/auxdata.johnsnowlabs.com/public/models/mpnet_embedding_DomainAdaptM2_en_5.1.0_3.0_1692376943387.zip){:.button.button-orange.button-orange-trans.arr.button-icon} +[Copy S3 URI](s3://auxdata.johnsnowlabs.com/public/models/mpnet_embedding_DomainAdaptM2_en_5.1.0_3.0_1692376943387.zip){:.button.button-orange.button-orange-trans.button-icon.button-copy-s3} + +## How to use + + + +
+{% include programmingLanguageSelectScalaPythonNLU.html %} +```python + +document_assembler = DocumentAssembler() \ + .setInputCol("text") \ + .setOutputCol("documents") + +instruction = MPNetEmbeddings \ + .pretrained("mpnet_embedding_DomainAdaptM2", "en")\ + .setInputCols(["documents"]) \ + .setOutputCol("mpnet_embeddings") + +pipeline = Pipeline(stages=[ + document_assembler, + instruction, +]) + +pipelineModel = pipeline.fit(data) + +pipelineDF = pipelineModel.transform(data) +``` +```scala + +val document_assembler = new DocumentAssembler() + .setInputCol("text") + .setOutputCol("documents") + +val instruction = MPNetEmbeddings + .pretrained("mpnet_embedding_DomainAdaptM2", "en") + .setInputCols(Array("documents")) + .setOutputCol("mpnet_embeddings") + +val pipeline = new Pipeline().setStages(Array(document_assembler, instruction)) + +val pipelineModel = pipeline.fit(data) + +val pipelineDF = pipelineModel.transform(data) + +``` +
+ +{:.model-param} +## Model Information + +{:.table-model} +|---|---| +|Model Name:|mpnet_embedding_DomainAdaptM2| +|Compatibility:|Spark NLP 5.1.0+| +|License:|Open Source| +|Edition:|Official| +|Input Labels:|[documents]| +|Output Labels:|[mpnet_embeddings]| +|Language:|en| +|Size:|409.8 MB| \ No newline at end of file From fca460f66af621639ba6ede2bec42b3bfae9daac Mon Sep 17 00:00:00 2001 From: ahmedlone127 Date: Fri, 18 Aug 2023 23:51:32 +0700 Subject: [PATCH 039/155] Add model 2023-08-18-mpnet_embedding_mpnet_retriever_squad2_en --- ...net_embedding_mpnet_retriever_squad2_en.md | 88 +++++++++++++++++++ 1 file changed, 88 insertions(+) create mode 100644 docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_mpnet_retriever_squad2_en.md diff --git a/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_mpnet_retriever_squad2_en.md b/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_mpnet_retriever_squad2_en.md new file mode 100644 index 00000000000000..0f10150d16b34d --- /dev/null +++ b/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_mpnet_retriever_squad2_en.md @@ -0,0 +1,88 @@ +--- +layout: model +title: English mpnet_embedding_mpnet_retriever_squad2 TFMPNetModel from pinecone +author: John Snow Labs +name: mpnet_embedding_mpnet_retriever_squad2 +date: 2023-08-18 +tags: [mpnet, en, open_source, tensorflow] +task: Embeddings +language: en +edition: Spark NLP 5.1.0 +spark_version: 3.0 +supported: true +engine: tensorflow +annotator: MPNetEmbeddings +article_header: + type: cover +use_language_switcher: "Python-Scala-Java" +--- + +## Description + +Pretrained mpnet model, adapted from Hugging Face and curated to provide scalability and production-readiness using Spark NLP.`mpnet_embedding_mpnet_retriever_squad2` is a English model originally trained by pinecone. + +{:.btn-box} + + +[Download](https://s3.amazonaws.com/auxdata.johnsnowlabs.com/public/models/mpnet_embedding_mpnet_retriever_squad2_en_5.1.0_3.0_1692377425348.zip){:.button.button-orange.button-orange-trans.arr.button-icon} +[Copy S3 URI](s3://auxdata.johnsnowlabs.com/public/models/mpnet_embedding_mpnet_retriever_squad2_en_5.1.0_3.0_1692377425348.zip){:.button.button-orange.button-orange-trans.button-icon.button-copy-s3} + +## How to use + + + +
+{% include programmingLanguageSelectScalaPythonNLU.html %} +```python + +document_assembler = DocumentAssembler() \ + .setInputCol("text") \ + .setOutputCol("documents") + +instruction = MPNetEmbeddings \ + .pretrained("mpnet_embedding_mpnet_retriever_squad2", "en")\ + .setInputCols(["documents"]) \ + .setOutputCol("mpnet_embeddings") + +pipeline = Pipeline(stages=[ + document_assembler, + instruction, +]) + +pipelineModel = pipeline.fit(data) + +pipelineDF = pipelineModel.transform(data) +``` +```scala + +val document_assembler = new DocumentAssembler() + .setInputCol("text") + .setOutputCol("documents") + +val instruction = MPNetEmbeddings + .pretrained("mpnet_embedding_mpnet_retriever_squad2", "en") + .setInputCols(Array("documents")) + .setOutputCol("mpnet_embeddings") + +val pipeline = new Pipeline().setStages(Array(document_assembler, instruction)) + +val pipelineModel = pipeline.fit(data) + +val pipelineDF = pipelineModel.transform(data) + +``` +
+ +{:.model-param} +## Model Information + +{:.table-model} +|---|---| +|Model Name:|mpnet_embedding_mpnet_retriever_squad2| +|Compatibility:|Spark NLP 5.1.0+| +|License:|Open Source| +|Edition:|Official| +|Input Labels:|[documents]| +|Output Labels:|[mpnet_embeddings]| +|Language:|en| +|Size:|409.6 MB| \ No newline at end of file From ba663352d99358fe46dbfc5849731976f77b2daf Mon Sep 17 00:00:00 2001 From: ahmedlone127 Date: Fri, 18 Aug 2023 23:52:32 +0700 Subject: [PATCH 040/155] Add model 2023-08-18-mpnet_embedding_tiny_random_MPNetForQuestionAnswering_en --- ...iny_random_MPNetForQuestionAnswering_en.md | 88 +++++++++++++++++++ 1 file changed, 88 insertions(+) create mode 100644 docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_tiny_random_MPNetForQuestionAnswering_en.md diff --git a/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_tiny_random_MPNetForQuestionAnswering_en.md b/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_tiny_random_MPNetForQuestionAnswering_en.md new file mode 100644 index 00000000000000..0eb1f3d76262ee --- /dev/null +++ b/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_tiny_random_MPNetForQuestionAnswering_en.md @@ -0,0 +1,88 @@ +--- +layout: model +title: English mpnet_embedding_tiny_random_MPNetForQuestionAnswering TFMPNetModel from hf-tiny-model-private +author: John Snow Labs +name: mpnet_embedding_tiny_random_MPNetForQuestionAnswering +date: 2023-08-18 +tags: [mpnet, en, open_source, tensorflow] +task: Embeddings +language: en +edition: Spark NLP 5.1.0 +spark_version: 3.0 +supported: true +engine: tensorflow +annotator: MPNetEmbeddings +article_header: + type: cover +use_language_switcher: "Python-Scala-Java" +--- + +## Description + +Pretrained mpnet model, adapted from Hugging Face and curated to provide scalability and production-readiness using Spark NLP.`mpnet_embedding_tiny_random_MPNetForQuestionAnswering` is a English model originally trained by hf-tiny-model-private. + +{:.btn-box} + + +[Download](https://s3.amazonaws.com/auxdata.johnsnowlabs.com/public/models/mpnet_embedding_tiny_random_MPNetForQuestionAnswering_en_5.1.0_3.0_1692375853395.zip){:.button.button-orange.button-orange-trans.arr.button-icon} +[Copy S3 URI](s3://auxdata.johnsnowlabs.com/public/models/mpnet_embedding_tiny_random_MPNetForQuestionAnswering_en_5.1.0_3.0_1692375853395.zip){:.button.button-orange.button-orange-trans.button-icon.button-copy-s3} + +## How to use + + + +
+{% include programmingLanguageSelectScalaPythonNLU.html %} +```python + +document_assembler = DocumentAssembler() \ + .setInputCol("text") \ + .setOutputCol("documents") + +instruction = MPNetEmbeddings \ + .pretrained("mpnet_embedding_tiny_random_MPNetForQuestionAnswering", "en")\ + .setInputCols(["documents"]) \ + .setOutputCol("mpnet_embeddings") + +pipeline = Pipeline(stages=[ + document_assembler, + instruction, +]) + +pipelineModel = pipeline.fit(data) + +pipelineDF = pipelineModel.transform(data) +``` +```scala + +val document_assembler = new DocumentAssembler() + .setInputCol("text") + .setOutputCol("documents") + +val instruction = MPNetEmbeddings + .pretrained("mpnet_embedding_tiny_random_MPNetForQuestionAnswering", "en") + .setInputCols(Array("documents")) + .setOutputCol("mpnet_embeddings") + +val pipeline = new Pipeline().setStages(Array(document_assembler, instruction)) + +val pipelineModel = pipeline.fit(data) + +val pipelineDF = pipelineModel.transform(data) + +``` +
+ +{:.model-param} +## Model Information + +{:.table-model} +|---|---| +|Model Name:|mpnet_embedding_tiny_random_MPNetForQuestionAnswering| +|Compatibility:|Spark NLP 5.1.0+| +|License:|Open Source| +|Edition:|Official| +|Input Labels:|[documents]| +|Output Labels:|[mpnet_embeddings]| +|Language:|en| +|Size:|1.2 MB| \ No newline at end of file From 942c636d611733d2399e1bb8a3102162b3a8e374 Mon Sep 17 00:00:00 2001 From: ahmedlone127 Date: Fri, 18 Aug 2023 23:53:32 +0700 Subject: [PATCH 041/155] Add model 2023-08-18-mpnet_embedding_sml_ukr_word_classifier_medium_en --- ...dding_sml_ukr_word_classifier_medium_en.md | 88 +++++++++++++++++++ 1 file changed, 88 insertions(+) create mode 100644 docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_sml_ukr_word_classifier_medium_en.md diff --git a/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_sml_ukr_word_classifier_medium_en.md b/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_sml_ukr_word_classifier_medium_en.md new file mode 100644 index 00000000000000..3e9a6c880b2e29 --- /dev/null +++ b/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_sml_ukr_word_classifier_medium_en.md @@ -0,0 +1,88 @@ +--- +layout: model +title: English mpnet_embedding_sml_ukr_word_classifier_medium TFMPNetModel from rodekruis +author: John Snow Labs +name: mpnet_embedding_sml_ukr_word_classifier_medium +date: 2023-08-18 +tags: [mpnet, en, open_source, tensorflow] +task: Embeddings +language: en +edition: Spark NLP 5.1.0 +spark_version: 3.0 +supported: true +engine: tensorflow +annotator: MPNetEmbeddings +article_header: + type: cover +use_language_switcher: "Python-Scala-Java" +--- + +## Description + +Pretrained mpnet model, adapted from Hugging Face and curated to provide scalability and production-readiness using Spark NLP.`mpnet_embedding_sml_ukr_word_classifier_medium` is a English model originally trained by rodekruis. + +{:.btn-box} + + +[Download](https://s3.amazonaws.com/auxdata.johnsnowlabs.com/public/models/mpnet_embedding_sml_ukr_word_classifier_medium_en_5.1.0_3.0_1692377549620.zip){:.button.button-orange.button-orange-trans.arr.button-icon} +[Copy S3 URI](s3://auxdata.johnsnowlabs.com/public/models/mpnet_embedding_sml_ukr_word_classifier_medium_en_5.1.0_3.0_1692377549620.zip){:.button.button-orange.button-orange-trans.button-icon.button-copy-s3} + +## How to use + + + +
+{% include programmingLanguageSelectScalaPythonNLU.html %} +```python + +document_assembler = DocumentAssembler() \ + .setInputCol("text") \ + .setOutputCol("documents") + +instruction = MPNetEmbeddings \ + .pretrained("mpnet_embedding_sml_ukr_word_classifier_medium", "en")\ + .setInputCols(["documents"]) \ + .setOutputCol("mpnet_embeddings") + +pipeline = Pipeline(stages=[ + document_assembler, + instruction, +]) + +pipelineModel = pipeline.fit(data) + +pipelineDF = pipelineModel.transform(data) +``` +```scala + +val document_assembler = new DocumentAssembler() + .setInputCol("text") + .setOutputCol("documents") + +val instruction = MPNetEmbeddings + .pretrained("mpnet_embedding_sml_ukr_word_classifier_medium", "en") + .setInputCols(Array("documents")) + .setOutputCol("mpnet_embeddings") + +val pipeline = new Pipeline().setStages(Array(document_assembler, instruction)) + +val pipelineModel = pipeline.fit(data) + +val pipelineDF = pipelineModel.transform(data) + +``` +
+ +{:.model-param} +## Model Information + +{:.table-model} +|---|---| +|Model Name:|mpnet_embedding_sml_ukr_word_classifier_medium| +|Compatibility:|Spark NLP 5.1.0+| +|License:|Open Source| +|Edition:|Official| +|Input Labels:|[documents]| +|Output Labels:|[mpnet_embeddings]| +|Language:|en| +|Size:|409.8 MB| \ No newline at end of file From 90241bb2ded31c689d461de081e36824f81ca8e7 Mon Sep 17 00:00:00 2001 From: ahmedlone127 Date: Fri, 18 Aug 2023 23:54:33 +0700 Subject: [PATCH 042/155] Add model 2023-08-18-mpnet_embedding_all_mpnet_base_v1_en --- ...18-mpnet_embedding_all_mpnet_base_v1_en.md | 88 +++++++++++++++++++ 1 file changed, 88 insertions(+) create mode 100644 docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_all_mpnet_base_v1_en.md diff --git a/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_all_mpnet_base_v1_en.md b/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_all_mpnet_base_v1_en.md new file mode 100644 index 00000000000000..891a88263da98e --- /dev/null +++ b/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_all_mpnet_base_v1_en.md @@ -0,0 +1,88 @@ +--- +layout: model +title: English mpnet_embedding_all_mpnet_base_v1 TFMPNetModel from sentence-transformers +author: John Snow Labs +name: mpnet_embedding_all_mpnet_base_v1 +date: 2023-08-18 +tags: [mpnet, en, open_source, tensorflow] +task: Embeddings +language: en +edition: Spark NLP 5.1.0 +spark_version: 3.0 +supported: true +engine: tensorflow +annotator: MPNetEmbeddings +article_header: + type: cover +use_language_switcher: "Python-Scala-Java" +--- + +## Description + +Pretrained mpnet model, adapted from Hugging Face and curated to provide scalability and production-readiness using Spark NLP.`mpnet_embedding_all_mpnet_base_v1` is a English model originally trained by sentence-transformers. + +{:.btn-box} + + +[Download](https://s3.amazonaws.com/auxdata.johnsnowlabs.com/public/models/mpnet_embedding_all_mpnet_base_v1_en_5.1.0_3.0_1692377622026.zip){:.button.button-orange.button-orange-trans.arr.button-icon} +[Copy S3 URI](s3://auxdata.johnsnowlabs.com/public/models/mpnet_embedding_all_mpnet_base_v1_en_5.1.0_3.0_1692377622026.zip){:.button.button-orange.button-orange-trans.button-icon.button-copy-s3} + +## How to use + + + +
+{% include programmingLanguageSelectScalaPythonNLU.html %} +```python + +document_assembler = DocumentAssembler() \ + .setInputCol("text") \ + .setOutputCol("documents") + +instruction = MPNetEmbeddings \ + .pretrained("mpnet_embedding_all_mpnet_base_v1", "en")\ + .setInputCols(["documents"]) \ + .setOutputCol("mpnet_embeddings") + +pipeline = Pipeline(stages=[ + document_assembler, + instruction, +]) + +pipelineModel = pipeline.fit(data) + +pipelineDF = pipelineModel.transform(data) +``` +```scala + +val document_assembler = new DocumentAssembler() + .setInputCol("text") + .setOutputCol("documents") + +val instruction = MPNetEmbeddings + .pretrained("mpnet_embedding_all_mpnet_base_v1", "en") + .setInputCols(Array("documents")) + .setOutputCol("mpnet_embeddings") + +val pipeline = new Pipeline().setStages(Array(document_assembler, instruction)) + +val pipelineModel = pipeline.fit(data) + +val pipelineDF = pipelineModel.transform(data) + +``` +
+ +{:.model-param} +## Model Information + +{:.table-model} +|---|---| +|Model Name:|mpnet_embedding_all_mpnet_base_v1| +|Compatibility:|Spark NLP 5.1.0+| +|License:|Open Source| +|Edition:|Official| +|Input Labels:|[documents]| +|Output Labels:|[mpnet_embeddings]| +|Language:|en| +|Size:|409.6 MB| \ No newline at end of file From acda84842394b181cb1ec41325696ae862e1c8f2 Mon Sep 17 00:00:00 2001 From: ahmedlone127 Date: Fri, 18 Aug 2023 23:55:33 +0700 Subject: [PATCH 043/155] Add model 2023-08-18-mpnet_embedding_nooks_amd_detection_realtime_en --- ...bedding_nooks_amd_detection_realtime_en.md | 88 +++++++++++++++++++ 1 file changed, 88 insertions(+) create mode 100644 docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_nooks_amd_detection_realtime_en.md diff --git a/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_nooks_amd_detection_realtime_en.md b/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_nooks_amd_detection_realtime_en.md new file mode 100644 index 00000000000000..e025c59600e3ae --- /dev/null +++ b/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_nooks_amd_detection_realtime_en.md @@ -0,0 +1,88 @@ +--- +layout: model +title: English mpnet_embedding_nooks_amd_detection_realtime TFMPNetModel from nikcheerla +author: John Snow Labs +name: mpnet_embedding_nooks_amd_detection_realtime +date: 2023-08-18 +tags: [mpnet, en, open_source, tensorflow] +task: Embeddings +language: en +edition: Spark NLP 5.1.0 +spark_version: 3.0 +supported: true +engine: tensorflow +annotator: MPNetEmbeddings +article_header: + type: cover +use_language_switcher: "Python-Scala-Java" +--- + +## Description + +Pretrained mpnet model, adapted from Hugging Face and curated to provide scalability and production-readiness using Spark NLP.`mpnet_embedding_nooks_amd_detection_realtime` is a English model originally trained by nikcheerla. + +{:.btn-box} + + +[Download](https://s3.amazonaws.com/auxdata.johnsnowlabs.com/public/models/mpnet_embedding_nooks_amd_detection_realtime_en_5.1.0_3.0_1692375770246.zip){:.button.button-orange.button-orange-trans.arr.button-icon} +[Copy S3 URI](s3://auxdata.johnsnowlabs.com/public/models/mpnet_embedding_nooks_amd_detection_realtime_en_5.1.0_3.0_1692375770246.zip){:.button.button-orange.button-orange-trans.button-icon.button-copy-s3} + +## How to use + + + +
+{% include programmingLanguageSelectScalaPythonNLU.html %} +```python + +document_assembler = DocumentAssembler() \ + .setInputCol("text") \ + .setOutputCol("documents") + +instruction = MPNetEmbeddings \ + .pretrained("mpnet_embedding_nooks_amd_detection_realtime", "en")\ + .setInputCols(["documents"]) \ + .setOutputCol("mpnet_embeddings") + +pipeline = Pipeline(stages=[ + document_assembler, + instruction, +]) + +pipelineModel = pipeline.fit(data) + +pipelineDF = pipelineModel.transform(data) +``` +```scala + +val document_assembler = new DocumentAssembler() + .setInputCol("text") + .setOutputCol("documents") + +val instruction = MPNetEmbeddings + .pretrained("mpnet_embedding_nooks_amd_detection_realtime", "en") + .setInputCols(Array("documents")) + .setOutputCol("mpnet_embeddings") + +val pipeline = new Pipeline().setStages(Array(document_assembler, instruction)) + +val pipelineModel = pipeline.fit(data) + +val pipelineDF = pipelineModel.transform(data) + +``` +
+ +{:.model-param} +## Model Information + +{:.table-model} +|---|---| +|Model Name:|mpnet_embedding_nooks_amd_detection_realtime| +|Compatibility:|Spark NLP 5.1.0+| +|License:|Open Source| +|Edition:|Official| +|Input Labels:|[documents]| +|Output Labels:|[mpnet_embeddings]| +|Language:|en| +|Size:|409.9 MB| \ No newline at end of file From 0c959740048db0cd3c321d062221304f8eb8b5b2 Mon Sep 17 00:00:00 2001 From: ahmedlone127 Date: Fri, 18 Aug 2023 23:56:33 +0700 Subject: [PATCH 044/155] Add model 2023-08-18-mpnet_embedding_setfit_model_test_sensitve_v1_en --- ...edding_setfit_model_test_sensitve_v1_en.md | 88 +++++++++++++++++++ 1 file changed, 88 insertions(+) create mode 100644 docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_setfit_model_test_sensitve_v1_en.md diff --git a/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_setfit_model_test_sensitve_v1_en.md b/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_setfit_model_test_sensitve_v1_en.md new file mode 100644 index 00000000000000..3b85c67565ca9c --- /dev/null +++ b/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_setfit_model_test_sensitve_v1_en.md @@ -0,0 +1,88 @@ +--- +layout: model +title: English mpnet_embedding_setfit_model_test_sensitve_v1 TFMPNetModel from Adipta +author: John Snow Labs +name: mpnet_embedding_setfit_model_test_sensitve_v1 +date: 2023-08-18 +tags: [mpnet, en, open_source, tensorflow] +task: Embeddings +language: en +edition: Spark NLP 5.1.0 +spark_version: 3.0 +supported: true +engine: tensorflow +annotator: MPNetEmbeddings +article_header: + type: cover +use_language_switcher: "Python-Scala-Java" +--- + +## Description + +Pretrained mpnet model, adapted from Hugging Face and curated to provide scalability and production-readiness using Spark NLP.`mpnet_embedding_setfit_model_test_sensitve_v1` is a English model originally trained by Adipta. + +{:.btn-box} + + +[Download](https://s3.amazonaws.com/auxdata.johnsnowlabs.com/public/models/mpnet_embedding_setfit_model_test_sensitve_v1_en_5.1.0_3.0_1692377739573.zip){:.button.button-orange.button-orange-trans.arr.button-icon} +[Copy S3 URI](s3://auxdata.johnsnowlabs.com/public/models/mpnet_embedding_setfit_model_test_sensitve_v1_en_5.1.0_3.0_1692377739573.zip){:.button.button-orange.button-orange-trans.button-icon.button-copy-s3} + +## How to use + + + +
+{% include programmingLanguageSelectScalaPythonNLU.html %} +```python + +document_assembler = DocumentAssembler() \ + .setInputCol("text") \ + .setOutputCol("documents") + +instruction = MPNetEmbeddings \ + .pretrained("mpnet_embedding_setfit_model_test_sensitve_v1", "en")\ + .setInputCols(["documents"]) \ + .setOutputCol("mpnet_embeddings") + +pipeline = Pipeline(stages=[ + document_assembler, + instruction, +]) + +pipelineModel = pipeline.fit(data) + +pipelineDF = pipelineModel.transform(data) +``` +```scala + +val document_assembler = new DocumentAssembler() + .setInputCol("text") + .setOutputCol("documents") + +val instruction = MPNetEmbeddings + .pretrained("mpnet_embedding_setfit_model_test_sensitve_v1", "en") + .setInputCols(Array("documents")) + .setOutputCol("mpnet_embeddings") + +val pipeline = new Pipeline().setStages(Array(document_assembler, instruction)) + +val pipelineModel = pipeline.fit(data) + +val pipelineDF = pipelineModel.transform(data) + +``` +
+ +{:.model-param} +## Model Information + +{:.table-model} +|---|---| +|Model Name:|mpnet_embedding_setfit_model_test_sensitve_v1| +|Compatibility:|Spark NLP 5.1.0+| +|License:|Open Source| +|Edition:|Official| +|Input Labels:|[documents]| +|Output Labels:|[mpnet_embeddings]| +|Language:|en| +|Size:|409.9 MB| \ No newline at end of file From 60da5546d93ab6077dedda3f05006879725ee5d6 Mon Sep 17 00:00:00 2001 From: ahmedlone127 Date: Fri, 18 Aug 2023 23:57:34 +0700 Subject: [PATCH 045/155] Add model 2023-08-18-mpnet_embedding_multi_qa_mpnet_base_dot_v1_en --- ...embedding_multi_qa_mpnet_base_dot_v1_en.md | 88 +++++++++++++++++++ 1 file changed, 88 insertions(+) create mode 100644 docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_multi_qa_mpnet_base_dot_v1_en.md diff --git a/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_multi_qa_mpnet_base_dot_v1_en.md b/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_multi_qa_mpnet_base_dot_v1_en.md new file mode 100644 index 00000000000000..c4ff822ac63084 --- /dev/null +++ b/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_multi_qa_mpnet_base_dot_v1_en.md @@ -0,0 +1,88 @@ +--- +layout: model +title: English mpnet_embedding_multi_qa_mpnet_base_dot_v1 TFMPNetModel from sentence-transformers +author: John Snow Labs +name: mpnet_embedding_multi_qa_mpnet_base_dot_v1 +date: 2023-08-18 +tags: [mpnet, en, open_source, tensorflow] +task: Embeddings +language: en +edition: Spark NLP 5.1.0 +spark_version: 3.0 +supported: true +engine: tensorflow +annotator: MPNetEmbeddings +article_header: + type: cover +use_language_switcher: "Python-Scala-Java" +--- + +## Description + +Pretrained mpnet model, adapted from Hugging Face and curated to provide scalability and production-readiness using Spark NLP.`mpnet_embedding_multi_qa_mpnet_base_dot_v1` is a English model originally trained by sentence-transformers. + +{:.btn-box} + + +[Download](https://s3.amazonaws.com/auxdata.johnsnowlabs.com/public/models/mpnet_embedding_multi_qa_mpnet_base_dot_v1_en_5.1.0_3.0_1692377809432.zip){:.button.button-orange.button-orange-trans.arr.button-icon} +[Copy S3 URI](s3://auxdata.johnsnowlabs.com/public/models/mpnet_embedding_multi_qa_mpnet_base_dot_v1_en_5.1.0_3.0_1692377809432.zip){:.button.button-orange.button-orange-trans.button-icon.button-copy-s3} + +## How to use + + + +
+{% include programmingLanguageSelectScalaPythonNLU.html %} +```python + +document_assembler = DocumentAssembler() \ + .setInputCol("text") \ + .setOutputCol("documents") + +instruction = MPNetEmbeddings \ + .pretrained("mpnet_embedding_multi_qa_mpnet_base_dot_v1", "en")\ + .setInputCols(["documents"]) \ + .setOutputCol("mpnet_embeddings") + +pipeline = Pipeline(stages=[ + document_assembler, + instruction, +]) + +pipelineModel = pipeline.fit(data) + +pipelineDF = pipelineModel.transform(data) +``` +```scala + +val document_assembler = new DocumentAssembler() + .setInputCol("text") + .setOutputCol("documents") + +val instruction = MPNetEmbeddings + .pretrained("mpnet_embedding_multi_qa_mpnet_base_dot_v1", "en") + .setInputCols(Array("documents")) + .setOutputCol("mpnet_embeddings") + +val pipeline = new Pipeline().setStages(Array(document_assembler, instruction)) + +val pipelineModel = pipeline.fit(data) + +val pipelineDF = pipelineModel.transform(data) + +``` +
+ +{:.model-param} +## Model Information + +{:.table-model} +|---|---| +|Model Name:|mpnet_embedding_multi_qa_mpnet_base_dot_v1| +|Compatibility:|Spark NLP 5.1.0+| +|License:|Open Source| +|Edition:|Official| +|Input Labels:|[documents]| +|Output Labels:|[mpnet_embeddings]| +|Language:|en| +|Size:|409.8 MB| \ No newline at end of file From 447fd2eca7ca58ee9107d8cd6bd645c6f3397028 Mon Sep 17 00:00:00 2001 From: ahmedlone127 Date: Fri, 18 Aug 2023 23:58:34 +0700 Subject: [PATCH 046/155] Add model 2023-08-18-mpnet_embedding_due_eshop_21_en --- ...3-08-18-mpnet_embedding_due_eshop_21_en.md | 88 +++++++++++++++++++ 1 file changed, 88 insertions(+) create mode 100644 docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_due_eshop_21_en.md diff --git a/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_due_eshop_21_en.md b/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_due_eshop_21_en.md new file mode 100644 index 00000000000000..e083fc0f2013b3 --- /dev/null +++ b/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_due_eshop_21_en.md @@ -0,0 +1,88 @@ +--- +layout: model +title: English mpnet_embedding_due_eshop_21 TFMPNetModel from konverner +author: John Snow Labs +name: mpnet_embedding_due_eshop_21 +date: 2023-08-18 +tags: [mpnet, en, open_source, tensorflow] +task: Embeddings +language: en +edition: Spark NLP 5.1.0 +spark_version: 3.0 +supported: true +engine: tensorflow +annotator: MPNetEmbeddings +article_header: + type: cover +use_language_switcher: "Python-Scala-Java" +--- + +## Description + +Pretrained mpnet model, adapted from Hugging Face and curated to provide scalability and production-readiness using Spark NLP.`mpnet_embedding_due_eshop_21` is a English model originally trained by konverner. + +{:.btn-box} + + +[Download](https://s3.amazonaws.com/auxdata.johnsnowlabs.com/public/models/mpnet_embedding_due_eshop_21_en_5.1.0_3.0_1692376954136.zip){:.button.button-orange.button-orange-trans.arr.button-icon} +[Copy S3 URI](s3://auxdata.johnsnowlabs.com/public/models/mpnet_embedding_due_eshop_21_en_5.1.0_3.0_1692376954136.zip){:.button.button-orange.button-orange-trans.button-icon.button-copy-s3} + +## How to use + + + +
+{% include programmingLanguageSelectScalaPythonNLU.html %} +```python + +document_assembler = DocumentAssembler() \ + .setInputCol("text") \ + .setOutputCol("documents") + +instruction = MPNetEmbeddings \ + .pretrained("mpnet_embedding_due_eshop_21", "en")\ + .setInputCols(["documents"]) \ + .setOutputCol("mpnet_embeddings") + +pipeline = Pipeline(stages=[ + document_assembler, + instruction, +]) + +pipelineModel = pipeline.fit(data) + +pipelineDF = pipelineModel.transform(data) +``` +```scala + +val document_assembler = new DocumentAssembler() + .setInputCol("text") + .setOutputCol("documents") + +val instruction = MPNetEmbeddings + .pretrained("mpnet_embedding_due_eshop_21", "en") + .setInputCols(Array("documents")) + .setOutputCol("mpnet_embeddings") + +val pipeline = new Pipeline().setStages(Array(document_assembler, instruction)) + +val pipelineModel = pipeline.fit(data) + +val pipelineDF = pipelineModel.transform(data) + +``` +
+ +{:.model-param} +## Model Information + +{:.table-model} +|---|---| +|Model Name:|mpnet_embedding_due_eshop_21| +|Compatibility:|Spark NLP 5.1.0+| +|License:|Open Source| +|Edition:|Official| +|Input Labels:|[documents]| +|Output Labels:|[mpnet_embeddings]| +|Language:|en| +|Size:|409.9 MB| \ No newline at end of file From ca326bbf01ad26d11df2ca14f725cd44afc2c1b1 Mon Sep 17 00:00:00 2001 From: ahmedlone127 Date: Fri, 18 Aug 2023 23:59:34 +0700 Subject: [PATCH 047/155] Add model 2023-08-18-mpnet_embedding_setfit_ag_news_endpoint_en --- ...et_embedding_setfit_ag_news_endpoint_en.md | 88 +++++++++++++++++++ 1 file changed, 88 insertions(+) create mode 100644 docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_setfit_ag_news_endpoint_en.md diff --git a/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_setfit_ag_news_endpoint_en.md b/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_setfit_ag_news_endpoint_en.md new file mode 100644 index 00000000000000..ca5b5626b0dff0 --- /dev/null +++ b/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_setfit_ag_news_endpoint_en.md @@ -0,0 +1,88 @@ +--- +layout: model +title: English mpnet_embedding_setfit_ag_news_endpoint TFMPNetModel from philschmid +author: John Snow Labs +name: mpnet_embedding_setfit_ag_news_endpoint +date: 2023-08-18 +tags: [mpnet, en, open_source, tensorflow] +task: Embeddings +language: en +edition: Spark NLP 5.1.0 +spark_version: 3.0 +supported: true +engine: tensorflow +annotator: MPNetEmbeddings +article_header: + type: cover +use_language_switcher: "Python-Scala-Java" +--- + +## Description + +Pretrained mpnet model, adapted from Hugging Face and curated to provide scalability and production-readiness using Spark NLP.`mpnet_embedding_setfit_ag_news_endpoint` is a English model originally trained by philschmid. + +{:.btn-box} + + +[Download](https://s3.amazonaws.com/auxdata.johnsnowlabs.com/public/models/mpnet_embedding_setfit_ag_news_endpoint_en_5.1.0_3.0_1692377945921.zip){:.button.button-orange.button-orange-trans.arr.button-icon} +[Copy S3 URI](s3://auxdata.johnsnowlabs.com/public/models/mpnet_embedding_setfit_ag_news_endpoint_en_5.1.0_3.0_1692377945921.zip){:.button.button-orange.button-orange-trans.button-icon.button-copy-s3} + +## How to use + + + +
+{% include programmingLanguageSelectScalaPythonNLU.html %} +```python + +document_assembler = DocumentAssembler() \ + .setInputCol("text") \ + .setOutputCol("documents") + +instruction = MPNetEmbeddings \ + .pretrained("mpnet_embedding_setfit_ag_news_endpoint", "en")\ + .setInputCols(["documents"]) \ + .setOutputCol("mpnet_embeddings") + +pipeline = Pipeline(stages=[ + document_assembler, + instruction, +]) + +pipelineModel = pipeline.fit(data) + +pipelineDF = pipelineModel.transform(data) +``` +```scala + +val document_assembler = new DocumentAssembler() + .setInputCol("text") + .setOutputCol("documents") + +val instruction = MPNetEmbeddings + .pretrained("mpnet_embedding_setfit_ag_news_endpoint", "en") + .setInputCols(Array("documents")) + .setOutputCol("mpnet_embeddings") + +val pipeline = new Pipeline().setStages(Array(document_assembler, instruction)) + +val pipelineModel = pipeline.fit(data) + +val pipelineDF = pipelineModel.transform(data) + +``` +
+ +{:.model-param} +## Model Information + +{:.table-model} +|---|---| +|Model Name:|mpnet_embedding_setfit_ag_news_endpoint| +|Compatibility:|Spark NLP 5.1.0+| +|License:|Open Source| +|Edition:|Official| +|Input Labels:|[documents]| +|Output Labels:|[mpnet_embeddings]| +|Language:|en| +|Size:|409.8 MB| \ No newline at end of file From 05e16ebca4b1fd8632b82895f93710a1fa01738a Mon Sep 17 00:00:00 2001 From: ahmedlone127 Date: Sat, 19 Aug 2023 00:00:34 +0700 Subject: [PATCH 048/155] Add model 2023-08-18-mpnet_embedding_setfit_ds_version_0_0_4_en --- ...et_embedding_setfit_ds_version_0_0_4_en.md | 88 +++++++++++++++++++ 1 file changed, 88 insertions(+) create mode 100644 docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_setfit_ds_version_0_0_4_en.md diff --git a/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_setfit_ds_version_0_0_4_en.md b/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_setfit_ds_version_0_0_4_en.md new file mode 100644 index 00000000000000..77367c287800e2 --- /dev/null +++ b/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_setfit_ds_version_0_0_4_en.md @@ -0,0 +1,88 @@ +--- +layout: model +title: English mpnet_embedding_setfit_ds_version_0_0_4 TFMPNetModel from amittian +author: John Snow Labs +name: mpnet_embedding_setfit_ds_version_0_0_4 +date: 2023-08-18 +tags: [mpnet, en, open_source, tensorflow] +task: Embeddings +language: en +edition: Spark NLP 5.1.0 +spark_version: 3.0 +supported: true +engine: tensorflow +annotator: MPNetEmbeddings +article_header: + type: cover +use_language_switcher: "Python-Scala-Java" +--- + +## Description + +Pretrained mpnet model, adapted from Hugging Face and curated to provide scalability and production-readiness using Spark NLP.`mpnet_embedding_setfit_ds_version_0_0_4` is a English model originally trained by amittian. + +{:.btn-box} + + +[Download](https://s3.amazonaws.com/auxdata.johnsnowlabs.com/public/models/mpnet_embedding_setfit_ds_version_0_0_4_en_5.1.0_3.0_1692377962339.zip){:.button.button-orange.button-orange-trans.arr.button-icon} +[Copy S3 URI](s3://auxdata.johnsnowlabs.com/public/models/mpnet_embedding_setfit_ds_version_0_0_4_en_5.1.0_3.0_1692377962339.zip){:.button.button-orange.button-orange-trans.button-icon.button-copy-s3} + +## How to use + + + +
+{% include programmingLanguageSelectScalaPythonNLU.html %} +```python + +document_assembler = DocumentAssembler() \ + .setInputCol("text") \ + .setOutputCol("documents") + +instruction = MPNetEmbeddings \ + .pretrained("mpnet_embedding_setfit_ds_version_0_0_4", "en")\ + .setInputCols(["documents"]) \ + .setOutputCol("mpnet_embeddings") + +pipeline = Pipeline(stages=[ + document_assembler, + instruction, +]) + +pipelineModel = pipeline.fit(data) + +pipelineDF = pipelineModel.transform(data) +``` +```scala + +val document_assembler = new DocumentAssembler() + .setInputCol("text") + .setOutputCol("documents") + +val instruction = MPNetEmbeddings + .pretrained("mpnet_embedding_setfit_ds_version_0_0_4", "en") + .setInputCols(Array("documents")) + .setOutputCol("mpnet_embeddings") + +val pipeline = new Pipeline().setStages(Array(document_assembler, instruction)) + +val pipelineModel = pipeline.fit(data) + +val pipelineDF = pipelineModel.transform(data) + +``` +
+ +{:.model-param} +## Model Information + +{:.table-model} +|---|---| +|Model Name:|mpnet_embedding_setfit_ds_version_0_0_4| +|Compatibility:|Spark NLP 5.1.0+| +|License:|Open Source| +|Edition:|Official| +|Input Labels:|[documents]| +|Output Labels:|[mpnet_embeddings]| +|Language:|en| +|Size:|410.2 MB| \ No newline at end of file From c47169e298e9273d86cd8c00611c29c21a32dd8f Mon Sep 17 00:00:00 2001 From: ahmedlone127 Date: Sat, 19 Aug 2023 00:01:34 +0700 Subject: [PATCH 049/155] Add model 2023-08-18-mpnet_embedding_nli_mpnet_base_v2_en --- ...18-mpnet_embedding_nli_mpnet_base_v2_en.md | 88 +++++++++++++++++++ 1 file changed, 88 insertions(+) create mode 100644 docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_nli_mpnet_base_v2_en.md diff --git a/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_nli_mpnet_base_v2_en.md b/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_nli_mpnet_base_v2_en.md new file mode 100644 index 00000000000000..2fa2384a26ea39 --- /dev/null +++ b/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_nli_mpnet_base_v2_en.md @@ -0,0 +1,88 @@ +--- +layout: model +title: English mpnet_embedding_nli_mpnet_base_v2 TFMPNetModel from sentence-transformers +author: John Snow Labs +name: mpnet_embedding_nli_mpnet_base_v2 +date: 2023-08-18 +tags: [mpnet, en, open_source, tensorflow] +task: Embeddings +language: en +edition: Spark NLP 5.1.0 +spark_version: 3.0 +supported: true +engine: tensorflow +annotator: MPNetEmbeddings +article_header: + type: cover +use_language_switcher: "Python-Scala-Java" +--- + +## Description + +Pretrained mpnet model, adapted from Hugging Face and curated to provide scalability and production-readiness using Spark NLP.`mpnet_embedding_nli_mpnet_base_v2` is a English model originally trained by sentence-transformers. + +{:.btn-box} + + +[Download](https://s3.amazonaws.com/auxdata.johnsnowlabs.com/public/models/mpnet_embedding_nli_mpnet_base_v2_en_5.1.0_3.0_1692377999396.zip){:.button.button-orange.button-orange-trans.arr.button-icon} +[Copy S3 URI](s3://auxdata.johnsnowlabs.com/public/models/mpnet_embedding_nli_mpnet_base_v2_en_5.1.0_3.0_1692377999396.zip){:.button.button-orange.button-orange-trans.button-icon.button-copy-s3} + +## How to use + + + +
+{% include programmingLanguageSelectScalaPythonNLU.html %} +```python + +document_assembler = DocumentAssembler() \ + .setInputCol("text") \ + .setOutputCol("documents") + +instruction = MPNetEmbeddings \ + .pretrained("mpnet_embedding_nli_mpnet_base_v2", "en")\ + .setInputCols(["documents"]) \ + .setOutputCol("mpnet_embeddings") + +pipeline = Pipeline(stages=[ + document_assembler, + instruction, +]) + +pipelineModel = pipeline.fit(data) + +pipelineDF = pipelineModel.transform(data) +``` +```scala + +val document_assembler = new DocumentAssembler() + .setInputCol("text") + .setOutputCol("documents") + +val instruction = MPNetEmbeddings + .pretrained("mpnet_embedding_nli_mpnet_base_v2", "en") + .setInputCols(Array("documents")) + .setOutputCol("mpnet_embeddings") + +val pipeline = new Pipeline().setStages(Array(document_assembler, instruction)) + +val pipelineModel = pipeline.fit(data) + +val pipelineDF = pipelineModel.transform(data) + +``` +
+ +{:.model-param} +## Model Information + +{:.table-model} +|---|---| +|Model Name:|mpnet_embedding_nli_mpnet_base_v2| +|Compatibility:|Spark NLP 5.1.0+| +|License:|Open Source| +|Edition:|Official| +|Input Labels:|[documents]| +|Output Labels:|[mpnet_embeddings]| +|Language:|en| +|Size:|408.5 MB| \ No newline at end of file From cb15be19f9fa710d139dc241aa8b28fc5cb6185c Mon Sep 17 00:00:00 2001 From: ahmedlone127 Date: Sat, 19 Aug 2023 00:02:35 +0700 Subject: [PATCH 050/155] Add model 2023-08-18-mpnet_embedding_sbert_paper_en --- ...23-08-18-mpnet_embedding_sbert_paper_en.md | 88 +++++++++++++++++++ 1 file changed, 88 insertions(+) create mode 100644 docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_sbert_paper_en.md diff --git a/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_sbert_paper_en.md b/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_sbert_paper_en.md new file mode 100644 index 00000000000000..18d08e2a8164b7 --- /dev/null +++ b/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_sbert_paper_en.md @@ -0,0 +1,88 @@ +--- +layout: model +title: English mpnet_embedding_sbert_paper TFMPNetModel from salsabiilashifa11 +author: John Snow Labs +name: mpnet_embedding_sbert_paper +date: 2023-08-18 +tags: [mpnet, en, open_source, tensorflow] +task: Embeddings +language: en +edition: Spark NLP 5.1.0 +spark_version: 3.0 +supported: true +engine: tensorflow +annotator: MPNetEmbeddings +article_header: + type: cover +use_language_switcher: "Python-Scala-Java" +--- + +## Description + +Pretrained mpnet model, adapted from Hugging Face and curated to provide scalability and production-readiness using Spark NLP.`mpnet_embedding_sbert_paper` is a English model originally trained by salsabiilashifa11. + +{:.btn-box} + + +[Download](https://s3.amazonaws.com/auxdata.johnsnowlabs.com/public/models/mpnet_embedding_sbert_paper_en_5.1.0_3.0_1692377968134.zip){:.button.button-orange.button-orange-trans.arr.button-icon} +[Copy S3 URI](s3://auxdata.johnsnowlabs.com/public/models/mpnet_embedding_sbert_paper_en_5.1.0_3.0_1692377968134.zip){:.button.button-orange.button-orange-trans.button-icon.button-copy-s3} + +## How to use + + + +
+{% include programmingLanguageSelectScalaPythonNLU.html %} +```python + +document_assembler = DocumentAssembler() \ + .setInputCol("text") \ + .setOutputCol("documents") + +instruction = MPNetEmbeddings \ + .pretrained("mpnet_embedding_sbert_paper", "en")\ + .setInputCols(["documents"]) \ + .setOutputCol("mpnet_embeddings") + +pipeline = Pipeline(stages=[ + document_assembler, + instruction, +]) + +pipelineModel = pipeline.fit(data) + +pipelineDF = pipelineModel.transform(data) +``` +```scala + +val document_assembler = new DocumentAssembler() + .setInputCol("text") + .setOutputCol("documents") + +val instruction = MPNetEmbeddings + .pretrained("mpnet_embedding_sbert_paper", "en") + .setInputCols(Array("documents")) + .setOutputCol("mpnet_embeddings") + +val pipeline = new Pipeline().setStages(Array(document_assembler, instruction)) + +val pipelineModel = pipeline.fit(data) + +val pipelineDF = pipelineModel.transform(data) + +``` +
+ +{:.model-param} +## Model Information + +{:.table-model} +|---|---| +|Model Name:|mpnet_embedding_sbert_paper| +|Compatibility:|Spark NLP 5.1.0+| +|License:|Open Source| +|Edition:|Official| +|Input Labels:|[documents]| +|Output Labels:|[mpnet_embeddings]| +|Language:|en| +|Size:|409.6 MB| \ No newline at end of file From afde56e1e7616929edf4b9047e461897be8436cb Mon Sep 17 00:00:00 2001 From: ahmedlone127 Date: Sat, 19 Aug 2023 00:03:35 +0700 Subject: [PATCH 051/155] Add model 2023-08-18-mpnet_embedding_test_food_en --- ...2023-08-18-mpnet_embedding_test_food_en.md | 88 +++++++++++++++++++ 1 file changed, 88 insertions(+) create mode 100644 docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_test_food_en.md diff --git a/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_test_food_en.md b/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_test_food_en.md new file mode 100644 index 00000000000000..dddc714dce2a1b --- /dev/null +++ b/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_test_food_en.md @@ -0,0 +1,88 @@ +--- +layout: model +title: English mpnet_embedding_test_food TFMPNetModel from Linus4Lyf +author: John Snow Labs +name: mpnet_embedding_test_food +date: 2023-08-18 +tags: [mpnet, en, open_source, tensorflow] +task: Embeddings +language: en +edition: Spark NLP 5.1.0 +spark_version: 3.0 +supported: true +engine: tensorflow +annotator: MPNetEmbeddings +article_header: + type: cover +use_language_switcher: "Python-Scala-Java" +--- + +## Description + +Pretrained mpnet model, adapted from Hugging Face and curated to provide scalability and production-readiness using Spark NLP.`mpnet_embedding_test_food` is a English model originally trained by Linus4Lyf. + +{:.btn-box} + + +[Download](https://s3.amazonaws.com/auxdata.johnsnowlabs.com/public/models/mpnet_embedding_test_food_en_5.1.0_3.0_1692378151159.zip){:.button.button-orange.button-orange-trans.arr.button-icon} +[Copy S3 URI](s3://auxdata.johnsnowlabs.com/public/models/mpnet_embedding_test_food_en_5.1.0_3.0_1692378151159.zip){:.button.button-orange.button-orange-trans.button-icon.button-copy-s3} + +## How to use + + + +
+{% include programmingLanguageSelectScalaPythonNLU.html %} +```python + +document_assembler = DocumentAssembler() \ + .setInputCol("text") \ + .setOutputCol("documents") + +instruction = MPNetEmbeddings \ + .pretrained("mpnet_embedding_test_food", "en")\ + .setInputCols(["documents"]) \ + .setOutputCol("mpnet_embeddings") + +pipeline = Pipeline(stages=[ + document_assembler, + instruction, +]) + +pipelineModel = pipeline.fit(data) + +pipelineDF = pipelineModel.transform(data) +``` +```scala + +val document_assembler = new DocumentAssembler() + .setInputCol("text") + .setOutputCol("documents") + +val instruction = MPNetEmbeddings + .pretrained("mpnet_embedding_test_food", "en") + .setInputCols(Array("documents")) + .setOutputCol("mpnet_embeddings") + +val pipeline = new Pipeline().setStages(Array(document_assembler, instruction)) + +val pipelineModel = pipeline.fit(data) + +val pipelineDF = pipelineModel.transform(data) + +``` +
+ +{:.model-param} +## Model Information + +{:.table-model} +|---|---| +|Model Name:|mpnet_embedding_test_food| +|Compatibility:|Spark NLP 5.1.0+| +|License:|Open Source| +|Edition:|Official| +|Input Labels:|[documents]| +|Output Labels:|[mpnet_embeddings]| +|Language:|en| +|Size:|409.9 MB| \ No newline at end of file From f46740682b042071be19392eb931734637d42245 Mon Sep 17 00:00:00 2001 From: ahmedlone127 Date: Sat, 19 Aug 2023 00:04:35 +0700 Subject: [PATCH 052/155] Add model 2023-08-18-mpnet_embedding_labels_per_job_title_fine_tune_en --- ...dding_labels_per_job_title_fine_tune_en.md | 88 +++++++++++++++++++ 1 file changed, 88 insertions(+) create mode 100644 docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_labels_per_job_title_fine_tune_en.md diff --git a/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_labels_per_job_title_fine_tune_en.md b/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_labels_per_job_title_fine_tune_en.md new file mode 100644 index 00000000000000..ea158a4635d45a --- /dev/null +++ b/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_labels_per_job_title_fine_tune_en.md @@ -0,0 +1,88 @@ +--- +layout: model +title: English mpnet_embedding_labels_per_job_title_fine_tune TFMPNetModel from marianodo +author: John Snow Labs +name: mpnet_embedding_labels_per_job_title_fine_tune +date: 2023-08-18 +tags: [mpnet, en, open_source, tensorflow] +task: Embeddings +language: en +edition: Spark NLP 5.1.0 +spark_version: 3.0 +supported: true +engine: tensorflow +annotator: MPNetEmbeddings +article_header: + type: cover +use_language_switcher: "Python-Scala-Java" +--- + +## Description + +Pretrained mpnet model, adapted from Hugging Face and curated to provide scalability and production-readiness using Spark NLP.`mpnet_embedding_labels_per_job_title_fine_tune` is a English model originally trained by marianodo. + +{:.btn-box} + + +[Download](https://s3.amazonaws.com/auxdata.johnsnowlabs.com/public/models/mpnet_embedding_labels_per_job_title_fine_tune_en_5.1.0_3.0_1692377349908.zip){:.button.button-orange.button-orange-trans.arr.button-icon} +[Copy S3 URI](s3://auxdata.johnsnowlabs.com/public/models/mpnet_embedding_labels_per_job_title_fine_tune_en_5.1.0_3.0_1692377349908.zip){:.button.button-orange.button-orange-trans.button-icon.button-copy-s3} + +## How to use + + + +
+{% include programmingLanguageSelectScalaPythonNLU.html %} +```python + +document_assembler = DocumentAssembler() \ + .setInputCol("text") \ + .setOutputCol("documents") + +instruction = MPNetEmbeddings \ + .pretrained("mpnet_embedding_labels_per_job_title_fine_tune", "en")\ + .setInputCols(["documents"]) \ + .setOutputCol("mpnet_embeddings") + +pipeline = Pipeline(stages=[ + document_assembler, + instruction, +]) + +pipelineModel = pipeline.fit(data) + +pipelineDF = pipelineModel.transform(data) +``` +```scala + +val document_assembler = new DocumentAssembler() + .setInputCol("text") + .setOutputCol("documents") + +val instruction = MPNetEmbeddings + .pretrained("mpnet_embedding_labels_per_job_title_fine_tune", "en") + .setInputCols(Array("documents")) + .setOutputCol("mpnet_embeddings") + +val pipeline = new Pipeline().setStages(Array(document_assembler, instruction)) + +val pipelineModel = pipeline.fit(data) + +val pipelineDF = pipelineModel.transform(data) + +``` +
+ +{:.model-param} +## Model Information + +{:.table-model} +|---|---| +|Model Name:|mpnet_embedding_labels_per_job_title_fine_tune| +|Compatibility:|Spark NLP 5.1.0+| +|License:|Open Source| +|Edition:|Official| +|Input Labels:|[documents]| +|Output Labels:|[mpnet_embeddings]| +|Language:|en| +|Size:|409.8 MB| \ No newline at end of file From ad782cbcf38e9754142c46b243f0ebec5577f2b7 Mon Sep 17 00:00:00 2001 From: ahmedlone127 Date: Sat, 19 Aug 2023 00:05:36 +0700 Subject: [PATCH 053/155] Add model 2023-08-18-mpnet_embedding_keyphrase_mpnet_v1_en --- ...8-mpnet_embedding_keyphrase_mpnet_v1_en.md | 88 +++++++++++++++++++ 1 file changed, 88 insertions(+) create mode 100644 docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_keyphrase_mpnet_v1_en.md diff --git a/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_keyphrase_mpnet_v1_en.md b/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_keyphrase_mpnet_v1_en.md new file mode 100644 index 00000000000000..fbc3401b41f28d --- /dev/null +++ b/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_keyphrase_mpnet_v1_en.md @@ -0,0 +1,88 @@ +--- +layout: model +title: English mpnet_embedding_keyphrase_mpnet_v1 TFMPNetModel from uclanlp +author: John Snow Labs +name: mpnet_embedding_keyphrase_mpnet_v1 +date: 2023-08-18 +tags: [mpnet, en, open_source, tensorflow] +task: Embeddings +language: en +edition: Spark NLP 5.1.0 +spark_version: 3.0 +supported: true +engine: tensorflow +annotator: MPNetEmbeddings +article_header: + type: cover +use_language_switcher: "Python-Scala-Java" +--- + +## Description + +Pretrained mpnet model, adapted from Hugging Face and curated to provide scalability and production-readiness using Spark NLP.`mpnet_embedding_keyphrase_mpnet_v1` is a English model originally trained by uclanlp. + +{:.btn-box} + + +[Download](https://s3.amazonaws.com/auxdata.johnsnowlabs.com/public/models/mpnet_embedding_keyphrase_mpnet_v1_en_5.1.0_3.0_1692377753915.zip){:.button.button-orange.button-orange-trans.arr.button-icon} +[Copy S3 URI](s3://auxdata.johnsnowlabs.com/public/models/mpnet_embedding_keyphrase_mpnet_v1_en_5.1.0_3.0_1692377753915.zip){:.button.button-orange.button-orange-trans.button-icon.button-copy-s3} + +## How to use + + + +
+{% include programmingLanguageSelectScalaPythonNLU.html %} +```python + +document_assembler = DocumentAssembler() \ + .setInputCol("text") \ + .setOutputCol("documents") + +instruction = MPNetEmbeddings \ + .pretrained("mpnet_embedding_keyphrase_mpnet_v1", "en")\ + .setInputCols(["documents"]) \ + .setOutputCol("mpnet_embeddings") + +pipeline = Pipeline(stages=[ + document_assembler, + instruction, +]) + +pipelineModel = pipeline.fit(data) + +pipelineDF = pipelineModel.transform(data) +``` +```scala + +val document_assembler = new DocumentAssembler() + .setInputCol("text") + .setOutputCol("documents") + +val instruction = MPNetEmbeddings + .pretrained("mpnet_embedding_keyphrase_mpnet_v1", "en") + .setInputCols(Array("documents")) + .setOutputCol("mpnet_embeddings") + +val pipeline = new Pipeline().setStages(Array(document_assembler, instruction)) + +val pipelineModel = pipeline.fit(data) + +val pipelineDF = pipelineModel.transform(data) + +``` +
+ +{:.model-param} +## Model Information + +{:.table-model} +|---|---| +|Model Name:|mpnet_embedding_keyphrase_mpnet_v1| +|Compatibility:|Spark NLP 5.1.0+| +|License:|Open Source| +|Edition:|Official| +|Input Labels:|[documents]| +|Output Labels:|[mpnet_embeddings]| +|Language:|en| +|Size:|409.6 MB| \ No newline at end of file From de54122bd0fb501717cbf14f1d271cd83fcb480d Mon Sep 17 00:00:00 2001 From: ahmedlone127 Date: Sat, 19 Aug 2023 00:06:36 +0700 Subject: [PATCH 054/155] Add model 2023-08-18-mpnet_embedding_setfit_occupation_en --- ...18-mpnet_embedding_setfit_occupation_en.md | 88 +++++++++++++++++++ 1 file changed, 88 insertions(+) create mode 100644 docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_setfit_occupation_en.md diff --git a/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_setfit_occupation_en.md b/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_setfit_occupation_en.md new file mode 100644 index 00000000000000..2e3bd0d8051043 --- /dev/null +++ b/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_setfit_occupation_en.md @@ -0,0 +1,88 @@ +--- +layout: model +title: English mpnet_embedding_setfit_occupation TFMPNetModel from ivanzidov +author: John Snow Labs +name: mpnet_embedding_setfit_occupation +date: 2023-08-18 +tags: [mpnet, en, open_source, tensorflow] +task: Embeddings +language: en +edition: Spark NLP 5.1.0 +spark_version: 3.0 +supported: true +engine: tensorflow +annotator: MPNetEmbeddings +article_header: + type: cover +use_language_switcher: "Python-Scala-Java" +--- + +## Description + +Pretrained mpnet model, adapted from Hugging Face and curated to provide scalability and production-readiness using Spark NLP.`mpnet_embedding_setfit_occupation` is a English model originally trained by ivanzidov. + +{:.btn-box} + + +[Download](https://s3.amazonaws.com/auxdata.johnsnowlabs.com/public/models/mpnet_embedding_setfit_occupation_en_5.1.0_3.0_1692378365223.zip){:.button.button-orange.button-orange-trans.arr.button-icon} +[Copy S3 URI](s3://auxdata.johnsnowlabs.com/public/models/mpnet_embedding_setfit_occupation_en_5.1.0_3.0_1692378365223.zip){:.button.button-orange.button-orange-trans.button-icon.button-copy-s3} + +## How to use + + + +
+{% include programmingLanguageSelectScalaPythonNLU.html %} +```python + +document_assembler = DocumentAssembler() \ + .setInputCol("text") \ + .setOutputCol("documents") + +instruction = MPNetEmbeddings \ + .pretrained("mpnet_embedding_setfit_occupation", "en")\ + .setInputCols(["documents"]) \ + .setOutputCol("mpnet_embeddings") + +pipeline = Pipeline(stages=[ + document_assembler, + instruction, +]) + +pipelineModel = pipeline.fit(data) + +pipelineDF = pipelineModel.transform(data) +``` +```scala + +val document_assembler = new DocumentAssembler() + .setInputCol("text") + .setOutputCol("documents") + +val instruction = MPNetEmbeddings + .pretrained("mpnet_embedding_setfit_occupation", "en") + .setInputCols(Array("documents")) + .setOutputCol("mpnet_embeddings") + +val pipeline = new Pipeline().setStages(Array(document_assembler, instruction)) + +val pipelineModel = pipeline.fit(data) + +val pipelineDF = pipelineModel.transform(data) + +``` +
+ +{:.model-param} +## Model Information + +{:.table-model} +|---|---| +|Model Name:|mpnet_embedding_setfit_occupation| +|Compatibility:|Spark NLP 5.1.0+| +|License:|Open Source| +|Edition:|Official| +|Input Labels:|[documents]| +|Output Labels:|[mpnet_embeddings]| +|Language:|en| +|Size:|409.8 MB| \ No newline at end of file From ee578424a06eec0e527be03f24e32117701c9f9d Mon Sep 17 00:00:00 2001 From: ahmedlone127 Date: Sat, 19 Aug 2023 00:07:36 +0700 Subject: [PATCH 055/155] Add model 2023-08-18-mpnet_embedding_python_developmentnotes_classifier_en --- ...g_python_developmentnotes_classifier_en.md | 88 +++++++++++++++++++ 1 file changed, 88 insertions(+) create mode 100644 docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_python_developmentnotes_classifier_en.md diff --git a/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_python_developmentnotes_classifier_en.md b/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_python_developmentnotes_classifier_en.md new file mode 100644 index 00000000000000..26b9499d6f8e78 --- /dev/null +++ b/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_python_developmentnotes_classifier_en.md @@ -0,0 +1,88 @@ +--- +layout: model +title: English mpnet_embedding_python_developmentnotes_classifier TFMPNetModel from AISE-TUDelft +author: John Snow Labs +name: mpnet_embedding_python_developmentnotes_classifier +date: 2023-08-18 +tags: [mpnet, en, open_source, tensorflow] +task: Embeddings +language: en +edition: Spark NLP 5.1.0 +spark_version: 3.0 +supported: true +engine: tensorflow +annotator: MPNetEmbeddings +article_header: + type: cover +use_language_switcher: "Python-Scala-Java" +--- + +## Description + +Pretrained mpnet model, adapted from Hugging Face and curated to provide scalability and production-readiness using Spark NLP.`mpnet_embedding_python_developmentnotes_classifier` is a English model originally trained by AISE-TUDelft. + +{:.btn-box} + + +[Download](https://s3.amazonaws.com/auxdata.johnsnowlabs.com/public/models/mpnet_embedding_python_developmentnotes_classifier_en_5.1.0_3.0_1692378408726.zip){:.button.button-orange.button-orange-trans.arr.button-icon} +[Copy S3 URI](s3://auxdata.johnsnowlabs.com/public/models/mpnet_embedding_python_developmentnotes_classifier_en_5.1.0_3.0_1692378408726.zip){:.button.button-orange.button-orange-trans.button-icon.button-copy-s3} + +## How to use + + + +
+{% include programmingLanguageSelectScalaPythonNLU.html %} +```python + +document_assembler = DocumentAssembler() \ + .setInputCol("text") \ + .setOutputCol("documents") + +instruction = MPNetEmbeddings \ + .pretrained("mpnet_embedding_python_developmentnotes_classifier", "en")\ + .setInputCols(["documents"]) \ + .setOutputCol("mpnet_embeddings") + +pipeline = Pipeline(stages=[ + document_assembler, + instruction, +]) + +pipelineModel = pipeline.fit(data) + +pipelineDF = pipelineModel.transform(data) +``` +```scala + +val document_assembler = new DocumentAssembler() + .setInputCol("text") + .setOutputCol("documents") + +val instruction = MPNetEmbeddings + .pretrained("mpnet_embedding_python_developmentnotes_classifier", "en") + .setInputCols(Array("documents")) + .setOutputCol("mpnet_embeddings") + +val pipeline = new Pipeline().setStages(Array(document_assembler, instruction)) + +val pipelineModel = pipeline.fit(data) + +val pipelineDF = pipelineModel.transform(data) + +``` +
+ +{:.model-param} +## Model Information + +{:.table-model} +|---|---| +|Model Name:|mpnet_embedding_python_developmentnotes_classifier| +|Compatibility:|Spark NLP 5.1.0+| +|License:|Open Source| +|Edition:|Official| +|Input Labels:|[documents]| +|Output Labels:|[mpnet_embeddings]| +|Language:|en| +|Size:|409.8 MB| \ No newline at end of file From ac99bcfd236db813c6c6c46b08449c1105598415 Mon Sep 17 00:00:00 2001 From: ahmedlone127 Date: Sat, 19 Aug 2023 00:08:37 +0700 Subject: [PATCH 056/155] Add model 2023-08-18-mpnet_embedding_tiny_random_MPNetForSequenceClassification_en --- ...andom_MPNetForSequenceClassification_en.md | 88 +++++++++++++++++++ 1 file changed, 88 insertions(+) create mode 100644 docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_tiny_random_MPNetForSequenceClassification_en.md diff --git a/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_tiny_random_MPNetForSequenceClassification_en.md b/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_tiny_random_MPNetForSequenceClassification_en.md new file mode 100644 index 00000000000000..e04ecbb65945f0 --- /dev/null +++ b/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_tiny_random_MPNetForSequenceClassification_en.md @@ -0,0 +1,88 @@ +--- +layout: model +title: English mpnet_embedding_tiny_random_MPNetForSequenceClassification TFMPNetModel from hf-tiny-model-private +author: John Snow Labs +name: mpnet_embedding_tiny_random_MPNetForSequenceClassification +date: 2023-08-18 +tags: [mpnet, en, open_source, tensorflow] +task: Embeddings +language: en +edition: Spark NLP 5.1.0 +spark_version: 3.0 +supported: true +engine: tensorflow +annotator: MPNetEmbeddings +article_header: + type: cover +use_language_switcher: "Python-Scala-Java" +--- + +## Description + +Pretrained mpnet model, adapted from Hugging Face and curated to provide scalability and production-readiness using Spark NLP.`mpnet_embedding_tiny_random_MPNetForSequenceClassification` is a English model originally trained by hf-tiny-model-private. + +{:.btn-box} + + +[Download](https://s3.amazonaws.com/auxdata.johnsnowlabs.com/public/models/mpnet_embedding_tiny_random_MPNetForSequenceClassification_en_5.1.0_3.0_1692375973996.zip){:.button.button-orange.button-orange-trans.arr.button-icon} +[Copy S3 URI](s3://auxdata.johnsnowlabs.com/public/models/mpnet_embedding_tiny_random_MPNetForSequenceClassification_en_5.1.0_3.0_1692375973996.zip){:.button.button-orange.button-orange-trans.button-icon.button-copy-s3} + +## How to use + + + +
+{% include programmingLanguageSelectScalaPythonNLU.html %} +```python + +document_assembler = DocumentAssembler() \ + .setInputCol("text") \ + .setOutputCol("documents") + +instruction = MPNetEmbeddings \ + .pretrained("mpnet_embedding_tiny_random_MPNetForSequenceClassification", "en")\ + .setInputCols(["documents"]) \ + .setOutputCol("mpnet_embeddings") + +pipeline = Pipeline(stages=[ + document_assembler, + instruction, +]) + +pipelineModel = pipeline.fit(data) + +pipelineDF = pipelineModel.transform(data) +``` +```scala + +val document_assembler = new DocumentAssembler() + .setInputCol("text") + .setOutputCol("documents") + +val instruction = MPNetEmbeddings + .pretrained("mpnet_embedding_tiny_random_MPNetForSequenceClassification", "en") + .setInputCols(Array("documents")) + .setOutputCol("mpnet_embeddings") + +val pipeline = new Pipeline().setStages(Array(document_assembler, instruction)) + +val pipelineModel = pipeline.fit(data) + +val pipelineDF = pipelineModel.transform(data) + +``` +
+ +{:.model-param} +## Model Information + +{:.table-model} +|---|---| +|Model Name:|mpnet_embedding_tiny_random_MPNetForSequenceClassification| +|Compatibility:|Spark NLP 5.1.0+| +|License:|Open Source| +|Edition:|Official| +|Input Labels:|[documents]| +|Output Labels:|[mpnet_embeddings]| +|Language:|en| +|Size:|1.2 MB| \ No newline at end of file From ccd6c31813c3d94dffc6928b073f799dda3930f9 Mon Sep 17 00:00:00 2001 From: ahmedlone127 Date: Sat, 19 Aug 2023 00:09:37 +0700 Subject: [PATCH 057/155] Add model 2023-08-18-mpnet_embedding_due_eshop_21_multilabel_en --- ...et_embedding_due_eshop_21_multilabel_en.md | 88 +++++++++++++++++++ 1 file changed, 88 insertions(+) create mode 100644 docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_due_eshop_21_multilabel_en.md diff --git a/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_due_eshop_21_multilabel_en.md b/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_due_eshop_21_multilabel_en.md new file mode 100644 index 00000000000000..4958a984a85246 --- /dev/null +++ b/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_due_eshop_21_multilabel_en.md @@ -0,0 +1,88 @@ +--- +layout: model +title: English mpnet_embedding_due_eshop_21_multilabel TFMPNetModel from konverner +author: John Snow Labs +name: mpnet_embedding_due_eshop_21_multilabel +date: 2023-08-18 +tags: [mpnet, en, open_source, tensorflow] +task: Embeddings +language: en +edition: Spark NLP 5.1.0 +spark_version: 3.0 +supported: true +engine: tensorflow +annotator: MPNetEmbeddings +article_header: + type: cover +use_language_switcher: "Python-Scala-Java" +--- + +## Description + +Pretrained mpnet model, adapted from Hugging Face and curated to provide scalability and production-readiness using Spark NLP.`mpnet_embedding_due_eshop_21_multilabel` is a English model originally trained by konverner. + +{:.btn-box} + + +[Download](https://s3.amazonaws.com/auxdata.johnsnowlabs.com/public/models/mpnet_embedding_due_eshop_21_multilabel_en_5.1.0_3.0_1692378545704.zip){:.button.button-orange.button-orange-trans.arr.button-icon} +[Copy S3 URI](s3://auxdata.johnsnowlabs.com/public/models/mpnet_embedding_due_eshop_21_multilabel_en_5.1.0_3.0_1692378545704.zip){:.button.button-orange.button-orange-trans.button-icon.button-copy-s3} + +## How to use + + + +
+{% include programmingLanguageSelectScalaPythonNLU.html %} +```python + +document_assembler = DocumentAssembler() \ + .setInputCol("text") \ + .setOutputCol("documents") + +instruction = MPNetEmbeddings \ + .pretrained("mpnet_embedding_due_eshop_21_multilabel", "en")\ + .setInputCols(["documents"]) \ + .setOutputCol("mpnet_embeddings") + +pipeline = Pipeline(stages=[ + document_assembler, + instruction, +]) + +pipelineModel = pipeline.fit(data) + +pipelineDF = pipelineModel.transform(data) +``` +```scala + +val document_assembler = new DocumentAssembler() + .setInputCol("text") + .setOutputCol("documents") + +val instruction = MPNetEmbeddings + .pretrained("mpnet_embedding_due_eshop_21_multilabel", "en") + .setInputCols(Array("documents")) + .setOutputCol("mpnet_embeddings") + +val pipeline = new Pipeline().setStages(Array(document_assembler, instruction)) + +val pipelineModel = pipeline.fit(data) + +val pipelineDF = pipelineModel.transform(data) + +``` +
+ +{:.model-param} +## Model Information + +{:.table-model} +|---|---| +|Model Name:|mpnet_embedding_due_eshop_21_multilabel| +|Compatibility:|Spark NLP 5.1.0+| +|License:|Open Source| +|Edition:|Official| +|Input Labels:|[documents]| +|Output Labels:|[mpnet_embeddings]| +|Language:|en| +|Size:|409.9 MB| \ No newline at end of file From 98fd7b79425815540be243f367494ff0dedf2f6d Mon Sep 17 00:00:00 2001 From: ahmedlone127 Date: Sat, 19 Aug 2023 00:10:37 +0700 Subject: [PATCH 058/155] Add model 2023-08-18-mpnet_embedding_retriever_coding_guru_adapted_en --- ...edding_retriever_coding_guru_adapted_en.md | 88 +++++++++++++++++++ 1 file changed, 88 insertions(+) create mode 100644 docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_retriever_coding_guru_adapted_en.md diff --git a/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_retriever_coding_guru_adapted_en.md b/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_retriever_coding_guru_adapted_en.md new file mode 100644 index 00000000000000..89c4d76b6d7c89 --- /dev/null +++ b/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_retriever_coding_guru_adapted_en.md @@ -0,0 +1,88 @@ +--- +layout: model +title: English mpnet_embedding_retriever_coding_guru_adapted TFMPNetModel from AlekseyKorshuk +author: John Snow Labs +name: mpnet_embedding_retriever_coding_guru_adapted +date: 2023-08-18 +tags: [mpnet, en, open_source, tensorflow] +task: Embeddings +language: en +edition: Spark NLP 5.1.0 +spark_version: 3.0 +supported: true +engine: tensorflow +annotator: MPNetEmbeddings +article_header: + type: cover +use_language_switcher: "Python-Scala-Java" +--- + +## Description + +Pretrained mpnet model, adapted from Hugging Face and curated to provide scalability and production-readiness using Spark NLP.`mpnet_embedding_retriever_coding_guru_adapted` is a English model originally trained by AlekseyKorshuk. + +{:.btn-box} + + +[Download](https://s3.amazonaws.com/auxdata.johnsnowlabs.com/public/models/mpnet_embedding_retriever_coding_guru_adapted_en_5.1.0_3.0_1692378566998.zip){:.button.button-orange.button-orange-trans.arr.button-icon} +[Copy S3 URI](s3://auxdata.johnsnowlabs.com/public/models/mpnet_embedding_retriever_coding_guru_adapted_en_5.1.0_3.0_1692378566998.zip){:.button.button-orange.button-orange-trans.button-icon.button-copy-s3} + +## How to use + + + +
+{% include programmingLanguageSelectScalaPythonNLU.html %} +```python + +document_assembler = DocumentAssembler() \ + .setInputCol("text") \ + .setOutputCol("documents") + +instruction = MPNetEmbeddings \ + .pretrained("mpnet_embedding_retriever_coding_guru_adapted", "en")\ + .setInputCols(["documents"]) \ + .setOutputCol("mpnet_embeddings") + +pipeline = Pipeline(stages=[ + document_assembler, + instruction, +]) + +pipelineModel = pipeline.fit(data) + +pipelineDF = pipelineModel.transform(data) +``` +```scala + +val document_assembler = new DocumentAssembler() + .setInputCol("text") + .setOutputCol("documents") + +val instruction = MPNetEmbeddings + .pretrained("mpnet_embedding_retriever_coding_guru_adapted", "en") + .setInputCols(Array("documents")) + .setOutputCol("mpnet_embeddings") + +val pipeline = new Pipeline().setStages(Array(document_assembler, instruction)) + +val pipelineModel = pipeline.fit(data) + +val pipelineDF = pipelineModel.transform(data) + +``` +
+ +{:.model-param} +## Model Information + +{:.table-model} +|---|---| +|Model Name:|mpnet_embedding_retriever_coding_guru_adapted| +|Compatibility:|Spark NLP 5.1.0+| +|License:|Open Source| +|Edition:|Official| +|Input Labels:|[documents]| +|Output Labels:|[mpnet_embeddings]| +|Language:|en| +|Size:|410.2 MB| \ No newline at end of file From 933e8a463be2aed77880e3e343ac750e12666d08 Mon Sep 17 00:00:00 2001 From: ahmedlone127 Date: Sat, 19 Aug 2023 00:11:37 +0700 Subject: [PATCH 059/155] Add model 2023-08-18-mpnet_embedding_negation_categories_classifier_es --- ...dding_negation_categories_classifier_es.md | 88 +++++++++++++++++++ 1 file changed, 88 insertions(+) create mode 100644 docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_negation_categories_classifier_es.md diff --git a/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_negation_categories_classifier_es.md b/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_negation_categories_classifier_es.md new file mode 100644 index 00000000000000..9f51d4f8c362fa --- /dev/null +++ b/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_negation_categories_classifier_es.md @@ -0,0 +1,88 @@ +--- +layout: model +title: Castilian, Spanish mpnet_embedding_negation_categories_classifier TFMPNetModel from mhammadkhan +author: John Snow Labs +name: mpnet_embedding_negation_categories_classifier +date: 2023-08-18 +tags: [mpnet, es, open_source, tensorflow] +task: Embeddings +language: es +edition: Spark NLP 5.1.0 +spark_version: 3.0 +supported: true +engine: tensorflow +annotator: MPNetEmbeddings +article_header: + type: cover +use_language_switcher: "Python-Scala-Java" +--- + +## Description + +Pretrained mpnet model, adapted from Hugging Face and curated to provide scalability and production-readiness using Spark NLP.`mpnet_embedding_negation_categories_classifier` is a Castilian, Spanish model originally trained by mhammadkhan. + +{:.btn-box} + + +[Download](https://s3.amazonaws.com/auxdata.johnsnowlabs.com/public/models/mpnet_embedding_negation_categories_classifier_es_5.1.0_3.0_1692378586323.zip){:.button.button-orange.button-orange-trans.arr.button-icon} +[Copy S3 URI](s3://auxdata.johnsnowlabs.com/public/models/mpnet_embedding_negation_categories_classifier_es_5.1.0_3.0_1692378586323.zip){:.button.button-orange.button-orange-trans.button-icon.button-copy-s3} + +## How to use + + + +
+{% include programmingLanguageSelectScalaPythonNLU.html %} +```python + +document_assembler = DocumentAssembler() \ + .setInputCol("text") \ + .setOutputCol("documents") + +instruction = MPNetEmbeddings \ + .pretrained("mpnet_embedding_negation_categories_classifier", "es")\ + .setInputCols(["documents"]) \ + .setOutputCol("mpnet_embeddings") + +pipeline = Pipeline(stages=[ + document_assembler, + instruction, +]) + +pipelineModel = pipeline.fit(data) + +pipelineDF = pipelineModel.transform(data) +``` +```scala + +val document_assembler = new DocumentAssembler() + .setInputCol("text") + .setOutputCol("documents") + +val instruction = MPNetEmbeddings + .pretrained("mpnet_embedding_negation_categories_classifier", "es") + .setInputCols(Array("documents")) + .setOutputCol("mpnet_embeddings") + +val pipeline = new Pipeline().setStages(Array(document_assembler, instruction)) + +val pipelineModel = pipeline.fit(data) + +val pipelineDF = pipelineModel.transform(data) + +``` +
+ +{:.model-param} +## Model Information + +{:.table-model} +|---|---| +|Model Name:|mpnet_embedding_negation_categories_classifier| +|Compatibility:|Spark NLP 5.1.0+| +|License:|Open Source| +|Edition:|Official| +|Input Labels:|[documents]| +|Output Labels:|[mpnet_embeddings]| +|Language:|es| +|Size:|409.9 MB| \ No newline at end of file From 81b7d3ae2911f907f7f206ab6ee41262bc32adcf Mon Sep 17 00:00:00 2001 From: ahmedlone127 Date: Sat, 19 Aug 2023 00:12:38 +0700 Subject: [PATCH 060/155] Add model 2023-08-18-mpnet_embedding_paraphrase_mpnet_base_v2_fuzzy_matcher_en --- ...raphrase_mpnet_base_v2_fuzzy_matcher_en.md | 88 +++++++++++++++++++ 1 file changed, 88 insertions(+) create mode 100644 docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_paraphrase_mpnet_base_v2_fuzzy_matcher_en.md diff --git a/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_paraphrase_mpnet_base_v2_fuzzy_matcher_en.md b/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_paraphrase_mpnet_base_v2_fuzzy_matcher_en.md new file mode 100644 index 00000000000000..a907840a8b3cc1 --- /dev/null +++ b/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_paraphrase_mpnet_base_v2_fuzzy_matcher_en.md @@ -0,0 +1,88 @@ +--- +layout: model +title: English mpnet_embedding_paraphrase_mpnet_base_v2_fuzzy_matcher TFMPNetModel from shahrukhx01 +author: John Snow Labs +name: mpnet_embedding_paraphrase_mpnet_base_v2_fuzzy_matcher +date: 2023-08-18 +tags: [mpnet, en, open_source, tensorflow] +task: Embeddings +language: en +edition: Spark NLP 5.1.0 +spark_version: 3.0 +supported: true +engine: tensorflow +annotator: MPNetEmbeddings +article_header: + type: cover +use_language_switcher: "Python-Scala-Java" +--- + +## Description + +Pretrained mpnet model, adapted from Hugging Face and curated to provide scalability and production-readiness using Spark NLP.`mpnet_embedding_paraphrase_mpnet_base_v2_fuzzy_matcher` is a English model originally trained by shahrukhx01. + +{:.btn-box} + + +[Download](https://s3.amazonaws.com/auxdata.johnsnowlabs.com/public/models/mpnet_embedding_paraphrase_mpnet_base_v2_fuzzy_matcher_en_5.1.0_3.0_1692378572822.zip){:.button.button-orange.button-orange-trans.arr.button-icon} +[Copy S3 URI](s3://auxdata.johnsnowlabs.com/public/models/mpnet_embedding_paraphrase_mpnet_base_v2_fuzzy_matcher_en_5.1.0_3.0_1692378572822.zip){:.button.button-orange.button-orange-trans.button-icon.button-copy-s3} + +## How to use + + + +
+{% include programmingLanguageSelectScalaPythonNLU.html %} +```python + +document_assembler = DocumentAssembler() \ + .setInputCol("text") \ + .setOutputCol("documents") + +instruction = MPNetEmbeddings \ + .pretrained("mpnet_embedding_paraphrase_mpnet_base_v2_fuzzy_matcher", "en")\ + .setInputCols(["documents"]) \ + .setOutputCol("mpnet_embeddings") + +pipeline = Pipeline(stages=[ + document_assembler, + instruction, +]) + +pipelineModel = pipeline.fit(data) + +pipelineDF = pipelineModel.transform(data) +``` +```scala + +val document_assembler = new DocumentAssembler() + .setInputCol("text") + .setOutputCol("documents") + +val instruction = MPNetEmbeddings + .pretrained("mpnet_embedding_paraphrase_mpnet_base_v2_fuzzy_matcher", "en") + .setInputCols(Array("documents")) + .setOutputCol("mpnet_embeddings") + +val pipeline = new Pipeline().setStages(Array(document_assembler, instruction)) + +val pipelineModel = pipeline.fit(data) + +val pipelineDF = pipelineModel.transform(data) + +``` +
+ +{:.model-param} +## Model Information + +{:.table-model} +|---|---| +|Model Name:|mpnet_embedding_paraphrase_mpnet_base_v2_fuzzy_matcher| +|Compatibility:|Spark NLP 5.1.0+| +|License:|Open Source| +|Edition:|Official| +|Input Labels:|[documents]| +|Output Labels:|[mpnet_embeddings]| +|Language:|en| +|Size:|409.9 MB| \ No newline at end of file From c48edb5fc39810a4c27b3662fef81bed46f202f4 Mon Sep 17 00:00:00 2001 From: ahmedlone127 Date: Sat, 19 Aug 2023 00:13:38 +0700 Subject: [PATCH 061/155] Add model 2023-08-18-mpnet_embedding_Sentiment140_fewshot_en --- ...mpnet_embedding_Sentiment140_fewshot_en.md | 88 +++++++++++++++++++ 1 file changed, 88 insertions(+) create mode 100644 docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_Sentiment140_fewshot_en.md diff --git a/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_Sentiment140_fewshot_en.md b/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_Sentiment140_fewshot_en.md new file mode 100644 index 00000000000000..6dae79cd0c75f3 --- /dev/null +++ b/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_Sentiment140_fewshot_en.md @@ -0,0 +1,88 @@ +--- +layout: model +title: English mpnet_embedding_Sentiment140_fewshot TFMPNetModel from pig4431 +author: John Snow Labs +name: mpnet_embedding_Sentiment140_fewshot +date: 2023-08-18 +tags: [mpnet, en, open_source, tensorflow] +task: Embeddings +language: en +edition: Spark NLP 5.1.0 +spark_version: 3.0 +supported: true +engine: tensorflow +annotator: MPNetEmbeddings +article_header: + type: cover +use_language_switcher: "Python-Scala-Java" +--- + +## Description + +Pretrained mpnet model, adapted from Hugging Face and curated to provide scalability and production-readiness using Spark NLP.`mpnet_embedding_Sentiment140_fewshot` is a English model originally trained by pig4431. + +{:.btn-box} + + +[Download](https://s3.amazonaws.com/auxdata.johnsnowlabs.com/public/models/mpnet_embedding_Sentiment140_fewshot_en_5.1.0_3.0_1692378771805.zip){:.button.button-orange.button-orange-trans.arr.button-icon} +[Copy S3 URI](s3://auxdata.johnsnowlabs.com/public/models/mpnet_embedding_Sentiment140_fewshot_en_5.1.0_3.0_1692378771805.zip){:.button.button-orange.button-orange-trans.button-icon.button-copy-s3} + +## How to use + + + +
+{% include programmingLanguageSelectScalaPythonNLU.html %} +```python + +document_assembler = DocumentAssembler() \ + .setInputCol("text") \ + .setOutputCol("documents") + +instruction = MPNetEmbeddings \ + .pretrained("mpnet_embedding_Sentiment140_fewshot", "en")\ + .setInputCols(["documents"]) \ + .setOutputCol("mpnet_embeddings") + +pipeline = Pipeline(stages=[ + document_assembler, + instruction, +]) + +pipelineModel = pipeline.fit(data) + +pipelineDF = pipelineModel.transform(data) +``` +```scala + +val document_assembler = new DocumentAssembler() + .setInputCol("text") + .setOutputCol("documents") + +val instruction = MPNetEmbeddings + .pretrained("mpnet_embedding_Sentiment140_fewshot", "en") + .setInputCols(Array("documents")) + .setOutputCol("mpnet_embeddings") + +val pipeline = new Pipeline().setStages(Array(document_assembler, instruction)) + +val pipelineModel = pipeline.fit(data) + +val pipelineDF = pipelineModel.transform(data) + +``` +
+ +{:.model-param} +## Model Information + +{:.table-model} +|---|---| +|Model Name:|mpnet_embedding_Sentiment140_fewshot| +|Compatibility:|Spark NLP 5.1.0+| +|License:|Open Source| +|Edition:|Official| +|Input Labels:|[documents]| +|Output Labels:|[mpnet_embeddings]| +|Language:|en| +|Size:|409.9 MB| \ No newline at end of file From 87a900803c99f1491eb2d39c3159efe046f8edd4 Mon Sep 17 00:00:00 2001 From: ahmedlone127 Date: Sat, 19 Aug 2023 00:14:38 +0700 Subject: [PATCH 062/155] Add model 2023-08-18-mpnet_embedding_all_mpnet_base_v2_ftlegal_v3_en --- ...bedding_all_mpnet_base_v2_ftlegal_v3_en.md | 88 +++++++++++++++++++ 1 file changed, 88 insertions(+) create mode 100644 docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_all_mpnet_base_v2_ftlegal_v3_en.md diff --git a/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_all_mpnet_base_v2_ftlegal_v3_en.md b/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_all_mpnet_base_v2_ftlegal_v3_en.md new file mode 100644 index 00000000000000..76e076e96782cc --- /dev/null +++ b/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_all_mpnet_base_v2_ftlegal_v3_en.md @@ -0,0 +1,88 @@ +--- +layout: model +title: English mpnet_embedding_all_mpnet_base_v2_ftlegal_v3 TFMPNetModel from sukantan +author: John Snow Labs +name: mpnet_embedding_all_mpnet_base_v2_ftlegal_v3 +date: 2023-08-18 +tags: [mpnet, en, open_source, tensorflow] +task: Embeddings +language: en +edition: Spark NLP 5.1.0 +spark_version: 3.0 +supported: true +engine: tensorflow +annotator: MPNetEmbeddings +article_header: + type: cover +use_language_switcher: "Python-Scala-Java" +--- + +## Description + +Pretrained mpnet model, adapted from Hugging Face and curated to provide scalability and production-readiness using Spark NLP.`mpnet_embedding_all_mpnet_base_v2_ftlegal_v3` is a English model originally trained by sukantan. + +{:.btn-box} + + +[Download](https://s3.amazonaws.com/auxdata.johnsnowlabs.com/public/models/mpnet_embedding_all_mpnet_base_v2_ftlegal_v3_en_5.1.0_3.0_1692377160570.zip){:.button.button-orange.button-orange-trans.arr.button-icon} +[Copy S3 URI](s3://auxdata.johnsnowlabs.com/public/models/mpnet_embedding_all_mpnet_base_v2_ftlegal_v3_en_5.1.0_3.0_1692377160570.zip){:.button.button-orange.button-orange-trans.button-icon.button-copy-s3} + +## How to use + + + +
+{% include programmingLanguageSelectScalaPythonNLU.html %} +```python + +document_assembler = DocumentAssembler() \ + .setInputCol("text") \ + .setOutputCol("documents") + +instruction = MPNetEmbeddings \ + .pretrained("mpnet_embedding_all_mpnet_base_v2_ftlegal_v3", "en")\ + .setInputCols(["documents"]) \ + .setOutputCol("mpnet_embeddings") + +pipeline = Pipeline(stages=[ + document_assembler, + instruction, +]) + +pipelineModel = pipeline.fit(data) + +pipelineDF = pipelineModel.transform(data) +``` +```scala + +val document_assembler = new DocumentAssembler() + .setInputCol("text") + .setOutputCol("documents") + +val instruction = MPNetEmbeddings + .pretrained("mpnet_embedding_all_mpnet_base_v2_ftlegal_v3", "en") + .setInputCols(Array("documents")) + .setOutputCol("mpnet_embeddings") + +val pipeline = new Pipeline().setStages(Array(document_assembler, instruction)) + +val pipelineModel = pipeline.fit(data) + +val pipelineDF = pipelineModel.transform(data) + +``` +
+ +{:.model-param} +## Model Information + +{:.table-model} +|---|---| +|Model Name:|mpnet_embedding_all_mpnet_base_v2_ftlegal_v3| +|Compatibility:|Spark NLP 5.1.0+| +|License:|Open Source| +|Edition:|Official| +|Input Labels:|[documents]| +|Output Labels:|[mpnet_embeddings]| +|Language:|en| +|Size:|410.0 MB| \ No newline at end of file From ec93ea031c0e1f683eac394e2dc9047476608662 Mon Sep 17 00:00:00 2001 From: ahmedlone127 Date: Sat, 19 Aug 2023 00:15:39 +0700 Subject: [PATCH 063/155] Add model 2023-08-18-mpnet_embedding_setfit_zero_shot_classification_pbsp_p3_sev_en --- ...zero_shot_classification_pbsp_p3_sev_en.md | 88 +++++++++++++++++++ 1 file changed, 88 insertions(+) create mode 100644 docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_setfit_zero_shot_classification_pbsp_p3_sev_en.md diff --git a/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_setfit_zero_shot_classification_pbsp_p3_sev_en.md b/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_setfit_zero_shot_classification_pbsp_p3_sev_en.md new file mode 100644 index 00000000000000..a1f5ae7929e72f --- /dev/null +++ b/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_setfit_zero_shot_classification_pbsp_p3_sev_en.md @@ -0,0 +1,88 @@ +--- +layout: model +title: English mpnet_embedding_setfit_zero_shot_classification_pbsp_p3_sev TFMPNetModel from aammari +author: John Snow Labs +name: mpnet_embedding_setfit_zero_shot_classification_pbsp_p3_sev +date: 2023-08-18 +tags: [mpnet, en, open_source, tensorflow] +task: Embeddings +language: en +edition: Spark NLP 5.1.0 +spark_version: 3.0 +supported: true +engine: tensorflow +annotator: MPNetEmbeddings +article_header: + type: cover +use_language_switcher: "Python-Scala-Java" +--- + +## Description + +Pretrained mpnet model, adapted from Hugging Face and curated to provide scalability and production-readiness using Spark NLP.`mpnet_embedding_setfit_zero_shot_classification_pbsp_p3_sev` is a English model originally trained by aammari. + +{:.btn-box} + + +[Download](https://s3.amazonaws.com/auxdata.johnsnowlabs.com/public/models/mpnet_embedding_setfit_zero_shot_classification_pbsp_p3_sev_en_5.1.0_3.0_1692378917103.zip){:.button.button-orange.button-orange-trans.arr.button-icon} +[Copy S3 URI](s3://auxdata.johnsnowlabs.com/public/models/mpnet_embedding_setfit_zero_shot_classification_pbsp_p3_sev_en_5.1.0_3.0_1692378917103.zip){:.button.button-orange.button-orange-trans.button-icon.button-copy-s3} + +## How to use + + + +
+{% include programmingLanguageSelectScalaPythonNLU.html %} +```python + +document_assembler = DocumentAssembler() \ + .setInputCol("text") \ + .setOutputCol("documents") + +instruction = MPNetEmbeddings \ + .pretrained("mpnet_embedding_setfit_zero_shot_classification_pbsp_p3_sev", "en")\ + .setInputCols(["documents"]) \ + .setOutputCol("mpnet_embeddings") + +pipeline = Pipeline(stages=[ + document_assembler, + instruction, +]) + +pipelineModel = pipeline.fit(data) + +pipelineDF = pipelineModel.transform(data) +``` +```scala + +val document_assembler = new DocumentAssembler() + .setInputCol("text") + .setOutputCol("documents") + +val instruction = MPNetEmbeddings + .pretrained("mpnet_embedding_setfit_zero_shot_classification_pbsp_p3_sev", "en") + .setInputCols(Array("documents")) + .setOutputCol("mpnet_embeddings") + +val pipeline = new Pipeline().setStages(Array(document_assembler, instruction)) + +val pipelineModel = pipeline.fit(data) + +val pipelineDF = pipelineModel.transform(data) + +``` +
+ +{:.model-param} +## Model Information + +{:.table-model} +|---|---| +|Model Name:|mpnet_embedding_setfit_zero_shot_classification_pbsp_p3_sev| +|Compatibility:|Spark NLP 5.1.0+| +|License:|Open Source| +|Edition:|Official| +|Input Labels:|[documents]| +|Output Labels:|[mpnet_embeddings]| +|Language:|en| +|Size:|409.8 MB| \ No newline at end of file From 51778ecbb857b1437102418abb75980c8fef0a37 Mon Sep 17 00:00:00 2001 From: ahmedlone127 Date: Sat, 19 Aug 2023 00:16:38 +0700 Subject: [PATCH 064/155] Add model 2023-08-18-mpnet_embedding_java_usage_classifier_en --- ...pnet_embedding_java_usage_classifier_en.md | 88 +++++++++++++++++++ 1 file changed, 88 insertions(+) create mode 100644 docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_java_usage_classifier_en.md diff --git a/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_java_usage_classifier_en.md b/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_java_usage_classifier_en.md new file mode 100644 index 00000000000000..c9c83299840f67 --- /dev/null +++ b/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_java_usage_classifier_en.md @@ -0,0 +1,88 @@ +--- +layout: model +title: English mpnet_embedding_java_usage_classifier TFMPNetModel from AISE-TUDelft +author: John Snow Labs +name: mpnet_embedding_java_usage_classifier +date: 2023-08-18 +tags: [mpnet, en, open_source, tensorflow] +task: Embeddings +language: en +edition: Spark NLP 5.1.0 +spark_version: 3.0 +supported: true +engine: tensorflow +annotator: MPNetEmbeddings +article_header: + type: cover +use_language_switcher: "Python-Scala-Java" +--- + +## Description + +Pretrained mpnet model, adapted from Hugging Face and curated to provide scalability and production-readiness using Spark NLP.`mpnet_embedding_java_usage_classifier` is a English model originally trained by AISE-TUDelft. + +{:.btn-box} + + +[Download](https://s3.amazonaws.com/auxdata.johnsnowlabs.com/public/models/mpnet_embedding_java_usage_classifier_en_5.1.0_3.0_1692377814202.zip){:.button.button-orange.button-orange-trans.arr.button-icon} +[Copy S3 URI](s3://auxdata.johnsnowlabs.com/public/models/mpnet_embedding_java_usage_classifier_en_5.1.0_3.0_1692377814202.zip){:.button.button-orange.button-orange-trans.button-icon.button-copy-s3} + +## How to use + + + +
+{% include programmingLanguageSelectScalaPythonNLU.html %} +```python + +document_assembler = DocumentAssembler() \ + .setInputCol("text") \ + .setOutputCol("documents") + +instruction = MPNetEmbeddings \ + .pretrained("mpnet_embedding_java_usage_classifier", "en")\ + .setInputCols(["documents"]) \ + .setOutputCol("mpnet_embeddings") + +pipeline = Pipeline(stages=[ + document_assembler, + instruction, +]) + +pipelineModel = pipeline.fit(data) + +pipelineDF = pipelineModel.transform(data) +``` +```scala + +val document_assembler = new DocumentAssembler() + .setInputCol("text") + .setOutputCol("documents") + +val instruction = MPNetEmbeddings + .pretrained("mpnet_embedding_java_usage_classifier", "en") + .setInputCols(Array("documents")) + .setOutputCol("mpnet_embeddings") + +val pipeline = new Pipeline().setStages(Array(document_assembler, instruction)) + +val pipelineModel = pipeline.fit(data) + +val pipelineDF = pipelineModel.transform(data) + +``` +
+ +{:.model-param} +## Model Information + +{:.table-model} +|---|---| +|Model Name:|mpnet_embedding_java_usage_classifier| +|Compatibility:|Spark NLP 5.1.0+| +|License:|Open Source| +|Edition:|Official| +|Input Labels:|[documents]| +|Output Labels:|[mpnet_embeddings]| +|Language:|en| +|Size:|409.9 MB| \ No newline at end of file From 13747a7c6d3c43e730ea58eb1498af8744c20055 Mon Sep 17 00:00:00 2001 From: ahmedlone127 Date: Sat, 19 Aug 2023 00:17:39 +0700 Subject: [PATCH 065/155] Add model 2023-08-18-mpnet_embedding_mpnet_base_snli_mnli_en --- ...mpnet_embedding_mpnet_base_snli_mnli_en.md | 88 +++++++++++++++++++ 1 file changed, 88 insertions(+) create mode 100644 docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_mpnet_base_snli_mnli_en.md diff --git a/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_mpnet_base_snli_mnli_en.md b/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_mpnet_base_snli_mnli_en.md new file mode 100644 index 00000000000000..a85dd22290550b --- /dev/null +++ b/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_mpnet_base_snli_mnli_en.md @@ -0,0 +1,88 @@ +--- +layout: model +title: English mpnet_embedding_mpnet_base_snli_mnli TFMPNetModel from symanto +author: John Snow Labs +name: mpnet_embedding_mpnet_base_snli_mnli +date: 2023-08-18 +tags: [mpnet, en, open_source, tensorflow] +task: Embeddings +language: en +edition: Spark NLP 5.1.0 +spark_version: 3.0 +supported: true +engine: tensorflow +annotator: MPNetEmbeddings +article_header: + type: cover +use_language_switcher: "Python-Scala-Java" +--- + +## Description + +Pretrained mpnet model, adapted from Hugging Face and curated to provide scalability and production-readiness using Spark NLP.`mpnet_embedding_mpnet_base_snli_mnli` is a English model originally trained by symanto. + +{:.btn-box} + + +[Download](https://s3.amazonaws.com/auxdata.johnsnowlabs.com/public/models/mpnet_embedding_mpnet_base_snli_mnli_en_5.1.0_3.0_1692378970961.zip){:.button.button-orange.button-orange-trans.arr.button-icon} +[Copy S3 URI](s3://auxdata.johnsnowlabs.com/public/models/mpnet_embedding_mpnet_base_snli_mnli_en_5.1.0_3.0_1692378970961.zip){:.button.button-orange.button-orange-trans.button-icon.button-copy-s3} + +## How to use + + + +
+{% include programmingLanguageSelectScalaPythonNLU.html %} +```python + +document_assembler = DocumentAssembler() \ + .setInputCol("text") \ + .setOutputCol("documents") + +instruction = MPNetEmbeddings \ + .pretrained("mpnet_embedding_mpnet_base_snli_mnli", "en")\ + .setInputCols(["documents"]) \ + .setOutputCol("mpnet_embeddings") + +pipeline = Pipeline(stages=[ + document_assembler, + instruction, +]) + +pipelineModel = pipeline.fit(data) + +pipelineDF = pipelineModel.transform(data) +``` +```scala + +val document_assembler = new DocumentAssembler() + .setInputCol("text") + .setOutputCol("documents") + +val instruction = MPNetEmbeddings + .pretrained("mpnet_embedding_mpnet_base_snli_mnli", "en") + .setInputCols(Array("documents")) + .setOutputCol("mpnet_embeddings") + +val pipeline = new Pipeline().setStages(Array(document_assembler, instruction)) + +val pipelineModel = pipeline.fit(data) + +val pipelineDF = pipelineModel.transform(data) + +``` +
+ +{:.model-param} +## Model Information + +{:.table-model} +|---|---| +|Model Name:|mpnet_embedding_mpnet_base_snli_mnli| +|Compatibility:|Spark NLP 5.1.0+| +|License:|Open Source| +|Edition:|Official| +|Input Labels:|[documents]| +|Output Labels:|[mpnet_embeddings]| +|Language:|en| +|Size:|407.3 MB| \ No newline at end of file From 0b39261529602b751b91c564aa4035cacbc27f39 Mon Sep 17 00:00:00 2001 From: ahmedlone127 Date: Sat, 19 Aug 2023 00:18:39 +0700 Subject: [PATCH 066/155] Add model 2023-08-18-mpnet_embedding_mpnet_base_articles_ner_en --- ...et_embedding_mpnet_base_articles_ner_en.md | 88 +++++++++++++++++++ 1 file changed, 88 insertions(+) create mode 100644 docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_mpnet_base_articles_ner_en.md diff --git a/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_mpnet_base_articles_ner_en.md b/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_mpnet_base_articles_ner_en.md new file mode 100644 index 00000000000000..7804f757513786 --- /dev/null +++ b/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_mpnet_base_articles_ner_en.md @@ -0,0 +1,88 @@ +--- +layout: model +title: English mpnet_embedding_mpnet_base_articles_ner TFMPNetModel from evangeliazve +author: John Snow Labs +name: mpnet_embedding_mpnet_base_articles_ner +date: 2023-08-18 +tags: [mpnet, en, open_source, tensorflow] +task: Embeddings +language: en +edition: Spark NLP 5.1.0 +spark_version: 3.0 +supported: true +engine: tensorflow +annotator: MPNetEmbeddings +article_header: + type: cover +use_language_switcher: "Python-Scala-Java" +--- + +## Description + +Pretrained mpnet model, adapted from Hugging Face and curated to provide scalability and production-readiness using Spark NLP.`mpnet_embedding_mpnet_base_articles_ner` is a English model originally trained by evangeliazve. + +{:.btn-box} + + +[Download](https://s3.amazonaws.com/auxdata.johnsnowlabs.com/public/models/mpnet_embedding_mpnet_base_articles_ner_en_5.1.0_3.0_1692378970812.zip){:.button.button-orange.button-orange-trans.arr.button-icon} +[Copy S3 URI](s3://auxdata.johnsnowlabs.com/public/models/mpnet_embedding_mpnet_base_articles_ner_en_5.1.0_3.0_1692378970812.zip){:.button.button-orange.button-orange-trans.button-icon.button-copy-s3} + +## How to use + + + +
+{% include programmingLanguageSelectScalaPythonNLU.html %} +```python + +document_assembler = DocumentAssembler() \ + .setInputCol("text") \ + .setOutputCol("documents") + +instruction = MPNetEmbeddings \ + .pretrained("mpnet_embedding_mpnet_base_articles_ner", "en")\ + .setInputCols(["documents"]) \ + .setOutputCol("mpnet_embeddings") + +pipeline = Pipeline(stages=[ + document_assembler, + instruction, +]) + +pipelineModel = pipeline.fit(data) + +pipelineDF = pipelineModel.transform(data) +``` +```scala + +val document_assembler = new DocumentAssembler() + .setInputCol("text") + .setOutputCol("documents") + +val instruction = MPNetEmbeddings + .pretrained("mpnet_embedding_mpnet_base_articles_ner", "en") + .setInputCols(Array("documents")) + .setOutputCol("mpnet_embeddings") + +val pipeline = new Pipeline().setStages(Array(document_assembler, instruction)) + +val pipelineModel = pipeline.fit(data) + +val pipelineDF = pipelineModel.transform(data) + +``` +
+ +{:.model-param} +## Model Information + +{:.table-model} +|---|---| +|Model Name:|mpnet_embedding_mpnet_base_articles_ner| +|Compatibility:|Spark NLP 5.1.0+| +|License:|Open Source| +|Edition:|Official| +|Input Labels:|[documents]| +|Output Labels:|[mpnet_embeddings]| +|Language:|en| +|Size:|385.7 MB| \ No newline at end of file From 40fffb9f75a9eee3d1a5f7dae68e51ab392a2d7c Mon Sep 17 00:00:00 2001 From: ahmedlone127 Date: Sat, 19 Aug 2023 00:19:39 +0700 Subject: [PATCH 067/155] Add model 2023-08-18-mpnet_embedding_sn_mpnet_base_snli_mnli_en --- ...et_embedding_sn_mpnet_base_snli_mnli_en.md | 88 +++++++++++++++++++ 1 file changed, 88 insertions(+) create mode 100644 docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_sn_mpnet_base_snli_mnli_en.md diff --git a/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_sn_mpnet_base_snli_mnli_en.md b/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_sn_mpnet_base_snli_mnli_en.md new file mode 100644 index 00000000000000..cf40cbaaf3d291 --- /dev/null +++ b/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_sn_mpnet_base_snli_mnli_en.md @@ -0,0 +1,88 @@ +--- +layout: model +title: English mpnet_embedding_sn_mpnet_base_snli_mnli TFMPNetModel from symanto +author: John Snow Labs +name: mpnet_embedding_sn_mpnet_base_snli_mnli +date: 2023-08-18 +tags: [mpnet, en, open_source, tensorflow] +task: Embeddings +language: en +edition: Spark NLP 5.1.0 +spark_version: 3.0 +supported: true +engine: tensorflow +annotator: MPNetEmbeddings +article_header: + type: cover +use_language_switcher: "Python-Scala-Java" +--- + +## Description + +Pretrained mpnet model, adapted from Hugging Face and curated to provide scalability and production-readiness using Spark NLP.`mpnet_embedding_sn_mpnet_base_snli_mnli` is a English model originally trained by symanto. + +{:.btn-box} + + +[Download](https://s3.amazonaws.com/auxdata.johnsnowlabs.com/public/models/mpnet_embedding_sn_mpnet_base_snli_mnli_en_5.1.0_3.0_1692379153696.zip){:.button.button-orange.button-orange-trans.arr.button-icon} +[Copy S3 URI](s3://auxdata.johnsnowlabs.com/public/models/mpnet_embedding_sn_mpnet_base_snli_mnli_en_5.1.0_3.0_1692379153696.zip){:.button.button-orange.button-orange-trans.button-icon.button-copy-s3} + +## How to use + + + +
+{% include programmingLanguageSelectScalaPythonNLU.html %} +```python + +document_assembler = DocumentAssembler() \ + .setInputCol("text") \ + .setOutputCol("documents") + +instruction = MPNetEmbeddings \ + .pretrained("mpnet_embedding_sn_mpnet_base_snli_mnli", "en")\ + .setInputCols(["documents"]) \ + .setOutputCol("mpnet_embeddings") + +pipeline = Pipeline(stages=[ + document_assembler, + instruction, +]) + +pipelineModel = pipeline.fit(data) + +pipelineDF = pipelineModel.transform(data) +``` +```scala + +val document_assembler = new DocumentAssembler() + .setInputCol("text") + .setOutputCol("documents") + +val instruction = MPNetEmbeddings + .pretrained("mpnet_embedding_sn_mpnet_base_snli_mnli", "en") + .setInputCols(Array("documents")) + .setOutputCol("mpnet_embeddings") + +val pipeline = new Pipeline().setStages(Array(document_assembler, instruction)) + +val pipelineModel = pipeline.fit(data) + +val pipelineDF = pipelineModel.transform(data) + +``` +
+ +{:.model-param} +## Model Information + +{:.table-model} +|---|---| +|Model Name:|mpnet_embedding_sn_mpnet_base_snli_mnli| +|Compatibility:|Spark NLP 5.1.0+| +|License:|Open Source| +|Edition:|Official| +|Input Labels:|[documents]| +|Output Labels:|[mpnet_embeddings]| +|Language:|en| +|Size:|409.8 MB| \ No newline at end of file From 5073ee19e15f7199e5451430eb11fff42a017a3c Mon Sep 17 00:00:00 2001 From: ahmedlone127 Date: Sat, 19 Aug 2023 00:20:39 +0700 Subject: [PATCH 068/155] Add model 2023-08-18-mpnet_embedding_biencoder_all_mpnet_base_v2_mmarcoFR_fr --- ...biencoder_all_mpnet_base_v2_mmarcoFR_fr.md | 88 +++++++++++++++++++ 1 file changed, 88 insertions(+) create mode 100644 docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_biencoder_all_mpnet_base_v2_mmarcoFR_fr.md diff --git a/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_biencoder_all_mpnet_base_v2_mmarcoFR_fr.md b/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_biencoder_all_mpnet_base_v2_mmarcoFR_fr.md new file mode 100644 index 00000000000000..7fc42dea86c473 --- /dev/null +++ b/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_biencoder_all_mpnet_base_v2_mmarcoFR_fr.md @@ -0,0 +1,88 @@ +--- +layout: model +title: French mpnet_embedding_biencoder_all_mpnet_base_v2_mmarcoFR TFMPNetModel from antoinelouis +author: John Snow Labs +name: mpnet_embedding_biencoder_all_mpnet_base_v2_mmarcoFR +date: 2023-08-18 +tags: [mpnet, fr, open_source, tensorflow] +task: Embeddings +language: fr +edition: Spark NLP 5.1.0 +spark_version: 3.0 +supported: true +engine: tensorflow +annotator: MPNetEmbeddings +article_header: + type: cover +use_language_switcher: "Python-Scala-Java" +--- + +## Description + +Pretrained mpnet model, adapted from Hugging Face and curated to provide scalability and production-readiness using Spark NLP.`mpnet_embedding_biencoder_all_mpnet_base_v2_mmarcoFR` is a French model originally trained by antoinelouis. + +{:.btn-box} + + +[Download](https://s3.amazonaws.com/auxdata.johnsnowlabs.com/public/models/mpnet_embedding_biencoder_all_mpnet_base_v2_mmarcoFR_fr_5.1.0_3.0_1692379200090.zip){:.button.button-orange.button-orange-trans.arr.button-icon} +[Copy S3 URI](s3://auxdata.johnsnowlabs.com/public/models/mpnet_embedding_biencoder_all_mpnet_base_v2_mmarcoFR_fr_5.1.0_3.0_1692379200090.zip){:.button.button-orange.button-orange-trans.button-icon.button-copy-s3} + +## How to use + + + +
+{% include programmingLanguageSelectScalaPythonNLU.html %} +```python + +document_assembler = DocumentAssembler() \ + .setInputCol("text") \ + .setOutputCol("documents") + +instruction = MPNetEmbeddings \ + .pretrained("mpnet_embedding_biencoder_all_mpnet_base_v2_mmarcoFR", "fr")\ + .setInputCols(["documents"]) \ + .setOutputCol("mpnet_embeddings") + +pipeline = Pipeline(stages=[ + document_assembler, + instruction, +]) + +pipelineModel = pipeline.fit(data) + +pipelineDF = pipelineModel.transform(data) +``` +```scala + +val document_assembler = new DocumentAssembler() + .setInputCol("text") + .setOutputCol("documents") + +val instruction = MPNetEmbeddings + .pretrained("mpnet_embedding_biencoder_all_mpnet_base_v2_mmarcoFR", "fr") + .setInputCols(Array("documents")) + .setOutputCol("mpnet_embeddings") + +val pipeline = new Pipeline().setStages(Array(document_assembler, instruction)) + +val pipelineModel = pipeline.fit(data) + +val pipelineDF = pipelineModel.transform(data) + +``` +
+ +{:.model-param} +## Model Information + +{:.table-model} +|---|---| +|Model Name:|mpnet_embedding_biencoder_all_mpnet_base_v2_mmarcoFR| +|Compatibility:|Spark NLP 5.1.0+| +|License:|Open Source| +|Edition:|Official| +|Input Labels:|[documents]| +|Output Labels:|[mpnet_embeddings]| +|Language:|fr| +|Size:|409.8 MB| \ No newline at end of file From 9a6eb992ab6b80d66e4bd24abf2f5441dfcd1c39 Mon Sep 17 00:00:00 2001 From: ahmedlone127 Date: Sat, 19 Aug 2023 00:21:40 +0700 Subject: [PATCH 069/155] Add model 2023-08-18-mpnet_embedding_python_expand_classifier_en --- ...t_embedding_python_expand_classifier_en.md | 88 +++++++++++++++++++ 1 file changed, 88 insertions(+) create mode 100644 docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_python_expand_classifier_en.md diff --git a/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_python_expand_classifier_en.md b/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_python_expand_classifier_en.md new file mode 100644 index 00000000000000..f8cc83dd2a7bec --- /dev/null +++ b/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_python_expand_classifier_en.md @@ -0,0 +1,88 @@ +--- +layout: model +title: English mpnet_embedding_python_expand_classifier TFMPNetModel from AISE-TUDelft +author: John Snow Labs +name: mpnet_embedding_python_expand_classifier +date: 2023-08-18 +tags: [mpnet, en, open_source, tensorflow] +task: Embeddings +language: en +edition: Spark NLP 5.1.0 +spark_version: 3.0 +supported: true +engine: tensorflow +annotator: MPNetEmbeddings +article_header: + type: cover +use_language_switcher: "Python-Scala-Java" +--- + +## Description + +Pretrained mpnet model, adapted from Hugging Face and curated to provide scalability and production-readiness using Spark NLP.`mpnet_embedding_python_expand_classifier` is a English model originally trained by AISE-TUDelft. + +{:.btn-box} + + +[Download](https://s3.amazonaws.com/auxdata.johnsnowlabs.com/public/models/mpnet_embedding_python_expand_classifier_en_5.1.0_3.0_1692379201526.zip){:.button.button-orange.button-orange-trans.arr.button-icon} +[Copy S3 URI](s3://auxdata.johnsnowlabs.com/public/models/mpnet_embedding_python_expand_classifier_en_5.1.0_3.0_1692379201526.zip){:.button.button-orange.button-orange-trans.button-icon.button-copy-s3} + +## How to use + + + +
+{% include programmingLanguageSelectScalaPythonNLU.html %} +```python + +document_assembler = DocumentAssembler() \ + .setInputCol("text") \ + .setOutputCol("documents") + +instruction = MPNetEmbeddings \ + .pretrained("mpnet_embedding_python_expand_classifier", "en")\ + .setInputCols(["documents"]) \ + .setOutputCol("mpnet_embeddings") + +pipeline = Pipeline(stages=[ + document_assembler, + instruction, +]) + +pipelineModel = pipeline.fit(data) + +pipelineDF = pipelineModel.transform(data) +``` +```scala + +val document_assembler = new DocumentAssembler() + .setInputCol("text") + .setOutputCol("documents") + +val instruction = MPNetEmbeddings + .pretrained("mpnet_embedding_python_expand_classifier", "en") + .setInputCols(Array("documents")) + .setOutputCol("mpnet_embeddings") + +val pipeline = new Pipeline().setStages(Array(document_assembler, instruction)) + +val pipelineModel = pipeline.fit(data) + +val pipelineDF = pipelineModel.transform(data) + +``` +
+ +{:.model-param} +## Model Information + +{:.table-model} +|---|---| +|Model Name:|mpnet_embedding_python_expand_classifier| +|Compatibility:|Spark NLP 5.1.0+| +|License:|Open Source| +|Edition:|Official| +|Input Labels:|[documents]| +|Output Labels:|[mpnet_embeddings]| +|Language:|en| +|Size:|409.8 MB| \ No newline at end of file From 6f3fa4da58e0d05d909da2a991d136e4b11a2ef3 Mon Sep 17 00:00:00 2001 From: ahmedlone127 Date: Sat, 19 Aug 2023 00:22:40 +0700 Subject: [PATCH 070/155] Add model 2023-08-18-mpnet_embedding_all_datasets_v4_mpnet_base_en --- ...embedding_all_datasets_v4_mpnet_base_en.md | 88 +++++++++++++++++++ 1 file changed, 88 insertions(+) create mode 100644 docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_all_datasets_v4_mpnet_base_en.md diff --git a/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_all_datasets_v4_mpnet_base_en.md b/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_all_datasets_v4_mpnet_base_en.md new file mode 100644 index 00000000000000..18f330d36d731a --- /dev/null +++ b/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_all_datasets_v4_mpnet_base_en.md @@ -0,0 +1,88 @@ +--- +layout: model +title: English mpnet_embedding_all_datasets_v4_mpnet_base TFMPNetModel from flax-sentence-embeddings +author: John Snow Labs +name: mpnet_embedding_all_datasets_v4_mpnet_base +date: 2023-08-18 +tags: [mpnet, en, open_source, tensorflow] +task: Embeddings +language: en +edition: Spark NLP 5.1.0 +spark_version: 3.0 +supported: true +engine: tensorflow +annotator: MPNetEmbeddings +article_header: + type: cover +use_language_switcher: "Python-Scala-Java" +--- + +## Description + +Pretrained mpnet model, adapted from Hugging Face and curated to provide scalability and production-readiness using Spark NLP.`mpnet_embedding_all_datasets_v4_mpnet_base` is a English model originally trained by flax-sentence-embeddings. + +{:.btn-box} + + +[Download](https://s3.amazonaws.com/auxdata.johnsnowlabs.com/public/models/mpnet_embedding_all_datasets_v4_mpnet_base_en_5.1.0_3.0_1692375835999.zip){:.button.button-orange.button-orange-trans.arr.button-icon} +[Copy S3 URI](s3://auxdata.johnsnowlabs.com/public/models/mpnet_embedding_all_datasets_v4_mpnet_base_en_5.1.0_3.0_1692375835999.zip){:.button.button-orange.button-orange-trans.button-icon.button-copy-s3} + +## How to use + + + +
+{% include programmingLanguageSelectScalaPythonNLU.html %} +```python + +document_assembler = DocumentAssembler() \ + .setInputCol("text") \ + .setOutputCol("documents") + +instruction = MPNetEmbeddings \ + .pretrained("mpnet_embedding_all_datasets_v4_mpnet_base", "en")\ + .setInputCols(["documents"]) \ + .setOutputCol("mpnet_embeddings") + +pipeline = Pipeline(stages=[ + document_assembler, + instruction, +]) + +pipelineModel = pipeline.fit(data) + +pipelineDF = pipelineModel.transform(data) +``` +```scala + +val document_assembler = new DocumentAssembler() + .setInputCol("text") + .setOutputCol("documents") + +val instruction = MPNetEmbeddings + .pretrained("mpnet_embedding_all_datasets_v4_mpnet_base", "en") + .setInputCols(Array("documents")) + .setOutputCol("mpnet_embeddings") + +val pipeline = new Pipeline().setStages(Array(document_assembler, instruction)) + +val pipelineModel = pipeline.fit(data) + +val pipelineDF = pipelineModel.transform(data) + +``` +
+ +{:.model-param} +## Model Information + +{:.table-model} +|---|---| +|Model Name:|mpnet_embedding_all_datasets_v4_mpnet_base| +|Compatibility:|Spark NLP 5.1.0+| +|License:|Open Source| +|Edition:|Official| +|Input Labels:|[documents]| +|Output Labels:|[mpnet_embeddings]| +|Language:|en| +|Size:|409.6 MB| \ No newline at end of file From b218326e31b5f348763b9a9c9fb5b0727b9ce2c4 Mon Sep 17 00:00:00 2001 From: ahmedlone127 Date: Sat, 19 Aug 2023 00:23:40 +0700 Subject: [PATCH 071/155] Add model 2023-08-18-mpnet_embedding_pharo_collaborators_classifier_en --- ...dding_pharo_collaborators_classifier_en.md | 88 +++++++++++++++++++ 1 file changed, 88 insertions(+) create mode 100644 docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_pharo_collaborators_classifier_en.md diff --git a/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_pharo_collaborators_classifier_en.md b/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_pharo_collaborators_classifier_en.md new file mode 100644 index 00000000000000..8165d870dd14df --- /dev/null +++ b/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_pharo_collaborators_classifier_en.md @@ -0,0 +1,88 @@ +--- +layout: model +title: English mpnet_embedding_pharo_collaborators_classifier TFMPNetModel from AISE-TUDelft +author: John Snow Labs +name: mpnet_embedding_pharo_collaborators_classifier +date: 2023-08-18 +tags: [mpnet, en, open_source, tensorflow] +task: Embeddings +language: en +edition: Spark NLP 5.1.0 +spark_version: 3.0 +supported: true +engine: tensorflow +annotator: MPNetEmbeddings +article_header: + type: cover +use_language_switcher: "Python-Scala-Java" +--- + +## Description + +Pretrained mpnet model, adapted from Hugging Face and curated to provide scalability and production-readiness using Spark NLP.`mpnet_embedding_pharo_collaborators_classifier` is a English model originally trained by AISE-TUDelft. + +{:.btn-box} + + +[Download](https://s3.amazonaws.com/auxdata.johnsnowlabs.com/public/models/mpnet_embedding_pharo_collaborators_classifier_en_5.1.0_3.0_1692379408752.zip){:.button.button-orange.button-orange-trans.arr.button-icon} +[Copy S3 URI](s3://auxdata.johnsnowlabs.com/public/models/mpnet_embedding_pharo_collaborators_classifier_en_5.1.0_3.0_1692379408752.zip){:.button.button-orange.button-orange-trans.button-icon.button-copy-s3} + +## How to use + + + +
+{% include programmingLanguageSelectScalaPythonNLU.html %} +```python + +document_assembler = DocumentAssembler() \ + .setInputCol("text") \ + .setOutputCol("documents") + +instruction = MPNetEmbeddings \ + .pretrained("mpnet_embedding_pharo_collaborators_classifier", "en")\ + .setInputCols(["documents"]) \ + .setOutputCol("mpnet_embeddings") + +pipeline = Pipeline(stages=[ + document_assembler, + instruction, +]) + +pipelineModel = pipeline.fit(data) + +pipelineDF = pipelineModel.transform(data) +``` +```scala + +val document_assembler = new DocumentAssembler() + .setInputCol("text") + .setOutputCol("documents") + +val instruction = MPNetEmbeddings + .pretrained("mpnet_embedding_pharo_collaborators_classifier", "en") + .setInputCols(Array("documents")) + .setOutputCol("mpnet_embeddings") + +val pipeline = new Pipeline().setStages(Array(document_assembler, instruction)) + +val pipelineModel = pipeline.fit(data) + +val pipelineDF = pipelineModel.transform(data) + +``` +
+ +{:.model-param} +## Model Information + +{:.table-model} +|---|---| +|Model Name:|mpnet_embedding_pharo_collaborators_classifier| +|Compatibility:|Spark NLP 5.1.0+| +|License:|Open Source| +|Edition:|Official| +|Input Labels:|[documents]| +|Output Labels:|[mpnet_embeddings]| +|Language:|en| +|Size:|409.8 MB| \ No newline at end of file From 6d775c7930dba981e26c6363cf75ea67ff13e03f Mon Sep 17 00:00:00 2001 From: ahmedlone127 Date: Sat, 19 Aug 2023 00:24:40 +0700 Subject: [PATCH 072/155] Add model 2023-08-18-mpnet_embedding_setfit_ds_version_0_0_5_en --- ...et_embedding_setfit_ds_version_0_0_5_en.md | 88 +++++++++++++++++++ 1 file changed, 88 insertions(+) create mode 100644 docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_setfit_ds_version_0_0_5_en.md diff --git a/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_setfit_ds_version_0_0_5_en.md b/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_setfit_ds_version_0_0_5_en.md new file mode 100644 index 00000000000000..f16927af381d81 --- /dev/null +++ b/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_setfit_ds_version_0_0_5_en.md @@ -0,0 +1,88 @@ +--- +layout: model +title: English mpnet_embedding_setfit_ds_version_0_0_5 TFMPNetModel from amittian +author: John Snow Labs +name: mpnet_embedding_setfit_ds_version_0_0_5 +date: 2023-08-18 +tags: [mpnet, en, open_source, tensorflow] +task: Embeddings +language: en +edition: Spark NLP 5.1.0 +spark_version: 3.0 +supported: true +engine: tensorflow +annotator: MPNetEmbeddings +article_header: + type: cover +use_language_switcher: "Python-Scala-Java" +--- + +## Description + +Pretrained mpnet model, adapted from Hugging Face and curated to provide scalability and production-readiness using Spark NLP.`mpnet_embedding_setfit_ds_version_0_0_5` is a English model originally trained by amittian. + +{:.btn-box} + + +[Download](https://s3.amazonaws.com/auxdata.johnsnowlabs.com/public/models/mpnet_embedding_setfit_ds_version_0_0_5_en_5.1.0_3.0_1692378155593.zip){:.button.button-orange.button-orange-trans.arr.button-icon} +[Copy S3 URI](s3://auxdata.johnsnowlabs.com/public/models/mpnet_embedding_setfit_ds_version_0_0_5_en_5.1.0_3.0_1692378155593.zip){:.button.button-orange.button-orange-trans.button-icon.button-copy-s3} + +## How to use + + + +
+{% include programmingLanguageSelectScalaPythonNLU.html %} +```python + +document_assembler = DocumentAssembler() \ + .setInputCol("text") \ + .setOutputCol("documents") + +instruction = MPNetEmbeddings \ + .pretrained("mpnet_embedding_setfit_ds_version_0_0_5", "en")\ + .setInputCols(["documents"]) \ + .setOutputCol("mpnet_embeddings") + +pipeline = Pipeline(stages=[ + document_assembler, + instruction, +]) + +pipelineModel = pipeline.fit(data) + +pipelineDF = pipelineModel.transform(data) +``` +```scala + +val document_assembler = new DocumentAssembler() + .setInputCol("text") + .setOutputCol("documents") + +val instruction = MPNetEmbeddings + .pretrained("mpnet_embedding_setfit_ds_version_0_0_5", "en") + .setInputCols(Array("documents")) + .setOutputCol("mpnet_embeddings") + +val pipeline = new Pipeline().setStages(Array(document_assembler, instruction)) + +val pipelineModel = pipeline.fit(data) + +val pipelineDF = pipelineModel.transform(data) + +``` +
+ +{:.model-param} +## Model Information + +{:.table-model} +|---|---| +|Model Name:|mpnet_embedding_setfit_ds_version_0_0_5| +|Compatibility:|Spark NLP 5.1.0+| +|License:|Open Source| +|Edition:|Official| +|Input Labels:|[documents]| +|Output Labels:|[mpnet_embeddings]| +|Language:|en| +|Size:|410.2 MB| \ No newline at end of file From 7f7cff3f0e23e54b20737f7bc3af943c0fd5d873 Mon Sep 17 00:00:00 2001 From: ahmedlone127 Date: Sat, 19 Aug 2023 00:25:40 +0700 Subject: [PATCH 073/155] Add model 2023-08-18-mpnet_embedding_stackoverflow_mpnet_base_en --- ...t_embedding_stackoverflow_mpnet_base_en.md | 88 +++++++++++++++++++ 1 file changed, 88 insertions(+) create mode 100644 docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_stackoverflow_mpnet_base_en.md diff --git a/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_stackoverflow_mpnet_base_en.md b/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_stackoverflow_mpnet_base_en.md new file mode 100644 index 00000000000000..5d7166fa502413 --- /dev/null +++ b/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_stackoverflow_mpnet_base_en.md @@ -0,0 +1,88 @@ +--- +layout: model +title: English mpnet_embedding_stackoverflow_mpnet_base TFMPNetModel from flax-sentence-embeddings +author: John Snow Labs +name: mpnet_embedding_stackoverflow_mpnet_base +date: 2023-08-18 +tags: [mpnet, en, open_source, tensorflow] +task: Embeddings +language: en +edition: Spark NLP 5.1.0 +spark_version: 3.0 +supported: true +engine: tensorflow +annotator: MPNetEmbeddings +article_header: + type: cover +use_language_switcher: "Python-Scala-Java" +--- + +## Description + +Pretrained mpnet model, adapted from Hugging Face and curated to provide scalability and production-readiness using Spark NLP.`mpnet_embedding_stackoverflow_mpnet_base` is a English model originally trained by flax-sentence-embeddings. + +{:.btn-box} + + +[Download](https://s3.amazonaws.com/auxdata.johnsnowlabs.com/public/models/mpnet_embedding_stackoverflow_mpnet_base_en_5.1.0_3.0_1692376811146.zip){:.button.button-orange.button-orange-trans.arr.button-icon} +[Copy S3 URI](s3://auxdata.johnsnowlabs.com/public/models/mpnet_embedding_stackoverflow_mpnet_base_en_5.1.0_3.0_1692376811146.zip){:.button.button-orange.button-orange-trans.button-icon.button-copy-s3} + +## How to use + + + +
+{% include programmingLanguageSelectScalaPythonNLU.html %} +```python + +document_assembler = DocumentAssembler() \ + .setInputCol("text") \ + .setOutputCol("documents") + +instruction = MPNetEmbeddings \ + .pretrained("mpnet_embedding_stackoverflow_mpnet_base", "en")\ + .setInputCols(["documents"]) \ + .setOutputCol("mpnet_embeddings") + +pipeline = Pipeline(stages=[ + document_assembler, + instruction, +]) + +pipelineModel = pipeline.fit(data) + +pipelineDF = pipelineModel.transform(data) +``` +```scala + +val document_assembler = new DocumentAssembler() + .setInputCol("text") + .setOutputCol("documents") + +val instruction = MPNetEmbeddings + .pretrained("mpnet_embedding_stackoverflow_mpnet_base", "en") + .setInputCols(Array("documents")) + .setOutputCol("mpnet_embeddings") + +val pipeline = new Pipeline().setStages(Array(document_assembler, instruction)) + +val pipelineModel = pipeline.fit(data) + +val pipelineDF = pipelineModel.transform(data) + +``` +
+ +{:.model-param} +## Model Information + +{:.table-model} +|---|---| +|Model Name:|mpnet_embedding_stackoverflow_mpnet_base| +|Compatibility:|Spark NLP 5.1.0+| +|License:|Open Source| +|Edition:|Official| +|Input Labels:|[documents]| +|Output Labels:|[mpnet_embeddings]| +|Language:|en| +|Size:|409.5 MB| \ No newline at end of file From a2cdcb8e4f73e402bbcccc3a7e026e39e39c25fe Mon Sep 17 00:00:00 2001 From: ahmedlone127 Date: Sat, 19 Aug 2023 00:26:40 +0700 Subject: [PATCH 074/155] Add model 2023-08-18-mpnet_embedding_paraphrase_mpnet_base_v2_SetFit_sst2_en --- ...paraphrase_mpnet_base_v2_SetFit_sst2_en.md | 88 +++++++++++++++++++ 1 file changed, 88 insertions(+) create mode 100644 docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_paraphrase_mpnet_base_v2_SetFit_sst2_en.md diff --git a/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_paraphrase_mpnet_base_v2_SetFit_sst2_en.md b/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_paraphrase_mpnet_base_v2_SetFit_sst2_en.md new file mode 100644 index 00000000000000..cf711e7af57488 --- /dev/null +++ b/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_paraphrase_mpnet_base_v2_SetFit_sst2_en.md @@ -0,0 +1,88 @@ +--- +layout: model +title: English mpnet_embedding_paraphrase_mpnet_base_v2_SetFit_sst2 TFMPNetModel from moshew +author: John Snow Labs +name: mpnet_embedding_paraphrase_mpnet_base_v2_SetFit_sst2 +date: 2023-08-18 +tags: [mpnet, en, open_source, tensorflow] +task: Embeddings +language: en +edition: Spark NLP 5.1.0 +spark_version: 3.0 +supported: true +engine: tensorflow +annotator: MPNetEmbeddings +article_header: + type: cover +use_language_switcher: "Python-Scala-Java" +--- + +## Description + +Pretrained mpnet model, adapted from Hugging Face and curated to provide scalability and production-readiness using Spark NLP.`mpnet_embedding_paraphrase_mpnet_base_v2_SetFit_sst2` is a English model originally trained by moshew. + +{:.btn-box} + + +[Download](https://s3.amazonaws.com/auxdata.johnsnowlabs.com/public/models/mpnet_embedding_paraphrase_mpnet_base_v2_SetFit_sst2_en_5.1.0_3.0_1692379417527.zip){:.button.button-orange.button-orange-trans.arr.button-icon} +[Copy S3 URI](s3://auxdata.johnsnowlabs.com/public/models/mpnet_embedding_paraphrase_mpnet_base_v2_SetFit_sst2_en_5.1.0_3.0_1692379417527.zip){:.button.button-orange.button-orange-trans.button-icon.button-copy-s3} + +## How to use + + + +
+{% include programmingLanguageSelectScalaPythonNLU.html %} +```python + +document_assembler = DocumentAssembler() \ + .setInputCol("text") \ + .setOutputCol("documents") + +instruction = MPNetEmbeddings \ + .pretrained("mpnet_embedding_paraphrase_mpnet_base_v2_SetFit_sst2", "en")\ + .setInputCols(["documents"]) \ + .setOutputCol("mpnet_embeddings") + +pipeline = Pipeline(stages=[ + document_assembler, + instruction, +]) + +pipelineModel = pipeline.fit(data) + +pipelineDF = pipelineModel.transform(data) +``` +```scala + +val document_assembler = new DocumentAssembler() + .setInputCol("text") + .setOutputCol("documents") + +val instruction = MPNetEmbeddings + .pretrained("mpnet_embedding_paraphrase_mpnet_base_v2_SetFit_sst2", "en") + .setInputCols(Array("documents")) + .setOutputCol("mpnet_embeddings") + +val pipeline = new Pipeline().setStages(Array(document_assembler, instruction)) + +val pipelineModel = pipeline.fit(data) + +val pipelineDF = pipelineModel.transform(data) + +``` +
+ +{:.model-param} +## Model Information + +{:.table-model} +|---|---| +|Model Name:|mpnet_embedding_paraphrase_mpnet_base_v2_SetFit_sst2| +|Compatibility:|Spark NLP 5.1.0+| +|License:|Open Source| +|Edition:|Official| +|Input Labels:|[documents]| +|Output Labels:|[mpnet_embeddings]| +|Language:|en| +|Size:|409.9 MB| \ No newline at end of file From 63e3a71f69109c045cccd36b477f900e469c7357 Mon Sep 17 00:00:00 2001 From: ahmedlone127 Date: Sat, 19 Aug 2023 00:27:41 +0700 Subject: [PATCH 075/155] Add model 2023-08-18-mpnet_embedding_all_mpnet_base_v2_table_en --- ...et_embedding_all_mpnet_base_v2_table_en.md | 88 +++++++++++++++++++ 1 file changed, 88 insertions(+) create mode 100644 docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_all_mpnet_base_v2_table_en.md diff --git a/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_all_mpnet_base_v2_table_en.md b/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_all_mpnet_base_v2_table_en.md new file mode 100644 index 00000000000000..d557a3e88e9205 --- /dev/null +++ b/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_all_mpnet_base_v2_table_en.md @@ -0,0 +1,88 @@ +--- +layout: model +title: English mpnet_embedding_all_mpnet_base_v2_table TFMPNetModel from deepset +author: John Snow Labs +name: mpnet_embedding_all_mpnet_base_v2_table +date: 2023-08-18 +tags: [mpnet, en, open_source, tensorflow] +task: Embeddings +language: en +edition: Spark NLP 5.1.0 +spark_version: 3.0 +supported: true +engine: tensorflow +annotator: MPNetEmbeddings +article_header: + type: cover +use_language_switcher: "Python-Scala-Java" +--- + +## Description + +Pretrained mpnet model, adapted from Hugging Face and curated to provide scalability and production-readiness using Spark NLP.`mpnet_embedding_all_mpnet_base_v2_table` is a English model originally trained by deepset. + +{:.btn-box} + + +[Download](https://s3.amazonaws.com/auxdata.johnsnowlabs.com/public/models/mpnet_embedding_all_mpnet_base_v2_table_en_5.1.0_3.0_1692379618976.zip){:.button.button-orange.button-orange-trans.arr.button-icon} +[Copy S3 URI](s3://auxdata.johnsnowlabs.com/public/models/mpnet_embedding_all_mpnet_base_v2_table_en_5.1.0_3.0_1692379618976.zip){:.button.button-orange.button-orange-trans.button-icon.button-copy-s3} + +## How to use + + + +
+{% include programmingLanguageSelectScalaPythonNLU.html %} +```python + +document_assembler = DocumentAssembler() \ + .setInputCol("text") \ + .setOutputCol("documents") + +instruction = MPNetEmbeddings \ + .pretrained("mpnet_embedding_all_mpnet_base_v2_table", "en")\ + .setInputCols(["documents"]) \ + .setOutputCol("mpnet_embeddings") + +pipeline = Pipeline(stages=[ + document_assembler, + instruction, +]) + +pipelineModel = pipeline.fit(data) + +pipelineDF = pipelineModel.transform(data) +``` +```scala + +val document_assembler = new DocumentAssembler() + .setInputCol("text") + .setOutputCol("documents") + +val instruction = MPNetEmbeddings + .pretrained("mpnet_embedding_all_mpnet_base_v2_table", "en") + .setInputCols(Array("documents")) + .setOutputCol("mpnet_embeddings") + +val pipeline = new Pipeline().setStages(Array(document_assembler, instruction)) + +val pipelineModel = pipeline.fit(data) + +val pipelineDF = pipelineModel.transform(data) + +``` +
+ +{:.model-param} +## Model Information + +{:.table-model} +|---|---| +|Model Name:|mpnet_embedding_all_mpnet_base_v2_table| +|Compatibility:|Spark NLP 5.1.0+| +|License:|Open Source| +|Edition:|Official| +|Input Labels:|[documents]| +|Output Labels:|[mpnet_embeddings]| +|Language:|en| +|Size:|410.2 MB| \ No newline at end of file From 2049e3aa549e9ae1641d43871e4d041ce73a4725 Mon Sep 17 00:00:00 2001 From: ahmedlone127 Date: Sat, 19 Aug 2023 00:28:41 +0700 Subject: [PATCH 076/155] Add model 2023-08-18-mpnet_embedding_InvoiceOrNot_en --- ...3-08-18-mpnet_embedding_InvoiceOrNot_en.md | 88 +++++++++++++++++++ 1 file changed, 88 insertions(+) create mode 100644 docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_InvoiceOrNot_en.md diff --git a/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_InvoiceOrNot_en.md b/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_InvoiceOrNot_en.md new file mode 100644 index 00000000000000..051f1bead450ea --- /dev/null +++ b/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_InvoiceOrNot_en.md @@ -0,0 +1,88 @@ +--- +layout: model +title: English mpnet_embedding_InvoiceOrNot TFMPNetModel from HamzaFarhan +author: John Snow Labs +name: mpnet_embedding_InvoiceOrNot +date: 2023-08-18 +tags: [mpnet, en, open_source, tensorflow] +task: Embeddings +language: en +edition: Spark NLP 5.1.0 +spark_version: 3.0 +supported: true +engine: tensorflow +annotator: MPNetEmbeddings +article_header: + type: cover +use_language_switcher: "Python-Scala-Java" +--- + +## Description + +Pretrained mpnet model, adapted from Hugging Face and curated to provide scalability and production-readiness using Spark NLP.`mpnet_embedding_InvoiceOrNot` is a English model originally trained by HamzaFarhan. + +{:.btn-box} + + +[Download](https://s3.amazonaws.com/auxdata.johnsnowlabs.com/public/models/mpnet_embedding_InvoiceOrNot_en_5.1.0_3.0_1692379683526.zip){:.button.button-orange.button-orange-trans.arr.button-icon} +[Copy S3 URI](s3://auxdata.johnsnowlabs.com/public/models/mpnet_embedding_InvoiceOrNot_en_5.1.0_3.0_1692379683526.zip){:.button.button-orange.button-orange-trans.button-icon.button-copy-s3} + +## How to use + + + +
+{% include programmingLanguageSelectScalaPythonNLU.html %} +```python + +document_assembler = DocumentAssembler() \ + .setInputCol("text") \ + .setOutputCol("documents") + +instruction = MPNetEmbeddings \ + .pretrained("mpnet_embedding_InvoiceOrNot", "en")\ + .setInputCols(["documents"]) \ + .setOutputCol("mpnet_embeddings") + +pipeline = Pipeline(stages=[ + document_assembler, + instruction, +]) + +pipelineModel = pipeline.fit(data) + +pipelineDF = pipelineModel.transform(data) +``` +```scala + +val document_assembler = new DocumentAssembler() + .setInputCol("text") + .setOutputCol("documents") + +val instruction = MPNetEmbeddings + .pretrained("mpnet_embedding_InvoiceOrNot", "en") + .setInputCols(Array("documents")) + .setOutputCol("mpnet_embeddings") + +val pipeline = new Pipeline().setStages(Array(document_assembler, instruction)) + +val pipelineModel = pipeline.fit(data) + +val pipelineDF = pipelineModel.transform(data) + +``` +
+ +{:.model-param} +## Model Information + +{:.table-model} +|---|---| +|Model Name:|mpnet_embedding_InvoiceOrNot| +|Compatibility:|Spark NLP 5.1.0+| +|License:|Open Source| +|Edition:|Official| +|Input Labels:|[documents]| +|Output Labels:|[mpnet_embeddings]| +|Language:|en| +|Size:|409.6 MB| \ No newline at end of file From cdf5789655b3ce28aa47954ecc82356b01f1f777 Mon Sep 17 00:00:00 2001 From: ahmedlone127 Date: Sat, 19 Aug 2023 00:29:41 +0700 Subject: [PATCH 077/155] Add model 2023-08-18-mpnet_embedding_python_usage_classifier_en --- ...et_embedding_python_usage_classifier_en.md | 88 +++++++++++++++++++ 1 file changed, 88 insertions(+) create mode 100644 docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_python_usage_classifier_en.md diff --git a/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_python_usage_classifier_en.md b/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_python_usage_classifier_en.md new file mode 100644 index 00000000000000..204d72ddf8deb7 --- /dev/null +++ b/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_python_usage_classifier_en.md @@ -0,0 +1,88 @@ +--- +layout: model +title: English mpnet_embedding_python_usage_classifier TFMPNetModel from AISE-TUDelft +author: John Snow Labs +name: mpnet_embedding_python_usage_classifier +date: 2023-08-18 +tags: [mpnet, en, open_source, tensorflow] +task: Embeddings +language: en +edition: Spark NLP 5.1.0 +spark_version: 3.0 +supported: true +engine: tensorflow +annotator: MPNetEmbeddings +article_header: + type: cover +use_language_switcher: "Python-Scala-Java" +--- + +## Description + +Pretrained mpnet model, adapted from Hugging Face and curated to provide scalability and production-readiness using Spark NLP.`mpnet_embedding_python_usage_classifier` is a English model originally trained by AISE-TUDelft. + +{:.btn-box} + + +[Download](https://s3.amazonaws.com/auxdata.johnsnowlabs.com/public/models/mpnet_embedding_python_usage_classifier_en_5.1.0_3.0_1692379761686.zip){:.button.button-orange.button-orange-trans.arr.button-icon} +[Copy S3 URI](s3://auxdata.johnsnowlabs.com/public/models/mpnet_embedding_python_usage_classifier_en_5.1.0_3.0_1692379761686.zip){:.button.button-orange.button-orange-trans.button-icon.button-copy-s3} + +## How to use + + + +
+{% include programmingLanguageSelectScalaPythonNLU.html %} +```python + +document_assembler = DocumentAssembler() \ + .setInputCol("text") \ + .setOutputCol("documents") + +instruction = MPNetEmbeddings \ + .pretrained("mpnet_embedding_python_usage_classifier", "en")\ + .setInputCols(["documents"]) \ + .setOutputCol("mpnet_embeddings") + +pipeline = Pipeline(stages=[ + document_assembler, + instruction, +]) + +pipelineModel = pipeline.fit(data) + +pipelineDF = pipelineModel.transform(data) +``` +```scala + +val document_assembler = new DocumentAssembler() + .setInputCol("text") + .setOutputCol("documents") + +val instruction = MPNetEmbeddings + .pretrained("mpnet_embedding_python_usage_classifier", "en") + .setInputCols(Array("documents")) + .setOutputCol("mpnet_embeddings") + +val pipeline = new Pipeline().setStages(Array(document_assembler, instruction)) + +val pipelineModel = pipeline.fit(data) + +val pipelineDF = pipelineModel.transform(data) + +``` +
+ +{:.model-param} +## Model Information + +{:.table-model} +|---|---| +|Model Name:|mpnet_embedding_python_usage_classifier| +|Compatibility:|Spark NLP 5.1.0+| +|License:|Open Source| +|Edition:|Official| +|Input Labels:|[documents]| +|Output Labels:|[mpnet_embeddings]| +|Language:|en| +|Size:|409.8 MB| \ No newline at end of file From 45beedfac85710ee5dafac09f31832881ac82e2f Mon Sep 17 00:00:00 2001 From: ahmedlone127 Date: Sat, 19 Aug 2023 00:30:42 +0700 Subject: [PATCH 078/155] Add model 2023-08-18-mpnet_embedding_all_mpnet_base_v2_feature_extraction_pipeline_en --- ..._base_v2_feature_extraction_pipeline_en.md | 88 +++++++++++++++++++ 1 file changed, 88 insertions(+) create mode 100644 docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_all_mpnet_base_v2_feature_extraction_pipeline_en.md diff --git a/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_all_mpnet_base_v2_feature_extraction_pipeline_en.md b/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_all_mpnet_base_v2_feature_extraction_pipeline_en.md new file mode 100644 index 00000000000000..bf081c166bb3db --- /dev/null +++ b/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_all_mpnet_base_v2_feature_extraction_pipeline_en.md @@ -0,0 +1,88 @@ +--- +layout: model +title: English mpnet_embedding_all_mpnet_base_v2_feature_extraction_pipeline TFMPNetModel from questgen +author: John Snow Labs +name: mpnet_embedding_all_mpnet_base_v2_feature_extraction_pipeline +date: 2023-08-18 +tags: [mpnet, en, open_source, tensorflow] +task: Embeddings +language: en +edition: Spark NLP 5.1.0 +spark_version: 3.0 +supported: true +engine: tensorflow +annotator: MPNetEmbeddings +article_header: + type: cover +use_language_switcher: "Python-Scala-Java" +--- + +## Description + +Pretrained mpnet model, adapted from Hugging Face and curated to provide scalability and production-readiness using Spark NLP.`mpnet_embedding_all_mpnet_base_v2_feature_extraction_pipeline` is a English model originally trained by questgen. + +{:.btn-box} + + +[Download](https://s3.amazonaws.com/auxdata.johnsnowlabs.com/public/models/mpnet_embedding_all_mpnet_base_v2_feature_extraction_pipeline_en_5.1.0_3.0_1692379801719.zip){:.button.button-orange.button-orange-trans.arr.button-icon} +[Copy S3 URI](s3://auxdata.johnsnowlabs.com/public/models/mpnet_embedding_all_mpnet_base_v2_feature_extraction_pipeline_en_5.1.0_3.0_1692379801719.zip){:.button.button-orange.button-orange-trans.button-icon.button-copy-s3} + +## How to use + + + +
+{% include programmingLanguageSelectScalaPythonNLU.html %} +```python + +document_assembler = DocumentAssembler() \ + .setInputCol("text") \ + .setOutputCol("documents") + +instruction = MPNetEmbeddings \ + .pretrained("mpnet_embedding_all_mpnet_base_v2_feature_extraction_pipeline", "en")\ + .setInputCols(["documents"]) \ + .setOutputCol("mpnet_embeddings") + +pipeline = Pipeline(stages=[ + document_assembler, + instruction, +]) + +pipelineModel = pipeline.fit(data) + +pipelineDF = pipelineModel.transform(data) +``` +```scala + +val document_assembler = new DocumentAssembler() + .setInputCol("text") + .setOutputCol("documents") + +val instruction = MPNetEmbeddings + .pretrained("mpnet_embedding_all_mpnet_base_v2_feature_extraction_pipeline", "en") + .setInputCols(Array("documents")) + .setOutputCol("mpnet_embeddings") + +val pipeline = new Pipeline().setStages(Array(document_assembler, instruction)) + +val pipelineModel = pipeline.fit(data) + +val pipelineDF = pipelineModel.transform(data) + +``` +
+ +{:.model-param} +## Model Information + +{:.table-model} +|---|---| +|Model Name:|mpnet_embedding_all_mpnet_base_v2_feature_extraction_pipeline| +|Compatibility:|Spark NLP 5.1.0+| +|License:|Open Source| +|Edition:|Official| +|Input Labels:|[documents]| +|Output Labels:|[mpnet_embeddings]| +|Language:|en| +|Size:|409.6 MB| \ No newline at end of file From 492458eedddfcc2a2f2b3fa25c226fc6523a086c Mon Sep 17 00:00:00 2001 From: ahmedlone127 Date: Sat, 19 Aug 2023 00:31:42 +0700 Subject: [PATCH 079/155] Add model 2023-08-18-mpnet_embedding_ikitracs_mitigation_en --- ...-mpnet_embedding_ikitracs_mitigation_en.md | 88 +++++++++++++++++++ 1 file changed, 88 insertions(+) create mode 100644 docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_ikitracs_mitigation_en.md diff --git a/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_ikitracs_mitigation_en.md b/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_ikitracs_mitigation_en.md new file mode 100644 index 00000000000000..2fe54cca2c1b46 --- /dev/null +++ b/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_ikitracs_mitigation_en.md @@ -0,0 +1,88 @@ +--- +layout: model +title: English mpnet_embedding_ikitracs_mitigation TFMPNetModel from ilaria-oneofftech +author: John Snow Labs +name: mpnet_embedding_ikitracs_mitigation +date: 2023-08-18 +tags: [mpnet, en, open_source, tensorflow] +task: Embeddings +language: en +edition: Spark NLP 5.1.0 +spark_version: 3.0 +supported: true +engine: tensorflow +annotator: MPNetEmbeddings +article_header: + type: cover +use_language_switcher: "Python-Scala-Java" +--- + +## Description + +Pretrained mpnet model, adapted from Hugging Face and curated to provide scalability and production-readiness using Spark NLP.`mpnet_embedding_ikitracs_mitigation` is a English model originally trained by ilaria-oneofftech. + +{:.btn-box} + + +[Download](https://s3.amazonaws.com/auxdata.johnsnowlabs.com/public/models/mpnet_embedding_ikitracs_mitigation_en_5.1.0_3.0_1692379836311.zip){:.button.button-orange.button-orange-trans.arr.button-icon} +[Copy S3 URI](s3://auxdata.johnsnowlabs.com/public/models/mpnet_embedding_ikitracs_mitigation_en_5.1.0_3.0_1692379836311.zip){:.button.button-orange.button-orange-trans.button-icon.button-copy-s3} + +## How to use + + + +
+{% include programmingLanguageSelectScalaPythonNLU.html %} +```python + +document_assembler = DocumentAssembler() \ + .setInputCol("text") \ + .setOutputCol("documents") + +instruction = MPNetEmbeddings \ + .pretrained("mpnet_embedding_ikitracs_mitigation", "en")\ + .setInputCols(["documents"]) \ + .setOutputCol("mpnet_embeddings") + +pipeline = Pipeline(stages=[ + document_assembler, + instruction, +]) + +pipelineModel = pipeline.fit(data) + +pipelineDF = pipelineModel.transform(data) +``` +```scala + +val document_assembler = new DocumentAssembler() + .setInputCol("text") + .setOutputCol("documents") + +val instruction = MPNetEmbeddings + .pretrained("mpnet_embedding_ikitracs_mitigation", "en") + .setInputCols(Array("documents")) + .setOutputCol("mpnet_embeddings") + +val pipeline = new Pipeline().setStages(Array(document_assembler, instruction)) + +val pipelineModel = pipeline.fit(data) + +val pipelineDF = pipelineModel.transform(data) + +``` +
+ +{:.model-param} +## Model Information + +{:.table-model} +|---|---| +|Model Name:|mpnet_embedding_ikitracs_mitigation| +|Compatibility:|Spark NLP 5.1.0+| +|License:|Open Source| +|Edition:|Official| +|Input Labels:|[documents]| +|Output Labels:|[mpnet_embeddings]| +|Language:|en| +|Size:|410.2 MB| \ No newline at end of file From 55bd84f4b459ee2c28961ae6ae8736378a05ec07 Mon Sep 17 00:00:00 2001 From: ahmedlone127 Date: Sat, 19 Aug 2023 00:32:42 +0700 Subject: [PATCH 080/155] Add model 2023-08-18-mpnet_embedding_pharo_example_classifier_en --- ...t_embedding_pharo_example_classifier_en.md | 88 +++++++++++++++++++ 1 file changed, 88 insertions(+) create mode 100644 docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_pharo_example_classifier_en.md diff --git a/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_pharo_example_classifier_en.md b/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_pharo_example_classifier_en.md new file mode 100644 index 00000000000000..bd766092e29d48 --- /dev/null +++ b/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_pharo_example_classifier_en.md @@ -0,0 +1,88 @@ +--- +layout: model +title: English mpnet_embedding_pharo_example_classifier TFMPNetModel from AISE-TUDelft +author: John Snow Labs +name: mpnet_embedding_pharo_example_classifier +date: 2023-08-18 +tags: [mpnet, en, open_source, tensorflow] +task: Embeddings +language: en +edition: Spark NLP 5.1.0 +spark_version: 3.0 +supported: true +engine: tensorflow +annotator: MPNetEmbeddings +article_header: + type: cover +use_language_switcher: "Python-Scala-Java" +--- + +## Description + +Pretrained mpnet model, adapted from Hugging Face and curated to provide scalability and production-readiness using Spark NLP.`mpnet_embedding_pharo_example_classifier` is a English model originally trained by AISE-TUDelft. + +{:.btn-box} + + +[Download](https://s3.amazonaws.com/auxdata.johnsnowlabs.com/public/models/mpnet_embedding_pharo_example_classifier_en_5.1.0_3.0_1692379945273.zip){:.button.button-orange.button-orange-trans.arr.button-icon} +[Copy S3 URI](s3://auxdata.johnsnowlabs.com/public/models/mpnet_embedding_pharo_example_classifier_en_5.1.0_3.0_1692379945273.zip){:.button.button-orange.button-orange-trans.button-icon.button-copy-s3} + +## How to use + + + +
+{% include programmingLanguageSelectScalaPythonNLU.html %} +```python + +document_assembler = DocumentAssembler() \ + .setInputCol("text") \ + .setOutputCol("documents") + +instruction = MPNetEmbeddings \ + .pretrained("mpnet_embedding_pharo_example_classifier", "en")\ + .setInputCols(["documents"]) \ + .setOutputCol("mpnet_embeddings") + +pipeline = Pipeline(stages=[ + document_assembler, + instruction, +]) + +pipelineModel = pipeline.fit(data) + +pipelineDF = pipelineModel.transform(data) +``` +```scala + +val document_assembler = new DocumentAssembler() + .setInputCol("text") + .setOutputCol("documents") + +val instruction = MPNetEmbeddings + .pretrained("mpnet_embedding_pharo_example_classifier", "en") + .setInputCols(Array("documents")) + .setOutputCol("mpnet_embeddings") + +val pipeline = new Pipeline().setStages(Array(document_assembler, instruction)) + +val pipelineModel = pipeline.fit(data) + +val pipelineDF = pipelineModel.transform(data) + +``` +
+ +{:.model-param} +## Model Information + +{:.table-model} +|---|---| +|Model Name:|mpnet_embedding_pharo_example_classifier| +|Compatibility:|Spark NLP 5.1.0+| +|License:|Open Source| +|Edition:|Official| +|Input Labels:|[documents]| +|Output Labels:|[mpnet_embeddings]| +|Language:|en| +|Size:|409.8 MB| \ No newline at end of file From 4d31960492a9bf58cb8fde0b0d439b3f0758ebf2 Mon Sep 17 00:00:00 2001 From: ahmedlone127 Date: Sat, 19 Aug 2023 00:33:42 +0700 Subject: [PATCH 081/155] Add model 2023-08-18-mpnet_embedding_tiny_random_MPNetModel_en --- ...net_embedding_tiny_random_MPNetModel_en.md | 88 +++++++++++++++++++ 1 file changed, 88 insertions(+) create mode 100644 docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_tiny_random_MPNetModel_en.md diff --git a/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_tiny_random_MPNetModel_en.md b/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_tiny_random_MPNetModel_en.md new file mode 100644 index 00000000000000..e8fe17dafa897c --- /dev/null +++ b/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_tiny_random_MPNetModel_en.md @@ -0,0 +1,88 @@ +--- +layout: model +title: English mpnet_embedding_tiny_random_MPNetModel TFMPNetModel from hf-tiny-model-private +author: John Snow Labs +name: mpnet_embedding_tiny_random_MPNetModel +date: 2023-08-18 +tags: [mpnet, en, open_source, tensorflow] +task: Embeddings +language: en +edition: Spark NLP 5.1.0 +spark_version: 3.0 +supported: true +engine: tensorflow +annotator: MPNetEmbeddings +article_header: + type: cover +use_language_switcher: "Python-Scala-Java" +--- + +## Description + +Pretrained mpnet model, adapted from Hugging Face and curated to provide scalability and production-readiness using Spark NLP.`mpnet_embedding_tiny_random_MPNetModel` is a English model originally trained by hf-tiny-model-private. + +{:.btn-box} + + +[Download](https://s3.amazonaws.com/auxdata.johnsnowlabs.com/public/models/mpnet_embedding_tiny_random_MPNetModel_en_5.1.0_3.0_1692376215032.zip){:.button.button-orange.button-orange-trans.arr.button-icon} +[Copy S3 URI](s3://auxdata.johnsnowlabs.com/public/models/mpnet_embedding_tiny_random_MPNetModel_en_5.1.0_3.0_1692376215032.zip){:.button.button-orange.button-orange-trans.button-icon.button-copy-s3} + +## How to use + + + +
+{% include programmingLanguageSelectScalaPythonNLU.html %} +```python + +document_assembler = DocumentAssembler() \ + .setInputCol("text") \ + .setOutputCol("documents") + +instruction = MPNetEmbeddings \ + .pretrained("mpnet_embedding_tiny_random_MPNetModel", "en")\ + .setInputCols(["documents"]) \ + .setOutputCol("mpnet_embeddings") + +pipeline = Pipeline(stages=[ + document_assembler, + instruction, +]) + +pipelineModel = pipeline.fit(data) + +pipelineDF = pipelineModel.transform(data) +``` +```scala + +val document_assembler = new DocumentAssembler() + .setInputCol("text") + .setOutputCol("documents") + +val instruction = MPNetEmbeddings + .pretrained("mpnet_embedding_tiny_random_MPNetModel", "en") + .setInputCols(Array("documents")) + .setOutputCol("mpnet_embeddings") + +val pipeline = new Pipeline().setStages(Array(document_assembler, instruction)) + +val pipelineModel = pipeline.fit(data) + +val pipelineDF = pipelineModel.transform(data) + +``` +
+ +{:.model-param} +## Model Information + +{:.table-model} +|---|---| +|Model Name:|mpnet_embedding_tiny_random_MPNetModel| +|Compatibility:|Spark NLP 5.1.0+| +|License:|Open Source| +|Edition:|Official| +|Input Labels:|[documents]| +|Output Labels:|[mpnet_embeddings]| +|Language:|en| +|Size:|1.2 MB| \ No newline at end of file From b56e80bbdd7755e7a2584e81d33f8a3f177c2afa Mon Sep 17 00:00:00 2001 From: ahmedlone127 Date: Sat, 19 Aug 2023 00:34:43 +0700 Subject: [PATCH 082/155] Add model 2023-08-18-mpnet_embedding_mpnet_base_en --- ...023-08-18-mpnet_embedding_mpnet_base_en.md | 88 +++++++++++++++++++ 1 file changed, 88 insertions(+) create mode 100644 docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_mpnet_base_en.md diff --git a/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_mpnet_base_en.md b/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_mpnet_base_en.md new file mode 100644 index 00000000000000..e36992b8d74edd --- /dev/null +++ b/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_mpnet_base_en.md @@ -0,0 +1,88 @@ +--- +layout: model +title: English mpnet_embedding_mpnet_base TFMPNetModel from microsoft +author: John Snow Labs +name: mpnet_embedding_mpnet_base +date: 2023-08-18 +tags: [mpnet, en, open_source, tensorflow] +task: Embeddings +language: en +edition: Spark NLP 5.1.0 +spark_version: 3.0 +supported: true +engine: tensorflow +annotator: MPNetEmbeddings +article_header: + type: cover +use_language_switcher: "Python-Scala-Java" +--- + +## Description + +Pretrained mpnet model, adapted from Hugging Face and curated to provide scalability and production-readiness using Spark NLP.`mpnet_embedding_mpnet_base` is a English model originally trained by microsoft. + +{:.btn-box} + + +[Download](https://s3.amazonaws.com/auxdata.johnsnowlabs.com/public/models/mpnet_embedding_mpnet_base_en_5.1.0_3.0_1692377047526.zip){:.button.button-orange.button-orange-trans.arr.button-icon} +[Copy S3 URI](s3://auxdata.johnsnowlabs.com/public/models/mpnet_embedding_mpnet_base_en_5.1.0_3.0_1692377047526.zip){:.button.button-orange.button-orange-trans.button-icon.button-copy-s3} + +## How to use + + + +
+{% include programmingLanguageSelectScalaPythonNLU.html %} +```python + +document_assembler = DocumentAssembler() \ + .setInputCol("text") \ + .setOutputCol("documents") + +instruction = MPNetEmbeddings \ + .pretrained("mpnet_embedding_mpnet_base", "en")\ + .setInputCols(["documents"]) \ + .setOutputCol("mpnet_embeddings") + +pipeline = Pipeline(stages=[ + document_assembler, + instruction, +]) + +pipelineModel = pipeline.fit(data) + +pipelineDF = pipelineModel.transform(data) +``` +```scala + +val document_assembler = new DocumentAssembler() + .setInputCol("text") + .setOutputCol("documents") + +val instruction = MPNetEmbeddings + .pretrained("mpnet_embedding_mpnet_base", "en") + .setInputCols(Array("documents")) + .setOutputCol("mpnet_embeddings") + +val pipeline = new Pipeline().setStages(Array(document_assembler, instruction)) + +val pipelineModel = pipeline.fit(data) + +val pipelineDF = pipelineModel.transform(data) + +``` +
+ +{:.model-param} +## Model Information + +{:.table-model} +|---|---| +|Model Name:|mpnet_embedding_mpnet_base| +|Compatibility:|Spark NLP 5.1.0+| +|License:|Open Source| +|Edition:|Official| +|Input Labels:|[documents]| +|Output Labels:|[mpnet_embeddings]| +|Language:|en| +|Size:|264.2 MB| \ No newline at end of file From accd6010a29ed908fb75f3ac3894a40be03b7d19 Mon Sep 17 00:00:00 2001 From: ahmedlone127 Date: Sat, 19 Aug 2023 00:35:43 +0700 Subject: [PATCH 083/155] Add model 2023-08-18-mpnet_embedding_all_mpnet_base_v2_tasky_classification_en --- ...l_mpnet_base_v2_tasky_classification_en.md | 88 +++++++++++++++++++ 1 file changed, 88 insertions(+) create mode 100644 docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_all_mpnet_base_v2_tasky_classification_en.md diff --git a/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_all_mpnet_base_v2_tasky_classification_en.md b/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_all_mpnet_base_v2_tasky_classification_en.md new file mode 100644 index 00000000000000..6462f5deaec345 --- /dev/null +++ b/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_all_mpnet_base_v2_tasky_classification_en.md @@ -0,0 +1,88 @@ +--- +layout: model +title: English mpnet_embedding_all_mpnet_base_v2_tasky_classification TFMPNetModel from khalidalt +author: John Snow Labs +name: mpnet_embedding_all_mpnet_base_v2_tasky_classification +date: 2023-08-18 +tags: [mpnet, en, open_source, tensorflow] +task: Embeddings +language: en +edition: Spark NLP 5.1.0 +spark_version: 3.0 +supported: true +engine: tensorflow +annotator: MPNetEmbeddings +article_header: + type: cover +use_language_switcher: "Python-Scala-Java" +--- + +## Description + +Pretrained mpnet model, adapted from Hugging Face and curated to provide scalability and production-readiness using Spark NLP.`mpnet_embedding_all_mpnet_base_v2_tasky_classification` is a English model originally trained by khalidalt. + +{:.btn-box} + + +[Download](https://s3.amazonaws.com/auxdata.johnsnowlabs.com/public/models/mpnet_embedding_all_mpnet_base_v2_tasky_classification_en_5.1.0_3.0_1692379990317.zip){:.button.button-orange.button-orange-trans.arr.button-icon} +[Copy S3 URI](s3://auxdata.johnsnowlabs.com/public/models/mpnet_embedding_all_mpnet_base_v2_tasky_classification_en_5.1.0_3.0_1692379990317.zip){:.button.button-orange.button-orange-trans.button-icon.button-copy-s3} + +## How to use + + + +
+{% include programmingLanguageSelectScalaPythonNLU.html %} +```python + +document_assembler = DocumentAssembler() \ + .setInputCol("text") \ + .setOutputCol("documents") + +instruction = MPNetEmbeddings \ + .pretrained("mpnet_embedding_all_mpnet_base_v2_tasky_classification", "en")\ + .setInputCols(["documents"]) \ + .setOutputCol("mpnet_embeddings") + +pipeline = Pipeline(stages=[ + document_assembler, + instruction, +]) + +pipelineModel = pipeline.fit(data) + +pipelineDF = pipelineModel.transform(data) +``` +```scala + +val document_assembler = new DocumentAssembler() + .setInputCol("text") + .setOutputCol("documents") + +val instruction = MPNetEmbeddings + .pretrained("mpnet_embedding_all_mpnet_base_v2_tasky_classification", "en") + .setInputCols(Array("documents")) + .setOutputCol("mpnet_embeddings") + +val pipeline = new Pipeline().setStages(Array(document_assembler, instruction)) + +val pipelineModel = pipeline.fit(data) + +val pipelineDF = pipelineModel.transform(data) + +``` +
+ +{:.model-param} +## Model Information + +{:.table-model} +|---|---| +|Model Name:|mpnet_embedding_all_mpnet_base_v2_tasky_classification| +|Compatibility:|Spark NLP 5.1.0+| +|License:|Open Source| +|Edition:|Official| +|Input Labels:|[documents]| +|Output Labels:|[mpnet_embeddings]| +|Language:|en| +|Size:|410.3 MB| \ No newline at end of file From 1e9fd01d0ccad9e58e5db4da72830c7bd2b521b3 Mon Sep 17 00:00:00 2001 From: ahmedlone127 Date: Sat, 19 Aug 2023 00:36:43 +0700 Subject: [PATCH 084/155] Add model 2023-08-18-mpnet_embedding_pharo_responsibilities_classifier_en --- ...ng_pharo_responsibilities_classifier_en.md | 88 +++++++++++++++++++ 1 file changed, 88 insertions(+) create mode 100644 docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_pharo_responsibilities_classifier_en.md diff --git a/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_pharo_responsibilities_classifier_en.md b/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_pharo_responsibilities_classifier_en.md new file mode 100644 index 00000000000000..438cf2032d7ea4 --- /dev/null +++ b/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_pharo_responsibilities_classifier_en.md @@ -0,0 +1,88 @@ +--- +layout: model +title: English mpnet_embedding_pharo_responsibilities_classifier TFMPNetModel from AISE-TUDelft +author: John Snow Labs +name: mpnet_embedding_pharo_responsibilities_classifier +date: 2023-08-18 +tags: [mpnet, en, open_source, tensorflow] +task: Embeddings +language: en +edition: Spark NLP 5.1.0 +spark_version: 3.0 +supported: true +engine: tensorflow +annotator: MPNetEmbeddings +article_header: + type: cover +use_language_switcher: "Python-Scala-Java" +--- + +## Description + +Pretrained mpnet model, adapted from Hugging Face and curated to provide scalability and production-readiness using Spark NLP.`mpnet_embedding_pharo_responsibilities_classifier` is a English model originally trained by AISE-TUDelft. + +{:.btn-box} + + +[Download](https://s3.amazonaws.com/auxdata.johnsnowlabs.com/public/models/mpnet_embedding_pharo_responsibilities_classifier_en_5.1.0_3.0_1692380147031.zip){:.button.button-orange.button-orange-trans.arr.button-icon} +[Copy S3 URI](s3://auxdata.johnsnowlabs.com/public/models/mpnet_embedding_pharo_responsibilities_classifier_en_5.1.0_3.0_1692380147031.zip){:.button.button-orange.button-orange-trans.button-icon.button-copy-s3} + +## How to use + + + +
+{% include programmingLanguageSelectScalaPythonNLU.html %} +```python + +document_assembler = DocumentAssembler() \ + .setInputCol("text") \ + .setOutputCol("documents") + +instruction = MPNetEmbeddings \ + .pretrained("mpnet_embedding_pharo_responsibilities_classifier", "en")\ + .setInputCols(["documents"]) \ + .setOutputCol("mpnet_embeddings") + +pipeline = Pipeline(stages=[ + document_assembler, + instruction, +]) + +pipelineModel = pipeline.fit(data) + +pipelineDF = pipelineModel.transform(data) +``` +```scala + +val document_assembler = new DocumentAssembler() + .setInputCol("text") + .setOutputCol("documents") + +val instruction = MPNetEmbeddings + .pretrained("mpnet_embedding_pharo_responsibilities_classifier", "en") + .setInputCols(Array("documents")) + .setOutputCol("mpnet_embeddings") + +val pipeline = new Pipeline().setStages(Array(document_assembler, instruction)) + +val pipelineModel = pipeline.fit(data) + +val pipelineDF = pipelineModel.transform(data) + +``` +
+ +{:.model-param} +## Model Information + +{:.table-model} +|---|---| +|Model Name:|mpnet_embedding_pharo_responsibilities_classifier| +|Compatibility:|Spark NLP 5.1.0+| +|License:|Open Source| +|Edition:|Official| +|Input Labels:|[documents]| +|Output Labels:|[mpnet_embeddings]| +|Language:|en| +|Size:|409.8 MB| \ No newline at end of file From 00a174954a64b6cbbeecf606af279a61a1e5777e Mon Sep 17 00:00:00 2001 From: ahmedlone127 Date: Sat, 19 Aug 2023 00:37:43 +0700 Subject: [PATCH 085/155] Add model 2023-08-18-mpnet_embedding_all_mpnet_base_v2_for_sb_clustering_en --- ..._all_mpnet_base_v2_for_sb_clustering_en.md | 88 +++++++++++++++++++ 1 file changed, 88 insertions(+) create mode 100644 docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_all_mpnet_base_v2_for_sb_clustering_en.md diff --git a/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_all_mpnet_base_v2_for_sb_clustering_en.md b/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_all_mpnet_base_v2_for_sb_clustering_en.md new file mode 100644 index 00000000000000..2c6860f4fe7452 --- /dev/null +++ b/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_all_mpnet_base_v2_for_sb_clustering_en.md @@ -0,0 +1,88 @@ +--- +layout: model +title: English mpnet_embedding_all_mpnet_base_v2_for_sb_clustering TFMPNetModel from Thabet +author: John Snow Labs +name: mpnet_embedding_all_mpnet_base_v2_for_sb_clustering +date: 2023-08-18 +tags: [mpnet, en, open_source, tensorflow] +task: Embeddings +language: en +edition: Spark NLP 5.1.0 +spark_version: 3.0 +supported: true +engine: tensorflow +annotator: MPNetEmbeddings +article_header: + type: cover +use_language_switcher: "Python-Scala-Java" +--- + +## Description + +Pretrained mpnet model, adapted from Hugging Face and curated to provide scalability and production-readiness using Spark NLP.`mpnet_embedding_all_mpnet_base_v2_for_sb_clustering` is a English model originally trained by Thabet. + +{:.btn-box} + + +[Download](https://s3.amazonaws.com/auxdata.johnsnowlabs.com/public/models/mpnet_embedding_all_mpnet_base_v2_for_sb_clustering_en_5.1.0_3.0_1692380236972.zip){:.button.button-orange.button-orange-trans.arr.button-icon} +[Copy S3 URI](s3://auxdata.johnsnowlabs.com/public/models/mpnet_embedding_all_mpnet_base_v2_for_sb_clustering_en_5.1.0_3.0_1692380236972.zip){:.button.button-orange.button-orange-trans.button-icon.button-copy-s3} + +## How to use + + + +
+{% include programmingLanguageSelectScalaPythonNLU.html %} +```python + +document_assembler = DocumentAssembler() \ + .setInputCol("text") \ + .setOutputCol("documents") + +instruction = MPNetEmbeddings \ + .pretrained("mpnet_embedding_all_mpnet_base_v2_for_sb_clustering", "en")\ + .setInputCols(["documents"]) \ + .setOutputCol("mpnet_embeddings") + +pipeline = Pipeline(stages=[ + document_assembler, + instruction, +]) + +pipelineModel = pipeline.fit(data) + +pipelineDF = pipelineModel.transform(data) +``` +```scala + +val document_assembler = new DocumentAssembler() + .setInputCol("text") + .setOutputCol("documents") + +val instruction = MPNetEmbeddings + .pretrained("mpnet_embedding_all_mpnet_base_v2_for_sb_clustering", "en") + .setInputCols(Array("documents")) + .setOutputCol("mpnet_embeddings") + +val pipeline = new Pipeline().setStages(Array(document_assembler, instruction)) + +val pipelineModel = pipeline.fit(data) + +val pipelineDF = pipelineModel.transform(data) + +``` +
+ +{:.model-param} +## Model Information + +{:.table-model} +|---|---| +|Model Name:|mpnet_embedding_all_mpnet_base_v2_for_sb_clustering| +|Compatibility:|Spark NLP 5.1.0+| +|License:|Open Source| +|Edition:|Official| +|Input Labels:|[documents]| +|Output Labels:|[mpnet_embeddings]| +|Language:|en| +|Size:|410.0 MB| \ No newline at end of file From 0273a41df5f8b0c4d780a57e18af2cf1c78895ef Mon Sep 17 00:00:00 2001 From: ahmedlone127 Date: Sat, 19 Aug 2023 00:38:44 +0700 Subject: [PATCH 086/155] Add model 2023-08-18-mpnet_embedding_setfit_zero_shot_classification_pbsp_p3_trig_en --- ...ero_shot_classification_pbsp_p3_trig_en.md | 88 +++++++++++++++++++ 1 file changed, 88 insertions(+) create mode 100644 docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_setfit_zero_shot_classification_pbsp_p3_trig_en.md diff --git a/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_setfit_zero_shot_classification_pbsp_p3_trig_en.md b/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_setfit_zero_shot_classification_pbsp_p3_trig_en.md new file mode 100644 index 00000000000000..3313b45dd2c604 --- /dev/null +++ b/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_setfit_zero_shot_classification_pbsp_p3_trig_en.md @@ -0,0 +1,88 @@ +--- +layout: model +title: English mpnet_embedding_setfit_zero_shot_classification_pbsp_p3_trig TFMPNetModel from aammari +author: John Snow Labs +name: mpnet_embedding_setfit_zero_shot_classification_pbsp_p3_trig +date: 2023-08-18 +tags: [mpnet, en, open_source, tensorflow] +task: Embeddings +language: en +edition: Spark NLP 5.1.0 +spark_version: 3.0 +supported: true +engine: tensorflow +annotator: MPNetEmbeddings +article_header: + type: cover +use_language_switcher: "Python-Scala-Java" +--- + +## Description + +Pretrained mpnet model, adapted from Hugging Face and curated to provide scalability and production-readiness using Spark NLP.`mpnet_embedding_setfit_zero_shot_classification_pbsp_p3_trig` is a English model originally trained by aammari. + +{:.btn-box} + + +[Download](https://s3.amazonaws.com/auxdata.johnsnowlabs.com/public/models/mpnet_embedding_setfit_zero_shot_classification_pbsp_p3_trig_en_5.1.0_3.0_1692380258211.zip){:.button.button-orange.button-orange-trans.arr.button-icon} +[Copy S3 URI](s3://auxdata.johnsnowlabs.com/public/models/mpnet_embedding_setfit_zero_shot_classification_pbsp_p3_trig_en_5.1.0_3.0_1692380258211.zip){:.button.button-orange.button-orange-trans.button-icon.button-copy-s3} + +## How to use + + + +
+{% include programmingLanguageSelectScalaPythonNLU.html %} +```python + +document_assembler = DocumentAssembler() \ + .setInputCol("text") \ + .setOutputCol("documents") + +instruction = MPNetEmbeddings \ + .pretrained("mpnet_embedding_setfit_zero_shot_classification_pbsp_p3_trig", "en")\ + .setInputCols(["documents"]) \ + .setOutputCol("mpnet_embeddings") + +pipeline = Pipeline(stages=[ + document_assembler, + instruction, +]) + +pipelineModel = pipeline.fit(data) + +pipelineDF = pipelineModel.transform(data) +``` +```scala + +val document_assembler = new DocumentAssembler() + .setInputCol("text") + .setOutputCol("documents") + +val instruction = MPNetEmbeddings + .pretrained("mpnet_embedding_setfit_zero_shot_classification_pbsp_p3_trig", "en") + .setInputCols(Array("documents")) + .setOutputCol("mpnet_embeddings") + +val pipeline = new Pipeline().setStages(Array(document_assembler, instruction)) + +val pipelineModel = pipeline.fit(data) + +val pipelineDF = pipelineModel.transform(data) + +``` +
+ +{:.model-param} +## Model Information + +{:.table-model} +|---|---| +|Model Name:|mpnet_embedding_setfit_zero_shot_classification_pbsp_p3_trig| +|Compatibility:|Spark NLP 5.1.0+| +|License:|Open Source| +|Edition:|Official| +|Input Labels:|[documents]| +|Output Labels:|[mpnet_embeddings]| +|Language:|en| +|Size:|409.8 MB| \ No newline at end of file From bdad66556e99645b462e196877a40ae28a2a560f Mon Sep 17 00:00:00 2001 From: ahmedlone127 Date: Sat, 19 Aug 2023 00:39:44 +0700 Subject: [PATCH 087/155] Add model 2023-08-18-mpnet_embedding_setfit_zero_shot_classification_pbsp_p1_likes_en --- ...ro_shot_classification_pbsp_p1_likes_en.md | 88 +++++++++++++++++++ 1 file changed, 88 insertions(+) create mode 100644 docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_setfit_zero_shot_classification_pbsp_p1_likes_en.md diff --git a/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_setfit_zero_shot_classification_pbsp_p1_likes_en.md b/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_setfit_zero_shot_classification_pbsp_p1_likes_en.md new file mode 100644 index 00000000000000..0947769eab496e --- /dev/null +++ b/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_setfit_zero_shot_classification_pbsp_p1_likes_en.md @@ -0,0 +1,88 @@ +--- +layout: model +title: English mpnet_embedding_setfit_zero_shot_classification_pbsp_p1_likes TFMPNetModel from aammari +author: John Snow Labs +name: mpnet_embedding_setfit_zero_shot_classification_pbsp_p1_likes +date: 2023-08-18 +tags: [mpnet, en, open_source, tensorflow] +task: Embeddings +language: en +edition: Spark NLP 5.1.0 +spark_version: 3.0 +supported: true +engine: tensorflow +annotator: MPNetEmbeddings +article_header: + type: cover +use_language_switcher: "Python-Scala-Java" +--- + +## Description + +Pretrained mpnet model, adapted from Hugging Face and curated to provide scalability and production-readiness using Spark NLP.`mpnet_embedding_setfit_zero_shot_classification_pbsp_p1_likes` is a English model originally trained by aammari. + +{:.btn-box} + + +[Download](https://s3.amazonaws.com/auxdata.johnsnowlabs.com/public/models/mpnet_embedding_setfit_zero_shot_classification_pbsp_p1_likes_en_5.1.0_3.0_1692380337188.zip){:.button.button-orange.button-orange-trans.arr.button-icon} +[Copy S3 URI](s3://auxdata.johnsnowlabs.com/public/models/mpnet_embedding_setfit_zero_shot_classification_pbsp_p1_likes_en_5.1.0_3.0_1692380337188.zip){:.button.button-orange.button-orange-trans.button-icon.button-copy-s3} + +## How to use + + + +
+{% include programmingLanguageSelectScalaPythonNLU.html %} +```python + +document_assembler = DocumentAssembler() \ + .setInputCol("text") \ + .setOutputCol("documents") + +instruction = MPNetEmbeddings \ + .pretrained("mpnet_embedding_setfit_zero_shot_classification_pbsp_p1_likes", "en")\ + .setInputCols(["documents"]) \ + .setOutputCol("mpnet_embeddings") + +pipeline = Pipeline(stages=[ + document_assembler, + instruction, +]) + +pipelineModel = pipeline.fit(data) + +pipelineDF = pipelineModel.transform(data) +``` +```scala + +val document_assembler = new DocumentAssembler() + .setInputCol("text") + .setOutputCol("documents") + +val instruction = MPNetEmbeddings + .pretrained("mpnet_embedding_setfit_zero_shot_classification_pbsp_p1_likes", "en") + .setInputCols(Array("documents")) + .setOutputCol("mpnet_embeddings") + +val pipeline = new Pipeline().setStages(Array(document_assembler, instruction)) + +val pipelineModel = pipeline.fit(data) + +val pipelineDF = pipelineModel.transform(data) + +``` +
+ +{:.model-param} +## Model Information + +{:.table-model} +|---|---| +|Model Name:|mpnet_embedding_setfit_zero_shot_classification_pbsp_p1_likes| +|Compatibility:|Spark NLP 5.1.0+| +|License:|Open Source| +|Edition:|Official| +|Input Labels:|[documents]| +|Output Labels:|[mpnet_embeddings]| +|Language:|en| +|Size:|409.8 MB| \ No newline at end of file From dc75af96ca2f4339222d0c4c249cf5838140bcc8 Mon Sep 17 00:00:00 2001 From: ahmedlone127 Date: Sat, 19 Aug 2023 00:40:44 +0700 Subject: [PATCH 088/155] Add model 2023-08-18-mpnet_embedding_python_summary_classifier_en --- ..._embedding_python_summary_classifier_en.md | 88 +++++++++++++++++++ 1 file changed, 88 insertions(+) create mode 100644 docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_python_summary_classifier_en.md diff --git a/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_python_summary_classifier_en.md b/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_python_summary_classifier_en.md new file mode 100644 index 00000000000000..593f49cd16d450 --- /dev/null +++ b/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_python_summary_classifier_en.md @@ -0,0 +1,88 @@ +--- +layout: model +title: English mpnet_embedding_python_summary_classifier TFMPNetModel from AISE-TUDelft +author: John Snow Labs +name: mpnet_embedding_python_summary_classifier +date: 2023-08-18 +tags: [mpnet, en, open_source, tensorflow] +task: Embeddings +language: en +edition: Spark NLP 5.1.0 +spark_version: 3.0 +supported: true +engine: tensorflow +annotator: MPNetEmbeddings +article_header: + type: cover +use_language_switcher: "Python-Scala-Java" +--- + +## Description + +Pretrained mpnet model, adapted from Hugging Face and curated to provide scalability and production-readiness using Spark NLP.`mpnet_embedding_python_summary_classifier` is a English model originally trained by AISE-TUDelft. + +{:.btn-box} + + +[Download](https://s3.amazonaws.com/auxdata.johnsnowlabs.com/public/models/mpnet_embedding_python_summary_classifier_en_5.1.0_3.0_1692378198075.zip){:.button.button-orange.button-orange-trans.arr.button-icon} +[Copy S3 URI](s3://auxdata.johnsnowlabs.com/public/models/mpnet_embedding_python_summary_classifier_en_5.1.0_3.0_1692378198075.zip){:.button.button-orange.button-orange-trans.button-icon.button-copy-s3} + +## How to use + + + +
+{% include programmingLanguageSelectScalaPythonNLU.html %} +```python + +document_assembler = DocumentAssembler() \ + .setInputCol("text") \ + .setOutputCol("documents") + +instruction = MPNetEmbeddings \ + .pretrained("mpnet_embedding_python_summary_classifier", "en")\ + .setInputCols(["documents"]) \ + .setOutputCol("mpnet_embeddings") + +pipeline = Pipeline(stages=[ + document_assembler, + instruction, +]) + +pipelineModel = pipeline.fit(data) + +pipelineDF = pipelineModel.transform(data) +``` +```scala + +val document_assembler = new DocumentAssembler() + .setInputCol("text") + .setOutputCol("documents") + +val instruction = MPNetEmbeddings + .pretrained("mpnet_embedding_python_summary_classifier", "en") + .setInputCols(Array("documents")) + .setOutputCol("mpnet_embeddings") + +val pipeline = new Pipeline().setStages(Array(document_assembler, instruction)) + +val pipelineModel = pipeline.fit(data) + +val pipelineDF = pipelineModel.transform(data) + +``` +
+ +{:.model-param} +## Model Information + +{:.table-model} +|---|---| +|Model Name:|mpnet_embedding_python_summary_classifier| +|Compatibility:|Spark NLP 5.1.0+| +|License:|Open Source| +|Edition:|Official| +|Input Labels:|[documents]| +|Output Labels:|[mpnet_embeddings]| +|Language:|en| +|Size:|409.8 MB| \ No newline at end of file From 1260aa98f11fcbe12d723c82e428351836f9ad9a Mon Sep 17 00:00:00 2001 From: ahmedlone127 Date: Sat, 19 Aug 2023 00:41:45 +0700 Subject: [PATCH 089/155] Add model 2023-08-18-mpnet_embedding_few_shot_model_en --- ...08-18-mpnet_embedding_few_shot_model_en.md | 88 +++++++++++++++++++ 1 file changed, 88 insertions(+) create mode 100644 docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_few_shot_model_en.md diff --git a/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_few_shot_model_en.md b/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_few_shot_model_en.md new file mode 100644 index 00000000000000..e7e7fd4bc3e539 --- /dev/null +++ b/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_few_shot_model_en.md @@ -0,0 +1,88 @@ +--- +layout: model +title: English mpnet_embedding_few_shot_model TFMPNetModel from jessietextstan +author: John Snow Labs +name: mpnet_embedding_few_shot_model +date: 2023-08-18 +tags: [mpnet, en, open_source, tensorflow] +task: Embeddings +language: en +edition: Spark NLP 5.1.0 +spark_version: 3.0 +supported: true +engine: tensorflow +annotator: MPNetEmbeddings +article_header: + type: cover +use_language_switcher: "Python-Scala-Java" +--- + +## Description + +Pretrained mpnet model, adapted from Hugging Face and curated to provide scalability and production-readiness using Spark NLP.`mpnet_embedding_few_shot_model` is a English model originally trained by jessietextstan. + +{:.btn-box} + + +[Download](https://s3.amazonaws.com/auxdata.johnsnowlabs.com/public/models/mpnet_embedding_few_shot_model_en_5.1.0_3.0_1692380439322.zip){:.button.button-orange.button-orange-trans.arr.button-icon} +[Copy S3 URI](s3://auxdata.johnsnowlabs.com/public/models/mpnet_embedding_few_shot_model_en_5.1.0_3.0_1692380439322.zip){:.button.button-orange.button-orange-trans.button-icon.button-copy-s3} + +## How to use + + + +
+{% include programmingLanguageSelectScalaPythonNLU.html %} +```python + +document_assembler = DocumentAssembler() \ + .setInputCol("text") \ + .setOutputCol("documents") + +instruction = MPNetEmbeddings \ + .pretrained("mpnet_embedding_few_shot_model", "en")\ + .setInputCols(["documents"]) \ + .setOutputCol("mpnet_embeddings") + +pipeline = Pipeline(stages=[ + document_assembler, + instruction, +]) + +pipelineModel = pipeline.fit(data) + +pipelineDF = pipelineModel.transform(data) +``` +```scala + +val document_assembler = new DocumentAssembler() + .setInputCol("text") + .setOutputCol("documents") + +val instruction = MPNetEmbeddings + .pretrained("mpnet_embedding_few_shot_model", "en") + .setInputCols(Array("documents")) + .setOutputCol("mpnet_embeddings") + +val pipeline = new Pipeline().setStages(Array(document_assembler, instruction)) + +val pipelineModel = pipeline.fit(data) + +val pipelineDF = pipelineModel.transform(data) + +``` +
+ +{:.model-param} +## Model Information + +{:.table-model} +|---|---| +|Model Name:|mpnet_embedding_few_shot_model| +|Compatibility:|Spark NLP 5.1.0+| +|License:|Open Source| +|Edition:|Official| +|Input Labels:|[documents]| +|Output Labels:|[mpnet_embeddings]| +|Language:|en| +|Size:|410.1 MB| \ No newline at end of file From 7479e97e5495b3b83417d706a28bca73f4e68405 Mon Sep 17 00:00:00 2001 From: ahmedlone127 Date: Sat, 19 Aug 2023 00:42:45 +0700 Subject: [PATCH 090/155] Add model 2023-08-18-mpnet_embedding_CPU_Mitigation_Classifier_en --- ..._embedding_CPU_Mitigation_Classifier_en.md | 88 +++++++++++++++++++ 1 file changed, 88 insertions(+) create mode 100644 docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_CPU_Mitigation_Classifier_en.md diff --git a/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_CPU_Mitigation_Classifier_en.md b/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_CPU_Mitigation_Classifier_en.md new file mode 100644 index 00000000000000..a1d1ba79c4ef31 --- /dev/null +++ b/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_CPU_Mitigation_Classifier_en.md @@ -0,0 +1,88 @@ +--- +layout: model +title: English mpnet_embedding_CPU_Mitigation_Classifier TFMPNetModel from mtyrrell +author: John Snow Labs +name: mpnet_embedding_CPU_Mitigation_Classifier +date: 2023-08-18 +tags: [mpnet, en, open_source, tensorflow] +task: Embeddings +language: en +edition: Spark NLP 5.1.0 +spark_version: 3.0 +supported: true +engine: tensorflow +annotator: MPNetEmbeddings +article_header: + type: cover +use_language_switcher: "Python-Scala-Java" +--- + +## Description + +Pretrained mpnet model, adapted from Hugging Face and curated to provide scalability and production-readiness using Spark NLP.`mpnet_embedding_CPU_Mitigation_Classifier` is a English model originally trained by mtyrrell. + +{:.btn-box} + + +[Download](https://s3.amazonaws.com/auxdata.johnsnowlabs.com/public/models/mpnet_embedding_CPU_Mitigation_Classifier_en_5.1.0_3.0_1692380447837.zip){:.button.button-orange.button-orange-trans.arr.button-icon} +[Copy S3 URI](s3://auxdata.johnsnowlabs.com/public/models/mpnet_embedding_CPU_Mitigation_Classifier_en_5.1.0_3.0_1692380447837.zip){:.button.button-orange.button-orange-trans.button-icon.button-copy-s3} + +## How to use + + + +
+{% include programmingLanguageSelectScalaPythonNLU.html %} +```python + +document_assembler = DocumentAssembler() \ + .setInputCol("text") \ + .setOutputCol("documents") + +instruction = MPNetEmbeddings \ + .pretrained("mpnet_embedding_CPU_Mitigation_Classifier", "en")\ + .setInputCols(["documents"]) \ + .setOutputCol("mpnet_embeddings") + +pipeline = Pipeline(stages=[ + document_assembler, + instruction, +]) + +pipelineModel = pipeline.fit(data) + +pipelineDF = pipelineModel.transform(data) +``` +```scala + +val document_assembler = new DocumentAssembler() + .setInputCol("text") + .setOutputCol("documents") + +val instruction = MPNetEmbeddings + .pretrained("mpnet_embedding_CPU_Mitigation_Classifier", "en") + .setInputCols(Array("documents")) + .setOutputCol("mpnet_embeddings") + +val pipeline = new Pipeline().setStages(Array(document_assembler, instruction)) + +val pipelineModel = pipeline.fit(data) + +val pipelineDF = pipelineModel.transform(data) + +``` +
+ +{:.model-param} +## Model Information + +{:.table-model} +|---|---| +|Model Name:|mpnet_embedding_CPU_Mitigation_Classifier| +|Compatibility:|Spark NLP 5.1.0+| +|License:|Open Source| +|Edition:|Official| +|Input Labels:|[documents]| +|Output Labels:|[mpnet_embeddings]| +|Language:|en| +|Size:|410.3 MB| \ No newline at end of file From 129838659de9ce8f105034827ed3c431573101de Mon Sep 17 00:00:00 2001 From: ahmedlone127 Date: Sat, 19 Aug 2023 00:43:45 +0700 Subject: [PATCH 091/155] Add model 2023-08-18-mpnet_embedding_all_mpnet_base_v2_feature_extraction_en --- ...all_mpnet_base_v2_feature_extraction_en.md | 88 +++++++++++++++++++ 1 file changed, 88 insertions(+) create mode 100644 docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_all_mpnet_base_v2_feature_extraction_en.md diff --git a/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_all_mpnet_base_v2_feature_extraction_en.md b/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_all_mpnet_base_v2_feature_extraction_en.md new file mode 100644 index 00000000000000..80204a205b4e8b --- /dev/null +++ b/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_all_mpnet_base_v2_feature_extraction_en.md @@ -0,0 +1,88 @@ +--- +layout: model +title: English mpnet_embedding_all_mpnet_base_v2_feature_extraction TFMPNetModel from guidecare +author: John Snow Labs +name: mpnet_embedding_all_mpnet_base_v2_feature_extraction +date: 2023-08-18 +tags: [mpnet, en, open_source, tensorflow] +task: Embeddings +language: en +edition: Spark NLP 5.1.0 +spark_version: 3.0 +supported: true +engine: tensorflow +annotator: MPNetEmbeddings +article_header: + type: cover +use_language_switcher: "Python-Scala-Java" +--- + +## Description + +Pretrained mpnet model, adapted from Hugging Face and curated to provide scalability and production-readiness using Spark NLP.`mpnet_embedding_all_mpnet_base_v2_feature_extraction` is a English model originally trained by guidecare. + +{:.btn-box} + + +[Download](https://s3.amazonaws.com/auxdata.johnsnowlabs.com/public/models/mpnet_embedding_all_mpnet_base_v2_feature_extraction_en_5.1.0_3.0_1692380529046.zip){:.button.button-orange.button-orange-trans.arr.button-icon} +[Copy S3 URI](s3://auxdata.johnsnowlabs.com/public/models/mpnet_embedding_all_mpnet_base_v2_feature_extraction_en_5.1.0_3.0_1692380529046.zip){:.button.button-orange.button-orange-trans.button-icon.button-copy-s3} + +## How to use + + + +
+{% include programmingLanguageSelectScalaPythonNLU.html %} +```python + +document_assembler = DocumentAssembler() \ + .setInputCol("text") \ + .setOutputCol("documents") + +instruction = MPNetEmbeddings \ + .pretrained("mpnet_embedding_all_mpnet_base_v2_feature_extraction", "en")\ + .setInputCols(["documents"]) \ + .setOutputCol("mpnet_embeddings") + +pipeline = Pipeline(stages=[ + document_assembler, + instruction, +]) + +pipelineModel = pipeline.fit(data) + +pipelineDF = pipelineModel.transform(data) +``` +```scala + +val document_assembler = new DocumentAssembler() + .setInputCol("text") + .setOutputCol("documents") + +val instruction = MPNetEmbeddings + .pretrained("mpnet_embedding_all_mpnet_base_v2_feature_extraction", "en") + .setInputCols(Array("documents")) + .setOutputCol("mpnet_embeddings") + +val pipeline = new Pipeline().setStages(Array(document_assembler, instruction)) + +val pipelineModel = pipeline.fit(data) + +val pipelineDF = pipelineModel.transform(data) + +``` +
+ +{:.model-param} +## Model Information + +{:.table-model} +|---|---| +|Model Name:|mpnet_embedding_all_mpnet_base_v2_feature_extraction| +|Compatibility:|Spark NLP 5.1.0+| +|License:|Open Source| +|Edition:|Official| +|Input Labels:|[documents]| +|Output Labels:|[mpnet_embeddings]| +|Language:|en| +|Size:|409.6 MB| \ No newline at end of file From f85a75c428ee97504e800b5aea40016c50ce9161 Mon Sep 17 00:00:00 2001 From: ahmedlone127 Date: Sat, 19 Aug 2023 00:44:45 +0700 Subject: [PATCH 092/155] Add model 2023-08-18-mpnet_embedding_mpnet_adaptation_mitigation_classifier_en --- ...net_adaptation_mitigation_classifier_en.md | 88 +++++++++++++++++++ 1 file changed, 88 insertions(+) create mode 100644 docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_mpnet_adaptation_mitigation_classifier_en.md diff --git a/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_mpnet_adaptation_mitigation_classifier_en.md b/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_mpnet_adaptation_mitigation_classifier_en.md new file mode 100644 index 00000000000000..a45b85dd2423ef --- /dev/null +++ b/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_mpnet_adaptation_mitigation_classifier_en.md @@ -0,0 +1,88 @@ +--- +layout: model +title: English mpnet_embedding_mpnet_adaptation_mitigation_classifier TFMPNetModel from ppsingh +author: John Snow Labs +name: mpnet_embedding_mpnet_adaptation_mitigation_classifier +date: 2023-08-18 +tags: [mpnet, en, open_source, tensorflow] +task: Embeddings +language: en +edition: Spark NLP 5.1.0 +spark_version: 3.0 +supported: true +engine: tensorflow +annotator: MPNetEmbeddings +article_header: + type: cover +use_language_switcher: "Python-Scala-Java" +--- + +## Description + +Pretrained mpnet model, adapted from Hugging Face and curated to provide scalability and production-readiness using Spark NLP.`mpnet_embedding_mpnet_adaptation_mitigation_classifier` is a English model originally trained by ppsingh. + +{:.btn-box} + + +[Download](https://s3.amazonaws.com/auxdata.johnsnowlabs.com/public/models/mpnet_embedding_mpnet_adaptation_mitigation_classifier_en_5.1.0_3.0_1692376773609.zip){:.button.button-orange.button-orange-trans.arr.button-icon} +[Copy S3 URI](s3://auxdata.johnsnowlabs.com/public/models/mpnet_embedding_mpnet_adaptation_mitigation_classifier_en_5.1.0_3.0_1692376773609.zip){:.button.button-orange.button-orange-trans.button-icon.button-copy-s3} + +## How to use + + + +
+{% include programmingLanguageSelectScalaPythonNLU.html %} +```python + +document_assembler = DocumentAssembler() \ + .setInputCol("text") \ + .setOutputCol("documents") + +instruction = MPNetEmbeddings \ + .pretrained("mpnet_embedding_mpnet_adaptation_mitigation_classifier", "en")\ + .setInputCols(["documents"]) \ + .setOutputCol("mpnet_embeddings") + +pipeline = Pipeline(stages=[ + document_assembler, + instruction, +]) + +pipelineModel = pipeline.fit(data) + +pipelineDF = pipelineModel.transform(data) +``` +```scala + +val document_assembler = new DocumentAssembler() + .setInputCol("text") + .setOutputCol("documents") + +val instruction = MPNetEmbeddings + .pretrained("mpnet_embedding_mpnet_adaptation_mitigation_classifier", "en") + .setInputCols(Array("documents")) + .setOutputCol("mpnet_embeddings") + +val pipeline = new Pipeline().setStages(Array(document_assembler, instruction)) + +val pipelineModel = pipeline.fit(data) + +val pipelineDF = pipelineModel.transform(data) + +``` +
+ +{:.model-param} +## Model Information + +{:.table-model} +|---|---| +|Model Name:|mpnet_embedding_mpnet_adaptation_mitigation_classifier| +|Compatibility:|Spark NLP 5.1.0+| +|License:|Open Source| +|Edition:|Official| +|Input Labels:|[documents]| +|Output Labels:|[mpnet_embeddings]| +|Language:|en| +|Size:|410.3 MB| \ No newline at end of file From f1156cad8d7f0a0cdcd1f99bc52189ba0d067c15 Mon Sep 17 00:00:00 2001 From: ahmedlone127 Date: Sat, 19 Aug 2023 00:45:45 +0700 Subject: [PATCH 093/155] Add model 2023-08-18-mpnet_embedding_java_summary_classifier_en --- ...et_embedding_java_summary_classifier_en.md | 88 +++++++++++++++++++ 1 file changed, 88 insertions(+) create mode 100644 docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_java_summary_classifier_en.md diff --git a/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_java_summary_classifier_en.md b/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_java_summary_classifier_en.md new file mode 100644 index 00000000000000..e29e8ec4811483 --- /dev/null +++ b/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_java_summary_classifier_en.md @@ -0,0 +1,88 @@ +--- +layout: model +title: English mpnet_embedding_java_summary_classifier TFMPNetModel from AISE-TUDelft +author: John Snow Labs +name: mpnet_embedding_java_summary_classifier +date: 2023-08-18 +tags: [mpnet, en, open_source, tensorflow] +task: Embeddings +language: en +edition: Spark NLP 5.1.0 +spark_version: 3.0 +supported: true +engine: tensorflow +annotator: MPNetEmbeddings +article_header: + type: cover +use_language_switcher: "Python-Scala-Java" +--- + +## Description + +Pretrained mpnet model, adapted from Hugging Face and curated to provide scalability and production-readiness using Spark NLP.`mpnet_embedding_java_summary_classifier` is a English model originally trained by AISE-TUDelft. + +{:.btn-box} + + +[Download](https://s3.amazonaws.com/auxdata.johnsnowlabs.com/public/models/mpnet_embedding_java_summary_classifier_en_5.1.0_3.0_1692377436133.zip){:.button.button-orange.button-orange-trans.arr.button-icon} +[Copy S3 URI](s3://auxdata.johnsnowlabs.com/public/models/mpnet_embedding_java_summary_classifier_en_5.1.0_3.0_1692377436133.zip){:.button.button-orange.button-orange-trans.button-icon.button-copy-s3} + +## How to use + + + +
+{% include programmingLanguageSelectScalaPythonNLU.html %} +```python + +document_assembler = DocumentAssembler() \ + .setInputCol("text") \ + .setOutputCol("documents") + +instruction = MPNetEmbeddings \ + .pretrained("mpnet_embedding_java_summary_classifier", "en")\ + .setInputCols(["documents"]) \ + .setOutputCol("mpnet_embeddings") + +pipeline = Pipeline(stages=[ + document_assembler, + instruction, +]) + +pipelineModel = pipeline.fit(data) + +pipelineDF = pipelineModel.transform(data) +``` +```scala + +val document_assembler = new DocumentAssembler() + .setInputCol("text") + .setOutputCol("documents") + +val instruction = MPNetEmbeddings + .pretrained("mpnet_embedding_java_summary_classifier", "en") + .setInputCols(Array("documents")) + .setOutputCol("mpnet_embeddings") + +val pipeline = new Pipeline().setStages(Array(document_assembler, instruction)) + +val pipelineModel = pipeline.fit(data) + +val pipelineDF = pipelineModel.transform(data) + +``` +
+ +{:.model-param} +## Model Information + +{:.table-model} +|---|---| +|Model Name:|mpnet_embedding_java_summary_classifier| +|Compatibility:|Spark NLP 5.1.0+| +|License:|Open Source| +|Edition:|Official| +|Input Labels:|[documents]| +|Output Labels:|[mpnet_embeddings]| +|Language:|en| +|Size:|409.9 MB| \ No newline at end of file From b0b791fdd0a95ec538f6151fa67ca2ba725f43b3 Mon Sep 17 00:00:00 2001 From: ahmedlone127 Date: Sat, 19 Aug 2023 00:46:45 +0700 Subject: [PATCH 094/155] Add model 2023-08-18-mpnet_embedding_579_STmodel_product_rem_v3a_en --- ...mbedding_579_STmodel_product_rem_v3a_en.md | 88 +++++++++++++++++++ 1 file changed, 88 insertions(+) create mode 100644 docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_579_STmodel_product_rem_v3a_en.md diff --git a/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_579_STmodel_product_rem_v3a_en.md b/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_579_STmodel_product_rem_v3a_en.md new file mode 100644 index 00000000000000..2826dbe6ee9788 --- /dev/null +++ b/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_579_STmodel_product_rem_v3a_en.md @@ -0,0 +1,88 @@ +--- +layout: model +title: English mpnet_embedding_579_STmodel_product_rem_v3a TFMPNetModel from jamiehudson +author: John Snow Labs +name: mpnet_embedding_579_STmodel_product_rem_v3a +date: 2023-08-18 +tags: [mpnet, en, open_source, tensorflow] +task: Embeddings +language: en +edition: Spark NLP 5.1.0 +spark_version: 3.0 +supported: true +engine: tensorflow +annotator: MPNetEmbeddings +article_header: + type: cover +use_language_switcher: "Python-Scala-Java" +--- + +## Description + +Pretrained mpnet model, adapted from Hugging Face and curated to provide scalability and production-readiness using Spark NLP.`mpnet_embedding_579_STmodel_product_rem_v3a` is a English model originally trained by jamiehudson. + +{:.btn-box} + + +[Download](https://s3.amazonaws.com/auxdata.johnsnowlabs.com/public/models/mpnet_embedding_579_STmodel_product_rem_v3a_en_5.1.0_3.0_1692379340262.zip){:.button.button-orange.button-orange-trans.arr.button-icon} +[Copy S3 URI](s3://auxdata.johnsnowlabs.com/public/models/mpnet_embedding_579_STmodel_product_rem_v3a_en_5.1.0_3.0_1692379340262.zip){:.button.button-orange.button-orange-trans.button-icon.button-copy-s3} + +## How to use + + + +
+{% include programmingLanguageSelectScalaPythonNLU.html %} +```python + +document_assembler = DocumentAssembler() \ + .setInputCol("text") \ + .setOutputCol("documents") + +instruction = MPNetEmbeddings \ + .pretrained("mpnet_embedding_579_STmodel_product_rem_v3a", "en")\ + .setInputCols(["documents"]) \ + .setOutputCol("mpnet_embeddings") + +pipeline = Pipeline(stages=[ + document_assembler, + instruction, +]) + +pipelineModel = pipeline.fit(data) + +pipelineDF = pipelineModel.transform(data) +``` +```scala + +val document_assembler = new DocumentAssembler() + .setInputCol("text") + .setOutputCol("documents") + +val instruction = MPNetEmbeddings + .pretrained("mpnet_embedding_579_STmodel_product_rem_v3a", "en") + .setInputCols(Array("documents")) + .setOutputCol("mpnet_embeddings") + +val pipeline = new Pipeline().setStages(Array(document_assembler, instruction)) + +val pipelineModel = pipeline.fit(data) + +val pipelineDF = pipelineModel.transform(data) + +``` +
+ +{:.model-param} +## Model Information + +{:.table-model} +|---|---| +|Model Name:|mpnet_embedding_579_STmodel_product_rem_v3a| +|Compatibility:|Spark NLP 5.1.0+| +|License:|Open Source| +|Edition:|Official| +|Input Labels:|[documents]| +|Output Labels:|[mpnet_embeddings]| +|Language:|en| +|Size:|410.2 MB| \ No newline at end of file From 5a964c985e644abca94db3681f9768d75859e15a Mon Sep 17 00:00:00 2001 From: ahmedlone127 Date: Sat, 19 Aug 2023 00:47:45 +0700 Subject: [PATCH 095/155] Add model 2023-08-18-mpnet_embedding_SetFit_all_data_en --- ...8-18-mpnet_embedding_SetFit_all_data_en.md | 88 +++++++++++++++++++ 1 file changed, 88 insertions(+) create mode 100644 docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_SetFit_all_data_en.md diff --git a/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_SetFit_all_data_en.md b/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_SetFit_all_data_en.md new file mode 100644 index 00000000000000..4d2f95602e7881 --- /dev/null +++ b/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_SetFit_all_data_en.md @@ -0,0 +1,88 @@ +--- +layout: model +title: English mpnet_embedding_SetFit_all_data TFMPNetModel from scaperex +author: John Snow Labs +name: mpnet_embedding_SetFit_all_data +date: 2023-08-18 +tags: [mpnet, en, open_source, tensorflow] +task: Embeddings +language: en +edition: Spark NLP 5.1.0 +spark_version: 3.0 +supported: true +engine: tensorflow +annotator: MPNetEmbeddings +article_header: + type: cover +use_language_switcher: "Python-Scala-Java" +--- + +## Description + +Pretrained mpnet model, adapted from Hugging Face and curated to provide scalability and production-readiness using Spark NLP.`mpnet_embedding_SetFit_all_data` is a English model originally trained by scaperex. + +{:.btn-box} + + +[Download](https://s3.amazonaws.com/auxdata.johnsnowlabs.com/public/models/mpnet_embedding_SetFit_all_data_en_5.1.0_3.0_1692380518236.zip){:.button.button-orange.button-orange-trans.arr.button-icon} +[Copy S3 URI](s3://auxdata.johnsnowlabs.com/public/models/mpnet_embedding_SetFit_all_data_en_5.1.0_3.0_1692380518236.zip){:.button.button-orange.button-orange-trans.button-icon.button-copy-s3} + +## How to use + + + +
+{% include programmingLanguageSelectScalaPythonNLU.html %} +```python + +document_assembler = DocumentAssembler() \ + .setInputCol("text") \ + .setOutputCol("documents") + +instruction = MPNetEmbeddings \ + .pretrained("mpnet_embedding_SetFit_all_data", "en")\ + .setInputCols(["documents"]) \ + .setOutputCol("mpnet_embeddings") + +pipeline = Pipeline(stages=[ + document_assembler, + instruction, +]) + +pipelineModel = pipeline.fit(data) + +pipelineDF = pipelineModel.transform(data) +``` +```scala + +val document_assembler = new DocumentAssembler() + .setInputCol("text") + .setOutputCol("documents") + +val instruction = MPNetEmbeddings + .pretrained("mpnet_embedding_SetFit_all_data", "en") + .setInputCols(Array("documents")) + .setOutputCol("mpnet_embeddings") + +val pipeline = new Pipeline().setStages(Array(document_assembler, instruction)) + +val pipelineModel = pipeline.fit(data) + +val pipelineDF = pipelineModel.transform(data) + +``` +
+ +{:.model-param} +## Model Information + +{:.table-model} +|---|---| +|Model Name:|mpnet_embedding_SetFit_all_data| +|Compatibility:|Spark NLP 5.1.0+| +|License:|Open Source| +|Edition:|Official| +|Input Labels:|[documents]| +|Output Labels:|[mpnet_embeddings]| +|Language:|en| +|Size:|409.9 MB| \ No newline at end of file From c471af3a62775aee23763227b0a10d9153bc7c7d Mon Sep 17 00:00:00 2001 From: ahmedlone127 Date: Sat, 19 Aug 2023 00:48:46 +0700 Subject: [PATCH 096/155] Add model 2023-08-18-mpnet_embedding_setfit_zero_shot_classification_pbsp_p4_meas_en --- ...ero_shot_classification_pbsp_p4_meas_en.md | 88 +++++++++++++++++++ 1 file changed, 88 insertions(+) create mode 100644 docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_setfit_zero_shot_classification_pbsp_p4_meas_en.md diff --git a/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_setfit_zero_shot_classification_pbsp_p4_meas_en.md b/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_setfit_zero_shot_classification_pbsp_p4_meas_en.md new file mode 100644 index 00000000000000..7110724a94f6a0 --- /dev/null +++ b/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_setfit_zero_shot_classification_pbsp_p4_meas_en.md @@ -0,0 +1,88 @@ +--- +layout: model +title: English mpnet_embedding_setfit_zero_shot_classification_pbsp_p4_meas TFMPNetModel from aammari +author: John Snow Labs +name: mpnet_embedding_setfit_zero_shot_classification_pbsp_p4_meas +date: 2023-08-18 +tags: [mpnet, en, open_source, tensorflow] +task: Embeddings +language: en +edition: Spark NLP 5.1.0 +spark_version: 3.0 +supported: true +engine: tensorflow +annotator: MPNetEmbeddings +article_header: + type: cover +use_language_switcher: "Python-Scala-Java" +--- + +## Description + +Pretrained mpnet model, adapted from Hugging Face and curated to provide scalability and production-readiness using Spark NLP.`mpnet_embedding_setfit_zero_shot_classification_pbsp_p4_meas` is a English model originally trained by aammari. + +{:.btn-box} + + +[Download](https://s3.amazonaws.com/auxdata.johnsnowlabs.com/public/models/mpnet_embedding_setfit_zero_shot_classification_pbsp_p4_meas_en_5.1.0_3.0_1692380894097.zip){:.button.button-orange.button-orange-trans.arr.button-icon} +[Copy S3 URI](s3://auxdata.johnsnowlabs.com/public/models/mpnet_embedding_setfit_zero_shot_classification_pbsp_p4_meas_en_5.1.0_3.0_1692380894097.zip){:.button.button-orange.button-orange-trans.button-icon.button-copy-s3} + +## How to use + + + +
+{% include programmingLanguageSelectScalaPythonNLU.html %} +```python + +document_assembler = DocumentAssembler() \ + .setInputCol("text") \ + .setOutputCol("documents") + +instruction = MPNetEmbeddings \ + .pretrained("mpnet_embedding_setfit_zero_shot_classification_pbsp_p4_meas", "en")\ + .setInputCols(["documents"]) \ + .setOutputCol("mpnet_embeddings") + +pipeline = Pipeline(stages=[ + document_assembler, + instruction, +]) + +pipelineModel = pipeline.fit(data) + +pipelineDF = pipelineModel.transform(data) +``` +```scala + +val document_assembler = new DocumentAssembler() + .setInputCol("text") + .setOutputCol("documents") + +val instruction = MPNetEmbeddings + .pretrained("mpnet_embedding_setfit_zero_shot_classification_pbsp_p4_meas", "en") + .setInputCols(Array("documents")) + .setOutputCol("mpnet_embeddings") + +val pipeline = new Pipeline().setStages(Array(document_assembler, instruction)) + +val pipelineModel = pipeline.fit(data) + +val pipelineDF = pipelineModel.transform(data) + +``` +
+ +{:.model-param} +## Model Information + +{:.table-model} +|---|---| +|Model Name:|mpnet_embedding_setfit_zero_shot_classification_pbsp_p4_meas| +|Compatibility:|Spark NLP 5.1.0+| +|License:|Open Source| +|Edition:|Official| +|Input Labels:|[documents]| +|Output Labels:|[mpnet_embeddings]| +|Language:|en| +|Size:|409.8 MB| \ No newline at end of file From 762d45b7dabf9749bba683359ee97b8b6462b364 Mon Sep 17 00:00:00 2001 From: ahmedlone127 Date: Sat, 19 Aug 2023 00:49:46 +0700 Subject: [PATCH 097/155] Add model 2023-08-18-mpnet_embedding_setfit_ostrom_en --- ...-08-18-mpnet_embedding_setfit_ostrom_en.md | 88 +++++++++++++++++++ 1 file changed, 88 insertions(+) create mode 100644 docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_setfit_ostrom_en.md diff --git a/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_setfit_ostrom_en.md b/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_setfit_ostrom_en.md new file mode 100644 index 00000000000000..1a08532fca1bbf --- /dev/null +++ b/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_setfit_ostrom_en.md @@ -0,0 +1,88 @@ +--- +layout: model +title: English mpnet_embedding_setfit_ostrom TFMPNetModel from mahaswec +author: John Snow Labs +name: mpnet_embedding_setfit_ostrom +date: 2023-08-18 +tags: [mpnet, en, open_source, tensorflow] +task: Embeddings +language: en +edition: Spark NLP 5.1.0 +spark_version: 3.0 +supported: true +engine: tensorflow +annotator: MPNetEmbeddings +article_header: + type: cover +use_language_switcher: "Python-Scala-Java" +--- + +## Description + +Pretrained mpnet model, adapted from Hugging Face and curated to provide scalability and production-readiness using Spark NLP.`mpnet_embedding_setfit_ostrom` is a English model originally trained by mahaswec. + +{:.btn-box} + + +[Download](https://s3.amazonaws.com/auxdata.johnsnowlabs.com/public/models/mpnet_embedding_setfit_ostrom_en_5.1.0_3.0_1692380969725.zip){:.button.button-orange.button-orange-trans.arr.button-icon} +[Copy S3 URI](s3://auxdata.johnsnowlabs.com/public/models/mpnet_embedding_setfit_ostrom_en_5.1.0_3.0_1692380969725.zip){:.button.button-orange.button-orange-trans.button-icon.button-copy-s3} + +## How to use + + + +
+{% include programmingLanguageSelectScalaPythonNLU.html %} +```python + +document_assembler = DocumentAssembler() \ + .setInputCol("text") \ + .setOutputCol("documents") + +instruction = MPNetEmbeddings \ + .pretrained("mpnet_embedding_setfit_ostrom", "en")\ + .setInputCols(["documents"]) \ + .setOutputCol("mpnet_embeddings") + +pipeline = Pipeline(stages=[ + document_assembler, + instruction, +]) + +pipelineModel = pipeline.fit(data) + +pipelineDF = pipelineModel.transform(data) +``` +```scala + +val document_assembler = new DocumentAssembler() + .setInputCol("text") + .setOutputCol("documents") + +val instruction = MPNetEmbeddings + .pretrained("mpnet_embedding_setfit_ostrom", "en") + .setInputCols(Array("documents")) + .setOutputCol("mpnet_embeddings") + +val pipeline = new Pipeline().setStages(Array(document_assembler, instruction)) + +val pipelineModel = pipeline.fit(data) + +val pipelineDF = pipelineModel.transform(data) + +``` +
+ +{:.model-param} +## Model Information + +{:.table-model} +|---|---| +|Model Name:|mpnet_embedding_setfit_ostrom| +|Compatibility:|Spark NLP 5.1.0+| +|License:|Open Source| +|Edition:|Official| +|Input Labels:|[documents]| +|Output Labels:|[mpnet_embeddings]| +|Language:|en| +|Size:|409.9 MB| \ No newline at end of file From 1fddaa394f7a9b780604dee31986df74ad2b922a Mon Sep 17 00:00:00 2001 From: ahmedlone127 Date: Sat, 19 Aug 2023 00:50:46 +0700 Subject: [PATCH 098/155] Add model 2023-08-18-mpnet_embedding_sentence_transformers_bible_reference_final_en --- ...e_transformers_bible_reference_final_en.md | 88 +++++++++++++++++++ 1 file changed, 88 insertions(+) create mode 100644 docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_sentence_transformers_bible_reference_final_en.md diff --git a/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_sentence_transformers_bible_reference_final_en.md b/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_sentence_transformers_bible_reference_final_en.md new file mode 100644 index 00000000000000..3e8320dc7970f0 --- /dev/null +++ b/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_sentence_transformers_bible_reference_final_en.md @@ -0,0 +1,88 @@ +--- +layout: model +title: English mpnet_embedding_sentence_transformers_bible_reference_final TFMPNetModel from odunola +author: John Snow Labs +name: mpnet_embedding_sentence_transformers_bible_reference_final +date: 2023-08-18 +tags: [mpnet, en, open_source, tensorflow] +task: Embeddings +language: en +edition: Spark NLP 5.1.0 +spark_version: 3.0 +supported: true +engine: tensorflow +annotator: MPNetEmbeddings +article_header: + type: cover +use_language_switcher: "Python-Scala-Java" +--- + +## Description + +Pretrained mpnet model, adapted from Hugging Face and curated to provide scalability and production-readiness using Spark NLP.`mpnet_embedding_sentence_transformers_bible_reference_final` is a English model originally trained by odunola. + +{:.btn-box} + + +[Download](https://s3.amazonaws.com/auxdata.johnsnowlabs.com/public/models/mpnet_embedding_sentence_transformers_bible_reference_final_en_5.1.0_3.0_1692376207996.zip){:.button.button-orange.button-orange-trans.arr.button-icon} +[Copy S3 URI](s3://auxdata.johnsnowlabs.com/public/models/mpnet_embedding_sentence_transformers_bible_reference_final_en_5.1.0_3.0_1692376207996.zip){:.button.button-orange.button-orange-trans.button-icon.button-copy-s3} + +## How to use + + + +
+{% include programmingLanguageSelectScalaPythonNLU.html %} +```python + +document_assembler = DocumentAssembler() \ + .setInputCol("text") \ + .setOutputCol("documents") + +instruction = MPNetEmbeddings \ + .pretrained("mpnet_embedding_sentence_transformers_bible_reference_final", "en")\ + .setInputCols(["documents"]) \ + .setOutputCol("mpnet_embeddings") + +pipeline = Pipeline(stages=[ + document_assembler, + instruction, +]) + +pipelineModel = pipeline.fit(data) + +pipelineDF = pipelineModel.transform(data) +``` +```scala + +val document_assembler = new DocumentAssembler() + .setInputCol("text") + .setOutputCol("documents") + +val instruction = MPNetEmbeddings + .pretrained("mpnet_embedding_sentence_transformers_bible_reference_final", "en") + .setInputCols(Array("documents")) + .setOutputCol("mpnet_embeddings") + +val pipeline = new Pipeline().setStages(Array(document_assembler, instruction)) + +val pipelineModel = pipeline.fit(data) + +val pipelineDF = pipelineModel.transform(data) + +``` +
+ +{:.model-param} +## Model Information + +{:.table-model} +|---|---| +|Model Name:|mpnet_embedding_sentence_transformers_bible_reference_final| +|Compatibility:|Spark NLP 5.1.0+| +|License:|Open Source| +|Edition:|Official| +|Input Labels:|[documents]| +|Output Labels:|[mpnet_embeddings]| +|Language:|en| +|Size:|410.2 MB| \ No newline at end of file From 4f1d42c37834f26c4c70feefbe466b17602017d3 Mon Sep 17 00:00:00 2001 From: ahmedlone127 Date: Sat, 19 Aug 2023 00:51:46 +0700 Subject: [PATCH 099/155] Add model 2023-08-18-mpnet_embedding_all_datasets_v3_mpnet_base_en --- ...embedding_all_datasets_v3_mpnet_base_en.md | 88 +++++++++++++++++++ 1 file changed, 88 insertions(+) create mode 100644 docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_all_datasets_v3_mpnet_base_en.md diff --git a/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_all_datasets_v3_mpnet_base_en.md b/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_all_datasets_v3_mpnet_base_en.md new file mode 100644 index 00000000000000..5f6b39f47d10cd --- /dev/null +++ b/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_all_datasets_v3_mpnet_base_en.md @@ -0,0 +1,88 @@ +--- +layout: model +title: English mpnet_embedding_all_datasets_v3_mpnet_base TFMPNetModel from flax-sentence-embeddings +author: John Snow Labs +name: mpnet_embedding_all_datasets_v3_mpnet_base +date: 2023-08-18 +tags: [mpnet, en, open_source, tensorflow] +task: Embeddings +language: en +edition: Spark NLP 5.1.0 +spark_version: 3.0 +supported: true +engine: tensorflow +annotator: MPNetEmbeddings +article_header: + type: cover +use_language_switcher: "Python-Scala-Java" +--- + +## Description + +Pretrained mpnet model, adapted from Hugging Face and curated to provide scalability and production-readiness using Spark NLP.`mpnet_embedding_all_datasets_v3_mpnet_base` is a English model originally trained by flax-sentence-embeddings. + +{:.btn-box} + + +[Download](https://s3.amazonaws.com/auxdata.johnsnowlabs.com/public/models/mpnet_embedding_all_datasets_v3_mpnet_base_en_5.1.0_3.0_1692375660568.zip){:.button.button-orange.button-orange-trans.arr.button-icon} +[Copy S3 URI](s3://auxdata.johnsnowlabs.com/public/models/mpnet_embedding_all_datasets_v3_mpnet_base_en_5.1.0_3.0_1692375660568.zip){:.button.button-orange.button-orange-trans.button-icon.button-copy-s3} + +## How to use + + + +
+{% include programmingLanguageSelectScalaPythonNLU.html %} +```python + +document_assembler = DocumentAssembler() \ + .setInputCol("text") \ + .setOutputCol("documents") + +instruction = MPNetEmbeddings \ + .pretrained("mpnet_embedding_all_datasets_v3_mpnet_base", "en")\ + .setInputCols(["documents"]) \ + .setOutputCol("mpnet_embeddings") + +pipeline = Pipeline(stages=[ + document_assembler, + instruction, +]) + +pipelineModel = pipeline.fit(data) + +pipelineDF = pipelineModel.transform(data) +``` +```scala + +val document_assembler = new DocumentAssembler() + .setInputCol("text") + .setOutputCol("documents") + +val instruction = MPNetEmbeddings + .pretrained("mpnet_embedding_all_datasets_v3_mpnet_base", "en") + .setInputCols(Array("documents")) + .setOutputCol("mpnet_embeddings") + +val pipeline = new Pipeline().setStages(Array(document_assembler, instruction)) + +val pipelineModel = pipeline.fit(data) + +val pipelineDF = pipelineModel.transform(data) + +``` +
+ +{:.model-param} +## Model Information + +{:.table-model} +|---|---| +|Model Name:|mpnet_embedding_all_datasets_v3_mpnet_base| +|Compatibility:|Spark NLP 5.1.0+| +|License:|Open Source| +|Edition:|Official| +|Input Labels:|[documents]| +|Output Labels:|[mpnet_embeddings]| +|Language:|en| +|Size:|409.6 MB| \ No newline at end of file From 4d63d7789a2968553857d7d2e331dde0f35b7ce9 Mon Sep 17 00:00:00 2001 From: ahmedlone127 Date: Sat, 19 Aug 2023 00:52:47 +0700 Subject: [PATCH 100/155] Add model 2023-08-18-mpnet_embedding_ikitracs_conditional_en --- ...mpnet_embedding_ikitracs_conditional_en.md | 88 +++++++++++++++++++ 1 file changed, 88 insertions(+) create mode 100644 docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_ikitracs_conditional_en.md diff --git a/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_ikitracs_conditional_en.md b/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_ikitracs_conditional_en.md new file mode 100644 index 00000000000000..9a88a28d497222 --- /dev/null +++ b/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_ikitracs_conditional_en.md @@ -0,0 +1,88 @@ +--- +layout: model +title: English mpnet_embedding_ikitracs_conditional TFMPNetModel from ilaria-oneofftech +author: John Snow Labs +name: mpnet_embedding_ikitracs_conditional +date: 2023-08-18 +tags: [mpnet, en, open_source, tensorflow] +task: Embeddings +language: en +edition: Spark NLP 5.1.0 +spark_version: 3.0 +supported: true +engine: tensorflow +annotator: MPNetEmbeddings +article_header: + type: cover +use_language_switcher: "Python-Scala-Java" +--- + +## Description + +Pretrained mpnet model, adapted from Hugging Face and curated to provide scalability and production-readiness using Spark NLP.`mpnet_embedding_ikitracs_conditional` is a English model originally trained by ilaria-oneofftech. + +{:.btn-box} + + +[Download](https://s3.amazonaws.com/auxdata.johnsnowlabs.com/public/models/mpnet_embedding_ikitracs_conditional_en_5.1.0_3.0_1692376423487.zip){:.button.button-orange.button-orange-trans.arr.button-icon} +[Copy S3 URI](s3://auxdata.johnsnowlabs.com/public/models/mpnet_embedding_ikitracs_conditional_en_5.1.0_3.0_1692376423487.zip){:.button.button-orange.button-orange-trans.button-icon.button-copy-s3} + +## How to use + + + +
+{% include programmingLanguageSelectScalaPythonNLU.html %} +```python + +document_assembler = DocumentAssembler() \ + .setInputCol("text") \ + .setOutputCol("documents") + +instruction = MPNetEmbeddings \ + .pretrained("mpnet_embedding_ikitracs_conditional", "en")\ + .setInputCols(["documents"]) \ + .setOutputCol("mpnet_embeddings") + +pipeline = Pipeline(stages=[ + document_assembler, + instruction, +]) + +pipelineModel = pipeline.fit(data) + +pipelineDF = pipelineModel.transform(data) +``` +```scala + +val document_assembler = new DocumentAssembler() + .setInputCol("text") + .setOutputCol("documents") + +val instruction = MPNetEmbeddings + .pretrained("mpnet_embedding_ikitracs_conditional", "en") + .setInputCols(Array("documents")) + .setOutputCol("mpnet_embeddings") + +val pipeline = new Pipeline().setStages(Array(document_assembler, instruction)) + +val pipelineModel = pipeline.fit(data) + +val pipelineDF = pipelineModel.transform(data) + +``` +
+ +{:.model-param} +## Model Information + +{:.table-model} +|---|---| +|Model Name:|mpnet_embedding_ikitracs_conditional| +|Compatibility:|Spark NLP 5.1.0+| +|License:|Open Source| +|Edition:|Official| +|Input Labels:|[documents]| +|Output Labels:|[mpnet_embeddings]| +|Language:|en| +|Size:|410.2 MB| \ No newline at end of file From 94e2bccb23880ab888753b61b651403caef592d1 Mon Sep 17 00:00:00 2001 From: ahmedlone127 Date: Sat, 19 Aug 2023 00:53:47 +0700 Subject: [PATCH 101/155] Add model 2023-08-18-mpnet_embedding_java_ownership_classifier_en --- ..._embedding_java_ownership_classifier_en.md | 88 +++++++++++++++++++ 1 file changed, 88 insertions(+) create mode 100644 docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_java_ownership_classifier_en.md diff --git a/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_java_ownership_classifier_en.md b/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_java_ownership_classifier_en.md new file mode 100644 index 00000000000000..30aa3bf0c2ef74 --- /dev/null +++ b/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_java_ownership_classifier_en.md @@ -0,0 +1,88 @@ +--- +layout: model +title: English mpnet_embedding_java_ownership_classifier TFMPNetModel from AISE-TUDelft +author: John Snow Labs +name: mpnet_embedding_java_ownership_classifier +date: 2023-08-18 +tags: [mpnet, en, open_source, tensorflow] +task: Embeddings +language: en +edition: Spark NLP 5.1.0 +spark_version: 3.0 +supported: true +engine: tensorflow +annotator: MPNetEmbeddings +article_header: + type: cover +use_language_switcher: "Python-Scala-Java" +--- + +## Description + +Pretrained mpnet model, adapted from Hugging Face and curated to provide scalability and production-readiness using Spark NLP.`mpnet_embedding_java_ownership_classifier` is a English model originally trained by AISE-TUDelft. + +{:.btn-box} + + +[Download](https://s3.amazonaws.com/auxdata.johnsnowlabs.com/public/models/mpnet_embedding_java_ownership_classifier_en_5.1.0_3.0_1692379581982.zip){:.button.button-orange.button-orange-trans.arr.button-icon} +[Copy S3 URI](s3://auxdata.johnsnowlabs.com/public/models/mpnet_embedding_java_ownership_classifier_en_5.1.0_3.0_1692379581982.zip){:.button.button-orange.button-orange-trans.button-icon.button-copy-s3} + +## How to use + + + +
+{% include programmingLanguageSelectScalaPythonNLU.html %} +```python + +document_assembler = DocumentAssembler() \ + .setInputCol("text") \ + .setOutputCol("documents") + +instruction = MPNetEmbeddings \ + .pretrained("mpnet_embedding_java_ownership_classifier", "en")\ + .setInputCols(["documents"]) \ + .setOutputCol("mpnet_embeddings") + +pipeline = Pipeline(stages=[ + document_assembler, + instruction, +]) + +pipelineModel = pipeline.fit(data) + +pipelineDF = pipelineModel.transform(data) +``` +```scala + +val document_assembler = new DocumentAssembler() + .setInputCol("text") + .setOutputCol("documents") + +val instruction = MPNetEmbeddings + .pretrained("mpnet_embedding_java_ownership_classifier", "en") + .setInputCols(Array("documents")) + .setOutputCol("mpnet_embeddings") + +val pipeline = new Pipeline().setStages(Array(document_assembler, instruction)) + +val pipelineModel = pipeline.fit(data) + +val pipelineDF = pipelineModel.transform(data) + +``` +
+ +{:.model-param} +## Model Information + +{:.table-model} +|---|---| +|Model Name:|mpnet_embedding_java_ownership_classifier| +|Compatibility:|Spark NLP 5.1.0+| +|License:|Open Source| +|Edition:|Official| +|Input Labels:|[documents]| +|Output Labels:|[mpnet_embeddings]| +|Language:|en| +|Size:|409.9 MB| \ No newline at end of file From bec129e21ec4a75fef4bfec8e20cf65e37cd1a1e Mon Sep 17 00:00:00 2001 From: ahmedlone127 Date: Sat, 19 Aug 2023 00:54:47 +0700 Subject: [PATCH 102/155] Add model 2023-08-18-mpnet_embedding_setfit_zero_shot_classification_pbsp_p4_rel_en --- ...zero_shot_classification_pbsp_p4_rel_en.md | 88 +++++++++++++++++++ 1 file changed, 88 insertions(+) create mode 100644 docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_setfit_zero_shot_classification_pbsp_p4_rel_en.md diff --git a/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_setfit_zero_shot_classification_pbsp_p4_rel_en.md b/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_setfit_zero_shot_classification_pbsp_p4_rel_en.md new file mode 100644 index 00000000000000..a8557c91ea2a95 --- /dev/null +++ b/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_setfit_zero_shot_classification_pbsp_p4_rel_en.md @@ -0,0 +1,88 @@ +--- +layout: model +title: English mpnet_embedding_setfit_zero_shot_classification_pbsp_p4_rel TFMPNetModel from aammari +author: John Snow Labs +name: mpnet_embedding_setfit_zero_shot_classification_pbsp_p4_rel +date: 2023-08-18 +tags: [mpnet, en, open_source, tensorflow] +task: Embeddings +language: en +edition: Spark NLP 5.1.0 +spark_version: 3.0 +supported: true +engine: tensorflow +annotator: MPNetEmbeddings +article_header: + type: cover +use_language_switcher: "Python-Scala-Java" +--- + +## Description + +Pretrained mpnet model, adapted from Hugging Face and curated to provide scalability and production-readiness using Spark NLP.`mpnet_embedding_setfit_zero_shot_classification_pbsp_p4_rel` is a English model originally trained by aammari. + +{:.btn-box} + + +[Download](https://s3.amazonaws.com/auxdata.johnsnowlabs.com/public/models/mpnet_embedding_setfit_zero_shot_classification_pbsp_p4_rel_en_5.1.0_3.0_1692381228140.zip){:.button.button-orange.button-orange-trans.arr.button-icon} +[Copy S3 URI](s3://auxdata.johnsnowlabs.com/public/models/mpnet_embedding_setfit_zero_shot_classification_pbsp_p4_rel_en_5.1.0_3.0_1692381228140.zip){:.button.button-orange.button-orange-trans.button-icon.button-copy-s3} + +## How to use + + + +
+{% include programmingLanguageSelectScalaPythonNLU.html %} +```python + +document_assembler = DocumentAssembler() \ + .setInputCol("text") \ + .setOutputCol("documents") + +instruction = MPNetEmbeddings \ + .pretrained("mpnet_embedding_setfit_zero_shot_classification_pbsp_p4_rel", "en")\ + .setInputCols(["documents"]) \ + .setOutputCol("mpnet_embeddings") + +pipeline = Pipeline(stages=[ + document_assembler, + instruction, +]) + +pipelineModel = pipeline.fit(data) + +pipelineDF = pipelineModel.transform(data) +``` +```scala + +val document_assembler = new DocumentAssembler() + .setInputCol("text") + .setOutputCol("documents") + +val instruction = MPNetEmbeddings + .pretrained("mpnet_embedding_setfit_zero_shot_classification_pbsp_p4_rel", "en") + .setInputCols(Array("documents")) + .setOutputCol("mpnet_embeddings") + +val pipeline = new Pipeline().setStages(Array(document_assembler, instruction)) + +val pipelineModel = pipeline.fit(data) + +val pipelineDF = pipelineModel.transform(data) + +``` +
+ +{:.model-param} +## Model Information + +{:.table-model} +|---|---| +|Model Name:|mpnet_embedding_setfit_zero_shot_classification_pbsp_p4_rel| +|Compatibility:|Spark NLP 5.1.0+| +|License:|Open Source| +|Edition:|Official| +|Input Labels:|[documents]| +|Output Labels:|[mpnet_embeddings]| +|Language:|en| +|Size:|409.8 MB| \ No newline at end of file From 22469e74e5186a743f6ee9e9f40cb500c61ebeed Mon Sep 17 00:00:00 2001 From: ahmedlone127 Date: Sat, 19 Aug 2023 00:55:47 +0700 Subject: [PATCH 103/155] Add model 2023-08-18-mpnet_embedding_setfit_zero_shot_classification_pbsp_q8a_azure_gpt35_en --- ..._classification_pbsp_q8a_azure_gpt35_en.md | 88 +++++++++++++++++++ 1 file changed, 88 insertions(+) create mode 100644 docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_setfit_zero_shot_classification_pbsp_q8a_azure_gpt35_en.md diff --git a/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_setfit_zero_shot_classification_pbsp_q8a_azure_gpt35_en.md b/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_setfit_zero_shot_classification_pbsp_q8a_azure_gpt35_en.md new file mode 100644 index 00000000000000..a7c5b427ca0860 --- /dev/null +++ b/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_setfit_zero_shot_classification_pbsp_q8a_azure_gpt35_en.md @@ -0,0 +1,88 @@ +--- +layout: model +title: English mpnet_embedding_setfit_zero_shot_classification_pbsp_q8a_azure_gpt35 TFMPNetModel from aammari +author: John Snow Labs +name: mpnet_embedding_setfit_zero_shot_classification_pbsp_q8a_azure_gpt35 +date: 2023-08-18 +tags: [mpnet, en, open_source, tensorflow] +task: Embeddings +language: en +edition: Spark NLP 5.1.0 +spark_version: 3.0 +supported: true +engine: tensorflow +annotator: MPNetEmbeddings +article_header: + type: cover +use_language_switcher: "Python-Scala-Java" +--- + +## Description + +Pretrained mpnet model, adapted from Hugging Face and curated to provide scalability and production-readiness using Spark NLP.`mpnet_embedding_setfit_zero_shot_classification_pbsp_q8a_azure_gpt35` is a English model originally trained by aammari. + +{:.btn-box} + + +[Download](https://s3.amazonaws.com/auxdata.johnsnowlabs.com/public/models/mpnet_embedding_setfit_zero_shot_classification_pbsp_q8a_azure_gpt35_en_5.1.0_3.0_1692381310055.zip){:.button.button-orange.button-orange-trans.arr.button-icon} +[Copy S3 URI](s3://auxdata.johnsnowlabs.com/public/models/mpnet_embedding_setfit_zero_shot_classification_pbsp_q8a_azure_gpt35_en_5.1.0_3.0_1692381310055.zip){:.button.button-orange.button-orange-trans.button-icon.button-copy-s3} + +## How to use + + + +
+{% include programmingLanguageSelectScalaPythonNLU.html %} +```python + +document_assembler = DocumentAssembler() \ + .setInputCol("text") \ + .setOutputCol("documents") + +instruction = MPNetEmbeddings \ + .pretrained("mpnet_embedding_setfit_zero_shot_classification_pbsp_q8a_azure_gpt35", "en")\ + .setInputCols(["documents"]) \ + .setOutputCol("mpnet_embeddings") + +pipeline = Pipeline(stages=[ + document_assembler, + instruction, +]) + +pipelineModel = pipeline.fit(data) + +pipelineDF = pipelineModel.transform(data) +``` +```scala + +val document_assembler = new DocumentAssembler() + .setInputCol("text") + .setOutputCol("documents") + +val instruction = MPNetEmbeddings + .pretrained("mpnet_embedding_setfit_zero_shot_classification_pbsp_q8a_azure_gpt35", "en") + .setInputCols(Array("documents")) + .setOutputCol("mpnet_embeddings") + +val pipeline = new Pipeline().setStages(Array(document_assembler, instruction)) + +val pipelineModel = pipeline.fit(data) + +val pipelineDF = pipelineModel.transform(data) + +``` +
+ +{:.model-param} +## Model Information + +{:.table-model} +|---|---| +|Model Name:|mpnet_embedding_setfit_zero_shot_classification_pbsp_q8a_azure_gpt35| +|Compatibility:|Spark NLP 5.1.0+| +|License:|Open Source| +|Edition:|Official| +|Input Labels:|[documents]| +|Output Labels:|[mpnet_embeddings]| +|Language:|en| +|Size:|409.9 MB| \ No newline at end of file From 6855d4bdc5570b23013750d0d7fd0e0fcdbf744d Mon Sep 17 00:00:00 2001 From: ahmedlone127 Date: Sat, 19 Aug 2023 00:56:48 +0700 Subject: [PATCH 104/155] Add model 2023-08-18-mpnet_embedding_setfit_zero_shot_classification_pbsp_p4_time_en --- ...ero_shot_classification_pbsp_p4_time_en.md | 88 +++++++++++++++++++ 1 file changed, 88 insertions(+) create mode 100644 docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_setfit_zero_shot_classification_pbsp_p4_time_en.md diff --git a/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_setfit_zero_shot_classification_pbsp_p4_time_en.md b/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_setfit_zero_shot_classification_pbsp_p4_time_en.md new file mode 100644 index 00000000000000..e9217efceb9876 --- /dev/null +++ b/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_setfit_zero_shot_classification_pbsp_p4_time_en.md @@ -0,0 +1,88 @@ +--- +layout: model +title: English mpnet_embedding_setfit_zero_shot_classification_pbsp_p4_time TFMPNetModel from aammari +author: John Snow Labs +name: mpnet_embedding_setfit_zero_shot_classification_pbsp_p4_time +date: 2023-08-18 +tags: [mpnet, en, open_source, tensorflow] +task: Embeddings +language: en +edition: Spark NLP 5.1.0 +spark_version: 3.0 +supported: true +engine: tensorflow +annotator: MPNetEmbeddings +article_header: + type: cover +use_language_switcher: "Python-Scala-Java" +--- + +## Description + +Pretrained mpnet model, adapted from Hugging Face and curated to provide scalability and production-readiness using Spark NLP.`mpnet_embedding_setfit_zero_shot_classification_pbsp_p4_time` is a English model originally trained by aammari. + +{:.btn-box} + + +[Download](https://s3.amazonaws.com/auxdata.johnsnowlabs.com/public/models/mpnet_embedding_setfit_zero_shot_classification_pbsp_p4_time_en_5.1.0_3.0_1692381391509.zip){:.button.button-orange.button-orange-trans.arr.button-icon} +[Copy S3 URI](s3://auxdata.johnsnowlabs.com/public/models/mpnet_embedding_setfit_zero_shot_classification_pbsp_p4_time_en_5.1.0_3.0_1692381391509.zip){:.button.button-orange.button-orange-trans.button-icon.button-copy-s3} + +## How to use + + + +
+{% include programmingLanguageSelectScalaPythonNLU.html %} +```python + +document_assembler = DocumentAssembler() \ + .setInputCol("text") \ + .setOutputCol("documents") + +instruction = MPNetEmbeddings \ + .pretrained("mpnet_embedding_setfit_zero_shot_classification_pbsp_p4_time", "en")\ + .setInputCols(["documents"]) \ + .setOutputCol("mpnet_embeddings") + +pipeline = Pipeline(stages=[ + document_assembler, + instruction, +]) + +pipelineModel = pipeline.fit(data) + +pipelineDF = pipelineModel.transform(data) +``` +```scala + +val document_assembler = new DocumentAssembler() + .setInputCol("text") + .setOutputCol("documents") + +val instruction = MPNetEmbeddings + .pretrained("mpnet_embedding_setfit_zero_shot_classification_pbsp_p4_time", "en") + .setInputCols(Array("documents")) + .setOutputCol("mpnet_embeddings") + +val pipeline = new Pipeline().setStages(Array(document_assembler, instruction)) + +val pipelineModel = pipeline.fit(data) + +val pipelineDF = pipelineModel.transform(data) + +``` +
+ +{:.model-param} +## Model Information + +{:.table-model} +|---|---| +|Model Name:|mpnet_embedding_setfit_zero_shot_classification_pbsp_p4_time| +|Compatibility:|Spark NLP 5.1.0+| +|License:|Open Source| +|Edition:|Official| +|Input Labels:|[documents]| +|Output Labels:|[mpnet_embeddings]| +|Language:|en| +|Size:|409.8 MB| \ No newline at end of file From 66740186148c56ba93b3ac6cb4c5b869b06edb8e Mon Sep 17 00:00:00 2001 From: ahmedlone127 Date: Sat, 19 Aug 2023 00:57:48 +0700 Subject: [PATCH 105/155] Add model 2023-08-18-mpnet_embedding_java_pointer_classifier_en --- ...et_embedding_java_pointer_classifier_en.md | 88 +++++++++++++++++++ 1 file changed, 88 insertions(+) create mode 100644 docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_java_pointer_classifier_en.md diff --git a/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_java_pointer_classifier_en.md b/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_java_pointer_classifier_en.md new file mode 100644 index 00000000000000..74e9f64acdd757 --- /dev/null +++ b/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_java_pointer_classifier_en.md @@ -0,0 +1,88 @@ +--- +layout: model +title: English mpnet_embedding_java_pointer_classifier TFMPNetModel from AISE-TUDelft +author: John Snow Labs +name: mpnet_embedding_java_pointer_classifier +date: 2023-08-18 +tags: [mpnet, en, open_source, tensorflow] +task: Embeddings +language: en +edition: Spark NLP 5.1.0 +spark_version: 3.0 +supported: true +engine: tensorflow +annotator: MPNetEmbeddings +article_header: + type: cover +use_language_switcher: "Python-Scala-Java" +--- + +## Description + +Pretrained mpnet model, adapted from Hugging Face and curated to provide scalability and production-readiness using Spark NLP.`mpnet_embedding_java_pointer_classifier` is a English model originally trained by AISE-TUDelft. + +{:.btn-box} + + +[Download](https://s3.amazonaws.com/auxdata.johnsnowlabs.com/public/models/mpnet_embedding_java_pointer_classifier_en_5.1.0_3.0_1692377625982.zip){:.button.button-orange.button-orange-trans.arr.button-icon} +[Copy S3 URI](s3://auxdata.johnsnowlabs.com/public/models/mpnet_embedding_java_pointer_classifier_en_5.1.0_3.0_1692377625982.zip){:.button.button-orange.button-orange-trans.button-icon.button-copy-s3} + +## How to use + + + +
+{% include programmingLanguageSelectScalaPythonNLU.html %} +```python + +document_assembler = DocumentAssembler() \ + .setInputCol("text") \ + .setOutputCol("documents") + +instruction = MPNetEmbeddings \ + .pretrained("mpnet_embedding_java_pointer_classifier", "en")\ + .setInputCols(["documents"]) \ + .setOutputCol("mpnet_embeddings") + +pipeline = Pipeline(stages=[ + document_assembler, + instruction, +]) + +pipelineModel = pipeline.fit(data) + +pipelineDF = pipelineModel.transform(data) +``` +```scala + +val document_assembler = new DocumentAssembler() + .setInputCol("text") + .setOutputCol("documents") + +val instruction = MPNetEmbeddings + .pretrained("mpnet_embedding_java_pointer_classifier", "en") + .setInputCols(Array("documents")) + .setOutputCol("mpnet_embeddings") + +val pipeline = new Pipeline().setStages(Array(document_assembler, instruction)) + +val pipelineModel = pipeline.fit(data) + +val pipelineDF = pipelineModel.transform(data) + +``` +
+ +{:.model-param} +## Model Information + +{:.table-model} +|---|---| +|Model Name:|mpnet_embedding_java_pointer_classifier| +|Compatibility:|Spark NLP 5.1.0+| +|License:|Open Source| +|Edition:|Official| +|Input Labels:|[documents]| +|Output Labels:|[mpnet_embeddings]| +|Language:|en| +|Size:|409.9 MB| \ No newline at end of file From 3fa2f0c94e613f8d0808e7c5e9a32a3e672e28ee Mon Sep 17 00:00:00 2001 From: ahmedlone127 Date: Sat, 19 Aug 2023 00:58:49 +0700 Subject: [PATCH 106/155] Add model 2023-08-18-mpnet_embedding_finetunned_sbert_en --- ...-18-mpnet_embedding_finetunned_sbert_en.md | 88 +++++++++++++++++++ 1 file changed, 88 insertions(+) create mode 100644 docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_finetunned_sbert_en.md diff --git a/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_finetunned_sbert_en.md b/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_finetunned_sbert_en.md new file mode 100644 index 00000000000000..af3135e5d9c8a2 --- /dev/null +++ b/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_finetunned_sbert_en.md @@ -0,0 +1,88 @@ +--- +layout: model +title: English mpnet_embedding_finetunned_sbert TFMPNetModel from Kick28 +author: John Snow Labs +name: mpnet_embedding_finetunned_sbert +date: 2023-08-18 +tags: [mpnet, en, open_source, tensorflow] +task: Embeddings +language: en +edition: Spark NLP 5.1.0 +spark_version: 3.0 +supported: true +engine: tensorflow +annotator: MPNetEmbeddings +article_header: + type: cover +use_language_switcher: "Python-Scala-Java" +--- + +## Description + +Pretrained mpnet model, adapted from Hugging Face and curated to provide scalability and production-readiness using Spark NLP.`mpnet_embedding_finetunned_sbert` is a English model originally trained by Kick28. + +{:.btn-box} + + +[Download](https://s3.amazonaws.com/auxdata.johnsnowlabs.com/public/models/mpnet_embedding_finetunned_sbert_en_5.1.0_3.0_1692380631621.zip){:.button.button-orange.button-orange-trans.arr.button-icon} +[Copy S3 URI](s3://auxdata.johnsnowlabs.com/public/models/mpnet_embedding_finetunned_sbert_en_5.1.0_3.0_1692380631621.zip){:.button.button-orange.button-orange-trans.button-icon.button-copy-s3} + +## How to use + + + +
+{% include programmingLanguageSelectScalaPythonNLU.html %} +```python + +document_assembler = DocumentAssembler() \ + .setInputCol("text") \ + .setOutputCol("documents") + +instruction = MPNetEmbeddings \ + .pretrained("mpnet_embedding_finetunned_sbert", "en")\ + .setInputCols(["documents"]) \ + .setOutputCol("mpnet_embeddings") + +pipeline = Pipeline(stages=[ + document_assembler, + instruction, +]) + +pipelineModel = pipeline.fit(data) + +pipelineDF = pipelineModel.transform(data) +``` +```scala + +val document_assembler = new DocumentAssembler() + .setInputCol("text") + .setOutputCol("documents") + +val instruction = MPNetEmbeddings + .pretrained("mpnet_embedding_finetunned_sbert", "en") + .setInputCols(Array("documents")) + .setOutputCol("mpnet_embeddings") + +val pipeline = new Pipeline().setStages(Array(document_assembler, instruction)) + +val pipelineModel = pipeline.fit(data) + +val pipelineDF = pipelineModel.transform(data) + +``` +
+ +{:.model-param} +## Model Information + +{:.table-model} +|---|---| +|Model Name:|mpnet_embedding_finetunned_sbert| +|Compatibility:|Spark NLP 5.1.0+| +|License:|Open Source| +|Edition:|Official| +|Input Labels:|[documents]| +|Output Labels:|[mpnet_embeddings]| +|Language:|en| +|Size:|409.9 MB| \ No newline at end of file From 57f71e53d9dfb96414f801f952c8a52a4c5b4ef6 Mon Sep 17 00:00:00 2001 From: ahmedlone127 Date: Sat, 19 Aug 2023 00:59:49 +0700 Subject: [PATCH 107/155] Add model 2023-08-18-mpnet_embedding_setfit_model_Feb11_Misinformation_on_Law_en --- ...it_model_Feb11_Misinformation_on_Law_en.md | 88 +++++++++++++++++++ 1 file changed, 88 insertions(+) create mode 100644 docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_setfit_model_Feb11_Misinformation_on_Law_en.md diff --git a/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_setfit_model_Feb11_Misinformation_on_Law_en.md b/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_setfit_model_Feb11_Misinformation_on_Law_en.md new file mode 100644 index 00000000000000..974c24d3f389a1 --- /dev/null +++ b/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_setfit_model_Feb11_Misinformation_on_Law_en.md @@ -0,0 +1,88 @@ +--- +layout: model +title: English mpnet_embedding_setfit_model_Feb11_Misinformation_on_Law TFMPNetModel from mitra-mir +author: John Snow Labs +name: mpnet_embedding_setfit_model_Feb11_Misinformation_on_Law +date: 2023-08-18 +tags: [mpnet, en, open_source, tensorflow] +task: Embeddings +language: en +edition: Spark NLP 5.1.0 +spark_version: 3.0 +supported: true +engine: tensorflow +annotator: MPNetEmbeddings +article_header: + type: cover +use_language_switcher: "Python-Scala-Java" +--- + +## Description + +Pretrained mpnet model, adapted from Hugging Face and curated to provide scalability and production-readiness using Spark NLP.`mpnet_embedding_setfit_model_Feb11_Misinformation_on_Law` is a English model originally trained by mitra-mir. + +{:.btn-box} + + +[Download](https://s3.amazonaws.com/auxdata.johnsnowlabs.com/public/models/mpnet_embedding_setfit_model_Feb11_Misinformation_on_Law_en_5.1.0_3.0_1692381563585.zip){:.button.button-orange.button-orange-trans.arr.button-icon} +[Copy S3 URI](s3://auxdata.johnsnowlabs.com/public/models/mpnet_embedding_setfit_model_Feb11_Misinformation_on_Law_en_5.1.0_3.0_1692381563585.zip){:.button.button-orange.button-orange-trans.button-icon.button-copy-s3} + +## How to use + + + +
+{% include programmingLanguageSelectScalaPythonNLU.html %} +```python + +document_assembler = DocumentAssembler() \ + .setInputCol("text") \ + .setOutputCol("documents") + +instruction = MPNetEmbeddings \ + .pretrained("mpnet_embedding_setfit_model_Feb11_Misinformation_on_Law", "en")\ + .setInputCols(["documents"]) \ + .setOutputCol("mpnet_embeddings") + +pipeline = Pipeline(stages=[ + document_assembler, + instruction, +]) + +pipelineModel = pipeline.fit(data) + +pipelineDF = pipelineModel.transform(data) +``` +```scala + +val document_assembler = new DocumentAssembler() + .setInputCol("text") + .setOutputCol("documents") + +val instruction = MPNetEmbeddings + .pretrained("mpnet_embedding_setfit_model_Feb11_Misinformation_on_Law", "en") + .setInputCols(Array("documents")) + .setOutputCol("mpnet_embeddings") + +val pipeline = new Pipeline().setStages(Array(document_assembler, instruction)) + +val pipelineModel = pipeline.fit(data) + +val pipelineDF = pipelineModel.transform(data) + +``` +
+ +{:.model-param} +## Model Information + +{:.table-model} +|---|---| +|Model Name:|mpnet_embedding_setfit_model_Feb11_Misinformation_on_Law| +|Compatibility:|Spark NLP 5.1.0+| +|License:|Open Source| +|Edition:|Official| +|Input Labels:|[documents]| +|Output Labels:|[mpnet_embeddings]| +|Language:|en| +|Size:|409.8 MB| \ No newline at end of file From 6c69dcdb4a4d9f74577b8f7947d5c4a3719c7b4d Mon Sep 17 00:00:00 2001 From: ahmedlone127 Date: Sat, 19 Aug 2023 01:00:49 +0700 Subject: [PATCH 108/155] Add model 2023-08-18-mpnet_embedding_python_parameters_classifier_en --- ...bedding_python_parameters_classifier_en.md | 88 +++++++++++++++++++ 1 file changed, 88 insertions(+) create mode 100644 docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_python_parameters_classifier_en.md diff --git a/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_python_parameters_classifier_en.md b/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_python_parameters_classifier_en.md new file mode 100644 index 00000000000000..be0de6ecbc3a87 --- /dev/null +++ b/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_python_parameters_classifier_en.md @@ -0,0 +1,88 @@ +--- +layout: model +title: English mpnet_embedding_python_parameters_classifier TFMPNetModel from AISE-TUDelft +author: John Snow Labs +name: mpnet_embedding_python_parameters_classifier +date: 2023-08-18 +tags: [mpnet, en, open_source, tensorflow] +task: Embeddings +language: en +edition: Spark NLP 5.1.0 +spark_version: 3.0 +supported: true +engine: tensorflow +annotator: MPNetEmbeddings +article_header: + type: cover +use_language_switcher: "Python-Scala-Java" +--- + +## Description + +Pretrained mpnet model, adapted from Hugging Face and curated to provide scalability and production-readiness using Spark NLP.`mpnet_embedding_python_parameters_classifier` is a English model originally trained by AISE-TUDelft. + +{:.btn-box} + + +[Download](https://s3.amazonaws.com/auxdata.johnsnowlabs.com/public/models/mpnet_embedding_python_parameters_classifier_en_5.1.0_3.0_1692379002867.zip){:.button.button-orange.button-orange-trans.arr.button-icon} +[Copy S3 URI](s3://auxdata.johnsnowlabs.com/public/models/mpnet_embedding_python_parameters_classifier_en_5.1.0_3.0_1692379002867.zip){:.button.button-orange.button-orange-trans.button-icon.button-copy-s3} + +## How to use + + + +
+{% include programmingLanguageSelectScalaPythonNLU.html %} +```python + +document_assembler = DocumentAssembler() \ + .setInputCol("text") \ + .setOutputCol("documents") + +instruction = MPNetEmbeddings \ + .pretrained("mpnet_embedding_python_parameters_classifier", "en")\ + .setInputCols(["documents"]) \ + .setOutputCol("mpnet_embeddings") + +pipeline = Pipeline(stages=[ + document_assembler, + instruction, +]) + +pipelineModel = pipeline.fit(data) + +pipelineDF = pipelineModel.transform(data) +``` +```scala + +val document_assembler = new DocumentAssembler() + .setInputCol("text") + .setOutputCol("documents") + +val instruction = MPNetEmbeddings + .pretrained("mpnet_embedding_python_parameters_classifier", "en") + .setInputCols(Array("documents")) + .setOutputCol("mpnet_embeddings") + +val pipeline = new Pipeline().setStages(Array(document_assembler, instruction)) + +val pipelineModel = pipeline.fit(data) + +val pipelineDF = pipelineModel.transform(data) + +``` +
+ +{:.model-param} +## Model Information + +{:.table-model} +|---|---| +|Model Name:|mpnet_embedding_python_parameters_classifier| +|Compatibility:|Spark NLP 5.1.0+| +|License:|Open Source| +|Edition:|Official| +|Input Labels:|[documents]| +|Output Labels:|[mpnet_embeddings]| +|Language:|en| +|Size:|409.8 MB| \ No newline at end of file From 2f550844c3ff540106b4e7ae12ff7d7b4767fde1 Mon Sep 17 00:00:00 2001 From: ahmedlone127 Date: Sat, 19 Aug 2023 01:01:49 +0700 Subject: [PATCH 109/155] Add model 2023-08-18-mpnet_embedding_pharo_keyimplementationpoints_classifier_en --- ...o_keyimplementationpoints_classifier_en.md | 88 +++++++++++++++++++ 1 file changed, 88 insertions(+) create mode 100644 docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_pharo_keyimplementationpoints_classifier_en.md diff --git a/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_pharo_keyimplementationpoints_classifier_en.md b/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_pharo_keyimplementationpoints_classifier_en.md new file mode 100644 index 00000000000000..562a9d4725c4b4 --- /dev/null +++ b/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_pharo_keyimplementationpoints_classifier_en.md @@ -0,0 +1,88 @@ +--- +layout: model +title: English mpnet_embedding_pharo_keyimplementationpoints_classifier TFMPNetModel from AISE-TUDelft +author: John Snow Labs +name: mpnet_embedding_pharo_keyimplementationpoints_classifier +date: 2023-08-18 +tags: [mpnet, en, open_source, tensorflow] +task: Embeddings +language: en +edition: Spark NLP 5.1.0 +spark_version: 3.0 +supported: true +engine: tensorflow +annotator: MPNetEmbeddings +article_header: + type: cover +use_language_switcher: "Python-Scala-Java" +--- + +## Description + +Pretrained mpnet model, adapted from Hugging Face and curated to provide scalability and production-readiness using Spark NLP.`mpnet_embedding_pharo_keyimplementationpoints_classifier` is a English model originally trained by AISE-TUDelft. + +{:.btn-box} + + +[Download](https://s3.amazonaws.com/auxdata.johnsnowlabs.com/public/models/mpnet_embedding_pharo_keyimplementationpoints_classifier_en_5.1.0_3.0_1692378611446.zip){:.button.button-orange.button-orange-trans.arr.button-icon} +[Copy S3 URI](s3://auxdata.johnsnowlabs.com/public/models/mpnet_embedding_pharo_keyimplementationpoints_classifier_en_5.1.0_3.0_1692378611446.zip){:.button.button-orange.button-orange-trans.button-icon.button-copy-s3} + +## How to use + + + +
+{% include programmingLanguageSelectScalaPythonNLU.html %} +```python + +document_assembler = DocumentAssembler() \ + .setInputCol("text") \ + .setOutputCol("documents") + +instruction = MPNetEmbeddings \ + .pretrained("mpnet_embedding_pharo_keyimplementationpoints_classifier", "en")\ + .setInputCols(["documents"]) \ + .setOutputCol("mpnet_embeddings") + +pipeline = Pipeline(stages=[ + document_assembler, + instruction, +]) + +pipelineModel = pipeline.fit(data) + +pipelineDF = pipelineModel.transform(data) +``` +```scala + +val document_assembler = new DocumentAssembler() + .setInputCol("text") + .setOutputCol("documents") + +val instruction = MPNetEmbeddings + .pretrained("mpnet_embedding_pharo_keyimplementationpoints_classifier", "en") + .setInputCols(Array("documents")) + .setOutputCol("mpnet_embeddings") + +val pipeline = new Pipeline().setStages(Array(document_assembler, instruction)) + +val pipelineModel = pipeline.fit(data) + +val pipelineDF = pipelineModel.transform(data) + +``` +
+ +{:.model-param} +## Model Information + +{:.table-model} +|---|---| +|Model Name:|mpnet_embedding_pharo_keyimplementationpoints_classifier| +|Compatibility:|Spark NLP 5.1.0+| +|License:|Open Source| +|Edition:|Official| +|Input Labels:|[documents]| +|Output Labels:|[mpnet_embeddings]| +|Language:|en| +|Size:|409.8 MB| \ No newline at end of file From cab20e7be756d42a14d533b3f12a332e9518c691 Mon Sep 17 00:00:00 2001 From: ahmedlone127 Date: Sat, 19 Aug 2023 01:02:50 +0700 Subject: [PATCH 110/155] Add model 2023-08-18-mpnet_embedding_setfit_ft_sentinent_eval_en --- ...t_embedding_setfit_ft_sentinent_eval_en.md | 88 +++++++++++++++++++ 1 file changed, 88 insertions(+) create mode 100644 docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_setfit_ft_sentinent_eval_en.md diff --git a/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_setfit_ft_sentinent_eval_en.md b/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_setfit_ft_sentinent_eval_en.md new file mode 100644 index 00000000000000..307710ac9c639c --- /dev/null +++ b/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_setfit_ft_sentinent_eval_en.md @@ -0,0 +1,88 @@ +--- +layout: model +title: English mpnet_embedding_setfit_ft_sentinent_eval TFMPNetModel from StatsGary +author: John Snow Labs +name: mpnet_embedding_setfit_ft_sentinent_eval +date: 2023-08-18 +tags: [mpnet, en, open_source, tensorflow] +task: Embeddings +language: en +edition: Spark NLP 5.1.0 +spark_version: 3.0 +supported: true +engine: tensorflow +annotator: MPNetEmbeddings +article_header: + type: cover +use_language_switcher: "Python-Scala-Java" +--- + +## Description + +Pretrained mpnet model, adapted from Hugging Face and curated to provide scalability and production-readiness using Spark NLP.`mpnet_embedding_setfit_ft_sentinent_eval` is a English model originally trained by StatsGary. + +{:.btn-box} + + +[Download](https://s3.amazonaws.com/auxdata.johnsnowlabs.com/public/models/mpnet_embedding_setfit_ft_sentinent_eval_en_5.1.0_3.0_1692381718097.zip){:.button.button-orange.button-orange-trans.arr.button-icon} +[Copy S3 URI](s3://auxdata.johnsnowlabs.com/public/models/mpnet_embedding_setfit_ft_sentinent_eval_en_5.1.0_3.0_1692381718097.zip){:.button.button-orange.button-orange-trans.button-icon.button-copy-s3} + +## How to use + + + +
+{% include programmingLanguageSelectScalaPythonNLU.html %} +```python + +document_assembler = DocumentAssembler() \ + .setInputCol("text") \ + .setOutputCol("documents") + +instruction = MPNetEmbeddings \ + .pretrained("mpnet_embedding_setfit_ft_sentinent_eval", "en")\ + .setInputCols(["documents"]) \ + .setOutputCol("mpnet_embeddings") + +pipeline = Pipeline(stages=[ + document_assembler, + instruction, +]) + +pipelineModel = pipeline.fit(data) + +pipelineDF = pipelineModel.transform(data) +``` +```scala + +val document_assembler = new DocumentAssembler() + .setInputCol("text") + .setOutputCol("documents") + +val instruction = MPNetEmbeddings + .pretrained("mpnet_embedding_setfit_ft_sentinent_eval", "en") + .setInputCols(Array("documents")) + .setOutputCol("mpnet_embeddings") + +val pipeline = new Pipeline().setStages(Array(document_assembler, instruction)) + +val pipelineModel = pipeline.fit(data) + +val pipelineDF = pipelineModel.transform(data) + +``` +
+ +{:.model-param} +## Model Information + +{:.table-model} +|---|---| +|Model Name:|mpnet_embedding_setfit_ft_sentinent_eval| +|Compatibility:|Spark NLP 5.1.0+| +|License:|Open Source| +|Edition:|Official| +|Input Labels:|[documents]| +|Output Labels:|[mpnet_embeddings]| +|Language:|en| +|Size:|409.9 MB| \ No newline at end of file From 06de97e8ebd829aa006c20b0953e669a57ddee9c Mon Sep 17 00:00:00 2001 From: ahmedlone127 Date: Sat, 19 Aug 2023 01:03:50 +0700 Subject: [PATCH 111/155] Add model 2023-08-18-mpnet_embedding_reddit_single_context_mpnet_base_en --- ...ing_reddit_single_context_mpnet_base_en.md | 88 +++++++++++++++++++ 1 file changed, 88 insertions(+) create mode 100644 docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_reddit_single_context_mpnet_base_en.md diff --git a/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_reddit_single_context_mpnet_base_en.md b/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_reddit_single_context_mpnet_base_en.md new file mode 100644 index 00000000000000..855c12b30641ce --- /dev/null +++ b/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_reddit_single_context_mpnet_base_en.md @@ -0,0 +1,88 @@ +--- +layout: model +title: English mpnet_embedding_reddit_single_context_mpnet_base TFMPNetModel from flax-sentence-embeddings +author: John Snow Labs +name: mpnet_embedding_reddit_single_context_mpnet_base +date: 2023-08-18 +tags: [mpnet, en, open_source, tensorflow] +task: Embeddings +language: en +edition: Spark NLP 5.1.0 +spark_version: 3.0 +supported: true +engine: tensorflow +annotator: MPNetEmbeddings +article_header: + type: cover +use_language_switcher: "Python-Scala-Java" +--- + +## Description + +Pretrained mpnet model, adapted from Hugging Face and curated to provide scalability and production-readiness using Spark NLP.`mpnet_embedding_reddit_single_context_mpnet_base` is a English model originally trained by flax-sentence-embeddings. + +{:.btn-box} + + +[Download](https://s3.amazonaws.com/auxdata.johnsnowlabs.com/public/models/mpnet_embedding_reddit_single_context_mpnet_base_en_5.1.0_3.0_1692376625049.zip){:.button.button-orange.button-orange-trans.arr.button-icon} +[Copy S3 URI](s3://auxdata.johnsnowlabs.com/public/models/mpnet_embedding_reddit_single_context_mpnet_base_en_5.1.0_3.0_1692376625049.zip){:.button.button-orange.button-orange-trans.button-icon.button-copy-s3} + +## How to use + + + +
+{% include programmingLanguageSelectScalaPythonNLU.html %} +```python + +document_assembler = DocumentAssembler() \ + .setInputCol("text") \ + .setOutputCol("documents") + +instruction = MPNetEmbeddings \ + .pretrained("mpnet_embedding_reddit_single_context_mpnet_base", "en")\ + .setInputCols(["documents"]) \ + .setOutputCol("mpnet_embeddings") + +pipeline = Pipeline(stages=[ + document_assembler, + instruction, +]) + +pipelineModel = pipeline.fit(data) + +pipelineDF = pipelineModel.transform(data) +``` +```scala + +val document_assembler = new DocumentAssembler() + .setInputCol("text") + .setOutputCol("documents") + +val instruction = MPNetEmbeddings + .pretrained("mpnet_embedding_reddit_single_context_mpnet_base", "en") + .setInputCols(Array("documents")) + .setOutputCol("mpnet_embeddings") + +val pipeline = new Pipeline().setStages(Array(document_assembler, instruction)) + +val pipelineModel = pipeline.fit(data) + +val pipelineDF = pipelineModel.transform(data) + +``` +
+ +{:.model-param} +## Model Information + +{:.table-model} +|---|---| +|Model Name:|mpnet_embedding_reddit_single_context_mpnet_base| +|Compatibility:|Spark NLP 5.1.0+| +|License:|Open Source| +|Edition:|Official| +|Input Labels:|[documents]| +|Output Labels:|[mpnet_embeddings]| +|Language:|en| +|Size:|409.5 MB| \ No newline at end of file From bce3fc380d29d61b79d54ff4613c94c8c72c522f Mon Sep 17 00:00:00 2001 From: ahmedlone127 Date: Sat, 19 Aug 2023 01:04:50 +0700 Subject: [PATCH 112/155] Add model 2023-08-18-mpnet_embedding_setfit_zero_shot_classification_pbsp_p1_en --- ...fit_zero_shot_classification_pbsp_p1_en.md | 88 +++++++++++++++++++ 1 file changed, 88 insertions(+) create mode 100644 docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_setfit_zero_shot_classification_pbsp_p1_en.md diff --git a/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_setfit_zero_shot_classification_pbsp_p1_en.md b/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_setfit_zero_shot_classification_pbsp_p1_en.md new file mode 100644 index 00000000000000..19754dca56e6e9 --- /dev/null +++ b/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_setfit_zero_shot_classification_pbsp_p1_en.md @@ -0,0 +1,88 @@ +--- +layout: model +title: English mpnet_embedding_setfit_zero_shot_classification_pbsp_p1 TFMPNetModel from aammari +author: John Snow Labs +name: mpnet_embedding_setfit_zero_shot_classification_pbsp_p1 +date: 2023-08-18 +tags: [mpnet, en, open_source, tensorflow] +task: Embeddings +language: en +edition: Spark NLP 5.1.0 +spark_version: 3.0 +supported: true +engine: tensorflow +annotator: MPNetEmbeddings +article_header: + type: cover +use_language_switcher: "Python-Scala-Java" +--- + +## Description + +Pretrained mpnet model, adapted from Hugging Face and curated to provide scalability and production-readiness using Spark NLP.`mpnet_embedding_setfit_zero_shot_classification_pbsp_p1` is a English model originally trained by aammari. + +{:.btn-box} + + +[Download](https://s3.amazonaws.com/auxdata.johnsnowlabs.com/public/models/mpnet_embedding_setfit_zero_shot_classification_pbsp_p1_en_5.1.0_3.0_1692379514816.zip){:.button.button-orange.button-orange-trans.arr.button-icon} +[Copy S3 URI](s3://auxdata.johnsnowlabs.com/public/models/mpnet_embedding_setfit_zero_shot_classification_pbsp_p1_en_5.1.0_3.0_1692379514816.zip){:.button.button-orange.button-orange-trans.button-icon.button-copy-s3} + +## How to use + + + +
+{% include programmingLanguageSelectScalaPythonNLU.html %} +```python + +document_assembler = DocumentAssembler() \ + .setInputCol("text") \ + .setOutputCol("documents") + +instruction = MPNetEmbeddings \ + .pretrained("mpnet_embedding_setfit_zero_shot_classification_pbsp_p1", "en")\ + .setInputCols(["documents"]) \ + .setOutputCol("mpnet_embeddings") + +pipeline = Pipeline(stages=[ + document_assembler, + instruction, +]) + +pipelineModel = pipeline.fit(data) + +pipelineDF = pipelineModel.transform(data) +``` +```scala + +val document_assembler = new DocumentAssembler() + .setInputCol("text") + .setOutputCol("documents") + +val instruction = MPNetEmbeddings + .pretrained("mpnet_embedding_setfit_zero_shot_classification_pbsp_p1", "en") + .setInputCols(Array("documents")) + .setOutputCol("mpnet_embeddings") + +val pipeline = new Pipeline().setStages(Array(document_assembler, instruction)) + +val pipelineModel = pipeline.fit(data) + +val pipelineDF = pipelineModel.transform(data) + +``` +
+ +{:.model-param} +## Model Information + +{:.table-model} +|---|---| +|Model Name:|mpnet_embedding_setfit_zero_shot_classification_pbsp_p1| +|Compatibility:|Spark NLP 5.1.0+| +|License:|Open Source| +|Edition:|Official| +|Input Labels:|[documents]| +|Output Labels:|[mpnet_embeddings]| +|Language:|en| +|Size:|409.8 MB| \ No newline at end of file From cc2e240cb300506c3aad3fd327107ac2363e0801 Mon Sep 17 00:00:00 2001 From: ahmedlone127 Date: Sat, 19 Aug 2023 01:05:50 +0700 Subject: [PATCH 113/155] Add model 2023-08-18-mpnet_embedding_java_expand_classifier_en --- ...net_embedding_java_expand_classifier_en.md | 88 +++++++++++++++++++ 1 file changed, 88 insertions(+) create mode 100644 docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_java_expand_classifier_en.md diff --git a/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_java_expand_classifier_en.md b/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_java_expand_classifier_en.md new file mode 100644 index 00000000000000..377df7c2b7e566 --- /dev/null +++ b/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_java_expand_classifier_en.md @@ -0,0 +1,88 @@ +--- +layout: model +title: English mpnet_embedding_java_expand_classifier TFMPNetModel from AISE-TUDelft +author: John Snow Labs +name: mpnet_embedding_java_expand_classifier +date: 2023-08-18 +tags: [mpnet, en, open_source, tensorflow] +task: Embeddings +language: en +edition: Spark NLP 5.1.0 +spark_version: 3.0 +supported: true +engine: tensorflow +annotator: MPNetEmbeddings +article_header: + type: cover +use_language_switcher: "Python-Scala-Java" +--- + +## Description + +Pretrained mpnet model, adapted from Hugging Face and curated to provide scalability and production-readiness using Spark NLP.`mpnet_embedding_java_expand_classifier` is a English model originally trained by AISE-TUDelft. + +{:.btn-box} + + +[Download](https://s3.amazonaws.com/auxdata.johnsnowlabs.com/public/models/mpnet_embedding_java_expand_classifier_en_5.1.0_3.0_1692378008915.zip){:.button.button-orange.button-orange-trans.arr.button-icon} +[Copy S3 URI](s3://auxdata.johnsnowlabs.com/public/models/mpnet_embedding_java_expand_classifier_en_5.1.0_3.0_1692378008915.zip){:.button.button-orange.button-orange-trans.button-icon.button-copy-s3} + +## How to use + + + +
+{% include programmingLanguageSelectScalaPythonNLU.html %} +```python + +document_assembler = DocumentAssembler() \ + .setInputCol("text") \ + .setOutputCol("documents") + +instruction = MPNetEmbeddings \ + .pretrained("mpnet_embedding_java_expand_classifier", "en")\ + .setInputCols(["documents"]) \ + .setOutputCol("mpnet_embeddings") + +pipeline = Pipeline(stages=[ + document_assembler, + instruction, +]) + +pipelineModel = pipeline.fit(data) + +pipelineDF = pipelineModel.transform(data) +``` +```scala + +val document_assembler = new DocumentAssembler() + .setInputCol("text") + .setOutputCol("documents") + +val instruction = MPNetEmbeddings + .pretrained("mpnet_embedding_java_expand_classifier", "en") + .setInputCols(Array("documents")) + .setOutputCol("mpnet_embeddings") + +val pipeline = new Pipeline().setStages(Array(document_assembler, instruction)) + +val pipelineModel = pipeline.fit(data) + +val pipelineDF = pipelineModel.transform(data) + +``` +
+ +{:.model-param} +## Model Information + +{:.table-model} +|---|---| +|Model Name:|mpnet_embedding_java_expand_classifier| +|Compatibility:|Spark NLP 5.1.0+| +|License:|Open Source| +|Edition:|Official| +|Input Labels:|[documents]| +|Output Labels:|[mpnet_embeddings]| +|Language:|en| +|Size:|409.9 MB| \ No newline at end of file From 97b0e6a564deea66c51941d386fe886d6a488698 Mon Sep 17 00:00:00 2001 From: ahmedlone127 Date: Sat, 19 Aug 2023 01:06:50 +0700 Subject: [PATCH 114/155] Add model 2023-08-18-mpnet_embedding_setfit_zero_shot_classification_pbsp_p4_achiev_en --- ...o_shot_classification_pbsp_p4_achiev_en.md | 88 +++++++++++++++++++ 1 file changed, 88 insertions(+) create mode 100644 docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_setfit_zero_shot_classification_pbsp_p4_achiev_en.md diff --git a/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_setfit_zero_shot_classification_pbsp_p4_achiev_en.md b/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_setfit_zero_shot_classification_pbsp_p4_achiev_en.md new file mode 100644 index 00000000000000..53892db8c8776f --- /dev/null +++ b/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_setfit_zero_shot_classification_pbsp_p4_achiev_en.md @@ -0,0 +1,88 @@ +--- +layout: model +title: English mpnet_embedding_setfit_zero_shot_classification_pbsp_p4_achiev TFMPNetModel from aammari +author: John Snow Labs +name: mpnet_embedding_setfit_zero_shot_classification_pbsp_p4_achiev +date: 2023-08-18 +tags: [mpnet, en, open_source, tensorflow] +task: Embeddings +language: en +edition: Spark NLP 5.1.0 +spark_version: 3.0 +supported: true +engine: tensorflow +annotator: MPNetEmbeddings +article_header: + type: cover +use_language_switcher: "Python-Scala-Java" +--- + +## Description + +Pretrained mpnet model, adapted from Hugging Face and curated to provide scalability and production-readiness using Spark NLP.`mpnet_embedding_setfit_zero_shot_classification_pbsp_p4_achiev` is a English model originally trained by aammari. + +{:.btn-box} + + +[Download](https://s3.amazonaws.com/auxdata.johnsnowlabs.com/public/models/mpnet_embedding_setfit_zero_shot_classification_pbsp_p4_achiev_en_5.1.0_3.0_1692381069742.zip){:.button.button-orange.button-orange-trans.arr.button-icon} +[Copy S3 URI](s3://auxdata.johnsnowlabs.com/public/models/mpnet_embedding_setfit_zero_shot_classification_pbsp_p4_achiev_en_5.1.0_3.0_1692381069742.zip){:.button.button-orange.button-orange-trans.button-icon.button-copy-s3} + +## How to use + + + +
+{% include programmingLanguageSelectScalaPythonNLU.html %} +```python + +document_assembler = DocumentAssembler() \ + .setInputCol("text") \ + .setOutputCol("documents") + +instruction = MPNetEmbeddings \ + .pretrained("mpnet_embedding_setfit_zero_shot_classification_pbsp_p4_achiev", "en")\ + .setInputCols(["documents"]) \ + .setOutputCol("mpnet_embeddings") + +pipeline = Pipeline(stages=[ + document_assembler, + instruction, +]) + +pipelineModel = pipeline.fit(data) + +pipelineDF = pipelineModel.transform(data) +``` +```scala + +val document_assembler = new DocumentAssembler() + .setInputCol("text") + .setOutputCol("documents") + +val instruction = MPNetEmbeddings + .pretrained("mpnet_embedding_setfit_zero_shot_classification_pbsp_p4_achiev", "en") + .setInputCols(Array("documents")) + .setOutputCol("mpnet_embeddings") + +val pipeline = new Pipeline().setStages(Array(document_assembler, instruction)) + +val pipelineModel = pipeline.fit(data) + +val pipelineDF = pipelineModel.transform(data) + +``` +
+ +{:.model-param} +## Model Information + +{:.table-model} +|---|---| +|Model Name:|mpnet_embedding_setfit_zero_shot_classification_pbsp_p4_achiev| +|Compatibility:|Spark NLP 5.1.0+| +|License:|Open Source| +|Edition:|Official| +|Input Labels:|[documents]| +|Output Labels:|[mpnet_embeddings]| +|Language:|en| +|Size:|409.8 MB| \ No newline at end of file From 4f28b550f28f82d34f16b95ac01b671793853b12 Mon Sep 17 00:00:00 2001 From: ahmedlone127 Date: Sat, 19 Aug 2023 01:07:51 +0700 Subject: [PATCH 115/155] Add model 2023-08-18-mpnet_embedding_BioLORD_STAMB2_v1_en --- ...18-mpnet_embedding_BioLORD_STAMB2_v1_en.md | 88 +++++++++++++++++++ 1 file changed, 88 insertions(+) create mode 100644 docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_BioLORD_STAMB2_v1_en.md diff --git a/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_BioLORD_STAMB2_v1_en.md b/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_BioLORD_STAMB2_v1_en.md new file mode 100644 index 00000000000000..f19db3f3446eac --- /dev/null +++ b/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_BioLORD_STAMB2_v1_en.md @@ -0,0 +1,88 @@ +--- +layout: model +title: English mpnet_embedding_BioLORD_STAMB2_v1 TFMPNetModel from FremyCompany +author: John Snow Labs +name: mpnet_embedding_BioLORD_STAMB2_v1 +date: 2023-08-18 +tags: [mpnet, en, open_source, tensorflow] +task: Embeddings +language: en +edition: Spark NLP 5.1.0 +spark_version: 3.0 +supported: true +engine: tensorflow +annotator: MPNetEmbeddings +article_header: + type: cover +use_language_switcher: "Python-Scala-Java" +--- + +## Description + +Pretrained mpnet model, adapted from Hugging Face and curated to provide scalability and production-readiness using Spark NLP.`mpnet_embedding_BioLORD_STAMB2_v1` is a English model originally trained by FremyCompany. + +{:.btn-box} + + +[Download](https://s3.amazonaws.com/auxdata.johnsnowlabs.com/public/models/mpnet_embedding_BioLORD_STAMB2_v1_en_5.1.0_3.0_1692377544444.zip){:.button.button-orange.button-orange-trans.arr.button-icon} +[Copy S3 URI](s3://auxdata.johnsnowlabs.com/public/models/mpnet_embedding_BioLORD_STAMB2_v1_en_5.1.0_3.0_1692377544444.zip){:.button.button-orange.button-orange-trans.button-icon.button-copy-s3} + +## How to use + + + +
+{% include programmingLanguageSelectScalaPythonNLU.html %} +```python + +document_assembler = DocumentAssembler() \ + .setInputCol("text") \ + .setOutputCol("documents") + +instruction = MPNetEmbeddings \ + .pretrained("mpnet_embedding_BioLORD_STAMB2_v1", "en")\ + .setInputCols(["documents"]) \ + .setOutputCol("mpnet_embeddings") + +pipeline = Pipeline(stages=[ + document_assembler, + instruction, +]) + +pipelineModel = pipeline.fit(data) + +pipelineDF = pipelineModel.transform(data) +``` +```scala + +val document_assembler = new DocumentAssembler() + .setInputCol("text") + .setOutputCol("documents") + +val instruction = MPNetEmbeddings + .pretrained("mpnet_embedding_BioLORD_STAMB2_v1", "en") + .setInputCols(Array("documents")) + .setOutputCol("mpnet_embeddings") + +val pipeline = new Pipeline().setStages(Array(document_assembler, instruction)) + +val pipelineModel = pipeline.fit(data) + +val pipelineDF = pipelineModel.transform(data) + +``` +
+ +{:.model-param} +## Model Information + +{:.table-model} +|---|---| +|Model Name:|mpnet_embedding_BioLORD_STAMB2_v1| +|Compatibility:|Spark NLP 5.1.0+| +|License:|Open Source| +|Edition:|Official| +|Input Labels:|[documents]| +|Output Labels:|[mpnet_embeddings]| +|Language:|en| +|Size:|409.9 MB| \ No newline at end of file From 98891c184381fd4b27f52d2da8821e6dd8d4e486 Mon Sep 17 00:00:00 2001 From: ahmedlone127 Date: Sat, 19 Aug 2023 01:08:51 +0700 Subject: [PATCH 116/155] Add model 2023-08-18-mpnet_embedding_setfit_zero_shot_classification_pbsp_p3_bhvr_en --- ...ero_shot_classification_pbsp_p3_bhvr_en.md | 88 +++++++++++++++++++ 1 file changed, 88 insertions(+) create mode 100644 docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_setfit_zero_shot_classification_pbsp_p3_bhvr_en.md diff --git a/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_setfit_zero_shot_classification_pbsp_p3_bhvr_en.md b/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_setfit_zero_shot_classification_pbsp_p3_bhvr_en.md new file mode 100644 index 00000000000000..93bf47a1c3ee14 --- /dev/null +++ b/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_setfit_zero_shot_classification_pbsp_p3_bhvr_en.md @@ -0,0 +1,88 @@ +--- +layout: model +title: English mpnet_embedding_setfit_zero_shot_classification_pbsp_p3_bhvr TFMPNetModel from aammari +author: John Snow Labs +name: mpnet_embedding_setfit_zero_shot_classification_pbsp_p3_bhvr +date: 2023-08-18 +tags: [mpnet, en, open_source, tensorflow] +task: Embeddings +language: en +edition: Spark NLP 5.1.0 +spark_version: 3.0 +supported: true +engine: tensorflow +annotator: MPNetEmbeddings +article_header: + type: cover +use_language_switcher: "Python-Scala-Java" +--- + +## Description + +Pretrained mpnet model, adapted from Hugging Face and curated to provide scalability and production-readiness using Spark NLP.`mpnet_embedding_setfit_zero_shot_classification_pbsp_p3_bhvr` is a English model originally trained by aammari. + +{:.btn-box} + + +[Download](https://s3.amazonaws.com/auxdata.johnsnowlabs.com/public/models/mpnet_embedding_setfit_zero_shot_classification_pbsp_p3_bhvr_en_5.1.0_3.0_1692379700885.zip){:.button.button-orange.button-orange-trans.arr.button-icon} +[Copy S3 URI](s3://auxdata.johnsnowlabs.com/public/models/mpnet_embedding_setfit_zero_shot_classification_pbsp_p3_bhvr_en_5.1.0_3.0_1692379700885.zip){:.button.button-orange.button-orange-trans.button-icon.button-copy-s3} + +## How to use + + + +
+{% include programmingLanguageSelectScalaPythonNLU.html %} +```python + +document_assembler = DocumentAssembler() \ + .setInputCol("text") \ + .setOutputCol("documents") + +instruction = MPNetEmbeddings \ + .pretrained("mpnet_embedding_setfit_zero_shot_classification_pbsp_p3_bhvr", "en")\ + .setInputCols(["documents"]) \ + .setOutputCol("mpnet_embeddings") + +pipeline = Pipeline(stages=[ + document_assembler, + instruction, +]) + +pipelineModel = pipeline.fit(data) + +pipelineDF = pipelineModel.transform(data) +``` +```scala + +val document_assembler = new DocumentAssembler() + .setInputCol("text") + .setOutputCol("documents") + +val instruction = MPNetEmbeddings + .pretrained("mpnet_embedding_setfit_zero_shot_classification_pbsp_p3_bhvr", "en") + .setInputCols(Array("documents")) + .setOutputCol("mpnet_embeddings") + +val pipeline = new Pipeline().setStages(Array(document_assembler, instruction)) + +val pipelineModel = pipeline.fit(data) + +val pipelineDF = pipelineModel.transform(data) + +``` +
+ +{:.model-param} +## Model Information + +{:.table-model} +|---|---| +|Model Name:|mpnet_embedding_setfit_zero_shot_classification_pbsp_p3_bhvr| +|Compatibility:|Spark NLP 5.1.0+| +|License:|Open Source| +|Edition:|Official| +|Input Labels:|[documents]| +|Output Labels:|[mpnet_embeddings]| +|Language:|en| +|Size:|409.9 MB| \ No newline at end of file From 39e62f00fb2263f738d15a265e6ffec5463bc7a6 Mon Sep 17 00:00:00 2001 From: ahmedlone127 Date: Sat, 19 Aug 2023 01:09:51 +0700 Subject: [PATCH 117/155] Add model 2023-08-18-mpnet_embedding_fail_detect_en --- ...23-08-18-mpnet_embedding_fail_detect_en.md | 88 +++++++++++++++++++ 1 file changed, 88 insertions(+) create mode 100644 docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_fail_detect_en.md diff --git a/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_fail_detect_en.md b/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_fail_detect_en.md new file mode 100644 index 00000000000000..6040e046ce6d0d --- /dev/null +++ b/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_fail_detect_en.md @@ -0,0 +1,88 @@ +--- +layout: model +title: English mpnet_embedding_fail_detect TFMPNetModel from Ngit +author: John Snow Labs +name: mpnet_embedding_fail_detect +date: 2023-08-18 +tags: [mpnet, en, open_source, tensorflow] +task: Embeddings +language: en +edition: Spark NLP 5.1.0 +spark_version: 3.0 +supported: true +engine: tensorflow +annotator: MPNetEmbeddings +article_header: + type: cover +use_language_switcher: "Python-Scala-Java" +--- + +## Description + +Pretrained mpnet model, adapted from Hugging Face and curated to provide scalability and production-readiness using Spark NLP.`mpnet_embedding_fail_detect` is a English model originally trained by Ngit. + +{:.btn-box} + + +[Download](https://s3.amazonaws.com/auxdata.johnsnowlabs.com/public/models/mpnet_embedding_fail_detect_en_5.1.0_3.0_1692379164436.zip){:.button.button-orange.button-orange-trans.arr.button-icon} +[Copy S3 URI](s3://auxdata.johnsnowlabs.com/public/models/mpnet_embedding_fail_detect_en_5.1.0_3.0_1692379164436.zip){:.button.button-orange.button-orange-trans.button-icon.button-copy-s3} + +## How to use + + + +
+{% include programmingLanguageSelectScalaPythonNLU.html %} +```python + +document_assembler = DocumentAssembler() \ + .setInputCol("text") \ + .setOutputCol("documents") + +instruction = MPNetEmbeddings \ + .pretrained("mpnet_embedding_fail_detect", "en")\ + .setInputCols(["documents"]) \ + .setOutputCol("mpnet_embeddings") + +pipeline = Pipeline(stages=[ + document_assembler, + instruction, +]) + +pipelineModel = pipeline.fit(data) + +pipelineDF = pipelineModel.transform(data) +``` +```scala + +val document_assembler = new DocumentAssembler() + .setInputCol("text") + .setOutputCol("documents") + +val instruction = MPNetEmbeddings + .pretrained("mpnet_embedding_fail_detect", "en") + .setInputCols(Array("documents")) + .setOutputCol("mpnet_embeddings") + +val pipeline = new Pipeline().setStages(Array(document_assembler, instruction)) + +val pipelineModel = pipeline.fit(data) + +val pipelineDF = pipelineModel.transform(data) + +``` +
+ +{:.model-param} +## Model Information + +{:.table-model} +|---|---| +|Model Name:|mpnet_embedding_fail_detect| +|Compatibility:|Spark NLP 5.1.0+| +|License:|Open Source| +|Edition:|Official| +|Input Labels:|[documents]| +|Output Labels:|[mpnet_embeddings]| +|Language:|en| +|Size:|410.2 MB| \ No newline at end of file From 7510d203f01febfc3504cb5d0ea292a37cbaf1df Mon Sep 17 00:00:00 2001 From: ahmedlone127 Date: Sat, 19 Aug 2023 01:10:51 +0700 Subject: [PATCH 118/155] Add model 2023-08-18-mpnet_embedding_CPU_Target_Classifier_en --- ...pnet_embedding_CPU_Target_Classifier_en.md | 88 +++++++++++++++++++ 1 file changed, 88 insertions(+) create mode 100644 docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_CPU_Target_Classifier_en.md diff --git a/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_CPU_Target_Classifier_en.md b/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_CPU_Target_Classifier_en.md new file mode 100644 index 00000000000000..046ce8c8c4938e --- /dev/null +++ b/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_CPU_Target_Classifier_en.md @@ -0,0 +1,88 @@ +--- +layout: model +title: English mpnet_embedding_CPU_Target_Classifier TFMPNetModel from mtyrrell +author: John Snow Labs +name: mpnet_embedding_CPU_Target_Classifier +date: 2023-08-18 +tags: [mpnet, en, open_source, tensorflow] +task: Embeddings +language: en +edition: Spark NLP 5.1.0 +spark_version: 3.0 +supported: true +engine: tensorflow +annotator: MPNetEmbeddings +article_header: + type: cover +use_language_switcher: "Python-Scala-Java" +--- + +## Description + +Pretrained mpnet model, adapted from Hugging Face and curated to provide scalability and production-readiness using Spark NLP.`mpnet_embedding_CPU_Target_Classifier` is a English model originally trained by mtyrrell. + +{:.btn-box} + + +[Download](https://s3.amazonaws.com/auxdata.johnsnowlabs.com/public/models/mpnet_embedding_CPU_Target_Classifier_en_5.1.0_3.0_1692381136016.zip){:.button.button-orange.button-orange-trans.arr.button-icon} +[Copy S3 URI](s3://auxdata.johnsnowlabs.com/public/models/mpnet_embedding_CPU_Target_Classifier_en_5.1.0_3.0_1692381136016.zip){:.button.button-orange.button-orange-trans.button-icon.button-copy-s3} + +## How to use + + + +
+{% include programmingLanguageSelectScalaPythonNLU.html %} +```python + +document_assembler = DocumentAssembler() \ + .setInputCol("text") \ + .setOutputCol("documents") + +instruction = MPNetEmbeddings \ + .pretrained("mpnet_embedding_CPU_Target_Classifier", "en")\ + .setInputCols(["documents"]) \ + .setOutputCol("mpnet_embeddings") + +pipeline = Pipeline(stages=[ + document_assembler, + instruction, +]) + +pipelineModel = pipeline.fit(data) + +pipelineDF = pipelineModel.transform(data) +``` +```scala + +val document_assembler = new DocumentAssembler() + .setInputCol("text") + .setOutputCol("documents") + +val instruction = MPNetEmbeddings + .pretrained("mpnet_embedding_CPU_Target_Classifier", "en") + .setInputCols(Array("documents")) + .setOutputCol("mpnet_embeddings") + +val pipeline = new Pipeline().setStages(Array(document_assembler, instruction)) + +val pipelineModel = pipeline.fit(data) + +val pipelineDF = pipelineModel.transform(data) + +``` +
+ +{:.model-param} +## Model Information + +{:.table-model} +|---|---| +|Model Name:|mpnet_embedding_CPU_Target_Classifier| +|Compatibility:|Spark NLP 5.1.0+| +|License:|Open Source| +|Edition:|Official| +|Input Labels:|[documents]| +|Output Labels:|[mpnet_embeddings]| +|Language:|en| +|Size:|410.2 MB| \ No newline at end of file From 89f00bc1aba77a25a15804a7c79c08e3eb1c2c70 Mon Sep 17 00:00:00 2001 From: ahmedlone127 Date: Sat, 19 Aug 2023 01:11:52 +0700 Subject: [PATCH 119/155] Add model 2023-08-18-mpnet_embedding_kw_classification_setfit_model_en --- ...dding_kw_classification_setfit_model_en.md | 88 +++++++++++++++++++ 1 file changed, 88 insertions(+) create mode 100644 docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_kw_classification_setfit_model_en.md diff --git a/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_kw_classification_setfit_model_en.md b/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_kw_classification_setfit_model_en.md new file mode 100644 index 00000000000000..65cef5eb694cf5 --- /dev/null +++ b/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_kw_classification_setfit_model_en.md @@ -0,0 +1,88 @@ +--- +layout: model +title: English mpnet_embedding_kw_classification_setfit_model TFMPNetModel from gyuri2020 +author: John Snow Labs +name: mpnet_embedding_kw_classification_setfit_model +date: 2023-08-18 +tags: [mpnet, en, open_source, tensorflow] +task: Embeddings +language: en +edition: Spark NLP 5.1.0 +spark_version: 3.0 +supported: true +engine: tensorflow +annotator: MPNetEmbeddings +article_header: + type: cover +use_language_switcher: "Python-Scala-Java" +--- + +## Description + +Pretrained mpnet model, adapted from Hugging Face and curated to provide scalability and production-readiness using Spark NLP.`mpnet_embedding_kw_classification_setfit_model` is a English model originally trained by gyuri2020. + +{:.btn-box} + + +[Download](https://s3.amazonaws.com/auxdata.johnsnowlabs.com/public/models/mpnet_embedding_kw_classification_setfit_model_en_5.1.0_3.0_1692379109726.zip){:.button.button-orange.button-orange-trans.arr.button-icon} +[Copy S3 URI](s3://auxdata.johnsnowlabs.com/public/models/mpnet_embedding_kw_classification_setfit_model_en_5.1.0_3.0_1692379109726.zip){:.button.button-orange.button-orange-trans.button-icon.button-copy-s3} + +## How to use + + + +
+{% include programmingLanguageSelectScalaPythonNLU.html %} +```python + +document_assembler = DocumentAssembler() \ + .setInputCol("text") \ + .setOutputCol("documents") + +instruction = MPNetEmbeddings \ + .pretrained("mpnet_embedding_kw_classification_setfit_model", "en")\ + .setInputCols(["documents"]) \ + .setOutputCol("mpnet_embeddings") + +pipeline = Pipeline(stages=[ + document_assembler, + instruction, +]) + +pipelineModel = pipeline.fit(data) + +pipelineDF = pipelineModel.transform(data) +``` +```scala + +val document_assembler = new DocumentAssembler() + .setInputCol("text") + .setOutputCol("documents") + +val instruction = MPNetEmbeddings + .pretrained("mpnet_embedding_kw_classification_setfit_model", "en") + .setInputCols(Array("documents")) + .setOutputCol("mpnet_embeddings") + +val pipeline = new Pipeline().setStages(Array(document_assembler, instruction)) + +val pipelineModel = pipeline.fit(data) + +val pipelineDF = pipelineModel.transform(data) + +``` +
+ +{:.model-param} +## Model Information + +{:.table-model} +|---|---| +|Model Name:|mpnet_embedding_kw_classification_setfit_model| +|Compatibility:|Spark NLP 5.1.0+| +|License:|Open Source| +|Edition:|Official| +|Input Labels:|[documents]| +|Output Labels:|[mpnet_embeddings]| +|Language:|en| +|Size:|409.9 MB| \ No newline at end of file From bd6142a0c861b50a90e5daf19a8b700e3799ed5a Mon Sep 17 00:00:00 2001 From: ahmedlone127 Date: Sat, 19 Aug 2023 01:12:52 +0700 Subject: [PATCH 120/155] Add model 2023-08-18-mpnet_embedding_paraphrase_mpnet_base_v2_en --- ...t_embedding_paraphrase_mpnet_base_v2_en.md | 88 +++++++++++++++++++ 1 file changed, 88 insertions(+) create mode 100644 docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_paraphrase_mpnet_base_v2_en.md diff --git a/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_paraphrase_mpnet_base_v2_en.md b/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_paraphrase_mpnet_base_v2_en.md new file mode 100644 index 00000000000000..1dabd46f84cf5d --- /dev/null +++ b/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_paraphrase_mpnet_base_v2_en.md @@ -0,0 +1,88 @@ +--- +layout: model +title: English mpnet_embedding_paraphrase_mpnet_base_v2 TFMPNetModel from sentence-transformers +author: John Snow Labs +name: mpnet_embedding_paraphrase_mpnet_base_v2 +date: 2023-08-18 +tags: [mpnet, en, open_source, tensorflow] +task: Embeddings +language: en +edition: Spark NLP 5.1.0 +spark_version: 3.0 +supported: true +engine: tensorflow +annotator: MPNetEmbeddings +article_header: + type: cover +use_language_switcher: "Python-Scala-Java" +--- + +## Description + +Pretrained mpnet model, adapted from Hugging Face and curated to provide scalability and production-readiness using Spark NLP.`mpnet_embedding_paraphrase_mpnet_base_v2` is a English model originally trained by sentence-transformers. + +{:.btn-box} + + +[Download](https://s3.amazonaws.com/auxdata.johnsnowlabs.com/public/models/mpnet_embedding_paraphrase_mpnet_base_v2_en_5.1.0_3.0_1692378194037.zip){:.button.button-orange.button-orange-trans.arr.button-icon} +[Copy S3 URI](s3://auxdata.johnsnowlabs.com/public/models/mpnet_embedding_paraphrase_mpnet_base_v2_en_5.1.0_3.0_1692378194037.zip){:.button.button-orange.button-orange-trans.button-icon.button-copy-s3} + +## How to use + + + +
+{% include programmingLanguageSelectScalaPythonNLU.html %} +```python + +document_assembler = DocumentAssembler() \ + .setInputCol("text") \ + .setOutputCol("documents") + +instruction = MPNetEmbeddings \ + .pretrained("mpnet_embedding_paraphrase_mpnet_base_v2", "en")\ + .setInputCols(["documents"]) \ + .setOutputCol("mpnet_embeddings") + +pipeline = Pipeline(stages=[ + document_assembler, + instruction, +]) + +pipelineModel = pipeline.fit(data) + +pipelineDF = pipelineModel.transform(data) +``` +```scala + +val document_assembler = new DocumentAssembler() + .setInputCol("text") + .setOutputCol("documents") + +val instruction = MPNetEmbeddings + .pretrained("mpnet_embedding_paraphrase_mpnet_base_v2", "en") + .setInputCols(Array("documents")) + .setOutputCol("mpnet_embeddings") + +val pipeline = new Pipeline().setStages(Array(document_assembler, instruction)) + +val pipelineModel = pipeline.fit(data) + +val pipelineDF = pipelineModel.transform(data) + +``` +
+ +{:.model-param} +## Model Information + +{:.table-model} +|---|---| +|Model Name:|mpnet_embedding_paraphrase_mpnet_base_v2| +|Compatibility:|Spark NLP 5.1.0+| +|License:|Open Source| +|Edition:|Official| +|Input Labels:|[documents]| +|Output Labels:|[mpnet_embeddings]| +|Language:|en| +|Size:|409.9 MB| \ No newline at end of file From 3b1b90edc2e9cfe3c768701d040b061bdee40424 Mon Sep 17 00:00:00 2001 From: ahmedlone127 Date: Sat, 19 Aug 2023 01:13:53 +0700 Subject: [PATCH 121/155] Add model 2023-08-18-mpnet_embedding_abstract_sim_sentence_en --- ...pnet_embedding_abstract_sim_sentence_en.md | 88 +++++++++++++++++++ 1 file changed, 88 insertions(+) create mode 100644 docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_abstract_sim_sentence_en.md diff --git a/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_abstract_sim_sentence_en.md b/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_abstract_sim_sentence_en.md new file mode 100644 index 00000000000000..7a6ef57f054364 --- /dev/null +++ b/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_abstract_sim_sentence_en.md @@ -0,0 +1,88 @@ +--- +layout: model +title: English mpnet_embedding_abstract_sim_sentence TFMPNetModel from biu-nlp +author: John Snow Labs +name: mpnet_embedding_abstract_sim_sentence +date: 2023-08-18 +tags: [mpnet, en, open_source, tensorflow] +task: Embeddings +language: en +edition: Spark NLP 5.1.0 +spark_version: 3.0 +supported: true +engine: tensorflow +annotator: MPNetEmbeddings +article_header: + type: cover +use_language_switcher: "Python-Scala-Java" +--- + +## Description + +Pretrained mpnet model, adapted from Hugging Face and curated to provide scalability and production-readiness using Spark NLP.`mpnet_embedding_abstract_sim_sentence` is a English model originally trained by biu-nlp. + +{:.btn-box} + + +[Download](https://s3.amazonaws.com/auxdata.johnsnowlabs.com/public/models/mpnet_embedding_abstract_sim_sentence_en_5.1.0_3.0_1692378160891.zip){:.button.button-orange.button-orange-trans.arr.button-icon} +[Copy S3 URI](s3://auxdata.johnsnowlabs.com/public/models/mpnet_embedding_abstract_sim_sentence_en_5.1.0_3.0_1692378160891.zip){:.button.button-orange.button-orange-trans.button-icon.button-copy-s3} + +## How to use + + + +
+{% include programmingLanguageSelectScalaPythonNLU.html %} +```python + +document_assembler = DocumentAssembler() \ + .setInputCol("text") \ + .setOutputCol("documents") + +instruction = MPNetEmbeddings \ + .pretrained("mpnet_embedding_abstract_sim_sentence", "en")\ + .setInputCols(["documents"]) \ + .setOutputCol("mpnet_embeddings") + +pipeline = Pipeline(stages=[ + document_assembler, + instruction, +]) + +pipelineModel = pipeline.fit(data) + +pipelineDF = pipelineModel.transform(data) +``` +```scala + +val document_assembler = new DocumentAssembler() + .setInputCol("text") + .setOutputCol("documents") + +val instruction = MPNetEmbeddings + .pretrained("mpnet_embedding_abstract_sim_sentence", "en") + .setInputCols(Array("documents")) + .setOutputCol("mpnet_embeddings") + +val pipeline = new Pipeline().setStages(Array(document_assembler, instruction)) + +val pipelineModel = pipeline.fit(data) + +val pipelineDF = pipelineModel.transform(data) + +``` +
+ +{:.model-param} +## Model Information + +{:.table-model} +|---|---| +|Model Name:|mpnet_embedding_abstract_sim_sentence| +|Compatibility:|Spark NLP 5.1.0+| +|License:|Open Source| +|Edition:|Official| +|Input Labels:|[documents]| +|Output Labels:|[mpnet_embeddings]| +|Language:|en| +|Size:|409.5 MB| \ No newline at end of file From 614cfc88a8fc19d763139f1e28f56eb63ae038d5 Mon Sep 17 00:00:00 2001 From: ahmedlone127 Date: Sat, 19 Aug 2023 01:14:53 +0700 Subject: [PATCH 122/155] Add model 2023-08-18-mpnet_embedding_setfit_ds_version_0_0_2_en --- ...et_embedding_setfit_ds_version_0_0_2_en.md | 88 +++++++++++++++++++ 1 file changed, 88 insertions(+) create mode 100644 docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_setfit_ds_version_0_0_2_en.md diff --git a/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_setfit_ds_version_0_0_2_en.md b/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_setfit_ds_version_0_0_2_en.md new file mode 100644 index 00000000000000..5472a869b11882 --- /dev/null +++ b/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_setfit_ds_version_0_0_2_en.md @@ -0,0 +1,88 @@ +--- +layout: model +title: English mpnet_embedding_setfit_ds_version_0_0_2 TFMPNetModel from amittian +author: John Snow Labs +name: mpnet_embedding_setfit_ds_version_0_0_2 +date: 2023-08-18 +tags: [mpnet, en, open_source, tensorflow] +task: Embeddings +language: en +edition: Spark NLP 5.1.0 +spark_version: 3.0 +supported: true +engine: tensorflow +annotator: MPNetEmbeddings +article_header: + type: cover +use_language_switcher: "Python-Scala-Java" +--- + +## Description + +Pretrained mpnet model, adapted from Hugging Face and curated to provide scalability and production-readiness using Spark NLP.`mpnet_embedding_setfit_ds_version_0_0_2` is a English model originally trained by amittian. + +{:.btn-box} + + +[Download](https://s3.amazonaws.com/auxdata.johnsnowlabs.com/public/models/mpnet_embedding_setfit_ds_version_0_0_2_en_5.1.0_3.0_1692377560015.zip){:.button.button-orange.button-orange-trans.arr.button-icon} +[Copy S3 URI](s3://auxdata.johnsnowlabs.com/public/models/mpnet_embedding_setfit_ds_version_0_0_2_en_5.1.0_3.0_1692377560015.zip){:.button.button-orange.button-orange-trans.button-icon.button-copy-s3} + +## How to use + + + +
+{% include programmingLanguageSelectScalaPythonNLU.html %} +```python + +document_assembler = DocumentAssembler() \ + .setInputCol("text") \ + .setOutputCol("documents") + +instruction = MPNetEmbeddings \ + .pretrained("mpnet_embedding_setfit_ds_version_0_0_2", "en")\ + .setInputCols(["documents"]) \ + .setOutputCol("mpnet_embeddings") + +pipeline = Pipeline(stages=[ + document_assembler, + instruction, +]) + +pipelineModel = pipeline.fit(data) + +pipelineDF = pipelineModel.transform(data) +``` +```scala + +val document_assembler = new DocumentAssembler() + .setInputCol("text") + .setOutputCol("documents") + +val instruction = MPNetEmbeddings + .pretrained("mpnet_embedding_setfit_ds_version_0_0_2", "en") + .setInputCols(Array("documents")) + .setOutputCol("mpnet_embeddings") + +val pipeline = new Pipeline().setStages(Array(document_assembler, instruction)) + +val pipelineModel = pipeline.fit(data) + +val pipelineDF = pipelineModel.transform(data) + +``` +
+ +{:.model-param} +## Model Information + +{:.table-model} +|---|---| +|Model Name:|mpnet_embedding_setfit_ds_version_0_0_2| +|Compatibility:|Spark NLP 5.1.0+| +|License:|Open Source| +|Edition:|Official| +|Input Labels:|[documents]| +|Output Labels:|[mpnet_embeddings]| +|Language:|en| +|Size:|410.2 MB| \ No newline at end of file From 3f9e8848dc19daf135b95d87aeec774ce57d7a7b Mon Sep 17 00:00:00 2001 From: ahmedlone127 Date: Sat, 19 Aug 2023 01:15:53 +0700 Subject: [PATCH 123/155] Add model 2023-08-18-mpnet_embedding_mpnet_mnr_v2_fine_tuned_en --- ...et_embedding_mpnet_mnr_v2_fine_tuned_en.md | 88 +++++++++++++++++++ 1 file changed, 88 insertions(+) create mode 100644 docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_mpnet_mnr_v2_fine_tuned_en.md diff --git a/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_mpnet_mnr_v2_fine_tuned_en.md b/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_mpnet_mnr_v2_fine_tuned_en.md new file mode 100644 index 00000000000000..0292e2bfd78e0b --- /dev/null +++ b/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_mpnet_mnr_v2_fine_tuned_en.md @@ -0,0 +1,88 @@ +--- +layout: model +title: English mpnet_embedding_mpnet_mnr_v2_fine_tuned TFMPNetModel from BlazingFringe +author: John Snow Labs +name: mpnet_embedding_mpnet_mnr_v2_fine_tuned +date: 2023-08-18 +tags: [mpnet, en, open_source, tensorflow] +task: Embeddings +language: en +edition: Spark NLP 5.1.0 +spark_version: 3.0 +supported: true +engine: tensorflow +annotator: MPNetEmbeddings +article_header: + type: cover +use_language_switcher: "Python-Scala-Java" +--- + +## Description + +Pretrained mpnet model, adapted from Hugging Face and curated to provide scalability and production-readiness using Spark NLP.`mpnet_embedding_mpnet_mnr_v2_fine_tuned` is a English model originally trained by BlazingFringe. + +{:.btn-box} + + +[Download](https://s3.amazonaws.com/auxdata.johnsnowlabs.com/public/models/mpnet_embedding_mpnet_mnr_v2_fine_tuned_en_5.1.0_3.0_1692379624059.zip){:.button.button-orange.button-orange-trans.arr.button-icon} +[Copy S3 URI](s3://auxdata.johnsnowlabs.com/public/models/mpnet_embedding_mpnet_mnr_v2_fine_tuned_en_5.1.0_3.0_1692379624059.zip){:.button.button-orange.button-orange-trans.button-icon.button-copy-s3} + +## How to use + + + +
+{% include programmingLanguageSelectScalaPythonNLU.html %} +```python + +document_assembler = DocumentAssembler() \ + .setInputCol("text") \ + .setOutputCol("documents") + +instruction = MPNetEmbeddings \ + .pretrained("mpnet_embedding_mpnet_mnr_v2_fine_tuned", "en")\ + .setInputCols(["documents"]) \ + .setOutputCol("mpnet_embeddings") + +pipeline = Pipeline(stages=[ + document_assembler, + instruction, +]) + +pipelineModel = pipeline.fit(data) + +pipelineDF = pipelineModel.transform(data) +``` +```scala + +val document_assembler = new DocumentAssembler() + .setInputCol("text") + .setOutputCol("documents") + +val instruction = MPNetEmbeddings + .pretrained("mpnet_embedding_mpnet_mnr_v2_fine_tuned", "en") + .setInputCols(Array("documents")) + .setOutputCol("mpnet_embeddings") + +val pipeline = new Pipeline().setStages(Array(document_assembler, instruction)) + +val pipelineModel = pipeline.fit(data) + +val pipelineDF = pipelineModel.transform(data) + +``` +
+ +{:.model-param} +## Model Information + +{:.table-model} +|---|---| +|Model Name:|mpnet_embedding_mpnet_mnr_v2_fine_tuned| +|Compatibility:|Spark NLP 5.1.0+| +|License:|Open Source| +|Edition:|Official| +|Input Labels:|[documents]| +|Output Labels:|[mpnet_embeddings]| +|Language:|en| +|Size:|410.1 MB| \ No newline at end of file From 2977acaf9c3440418780f8a2a5d94f3441febf5f Mon Sep 17 00:00:00 2001 From: ahmedlone127 Date: Sat, 19 Aug 2023 01:16:53 +0700 Subject: [PATCH 124/155] Add model 2023-08-18-mpnet_embedding_kw_classification_setfithead_model_en --- ...g_kw_classification_setfithead_model_en.md | 88 +++++++++++++++++++ 1 file changed, 88 insertions(+) create mode 100644 docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_kw_classification_setfithead_model_en.md diff --git a/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_kw_classification_setfithead_model_en.md b/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_kw_classification_setfithead_model_en.md new file mode 100644 index 00000000000000..a28f3cc26cf82f --- /dev/null +++ b/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_kw_classification_setfithead_model_en.md @@ -0,0 +1,88 @@ +--- +layout: model +title: English mpnet_embedding_kw_classification_setfithead_model TFMPNetModel from gyuri2020 +author: John Snow Labs +name: mpnet_embedding_kw_classification_setfithead_model +date: 2023-08-18 +tags: [mpnet, en, open_source, tensorflow] +task: Embeddings +language: en +edition: Spark NLP 5.1.0 +spark_version: 3.0 +supported: true +engine: tensorflow +annotator: MPNetEmbeddings +article_header: + type: cover +use_language_switcher: "Python-Scala-Java" +--- + +## Description + +Pretrained mpnet model, adapted from Hugging Face and curated to provide scalability and production-readiness using Spark NLP.`mpnet_embedding_kw_classification_setfithead_model` is a English model originally trained by gyuri2020. + +{:.btn-box} + + +[Download](https://s3.amazonaws.com/auxdata.johnsnowlabs.com/public/models/mpnet_embedding_kw_classification_setfithead_model_en_5.1.0_3.0_1692379488047.zip){:.button.button-orange.button-orange-trans.arr.button-icon} +[Copy S3 URI](s3://auxdata.johnsnowlabs.com/public/models/mpnet_embedding_kw_classification_setfithead_model_en_5.1.0_3.0_1692379488047.zip){:.button.button-orange.button-orange-trans.button-icon.button-copy-s3} + +## How to use + + + +
+{% include programmingLanguageSelectScalaPythonNLU.html %} +```python + +document_assembler = DocumentAssembler() \ + .setInputCol("text") \ + .setOutputCol("documents") + +instruction = MPNetEmbeddings \ + .pretrained("mpnet_embedding_kw_classification_setfithead_model", "en")\ + .setInputCols(["documents"]) \ + .setOutputCol("mpnet_embeddings") + +pipeline = Pipeline(stages=[ + document_assembler, + instruction, +]) + +pipelineModel = pipeline.fit(data) + +pipelineDF = pipelineModel.transform(data) +``` +```scala + +val document_assembler = new DocumentAssembler() + .setInputCol("text") + .setOutputCol("documents") + +val instruction = MPNetEmbeddings + .pretrained("mpnet_embedding_kw_classification_setfithead_model", "en") + .setInputCols(Array("documents")) + .setOutputCol("mpnet_embeddings") + +val pipeline = new Pipeline().setStages(Array(document_assembler, instruction)) + +val pipelineModel = pipeline.fit(data) + +val pipelineDF = pipelineModel.transform(data) + +``` +
+ +{:.model-param} +## Model Information + +{:.table-model} +|---|---| +|Model Name:|mpnet_embedding_kw_classification_setfithead_model| +|Compatibility:|Spark NLP 5.1.0+| +|License:|Open Source| +|Edition:|Official| +|Input Labels:|[documents]| +|Output Labels:|[mpnet_embeddings]| +|Language:|en| +|Size:|409.9 MB| \ No newline at end of file From 76b6a45c88bbb019fba069a5e5ac1fa77cd7aa06 Mon Sep 17 00:00:00 2001 From: ahmedlone127 Date: Sat, 19 Aug 2023 01:17:54 +0700 Subject: [PATCH 125/155] Add model 2023-08-18-mpnet_embedding_due_retail_25_en --- ...-08-18-mpnet_embedding_due_retail_25_en.md | 88 +++++++++++++++++++ 1 file changed, 88 insertions(+) create mode 100644 docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_due_retail_25_en.md diff --git a/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_due_retail_25_en.md b/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_due_retail_25_en.md new file mode 100644 index 00000000000000..9a2a763086bb5f --- /dev/null +++ b/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_due_retail_25_en.md @@ -0,0 +1,88 @@ +--- +layout: model +title: English mpnet_embedding_due_retail_25 TFMPNetModel from konverner +author: John Snow Labs +name: mpnet_embedding_due_retail_25 +date: 2023-08-18 +tags: [mpnet, en, open_source, tensorflow] +task: Embeddings +language: en +edition: Spark NLP 5.1.0 +spark_version: 3.0 +supported: true +engine: tensorflow +annotator: MPNetEmbeddings +article_header: + type: cover +use_language_switcher: "Python-Scala-Java" +--- + +## Description + +Pretrained mpnet model, adapted from Hugging Face and curated to provide scalability and production-readiness using Spark NLP.`mpnet_embedding_due_retail_25` is a English model originally trained by konverner. + +{:.btn-box} + + +[Download](https://s3.amazonaws.com/auxdata.johnsnowlabs.com/public/models/mpnet_embedding_due_retail_25_en_5.1.0_3.0_1692377165556.zip){:.button.button-orange.button-orange-trans.arr.button-icon} +[Copy S3 URI](s3://auxdata.johnsnowlabs.com/public/models/mpnet_embedding_due_retail_25_en_5.1.0_3.0_1692377165556.zip){:.button.button-orange.button-orange-trans.button-icon.button-copy-s3} + +## How to use + + + +
+{% include programmingLanguageSelectScalaPythonNLU.html %} +```python + +document_assembler = DocumentAssembler() \ + .setInputCol("text") \ + .setOutputCol("documents") + +instruction = MPNetEmbeddings \ + .pretrained("mpnet_embedding_due_retail_25", "en")\ + .setInputCols(["documents"]) \ + .setOutputCol("mpnet_embeddings") + +pipeline = Pipeline(stages=[ + document_assembler, + instruction, +]) + +pipelineModel = pipeline.fit(data) + +pipelineDF = pipelineModel.transform(data) +``` +```scala + +val document_assembler = new DocumentAssembler() + .setInputCol("text") + .setOutputCol("documents") + +val instruction = MPNetEmbeddings + .pretrained("mpnet_embedding_due_retail_25", "en") + .setInputCols(Array("documents")) + .setOutputCol("mpnet_embeddings") + +val pipeline = new Pipeline().setStages(Array(document_assembler, instruction)) + +val pipelineModel = pipeline.fit(data) + +val pipelineDF = pipelineModel.transform(data) + +``` +
+ +{:.model-param} +## Model Information + +{:.table-model} +|---|---| +|Model Name:|mpnet_embedding_due_retail_25| +|Compatibility:|Spark NLP 5.1.0+| +|License:|Open Source| +|Edition:|Official| +|Input Labels:|[documents]| +|Output Labels:|[mpnet_embeddings]| +|Language:|en| +|Size:|409.9 MB| \ No newline at end of file From 3b1dd5bdf7c12d86650f9098f6db59aee2369ba2 Mon Sep 17 00:00:00 2001 From: ahmedlone127 Date: Sat, 19 Aug 2023 01:18:54 +0700 Subject: [PATCH 126/155] Add model 2023-08-18-mpnet_embedding_mpnet_multilabel_sector_classifier_en --- ...g_mpnet_multilabel_sector_classifier_en.md | 88 +++++++++++++++++++ 1 file changed, 88 insertions(+) create mode 100644 docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_mpnet_multilabel_sector_classifier_en.md diff --git a/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_mpnet_multilabel_sector_classifier_en.md b/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_mpnet_multilabel_sector_classifier_en.md new file mode 100644 index 00000000000000..cbba07db508f41 --- /dev/null +++ b/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_mpnet_multilabel_sector_classifier_en.md @@ -0,0 +1,88 @@ +--- +layout: model +title: English mpnet_embedding_mpnet_multilabel_sector_classifier TFMPNetModel from ppsingh +author: John Snow Labs +name: mpnet_embedding_mpnet_multilabel_sector_classifier +date: 2023-08-18 +tags: [mpnet, en, open_source, tensorflow] +task: Embeddings +language: en +edition: Spark NLP 5.1.0 +spark_version: 3.0 +supported: true +engine: tensorflow +annotator: MPNetEmbeddings +article_header: + type: cover +use_language_switcher: "Python-Scala-Java" +--- + +## Description + +Pretrained mpnet model, adapted from Hugging Face and curated to provide scalability and production-readiness using Spark NLP.`mpnet_embedding_mpnet_multilabel_sector_classifier` is a English model originally trained by ppsingh. + +{:.btn-box} + + +[Download](https://s3.amazonaws.com/auxdata.johnsnowlabs.com/public/models/mpnet_embedding_mpnet_multilabel_sector_classifier_en_5.1.0_3.0_1692380031162.zip){:.button.button-orange.button-orange-trans.arr.button-icon} +[Copy S3 URI](s3://auxdata.johnsnowlabs.com/public/models/mpnet_embedding_mpnet_multilabel_sector_classifier_en_5.1.0_3.0_1692380031162.zip){:.button.button-orange.button-orange-trans.button-icon.button-copy-s3} + +## How to use + + + +
+{% include programmingLanguageSelectScalaPythonNLU.html %} +```python + +document_assembler = DocumentAssembler() \ + .setInputCol("text") \ + .setOutputCol("documents") + +instruction = MPNetEmbeddings \ + .pretrained("mpnet_embedding_mpnet_multilabel_sector_classifier", "en")\ + .setInputCols(["documents"]) \ + .setOutputCol("mpnet_embeddings") + +pipeline = Pipeline(stages=[ + document_assembler, + instruction, +]) + +pipelineModel = pipeline.fit(data) + +pipelineDF = pipelineModel.transform(data) +``` +```scala + +val document_assembler = new DocumentAssembler() + .setInputCol("text") + .setOutputCol("documents") + +val instruction = MPNetEmbeddings + .pretrained("mpnet_embedding_mpnet_multilabel_sector_classifier", "en") + .setInputCols(Array("documents")) + .setOutputCol("mpnet_embeddings") + +val pipeline = new Pipeline().setStages(Array(document_assembler, instruction)) + +val pipelineModel = pipeline.fit(data) + +val pipelineDF = pipelineModel.transform(data) + +``` +
+ +{:.model-param} +## Model Information + +{:.table-model} +|---|---| +|Model Name:|mpnet_embedding_mpnet_multilabel_sector_classifier| +|Compatibility:|Spark NLP 5.1.0+| +|License:|Open Source| +|Edition:|Official| +|Input Labels:|[documents]| +|Output Labels:|[mpnet_embeddings]| +|Language:|en| +|Size:|410.2 MB| \ No newline at end of file From 753259d86ac76f03ba7b32c54cd8e581d1c7d812 Mon Sep 17 00:00:00 2001 From: ahmedlone127 Date: Sat, 19 Aug 2023 01:19:54 +0700 Subject: [PATCH 127/155] Add model 2023-08-18-mpnet_embedding_vulnerable_groups_en --- ...18-mpnet_embedding_vulnerable_groups_en.md | 88 +++++++++++++++++++ 1 file changed, 88 insertions(+) create mode 100644 docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_vulnerable_groups_en.md diff --git a/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_vulnerable_groups_en.md b/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_vulnerable_groups_en.md new file mode 100644 index 00000000000000..928e15998a7590 --- /dev/null +++ b/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_vulnerable_groups_en.md @@ -0,0 +1,88 @@ +--- +layout: model +title: English mpnet_embedding_vulnerable_groups TFMPNetModel from leavoigt +author: John Snow Labs +name: mpnet_embedding_vulnerable_groups +date: 2023-08-18 +tags: [mpnet, en, open_source, tensorflow] +task: Embeddings +language: en +edition: Spark NLP 5.1.0 +spark_version: 3.0 +supported: true +engine: tensorflow +annotator: MPNetEmbeddings +article_header: + type: cover +use_language_switcher: "Python-Scala-Java" +--- + +## Description + +Pretrained mpnet model, adapted from Hugging Face and curated to provide scalability and production-readiness using Spark NLP.`mpnet_embedding_vulnerable_groups` is a English model originally trained by leavoigt. + +{:.btn-box} + + +[Download](https://s3.amazonaws.com/auxdata.johnsnowlabs.com/public/models/mpnet_embedding_vulnerable_groups_en_5.1.0_3.0_1692377757868.zip){:.button.button-orange.button-orange-trans.arr.button-icon} +[Copy S3 URI](s3://auxdata.johnsnowlabs.com/public/models/mpnet_embedding_vulnerable_groups_en_5.1.0_3.0_1692377757868.zip){:.button.button-orange.button-orange-trans.button-icon.button-copy-s3} + +## How to use + + + +
+{% include programmingLanguageSelectScalaPythonNLU.html %} +```python + +document_assembler = DocumentAssembler() \ + .setInputCol("text") \ + .setOutputCol("documents") + +instruction = MPNetEmbeddings \ + .pretrained("mpnet_embedding_vulnerable_groups", "en")\ + .setInputCols(["documents"]) \ + .setOutputCol("mpnet_embeddings") + +pipeline = Pipeline(stages=[ + document_assembler, + instruction, +]) + +pipelineModel = pipeline.fit(data) + +pipelineDF = pipelineModel.transform(data) +``` +```scala + +val document_assembler = new DocumentAssembler() + .setInputCol("text") + .setOutputCol("documents") + +val instruction = MPNetEmbeddings + .pretrained("mpnet_embedding_vulnerable_groups", "en") + .setInputCols(Array("documents")) + .setOutputCol("mpnet_embeddings") + +val pipeline = new Pipeline().setStages(Array(document_assembler, instruction)) + +val pipelineModel = pipeline.fit(data) + +val pipelineDF = pipelineModel.transform(data) + +``` +
+ +{:.model-param} +## Model Information + +{:.table-model} +|---|---| +|Model Name:|mpnet_embedding_vulnerable_groups| +|Compatibility:|Spark NLP 5.1.0+| +|License:|Open Source| +|Edition:|Official| +|Input Labels:|[documents]| +|Output Labels:|[mpnet_embeddings]| +|Language:|en| +|Size:|409.8 MB| \ No newline at end of file From 4d0fd08e7cd3a50eb7cc3f8960d2821bb6f2ecbc Mon Sep 17 00:00:00 2001 From: ahmedlone127 Date: Sat, 19 Aug 2023 01:20:54 +0700 Subject: [PATCH 128/155] Add model 2023-08-18-mpnet_embedding_abstract_sim_query_en --- ...8-mpnet_embedding_abstract_sim_query_en.md | 88 +++++++++++++++++++ 1 file changed, 88 insertions(+) create mode 100644 docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_abstract_sim_query_en.md diff --git a/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_abstract_sim_query_en.md b/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_abstract_sim_query_en.md new file mode 100644 index 00000000000000..02fddd33189f45 --- /dev/null +++ b/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_abstract_sim_query_en.md @@ -0,0 +1,88 @@ +--- +layout: model +title: English mpnet_embedding_abstract_sim_query TFMPNetModel from biu-nlp +author: John Snow Labs +name: mpnet_embedding_abstract_sim_query +date: 2023-08-18 +tags: [mpnet, en, open_source, tensorflow] +task: Embeddings +language: en +edition: Spark NLP 5.1.0 +spark_version: 3.0 +supported: true +engine: tensorflow +annotator: MPNetEmbeddings +article_header: + type: cover +use_language_switcher: "Python-Scala-Java" +--- + +## Description + +Pretrained mpnet model, adapted from Hugging Face and curated to provide scalability and production-readiness using Spark NLP.`mpnet_embedding_abstract_sim_query` is a English model originally trained by biu-nlp. + +{:.btn-box} + + +[Download](https://s3.amazonaws.com/auxdata.johnsnowlabs.com/public/models/mpnet_embedding_abstract_sim_query_en_5.1.0_3.0_1692378373665.zip){:.button.button-orange.button-orange-trans.arr.button-icon} +[Copy S3 URI](s3://auxdata.johnsnowlabs.com/public/models/mpnet_embedding_abstract_sim_query_en_5.1.0_3.0_1692378373665.zip){:.button.button-orange.button-orange-trans.button-icon.button-copy-s3} + +## How to use + + + +
+{% include programmingLanguageSelectScalaPythonNLU.html %} +```python + +document_assembler = DocumentAssembler() \ + .setInputCol("text") \ + .setOutputCol("documents") + +instruction = MPNetEmbeddings \ + .pretrained("mpnet_embedding_abstract_sim_query", "en")\ + .setInputCols(["documents"]) \ + .setOutputCol("mpnet_embeddings") + +pipeline = Pipeline(stages=[ + document_assembler, + instruction, +]) + +pipelineModel = pipeline.fit(data) + +pipelineDF = pipelineModel.transform(data) +``` +```scala + +val document_assembler = new DocumentAssembler() + .setInputCol("text") + .setOutputCol("documents") + +val instruction = MPNetEmbeddings + .pretrained("mpnet_embedding_abstract_sim_query", "en") + .setInputCols(Array("documents")) + .setOutputCol("mpnet_embeddings") + +val pipeline = new Pipeline().setStages(Array(document_assembler, instruction)) + +val pipelineModel = pipeline.fit(data) + +val pipelineDF = pipelineModel.transform(data) + +``` +
+ +{:.model-param} +## Model Information + +{:.table-model} +|---|---| +|Model Name:|mpnet_embedding_abstract_sim_query| +|Compatibility:|Spark NLP 5.1.0+| +|License:|Open Source| +|Edition:|Official| +|Input Labels:|[documents]| +|Output Labels:|[mpnet_embeddings]| +|Language:|en| +|Size:|409.5 MB| \ No newline at end of file From 37af5aa2570d35e1141e84e785d6589a208f4ee1 Mon Sep 17 00:00:00 2001 From: ahmedlone127 Date: Sat, 19 Aug 2023 01:21:55 +0700 Subject: [PATCH 129/155] Add model 2023-08-18-mpnet_embedding_biencoder_multi_qa_mpnet_base_cos_v1_mmarcoFR_fr --- ..._multi_qa_mpnet_base_cos_v1_mmarcoFR_fr.md | 88 +++++++++++++++++++ 1 file changed, 88 insertions(+) create mode 100644 docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_biencoder_multi_qa_mpnet_base_cos_v1_mmarcoFR_fr.md diff --git a/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_biencoder_multi_qa_mpnet_base_cos_v1_mmarcoFR_fr.md b/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_biencoder_multi_qa_mpnet_base_cos_v1_mmarcoFR_fr.md new file mode 100644 index 00000000000000..e477d499a0aa32 --- /dev/null +++ b/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_biencoder_multi_qa_mpnet_base_cos_v1_mmarcoFR_fr.md @@ -0,0 +1,88 @@ +--- +layout: model +title: French mpnet_embedding_biencoder_multi_qa_mpnet_base_cos_v1_mmarcoFR TFMPNetModel from antoinelouis +author: John Snow Labs +name: mpnet_embedding_biencoder_multi_qa_mpnet_base_cos_v1_mmarcoFR +date: 2023-08-18 +tags: [mpnet, fr, open_source, tensorflow] +task: Embeddings +language: fr +edition: Spark NLP 5.1.0 +spark_version: 3.0 +supported: true +engine: tensorflow +annotator: MPNetEmbeddings +article_header: + type: cover +use_language_switcher: "Python-Scala-Java" +--- + +## Description + +Pretrained mpnet model, adapted from Hugging Face and curated to provide scalability and production-readiness using Spark NLP.`mpnet_embedding_biencoder_multi_qa_mpnet_base_cos_v1_mmarcoFR` is a French model originally trained by antoinelouis. + +{:.btn-box} + + +[Download](https://s3.amazonaws.com/auxdata.johnsnowlabs.com/public/models/mpnet_embedding_biencoder_multi_qa_mpnet_base_cos_v1_mmarcoFR_fr_5.1.0_3.0_1692379417337.zip){:.button.button-orange.button-orange-trans.arr.button-icon} +[Copy S3 URI](s3://auxdata.johnsnowlabs.com/public/models/mpnet_embedding_biencoder_multi_qa_mpnet_base_cos_v1_mmarcoFR_fr_5.1.0_3.0_1692379417337.zip){:.button.button-orange.button-orange-trans.button-icon.button-copy-s3} + +## How to use + + + +
+{% include programmingLanguageSelectScalaPythonNLU.html %} +```python + +document_assembler = DocumentAssembler() \ + .setInputCol("text") \ + .setOutputCol("documents") + +instruction = MPNetEmbeddings \ + .pretrained("mpnet_embedding_biencoder_multi_qa_mpnet_base_cos_v1_mmarcoFR", "fr")\ + .setInputCols(["documents"]) \ + .setOutputCol("mpnet_embeddings") + +pipeline = Pipeline(stages=[ + document_assembler, + instruction, +]) + +pipelineModel = pipeline.fit(data) + +pipelineDF = pipelineModel.transform(data) +``` +```scala + +val document_assembler = new DocumentAssembler() + .setInputCol("text") + .setOutputCol("documents") + +val instruction = MPNetEmbeddings + .pretrained("mpnet_embedding_biencoder_multi_qa_mpnet_base_cos_v1_mmarcoFR", "fr") + .setInputCols(Array("documents")) + .setOutputCol("mpnet_embeddings") + +val pipeline = new Pipeline().setStages(Array(document_assembler, instruction)) + +val pipelineModel = pipeline.fit(data) + +val pipelineDF = pipelineModel.transform(data) + +``` +
+ +{:.model-param} +## Model Information + +{:.table-model} +|---|---| +|Model Name:|mpnet_embedding_biencoder_multi_qa_mpnet_base_cos_v1_mmarcoFR| +|Compatibility:|Spark NLP 5.1.0+| +|License:|Open Source| +|Edition:|Official| +|Input Labels:|[documents]| +|Output Labels:|[mpnet_embeddings]| +|Language:|fr| +|Size:|410.0 MB| \ No newline at end of file From c23c39903e3b9783c141a3adf9b62e9f4ea35eb3 Mon Sep 17 00:00:00 2001 From: ahmedlone127 Date: Sat, 19 Aug 2023 01:22:56 +0700 Subject: [PATCH 130/155] Add model 2023-08-18-mpnet_embedding_stsb_mpnet_base_v2_en --- ...8-mpnet_embedding_stsb_mpnet_base_v2_en.md | 88 +++++++++++++++++++ 1 file changed, 88 insertions(+) create mode 100644 docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_stsb_mpnet_base_v2_en.md diff --git a/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_stsb_mpnet_base_v2_en.md b/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_stsb_mpnet_base_v2_en.md new file mode 100644 index 00000000000000..e3776e0d3da104 --- /dev/null +++ b/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_stsb_mpnet_base_v2_en.md @@ -0,0 +1,88 @@ +--- +layout: model +title: English mpnet_embedding_stsb_mpnet_base_v2 TFMPNetModel from sentence-transformers +author: John Snow Labs +name: mpnet_embedding_stsb_mpnet_base_v2 +date: 2023-08-18 +tags: [mpnet, en, open_source, tensorflow] +task: Embeddings +language: en +edition: Spark NLP 5.1.0 +spark_version: 3.0 +supported: true +engine: tensorflow +annotator: MPNetEmbeddings +article_header: + type: cover +use_language_switcher: "Python-Scala-Java" +--- + +## Description + +Pretrained mpnet model, adapted from Hugging Face and curated to provide scalability and production-readiness using Spark NLP.`mpnet_embedding_stsb_mpnet_base_v2` is a English model originally trained by sentence-transformers. + +{:.btn-box} + + +[Download](https://s3.amazonaws.com/auxdata.johnsnowlabs.com/public/models/mpnet_embedding_stsb_mpnet_base_v2_en_5.1.0_3.0_1692378378354.zip){:.button.button-orange.button-orange-trans.arr.button-icon} +[Copy S3 URI](s3://auxdata.johnsnowlabs.com/public/models/mpnet_embedding_stsb_mpnet_base_v2_en_5.1.0_3.0_1692378378354.zip){:.button.button-orange.button-orange-trans.button-icon.button-copy-s3} + +## How to use + + + +
+{% include programmingLanguageSelectScalaPythonNLU.html %} +```python + +document_assembler = DocumentAssembler() \ + .setInputCol("text") \ + .setOutputCol("documents") + +instruction = MPNetEmbeddings \ + .pretrained("mpnet_embedding_stsb_mpnet_base_v2", "en")\ + .setInputCols(["documents"]) \ + .setOutputCol("mpnet_embeddings") + +pipeline = Pipeline(stages=[ + document_assembler, + instruction, +]) + +pipelineModel = pipeline.fit(data) + +pipelineDF = pipelineModel.transform(data) +``` +```scala + +val document_assembler = new DocumentAssembler() + .setInputCol("text") + .setOutputCol("documents") + +val instruction = MPNetEmbeddings + .pretrained("mpnet_embedding_stsb_mpnet_base_v2", "en") + .setInputCols(Array("documents")) + .setOutputCol("mpnet_embeddings") + +val pipeline = new Pipeline().setStages(Array(document_assembler, instruction)) + +val pipelineModel = pipeline.fit(data) + +val pipelineDF = pipelineModel.transform(data) + +``` +
+ +{:.model-param} +## Model Information + +{:.table-model} +|---|---| +|Model Name:|mpnet_embedding_stsb_mpnet_base_v2| +|Compatibility:|Spark NLP 5.1.0+| +|License:|Open Source| +|Edition:|Official| +|Input Labels:|[documents]| +|Output Labels:|[mpnet_embeddings]| +|Language:|en| +|Size:|408.6 MB| \ No newline at end of file From f803020bf85b1cafa13de27247e06bfb456ea49a Mon Sep 17 00:00:00 2001 From: ahmedlone127 Date: Sat, 19 Aug 2023 01:23:55 +0700 Subject: [PATCH 131/155] Add model 2023-08-18-mpnet_embedding_covid_qa_mpnet_en --- ...08-18-mpnet_embedding_covid_qa_mpnet_en.md | 88 +++++++++++++++++++ 1 file changed, 88 insertions(+) create mode 100644 docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_covid_qa_mpnet_en.md diff --git a/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_covid_qa_mpnet_en.md b/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_covid_qa_mpnet_en.md new file mode 100644 index 00000000000000..99a354419a1a37 --- /dev/null +++ b/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_covid_qa_mpnet_en.md @@ -0,0 +1,88 @@ +--- +layout: model +title: English mpnet_embedding_covid_qa_mpnet TFMPNetModel from shaina +author: John Snow Labs +name: mpnet_embedding_covid_qa_mpnet +date: 2023-08-18 +tags: [mpnet, en, open_source, tensorflow] +task: Embeddings +language: en +edition: Spark NLP 5.1.0 +spark_version: 3.0 +supported: true +engine: tensorflow +annotator: MPNetEmbeddings +article_header: + type: cover +use_language_switcher: "Python-Scala-Java" +--- + +## Description + +Pretrained mpnet model, adapted from Hugging Face and curated to provide scalability and production-readiness using Spark NLP.`mpnet_embedding_covid_qa_mpnet` is a English model originally trained by shaina. + +{:.btn-box} + + +[Download](https://s3.amazonaws.com/auxdata.johnsnowlabs.com/public/models/mpnet_embedding_covid_qa_mpnet_en_5.1.0_3.0_1692378777451.zip){:.button.button-orange.button-orange-trans.arr.button-icon} +[Copy S3 URI](s3://auxdata.johnsnowlabs.com/public/models/mpnet_embedding_covid_qa_mpnet_en_5.1.0_3.0_1692378777451.zip){:.button.button-orange.button-orange-trans.button-icon.button-copy-s3} + +## How to use + + + +
+{% include programmingLanguageSelectScalaPythonNLU.html %} +```python + +document_assembler = DocumentAssembler() \ + .setInputCol("text") \ + .setOutputCol("documents") + +instruction = MPNetEmbeddings \ + .pretrained("mpnet_embedding_covid_qa_mpnet", "en")\ + .setInputCols(["documents"]) \ + .setOutputCol("mpnet_embeddings") + +pipeline = Pipeline(stages=[ + document_assembler, + instruction, +]) + +pipelineModel = pipeline.fit(data) + +pipelineDF = pipelineModel.transform(data) +``` +```scala + +val document_assembler = new DocumentAssembler() + .setInputCol("text") + .setOutputCol("documents") + +val instruction = MPNetEmbeddings + .pretrained("mpnet_embedding_covid_qa_mpnet", "en") + .setInputCols(Array("documents")) + .setOutputCol("mpnet_embeddings") + +val pipeline = new Pipeline().setStages(Array(document_assembler, instruction)) + +val pipelineModel = pipeline.fit(data) + +val pipelineDF = pipelineModel.transform(data) + +``` +
+ +{:.model-param} +## Model Information + +{:.table-model} +|---|---| +|Model Name:|mpnet_embedding_covid_qa_mpnet| +|Compatibility:|Spark NLP 5.1.0+| +|License:|Open Source| +|Edition:|Official| +|Input Labels:|[documents]| +|Output Labels:|[mpnet_embeddings]| +|Language:|en| +|Size:|402.6 MB| \ No newline at end of file From d78aed371155901467abd1765b754ac012b0bad6 Mon Sep 17 00:00:00 2001 From: ahmedlone127 Date: Sat, 19 Aug 2023 01:24:56 +0700 Subject: [PATCH 132/155] Add model 2023-08-18-mpnet_embedding_CPU_Economywide_Classifier_en --- ...embedding_CPU_Economywide_Classifier_en.md | 88 +++++++++++++++++++ 1 file changed, 88 insertions(+) create mode 100644 docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_CPU_Economywide_Classifier_en.md diff --git a/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_CPU_Economywide_Classifier_en.md b/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_CPU_Economywide_Classifier_en.md new file mode 100644 index 00000000000000..757624791b4dbd --- /dev/null +++ b/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_CPU_Economywide_Classifier_en.md @@ -0,0 +1,88 @@ +--- +layout: model +title: English mpnet_embedding_CPU_Economywide_Classifier TFMPNetModel from mtyrrell +author: John Snow Labs +name: mpnet_embedding_CPU_Economywide_Classifier +date: 2023-08-18 +tags: [mpnet, en, open_source, tensorflow] +task: Embeddings +language: en +edition: Spark NLP 5.1.0 +spark_version: 3.0 +supported: true +engine: tensorflow +annotator: MPNetEmbeddings +article_header: + type: cover +use_language_switcher: "Python-Scala-Java" +--- + +## Description + +Pretrained mpnet model, adapted from Hugging Face and curated to provide scalability and production-readiness using Spark NLP.`mpnet_embedding_CPU_Economywide_Classifier` is a English model originally trained by mtyrrell. + +{:.btn-box} + + +[Download](https://s3.amazonaws.com/auxdata.johnsnowlabs.com/public/models/mpnet_embedding_CPU_Economywide_Classifier_en_5.1.0_3.0_1692379293006.zip){:.button.button-orange.button-orange-trans.arr.button-icon} +[Copy S3 URI](s3://auxdata.johnsnowlabs.com/public/models/mpnet_embedding_CPU_Economywide_Classifier_en_5.1.0_3.0_1692379293006.zip){:.button.button-orange.button-orange-trans.button-icon.button-copy-s3} + +## How to use + + + +
+{% include programmingLanguageSelectScalaPythonNLU.html %} +```python + +document_assembler = DocumentAssembler() \ + .setInputCol("text") \ + .setOutputCol("documents") + +instruction = MPNetEmbeddings \ + .pretrained("mpnet_embedding_CPU_Economywide_Classifier", "en")\ + .setInputCols(["documents"]) \ + .setOutputCol("mpnet_embeddings") + +pipeline = Pipeline(stages=[ + document_assembler, + instruction, +]) + +pipelineModel = pipeline.fit(data) + +pipelineDF = pipelineModel.transform(data) +``` +```scala + +val document_assembler = new DocumentAssembler() + .setInputCol("text") + .setOutputCol("documents") + +val instruction = MPNetEmbeddings + .pretrained("mpnet_embedding_CPU_Economywide_Classifier", "en") + .setInputCols(Array("documents")) + .setOutputCol("mpnet_embeddings") + +val pipeline = new Pipeline().setStages(Array(document_assembler, instruction)) + +val pipelineModel = pipeline.fit(data) + +val pipelineDF = pipelineModel.transform(data) + +``` +
+ +{:.model-param} +## Model Information + +{:.table-model} +|---|---| +|Model Name:|mpnet_embedding_CPU_Economywide_Classifier| +|Compatibility:|Spark NLP 5.1.0+| +|License:|Open Source| +|Edition:|Official| +|Input Labels:|[documents]| +|Output Labels:|[mpnet_embeddings]| +|Language:|en| +|Size:|410.2 MB| \ No newline at end of file From 59c5f6849a5e3f08cdb400d1e5561222a6c968a5 Mon Sep 17 00:00:00 2001 From: ahmedlone127 Date: Sat, 19 Aug 2023 01:25:56 +0700 Subject: [PATCH 133/155] Add model 2023-08-18-mpnet_embedding_github_issues_mpnet_st_e10_en --- ...embedding_github_issues_mpnet_st_e10_en.md | 88 +++++++++++++++++++ 1 file changed, 88 insertions(+) create mode 100644 docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_github_issues_mpnet_st_e10_en.md diff --git a/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_github_issues_mpnet_st_e10_en.md b/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_github_issues_mpnet_st_e10_en.md new file mode 100644 index 00000000000000..9326717aa52c89 --- /dev/null +++ b/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_github_issues_mpnet_st_e10_en.md @@ -0,0 +1,88 @@ +--- +layout: model +title: English mpnet_embedding_github_issues_mpnet_st_e10 TFMPNetModel from Collab-uniba +author: John Snow Labs +name: mpnet_embedding_github_issues_mpnet_st_e10 +date: 2023-08-18 +tags: [mpnet, en, open_source, tensorflow] +task: Embeddings +language: en +edition: Spark NLP 5.1.0 +spark_version: 3.0 +supported: true +engine: tensorflow +annotator: MPNetEmbeddings +article_header: + type: cover +use_language_switcher: "Python-Scala-Java" +--- + +## Description + +Pretrained mpnet model, adapted from Hugging Face and curated to provide scalability and production-readiness using Spark NLP.`mpnet_embedding_github_issues_mpnet_st_e10` is a English model originally trained by Collab-uniba. + +{:.btn-box} + + +[Download](https://s3.amazonaws.com/auxdata.johnsnowlabs.com/public/models/mpnet_embedding_github_issues_mpnet_st_e10_en_5.1.0_3.0_1692376620750.zip){:.button.button-orange.button-orange-trans.arr.button-icon} +[Copy S3 URI](s3://auxdata.johnsnowlabs.com/public/models/mpnet_embedding_github_issues_mpnet_st_e10_en_5.1.0_3.0_1692376620750.zip){:.button.button-orange.button-orange-trans.button-icon.button-copy-s3} + +## How to use + + + +
+{% include programmingLanguageSelectScalaPythonNLU.html %} +```python + +document_assembler = DocumentAssembler() \ + .setInputCol("text") \ + .setOutputCol("documents") + +instruction = MPNetEmbeddings \ + .pretrained("mpnet_embedding_github_issues_mpnet_st_e10", "en")\ + .setInputCols(["documents"]) \ + .setOutputCol("mpnet_embeddings") + +pipeline = Pipeline(stages=[ + document_assembler, + instruction, +]) + +pipelineModel = pipeline.fit(data) + +pipelineDF = pipelineModel.transform(data) +``` +```scala + +val document_assembler = new DocumentAssembler() + .setInputCol("text") + .setOutputCol("documents") + +val instruction = MPNetEmbeddings + .pretrained("mpnet_embedding_github_issues_mpnet_st_e10", "en") + .setInputCols(Array("documents")) + .setOutputCol("mpnet_embeddings") + +val pipeline = new Pipeline().setStages(Array(document_assembler, instruction)) + +val pipelineModel = pipeline.fit(data) + +val pipelineDF = pipelineModel.transform(data) + +``` +
+ +{:.model-param} +## Model Information + +{:.table-model} +|---|---| +|Model Name:|mpnet_embedding_github_issues_mpnet_st_e10| +|Compatibility:|Spark NLP 5.1.0+| +|License:|Open Source| +|Edition:|Official| +|Input Labels:|[documents]| +|Output Labels:|[mpnet_embeddings]| +|Language:|en| +|Size:|409.3 MB| \ No newline at end of file From 2e8ead0a9d0de7f6072cead73319dad047e92049 Mon Sep 17 00:00:00 2001 From: ahmedlone127 Date: Sat, 19 Aug 2023 01:26:56 +0700 Subject: [PATCH 134/155] Add model 2023-08-18-mpnet_embedding_esci_jp_mpnet_crossencoder_en --- ...embedding_esci_jp_mpnet_crossencoder_en.md | 88 +++++++++++++++++++ 1 file changed, 88 insertions(+) create mode 100644 docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_esci_jp_mpnet_crossencoder_en.md diff --git a/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_esci_jp_mpnet_crossencoder_en.md b/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_esci_jp_mpnet_crossencoder_en.md new file mode 100644 index 00000000000000..a7d130c9112a1c --- /dev/null +++ b/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_esci_jp_mpnet_crossencoder_en.md @@ -0,0 +1,88 @@ +--- +layout: model +title: English mpnet_embedding_esci_jp_mpnet_crossencoder TFMPNetModel from spacemanidol +author: John Snow Labs +name: mpnet_embedding_esci_jp_mpnet_crossencoder +date: 2023-08-18 +tags: [mpnet, en, open_source, tensorflow] +task: Embeddings +language: en +edition: Spark NLP 5.1.0 +spark_version: 3.0 +supported: true +engine: tensorflow +annotator: MPNetEmbeddings +article_header: + type: cover +use_language_switcher: "Python-Scala-Java" +--- + +## Description + +Pretrained mpnet model, adapted from Hugging Face and curated to provide scalability and production-readiness using Spark NLP.`mpnet_embedding_esci_jp_mpnet_crossencoder` is a English model originally trained by spacemanidol. + +{:.btn-box} + + +[Download](https://s3.amazonaws.com/auxdata.johnsnowlabs.com/public/models/mpnet_embedding_esci_jp_mpnet_crossencoder_en_5.1.0_3.0_1692380712380.zip){:.button.button-orange.button-orange-trans.arr.button-icon} +[Copy S3 URI](s3://auxdata.johnsnowlabs.com/public/models/mpnet_embedding_esci_jp_mpnet_crossencoder_en_5.1.0_3.0_1692380712380.zip){:.button.button-orange.button-orange-trans.button-icon.button-copy-s3} + +## How to use + + + +
+{% include programmingLanguageSelectScalaPythonNLU.html %} +```python + +document_assembler = DocumentAssembler() \ + .setInputCol("text") \ + .setOutputCol("documents") + +instruction = MPNetEmbeddings \ + .pretrained("mpnet_embedding_esci_jp_mpnet_crossencoder", "en")\ + .setInputCols(["documents"]) \ + .setOutputCol("mpnet_embeddings") + +pipeline = Pipeline(stages=[ + document_assembler, + instruction, +]) + +pipelineModel = pipeline.fit(data) + +pipelineDF = pipelineModel.transform(data) +``` +```scala + +val document_assembler = new DocumentAssembler() + .setInputCol("text") + .setOutputCol("documents") + +val instruction = MPNetEmbeddings + .pretrained("mpnet_embedding_esci_jp_mpnet_crossencoder", "en") + .setInputCols(Array("documents")) + .setOutputCol("mpnet_embeddings") + +val pipeline = new Pipeline().setStages(Array(document_assembler, instruction)) + +val pipelineModel = pipeline.fit(data) + +val pipelineDF = pipelineModel.transform(data) + +``` +
+ +{:.model-param} +## Model Information + +{:.table-model} +|---|---| +|Model Name:|mpnet_embedding_esci_jp_mpnet_crossencoder| +|Compatibility:|Spark NLP 5.1.0+| +|License:|Open Source| +|Edition:|Official| +|Input Labels:|[documents]| +|Output Labels:|[mpnet_embeddings]| +|Language:|en| +|Size:|410.2 MB| \ No newline at end of file From 77700500f917995b747852ff67ab74256270db2b Mon Sep 17 00:00:00 2001 From: ahmedlone127 Date: Sat, 19 Aug 2023 01:27:57 +0700 Subject: [PATCH 135/155] Add model 2023-08-18-mpnet_embedding_CPU_Netzero_Classifier_en --- ...net_embedding_CPU_Netzero_Classifier_en.md | 88 +++++++++++++++++++ 1 file changed, 88 insertions(+) create mode 100644 docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_CPU_Netzero_Classifier_en.md diff --git a/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_CPU_Netzero_Classifier_en.md b/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_CPU_Netzero_Classifier_en.md new file mode 100644 index 00000000000000..cb3712d55153b6 --- /dev/null +++ b/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_CPU_Netzero_Classifier_en.md @@ -0,0 +1,88 @@ +--- +layout: model +title: English mpnet_embedding_CPU_Netzero_Classifier TFMPNetModel from mtyrrell +author: John Snow Labs +name: mpnet_embedding_CPU_Netzero_Classifier +date: 2023-08-18 +tags: [mpnet, en, open_source, tensorflow] +task: Embeddings +language: en +edition: Spark NLP 5.1.0 +spark_version: 3.0 +supported: true +engine: tensorflow +annotator: MPNetEmbeddings +article_header: + type: cover +use_language_switcher: "Python-Scala-Java" +--- + +## Description + +Pretrained mpnet model, adapted from Hugging Face and curated to provide scalability and production-readiness using Spark NLP.`mpnet_embedding_CPU_Netzero_Classifier` is a English model originally trained by mtyrrell. + +{:.btn-box} + + +[Download](https://s3.amazonaws.com/auxdata.johnsnowlabs.com/public/models/mpnet_embedding_CPU_Netzero_Classifier_en_5.1.0_3.0_1692380071810.zip){:.button.button-orange.button-orange-trans.arr.button-icon} +[Copy S3 URI](s3://auxdata.johnsnowlabs.com/public/models/mpnet_embedding_CPU_Netzero_Classifier_en_5.1.0_3.0_1692380071810.zip){:.button.button-orange.button-orange-trans.button-icon.button-copy-s3} + +## How to use + + + +
+{% include programmingLanguageSelectScalaPythonNLU.html %} +```python + +document_assembler = DocumentAssembler() \ + .setInputCol("text") \ + .setOutputCol("documents") + +instruction = MPNetEmbeddings \ + .pretrained("mpnet_embedding_CPU_Netzero_Classifier", "en")\ + .setInputCols(["documents"]) \ + .setOutputCol("mpnet_embeddings") + +pipeline = Pipeline(stages=[ + document_assembler, + instruction, +]) + +pipelineModel = pipeline.fit(data) + +pipelineDF = pipelineModel.transform(data) +``` +```scala + +val document_assembler = new DocumentAssembler() + .setInputCol("text") + .setOutputCol("documents") + +val instruction = MPNetEmbeddings + .pretrained("mpnet_embedding_CPU_Netzero_Classifier", "en") + .setInputCols(Array("documents")) + .setOutputCol("mpnet_embeddings") + +val pipeline = new Pipeline().setStages(Array(document_assembler, instruction)) + +val pipelineModel = pipeline.fit(data) + +val pipelineDF = pipelineModel.transform(data) + +``` +
+ +{:.model-param} +## Model Information + +{:.table-model} +|---|---| +|Model Name:|mpnet_embedding_CPU_Netzero_Classifier| +|Compatibility:|Spark NLP 5.1.0+| +|License:|Open Source| +|Edition:|Official| +|Input Labels:|[documents]| +|Output Labels:|[mpnet_embeddings]| +|Language:|en| +|Size:|410.1 MB| \ No newline at end of file From 90ec2edcb52cfc038430e1cb6343dc7acf7ab56f Mon Sep 17 00:00:00 2001 From: ahmedlone127 Date: Sat, 19 Aug 2023 01:28:57 +0700 Subject: [PATCH 136/155] Add model 2023-08-18-mpnet_embedding_eth_setfit_payment_model_en --- ...t_embedding_eth_setfit_payment_model_en.md | 88 +++++++++++++++++++ 1 file changed, 88 insertions(+) create mode 100644 docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_eth_setfit_payment_model_en.md diff --git a/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_eth_setfit_payment_model_en.md b/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_eth_setfit_payment_model_en.md new file mode 100644 index 00000000000000..9b60461052880d --- /dev/null +++ b/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_eth_setfit_payment_model_en.md @@ -0,0 +1,88 @@ +--- +layout: model +title: English mpnet_embedding_eth_setfit_payment_model TFMPNetModel from kainxwang +author: John Snow Labs +name: mpnet_embedding_eth_setfit_payment_model +date: 2023-08-18 +tags: [mpnet, en, open_source, tensorflow] +task: Embeddings +language: en +edition: Spark NLP 5.1.0 +spark_version: 3.0 +supported: true +engine: tensorflow +annotator: MPNetEmbeddings +article_header: + type: cover +use_language_switcher: "Python-Scala-Java" +--- + +## Description + +Pretrained mpnet model, adapted from Hugging Face and curated to provide scalability and production-readiness using Spark NLP.`mpnet_embedding_eth_setfit_payment_model` is a English model originally trained by kainxwang. + +{:.btn-box} + + +[Download](https://s3.amazonaws.com/auxdata.johnsnowlabs.com/public/models/mpnet_embedding_eth_setfit_payment_model_en_5.1.0_3.0_1692380076037.zip){:.button.button-orange.button-orange-trans.arr.button-icon} +[Copy S3 URI](s3://auxdata.johnsnowlabs.com/public/models/mpnet_embedding_eth_setfit_payment_model_en_5.1.0_3.0_1692380076037.zip){:.button.button-orange.button-orange-trans.button-icon.button-copy-s3} + +## How to use + + + +
+{% include programmingLanguageSelectScalaPythonNLU.html %} +```python + +document_assembler = DocumentAssembler() \ + .setInputCol("text") \ + .setOutputCol("documents") + +instruction = MPNetEmbeddings \ + .pretrained("mpnet_embedding_eth_setfit_payment_model", "en")\ + .setInputCols(["documents"]) \ + .setOutputCol("mpnet_embeddings") + +pipeline = Pipeline(stages=[ + document_assembler, + instruction, +]) + +pipelineModel = pipeline.fit(data) + +pipelineDF = pipelineModel.transform(data) +``` +```scala + +val document_assembler = new DocumentAssembler() + .setInputCol("text") + .setOutputCol("documents") + +val instruction = MPNetEmbeddings + .pretrained("mpnet_embedding_eth_setfit_payment_model", "en") + .setInputCols(Array("documents")) + .setOutputCol("mpnet_embeddings") + +val pipeline = new Pipeline().setStages(Array(document_assembler, instruction)) + +val pipelineModel = pipeline.fit(data) + +val pipelineDF = pipelineModel.transform(data) + +``` +
+ +{:.model-param} +## Model Information + +{:.table-model} +|---|---| +|Model Name:|mpnet_embedding_eth_setfit_payment_model| +|Compatibility:|Spark NLP 5.1.0+| +|License:|Open Source| +|Edition:|Official| +|Input Labels:|[documents]| +|Output Labels:|[mpnet_embeddings]| +|Language:|en| +|Size:|409.9 MB| \ No newline at end of file From eb4d293a4b0129eb632269624cfde71a463c2cd0 Mon Sep 17 00:00:00 2001 From: ahmedlone127 Date: Sat, 19 Aug 2023 01:29:57 +0700 Subject: [PATCH 137/155] Add model 2023-08-18-mpnet_embedding_PDFSegs_en --- .../2023-08-18-mpnet_embedding_PDFSegs_en.md | 88 +++++++++++++++++++ 1 file changed, 88 insertions(+) create mode 100644 docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_PDFSegs_en.md diff --git a/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_PDFSegs_en.md b/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_PDFSegs_en.md new file mode 100644 index 00000000000000..2cf1cf933def81 --- /dev/null +++ b/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_PDFSegs_en.md @@ -0,0 +1,88 @@ +--- +layout: model +title: English mpnet_embedding_PDFSegs TFMPNetModel from HamzaFarhan +author: John Snow Labs +name: mpnet_embedding_PDFSegs +date: 2023-08-18 +tags: [mpnet, en, open_source, tensorflow] +task: Embeddings +language: en +edition: Spark NLP 5.1.0 +spark_version: 3.0 +supported: true +engine: tensorflow +annotator: MPNetEmbeddings +article_header: + type: cover +use_language_switcher: "Python-Scala-Java" +--- + +## Description + +Pretrained mpnet model, adapted from Hugging Face and curated to provide scalability and production-readiness using Spark NLP.`mpnet_embedding_PDFSegs` is a English model originally trained by HamzaFarhan. + +{:.btn-box} + + +[Download](https://s3.amazonaws.com/auxdata.johnsnowlabs.com/public/models/mpnet_embedding_PDFSegs_en_5.1.0_3.0_1692378985073.zip){:.button.button-orange.button-orange-trans.arr.button-icon} +[Copy S3 URI](s3://auxdata.johnsnowlabs.com/public/models/mpnet_embedding_PDFSegs_en_5.1.0_3.0_1692378985073.zip){:.button.button-orange.button-orange-trans.button-icon.button-copy-s3} + +## How to use + + + +
+{% include programmingLanguageSelectScalaPythonNLU.html %} +```python + +document_assembler = DocumentAssembler() \ + .setInputCol("text") \ + .setOutputCol("documents") + +instruction = MPNetEmbeddings \ + .pretrained("mpnet_embedding_PDFSegs", "en")\ + .setInputCols(["documents"]) \ + .setOutputCol("mpnet_embeddings") + +pipeline = Pipeline(stages=[ + document_assembler, + instruction, +]) + +pipelineModel = pipeline.fit(data) + +pipelineDF = pipelineModel.transform(data) +``` +```scala + +val document_assembler = new DocumentAssembler() + .setInputCol("text") + .setOutputCol("documents") + +val instruction = MPNetEmbeddings + .pretrained("mpnet_embedding_PDFSegs", "en") + .setInputCols(Array("documents")) + .setOutputCol("mpnet_embeddings") + +val pipeline = new Pipeline().setStages(Array(document_assembler, instruction)) + +val pipelineModel = pipeline.fit(data) + +val pipelineDF = pipelineModel.transform(data) + +``` +
+ +{:.model-param} +## Model Information + +{:.table-model} +|---|---| +|Model Name:|mpnet_embedding_PDFSegs| +|Compatibility:|Spark NLP 5.1.0+| +|License:|Open Source| +|Edition:|Official| +|Input Labels:|[documents]| +|Output Labels:|[mpnet_embeddings]| +|Language:|en| +|Size:|409.8 MB| \ No newline at end of file From 72b72cf9881eeaf736ef6bd49f9991a383311cf7 Mon Sep 17 00:00:00 2001 From: ahmedlone127 Date: Sat, 19 Aug 2023 01:30:57 +0700 Subject: [PATCH 138/155] Add model 2023-08-18-mpnet_embedding_setfit_zero_shot_classification_pbsp_p3_dur_en --- ...zero_shot_classification_pbsp_p3_dur_en.md | 88 +++++++++++++++++++ 1 file changed, 88 insertions(+) create mode 100644 docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_setfit_zero_shot_classification_pbsp_p3_dur_en.md diff --git a/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_setfit_zero_shot_classification_pbsp_p3_dur_en.md b/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_setfit_zero_shot_classification_pbsp_p3_dur_en.md new file mode 100644 index 00000000000000..3a1faacf91fbd8 --- /dev/null +++ b/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_setfit_zero_shot_classification_pbsp_p3_dur_en.md @@ -0,0 +1,88 @@ +--- +layout: model +title: English mpnet_embedding_setfit_zero_shot_classification_pbsp_p3_dur TFMPNetModel from aammari +author: John Snow Labs +name: mpnet_embedding_setfit_zero_shot_classification_pbsp_p3_dur +date: 2023-08-18 +tags: [mpnet, en, open_source, tensorflow] +task: Embeddings +language: en +edition: Spark NLP 5.1.0 +spark_version: 3.0 +supported: true +engine: tensorflow +annotator: MPNetEmbeddings +article_header: + type: cover +use_language_switcher: "Python-Scala-Java" +--- + +## Description + +Pretrained mpnet model, adapted from Hugging Face and curated to provide scalability and production-readiness using Spark NLP.`mpnet_embedding_setfit_zero_shot_classification_pbsp_p3_dur` is a English model originally trained by aammari. + +{:.btn-box} + + +[Download](https://s3.amazonaws.com/auxdata.johnsnowlabs.com/public/models/mpnet_embedding_setfit_zero_shot_classification_pbsp_p3_dur_en_5.1.0_3.0_1692379876045.zip){:.button.button-orange.button-orange-trans.arr.button-icon} +[Copy S3 URI](s3://auxdata.johnsnowlabs.com/public/models/mpnet_embedding_setfit_zero_shot_classification_pbsp_p3_dur_en_5.1.0_3.0_1692379876045.zip){:.button.button-orange.button-orange-trans.button-icon.button-copy-s3} + +## How to use + + + +
+{% include programmingLanguageSelectScalaPythonNLU.html %} +```python + +document_assembler = DocumentAssembler() \ + .setInputCol("text") \ + .setOutputCol("documents") + +instruction = MPNetEmbeddings \ + .pretrained("mpnet_embedding_setfit_zero_shot_classification_pbsp_p3_dur", "en")\ + .setInputCols(["documents"]) \ + .setOutputCol("mpnet_embeddings") + +pipeline = Pipeline(stages=[ + document_assembler, + instruction, +]) + +pipelineModel = pipeline.fit(data) + +pipelineDF = pipelineModel.transform(data) +``` +```scala + +val document_assembler = new DocumentAssembler() + .setInputCol("text") + .setOutputCol("documents") + +val instruction = MPNetEmbeddings + .pretrained("mpnet_embedding_setfit_zero_shot_classification_pbsp_p3_dur", "en") + .setInputCols(Array("documents")) + .setOutputCol("mpnet_embeddings") + +val pipeline = new Pipeline().setStages(Array(document_assembler, instruction)) + +val pipelineModel = pipeline.fit(data) + +val pipelineDF = pipelineModel.transform(data) + +``` +
+ +{:.model-param} +## Model Information + +{:.table-model} +|---|---| +|Model Name:|mpnet_embedding_setfit_zero_shot_classification_pbsp_p3_dur| +|Compatibility:|Spark NLP 5.1.0+| +|License:|Open Source| +|Edition:|Official| +|Input Labels:|[documents]| +|Output Labels:|[mpnet_embeddings]| +|Language:|en| +|Size:|409.8 MB| \ No newline at end of file From bb57de4ff0738eda3dbaae16c8afe2bdf6f816df Mon Sep 17 00:00:00 2001 From: ahmedlone127 Date: Sat, 19 Aug 2023 01:31:58 +0700 Subject: [PATCH 139/155] Add model 2023-08-18-mpnet_embedding_multi_qa_v1_mpnet_cls_dot_en --- ..._embedding_multi_qa_v1_mpnet_cls_dot_en.md | 88 +++++++++++++++++++ 1 file changed, 88 insertions(+) create mode 100644 docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_multi_qa_v1_mpnet_cls_dot_en.md diff --git a/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_multi_qa_v1_mpnet_cls_dot_en.md b/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_multi_qa_v1_mpnet_cls_dot_en.md new file mode 100644 index 00000000000000..3b9b76913b6c0c --- /dev/null +++ b/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_multi_qa_v1_mpnet_cls_dot_en.md @@ -0,0 +1,88 @@ +--- +layout: model +title: English mpnet_embedding_multi_qa_v1_mpnet_cls_dot TFMPNetModel from flax-sentence-embeddings +author: John Snow Labs +name: mpnet_embedding_multi_qa_v1_mpnet_cls_dot +date: 2023-08-18 +tags: [mpnet, en, open_source, tensorflow] +task: Embeddings +language: en +edition: Spark NLP 5.1.0 +spark_version: 3.0 +supported: true +engine: tensorflow +annotator: MPNetEmbeddings +article_header: + type: cover +use_language_switcher: "Python-Scala-Java" +--- + +## Description + +Pretrained mpnet model, adapted from Hugging Face and curated to provide scalability and production-readiness using Spark NLP.`mpnet_embedding_multi_qa_v1_mpnet_cls_dot` is a English model originally trained by flax-sentence-embeddings. + +{:.btn-box} + + +[Download](https://s3.amazonaws.com/auxdata.johnsnowlabs.com/public/models/mpnet_embedding_multi_qa_v1_mpnet_cls_dot_en_5.1.0_3.0_1692376421210.zip){:.button.button-orange.button-orange-trans.arr.button-icon} +[Copy S3 URI](s3://auxdata.johnsnowlabs.com/public/models/mpnet_embedding_multi_qa_v1_mpnet_cls_dot_en_5.1.0_3.0_1692376421210.zip){:.button.button-orange.button-orange-trans.button-icon.button-copy-s3} + +## How to use + + + +
+{% include programmingLanguageSelectScalaPythonNLU.html %} +```python + +document_assembler = DocumentAssembler() \ + .setInputCol("text") \ + .setOutputCol("documents") + +instruction = MPNetEmbeddings \ + .pretrained("mpnet_embedding_multi_qa_v1_mpnet_cls_dot", "en")\ + .setInputCols(["documents"]) \ + .setOutputCol("mpnet_embeddings") + +pipeline = Pipeline(stages=[ + document_assembler, + instruction, +]) + +pipelineModel = pipeline.fit(data) + +pipelineDF = pipelineModel.transform(data) +``` +```scala + +val document_assembler = new DocumentAssembler() + .setInputCol("text") + .setOutputCol("documents") + +val instruction = MPNetEmbeddings + .pretrained("mpnet_embedding_multi_qa_v1_mpnet_cls_dot", "en") + .setInputCols(Array("documents")) + .setOutputCol("mpnet_embeddings") + +val pipeline = new Pipeline().setStages(Array(document_assembler, instruction)) + +val pipelineModel = pipeline.fit(data) + +val pipelineDF = pipelineModel.transform(data) + +``` +
+ +{:.model-param} +## Model Information + +{:.table-model} +|---|---| +|Model Name:|mpnet_embedding_multi_qa_v1_mpnet_cls_dot| +|Compatibility:|Spark NLP 5.1.0+| +|License:|Open Source| +|Edition:|Official| +|Input Labels:|[documents]| +|Output Labels:|[mpnet_embeddings]| +|Language:|en| +|Size:|409.8 MB| \ No newline at end of file From a3f515482cd813c6aef8d782182676220b624029 Mon Sep 17 00:00:00 2001 From: ahmedlone127 Date: Sat, 19 Aug 2023 01:32:59 +0700 Subject: [PATCH 140/155] Add model 2023-08-18-mpnet_embedding_setfit_zero_shot_classification_pbsp_p4_specific_en --- ...shot_classification_pbsp_p4_specific_en.md | 88 +++++++++++++++++++ 1 file changed, 88 insertions(+) create mode 100644 docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_setfit_zero_shot_classification_pbsp_p4_specific_en.md diff --git a/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_setfit_zero_shot_classification_pbsp_p4_specific_en.md b/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_setfit_zero_shot_classification_pbsp_p4_specific_en.md new file mode 100644 index 00000000000000..3d90d68f5ef02c --- /dev/null +++ b/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_setfit_zero_shot_classification_pbsp_p4_specific_en.md @@ -0,0 +1,88 @@ +--- +layout: model +title: English mpnet_embedding_setfit_zero_shot_classification_pbsp_p4_specific TFMPNetModel from aammari +author: John Snow Labs +name: mpnet_embedding_setfit_zero_shot_classification_pbsp_p4_specific +date: 2023-08-18 +tags: [mpnet, en, open_source, tensorflow] +task: Embeddings +language: en +edition: Spark NLP 5.1.0 +spark_version: 3.0 +supported: true +engine: tensorflow +annotator: MPNetEmbeddings +article_header: + type: cover +use_language_switcher: "Python-Scala-Java" +--- + +## Description + +Pretrained mpnet model, adapted from Hugging Face and curated to provide scalability and production-readiness using Spark NLP.`mpnet_embedding_setfit_zero_shot_classification_pbsp_p4_specific` is a English model originally trained by aammari. + +{:.btn-box} + + +[Download](https://s3.amazonaws.com/auxdata.johnsnowlabs.com/public/models/mpnet_embedding_setfit_zero_shot_classification_pbsp_p4_specific_en_5.1.0_3.0_1692380711935.zip){:.button.button-orange.button-orange-trans.arr.button-icon} +[Copy S3 URI](s3://auxdata.johnsnowlabs.com/public/models/mpnet_embedding_setfit_zero_shot_classification_pbsp_p4_specific_en_5.1.0_3.0_1692380711935.zip){:.button.button-orange.button-orange-trans.button-icon.button-copy-s3} + +## How to use + + + +
+{% include programmingLanguageSelectScalaPythonNLU.html %} +```python + +document_assembler = DocumentAssembler() \ + .setInputCol("text") \ + .setOutputCol("documents") + +instruction = MPNetEmbeddings \ + .pretrained("mpnet_embedding_setfit_zero_shot_classification_pbsp_p4_specific", "en")\ + .setInputCols(["documents"]) \ + .setOutputCol("mpnet_embeddings") + +pipeline = Pipeline(stages=[ + document_assembler, + instruction, +]) + +pipelineModel = pipeline.fit(data) + +pipelineDF = pipelineModel.transform(data) +``` +```scala + +val document_assembler = new DocumentAssembler() + .setInputCol("text") + .setOutputCol("documents") + +val instruction = MPNetEmbeddings + .pretrained("mpnet_embedding_setfit_zero_shot_classification_pbsp_p4_specific", "en") + .setInputCols(Array("documents")) + .setOutputCol("mpnet_embeddings") + +val pipeline = new Pipeline().setStages(Array(document_assembler, instruction)) + +val pipelineModel = pipeline.fit(data) + +val pipelineDF = pipelineModel.transform(data) + +``` +
+ +{:.model-param} +## Model Information + +{:.table-model} +|---|---| +|Model Name:|mpnet_embedding_setfit_zero_shot_classification_pbsp_p4_specific| +|Compatibility:|Spark NLP 5.1.0+| +|License:|Open Source| +|Edition:|Official| +|Input Labels:|[documents]| +|Output Labels:|[mpnet_embeddings]| +|Language:|en| +|Size:|409.8 MB| \ No newline at end of file From cf44bf06638f1446635e3303adc958e35aa534b6 Mon Sep 17 00:00:00 2001 From: ahmedlone127 Date: Sat, 19 Aug 2023 01:33:58 +0700 Subject: [PATCH 141/155] Add model 2023-08-18-mpnet_embedding_setfit_zero_shot_classification_pbsp_p3_cons_en --- ...ero_shot_classification_pbsp_p3_cons_en.md | 88 +++++++++++++++++++ 1 file changed, 88 insertions(+) create mode 100644 docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_setfit_zero_shot_classification_pbsp_p3_cons_en.md diff --git a/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_setfit_zero_shot_classification_pbsp_p3_cons_en.md b/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_setfit_zero_shot_classification_pbsp_p3_cons_en.md new file mode 100644 index 00000000000000..5df40d6706c611 --- /dev/null +++ b/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_setfit_zero_shot_classification_pbsp_p3_cons_en.md @@ -0,0 +1,88 @@ +--- +layout: model +title: English mpnet_embedding_setfit_zero_shot_classification_pbsp_p3_cons TFMPNetModel from aammari +author: John Snow Labs +name: mpnet_embedding_setfit_zero_shot_classification_pbsp_p3_cons +date: 2023-08-18 +tags: [mpnet, en, open_source, tensorflow] +task: Embeddings +language: en +edition: Spark NLP 5.1.0 +spark_version: 3.0 +supported: true +engine: tensorflow +annotator: MPNetEmbeddings +article_header: + type: cover +use_language_switcher: "Python-Scala-Java" +--- + +## Description + +Pretrained mpnet model, adapted from Hugging Face and curated to provide scalability and production-readiness using Spark NLP.`mpnet_embedding_setfit_zero_shot_classification_pbsp_p3_cons` is a English model originally trained by aammari. + +{:.btn-box} + + +[Download](https://s3.amazonaws.com/auxdata.johnsnowlabs.com/public/models/mpnet_embedding_setfit_zero_shot_classification_pbsp_p3_cons_en_5.1.0_3.0_1692380608777.zip){:.button.button-orange.button-orange-trans.arr.button-icon} +[Copy S3 URI](s3://auxdata.johnsnowlabs.com/public/models/mpnet_embedding_setfit_zero_shot_classification_pbsp_p3_cons_en_5.1.0_3.0_1692380608777.zip){:.button.button-orange.button-orange-trans.button-icon.button-copy-s3} + +## How to use + + + +
+{% include programmingLanguageSelectScalaPythonNLU.html %} +```python + +document_assembler = DocumentAssembler() \ + .setInputCol("text") \ + .setOutputCol("documents") + +instruction = MPNetEmbeddings \ + .pretrained("mpnet_embedding_setfit_zero_shot_classification_pbsp_p3_cons", "en")\ + .setInputCols(["documents"]) \ + .setOutputCol("mpnet_embeddings") + +pipeline = Pipeline(stages=[ + document_assembler, + instruction, +]) + +pipelineModel = pipeline.fit(data) + +pipelineDF = pipelineModel.transform(data) +``` +```scala + +val document_assembler = new DocumentAssembler() + .setInputCol("text") + .setOutputCol("documents") + +val instruction = MPNetEmbeddings + .pretrained("mpnet_embedding_setfit_zero_shot_classification_pbsp_p3_cons", "en") + .setInputCols(Array("documents")) + .setOutputCol("mpnet_embeddings") + +val pipeline = new Pipeline().setStages(Array(document_assembler, instruction)) + +val pipelineModel = pipeline.fit(data) + +val pipelineDF = pipelineModel.transform(data) + +``` +
+ +{:.model-param} +## Model Information + +{:.table-model} +|---|---| +|Model Name:|mpnet_embedding_setfit_zero_shot_classification_pbsp_p3_cons| +|Compatibility:|Spark NLP 5.1.0+| +|License:|Open Source| +|Edition:|Official| +|Input Labels:|[documents]| +|Output Labels:|[mpnet_embeddings]| +|Language:|en| +|Size:|409.8 MB| \ No newline at end of file From e503a0ce5799d5f6c2a9bc142f6131b039a03cf3 Mon Sep 17 00:00:00 2001 From: ahmedlone127 Date: Sat, 19 Aug 2023 01:34:59 +0700 Subject: [PATCH 142/155] Add model 2023-08-18-mpnet_embedding_CPU_Conditional_Classifier_en --- ...embedding_CPU_Conditional_Classifier_en.md | 88 +++++++++++++++++++ 1 file changed, 88 insertions(+) create mode 100644 docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_CPU_Conditional_Classifier_en.md diff --git a/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_CPU_Conditional_Classifier_en.md b/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_CPU_Conditional_Classifier_en.md new file mode 100644 index 00000000000000..918b41525c1172 --- /dev/null +++ b/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_CPU_Conditional_Classifier_en.md @@ -0,0 +1,88 @@ +--- +layout: model +title: English mpnet_embedding_CPU_Conditional_Classifier TFMPNetModel from mtyrrell +author: John Snow Labs +name: mpnet_embedding_CPU_Conditional_Classifier +date: 2023-08-18 +tags: [mpnet, en, open_source, tensorflow] +task: Embeddings +language: en +edition: Spark NLP 5.1.0 +spark_version: 3.0 +supported: true +engine: tensorflow +annotator: MPNetEmbeddings +article_header: + type: cover +use_language_switcher: "Python-Scala-Java" +--- + +## Description + +Pretrained mpnet model, adapted from Hugging Face and curated to provide scalability and production-readiness using Spark NLP.`mpnet_embedding_CPU_Conditional_Classifier` is a English model originally trained by mtyrrell. + +{:.btn-box} + + +[Download](https://s3.amazonaws.com/auxdata.johnsnowlabs.com/public/models/mpnet_embedding_CPU_Conditional_Classifier_en_5.1.0_3.0_1692380267965.zip){:.button.button-orange.button-orange-trans.arr.button-icon} +[Copy S3 URI](s3://auxdata.johnsnowlabs.com/public/models/mpnet_embedding_CPU_Conditional_Classifier_en_5.1.0_3.0_1692380267965.zip){:.button.button-orange.button-orange-trans.button-icon.button-copy-s3} + +## How to use + + + +
+{% include programmingLanguageSelectScalaPythonNLU.html %} +```python + +document_assembler = DocumentAssembler() \ + .setInputCol("text") \ + .setOutputCol("documents") + +instruction = MPNetEmbeddings \ + .pretrained("mpnet_embedding_CPU_Conditional_Classifier", "en")\ + .setInputCols(["documents"]) \ + .setOutputCol("mpnet_embeddings") + +pipeline = Pipeline(stages=[ + document_assembler, + instruction, +]) + +pipelineModel = pipeline.fit(data) + +pipelineDF = pipelineModel.transform(data) +``` +```scala + +val document_assembler = new DocumentAssembler() + .setInputCol("text") + .setOutputCol("documents") + +val instruction = MPNetEmbeddings + .pretrained("mpnet_embedding_CPU_Conditional_Classifier", "en") + .setInputCols(Array("documents")) + .setOutputCol("mpnet_embeddings") + +val pipeline = new Pipeline().setStages(Array(document_assembler, instruction)) + +val pipelineModel = pipeline.fit(data) + +val pipelineDF = pipelineModel.transform(data) + +``` +
+ +{:.model-param} +## Model Information + +{:.table-model} +|---|---| +|Model Name:|mpnet_embedding_CPU_Conditional_Classifier| +|Compatibility:|Spark NLP 5.1.0+| +|License:|Open Source| +|Edition:|Official| +|Input Labels:|[documents]| +|Output Labels:|[mpnet_embeddings]| +|Language:|en| +|Size:|410.2 MB| \ No newline at end of file From 9da2c56057013717cac390a01161fb27562230e2 Mon Sep 17 00:00:00 2001 From: ahmedlone127 Date: Sat, 19 Aug 2023 01:35:59 +0700 Subject: [PATCH 143/155] Add model 2023-08-18-mpnet_embedding_tiny_random_MPNetForMaskedLM_en --- ...bedding_tiny_random_MPNetForMaskedLM_en.md | 88 +++++++++++++++++++ 1 file changed, 88 insertions(+) create mode 100644 docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_tiny_random_MPNetForMaskedLM_en.md diff --git a/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_tiny_random_MPNetForMaskedLM_en.md b/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_tiny_random_MPNetForMaskedLM_en.md new file mode 100644 index 00000000000000..cce037ebd52a24 --- /dev/null +++ b/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_tiny_random_MPNetForMaskedLM_en.md @@ -0,0 +1,88 @@ +--- +layout: model +title: English mpnet_embedding_tiny_random_MPNetForMaskedLM TFMPNetModel from hf-tiny-model-private +author: John Snow Labs +name: mpnet_embedding_tiny_random_MPNetForMaskedLM +date: 2023-08-18 +tags: [mpnet, en, open_source, tensorflow] +task: Embeddings +language: en +edition: Spark NLP 5.1.0 +spark_version: 3.0 +supported: true +engine: tensorflow +annotator: MPNetEmbeddings +article_header: + type: cover +use_language_switcher: "Python-Scala-Java" +--- + +## Description + +Pretrained mpnet model, adapted from Hugging Face and curated to provide scalability and production-readiness using Spark NLP.`mpnet_embedding_tiny_random_MPNetForMaskedLM` is a English model originally trained by hf-tiny-model-private. + +{:.btn-box} + + +[Download](https://s3.amazonaws.com/auxdata.johnsnowlabs.com/public/models/mpnet_embedding_tiny_random_MPNetForMaskedLM_en_5.1.0_3.0_1692375647237.zip){:.button.button-orange.button-orange-trans.arr.button-icon} +[Copy S3 URI](s3://auxdata.johnsnowlabs.com/public/models/mpnet_embedding_tiny_random_MPNetForMaskedLM_en_5.1.0_3.0_1692375647237.zip){:.button.button-orange.button-orange-trans.button-icon.button-copy-s3} + +## How to use + + + +
+{% include programmingLanguageSelectScalaPythonNLU.html %} +```python + +document_assembler = DocumentAssembler() \ + .setInputCol("text") \ + .setOutputCol("documents") + +instruction = MPNetEmbeddings \ + .pretrained("mpnet_embedding_tiny_random_MPNetForMaskedLM", "en")\ + .setInputCols(["documents"]) \ + .setOutputCol("mpnet_embeddings") + +pipeline = Pipeline(stages=[ + document_assembler, + instruction, +]) + +pipelineModel = pipeline.fit(data) + +pipelineDF = pipelineModel.transform(data) +``` +```scala + +val document_assembler = new DocumentAssembler() + .setInputCol("text") + .setOutputCol("documents") + +val instruction = MPNetEmbeddings + .pretrained("mpnet_embedding_tiny_random_MPNetForMaskedLM", "en") + .setInputCols(Array("documents")) + .setOutputCol("mpnet_embeddings") + +val pipeline = new Pipeline().setStages(Array(document_assembler, instruction)) + +val pipelineModel = pipeline.fit(data) + +val pipelineDF = pipelineModel.transform(data) + +``` +
+ +{:.model-param} +## Model Information + +{:.table-model} +|---|---| +|Model Name:|mpnet_embedding_tiny_random_MPNetForMaskedLM| +|Compatibility:|Spark NLP 5.1.0+| +|License:|Open Source| +|Edition:|Official| +|Input Labels:|[documents]| +|Output Labels:|[mpnet_embeddings]| +|Language:|en| +|Size:|1.2 MB| \ No newline at end of file From 08691203e2a3348686ef439aa2343853260a0320 Mon Sep 17 00:00:00 2001 From: ahmedlone127 Date: Sat, 19 Aug 2023 01:36:59 +0700 Subject: [PATCH 144/155] Add model 2023-08-18-mpnet_embedding_PatentSBERTa_V2_en --- ...8-18-mpnet_embedding_PatentSBERTa_V2_en.md | 88 +++++++++++++++++++ 1 file changed, 88 insertions(+) create mode 100644 docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_PatentSBERTa_V2_en.md diff --git a/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_PatentSBERTa_V2_en.md b/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_PatentSBERTa_V2_en.md new file mode 100644 index 00000000000000..f9d4fca88cea02 --- /dev/null +++ b/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_PatentSBERTa_V2_en.md @@ -0,0 +1,88 @@ +--- +layout: model +title: English mpnet_embedding_PatentSBERTa_V2 TFMPNetModel from AAUBS +author: John Snow Labs +name: mpnet_embedding_PatentSBERTa_V2 +date: 2023-08-18 +tags: [mpnet, en, open_source, tensorflow] +task: Embeddings +language: en +edition: Spark NLP 5.1.0 +spark_version: 3.0 +supported: true +engine: tensorflow +annotator: MPNetEmbeddings +article_header: + type: cover +use_language_switcher: "Python-Scala-Java" +--- + +## Description + +Pretrained mpnet model, adapted from Hugging Face and curated to provide scalability and production-readiness using Spark NLP.`mpnet_embedding_PatentSBERTa_V2` is a English model originally trained by AAUBS. + +{:.btn-box} + + +[Download](https://s3.amazonaws.com/auxdata.johnsnowlabs.com/public/models/mpnet_embedding_PatentSBERTa_V2_en_5.1.0_3.0_1692380355761.zip){:.button.button-orange.button-orange-trans.arr.button-icon} +[Copy S3 URI](s3://auxdata.johnsnowlabs.com/public/models/mpnet_embedding_PatentSBERTa_V2_en_5.1.0_3.0_1692380355761.zip){:.button.button-orange.button-orange-trans.button-icon.button-copy-s3} + +## How to use + + + +
+{% include programmingLanguageSelectScalaPythonNLU.html %} +```python + +document_assembler = DocumentAssembler() \ + .setInputCol("text") \ + .setOutputCol("documents") + +instruction = MPNetEmbeddings \ + .pretrained("mpnet_embedding_PatentSBERTa_V2", "en")\ + .setInputCols(["documents"]) \ + .setOutputCol("mpnet_embeddings") + +pipeline = Pipeline(stages=[ + document_assembler, + instruction, +]) + +pipelineModel = pipeline.fit(data) + +pipelineDF = pipelineModel.transform(data) +``` +```scala + +val document_assembler = new DocumentAssembler() + .setInputCol("text") + .setOutputCol("documents") + +val instruction = MPNetEmbeddings + .pretrained("mpnet_embedding_PatentSBERTa_V2", "en") + .setInputCols(Array("documents")) + .setOutputCol("mpnet_embeddings") + +val pipeline = new Pipeline().setStages(Array(document_assembler, instruction)) + +val pipelineModel = pipeline.fit(data) + +val pipelineDF = pipelineModel.transform(data) + +``` +
+ +{:.model-param} +## Model Information + +{:.table-model} +|---|---| +|Model Name:|mpnet_embedding_PatentSBERTa_V2| +|Compatibility:|Spark NLP 5.1.0+| +|License:|Open Source| +|Edition:|Official| +|Input Labels:|[documents]| +|Output Labels:|[mpnet_embeddings]| +|Language:|en| +|Size:|410.3 MB| \ No newline at end of file From 060b3c5cd121cc2cde78d38232908321358b4ed0 Mon Sep 17 00:00:00 2001 From: ahmedlone127 Date: Sat, 19 Aug 2023 01:38:00 +0700 Subject: [PATCH 145/155] Add model 2023-08-18-mpnet_embedding_sml_ukr_message_classifier_en --- ...embedding_sml_ukr_message_classifier_en.md | 88 +++++++++++++++++++ 1 file changed, 88 insertions(+) create mode 100644 docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_sml_ukr_message_classifier_en.md diff --git a/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_sml_ukr_message_classifier_en.md b/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_sml_ukr_message_classifier_en.md new file mode 100644 index 00000000000000..3b7865654bb278 --- /dev/null +++ b/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_sml_ukr_message_classifier_en.md @@ -0,0 +1,88 @@ +--- +layout: model +title: English mpnet_embedding_sml_ukr_message_classifier TFMPNetModel from rodekruis +author: John Snow Labs +name: mpnet_embedding_sml_ukr_message_classifier +date: 2023-08-18 +tags: [mpnet, en, open_source, tensorflow] +task: Embeddings +language: en +edition: Spark NLP 5.1.0 +spark_version: 3.0 +supported: true +engine: tensorflow +annotator: MPNetEmbeddings +article_header: + type: cover +use_language_switcher: "Python-Scala-Java" +--- + +## Description + +Pretrained mpnet model, adapted from Hugging Face and curated to provide scalability and production-readiness using Spark NLP.`mpnet_embedding_sml_ukr_message_classifier` is a English model originally trained by rodekruis. + +{:.btn-box} + + +[Download](https://s3.amazonaws.com/auxdata.johnsnowlabs.com/public/models/mpnet_embedding_sml_ukr_message_classifier_en_5.1.0_3.0_1692375430011.zip){:.button.button-orange.button-orange-trans.arr.button-icon} +[Copy S3 URI](s3://auxdata.johnsnowlabs.com/public/models/mpnet_embedding_sml_ukr_message_classifier_en_5.1.0_3.0_1692375430011.zip){:.button.button-orange.button-orange-trans.button-icon.button-copy-s3} + +## How to use + + + +
+{% include programmingLanguageSelectScalaPythonNLU.html %} +```python + +document_assembler = DocumentAssembler() \ + .setInputCol("text") \ + .setOutputCol("documents") + +instruction = MPNetEmbeddings \ + .pretrained("mpnet_embedding_sml_ukr_message_classifier", "en")\ + .setInputCols(["documents"]) \ + .setOutputCol("mpnet_embeddings") + +pipeline = Pipeline(stages=[ + document_assembler, + instruction, +]) + +pipelineModel = pipeline.fit(data) + +pipelineDF = pipelineModel.transform(data) +``` +```scala + +val document_assembler = new DocumentAssembler() + .setInputCol("text") + .setOutputCol("documents") + +val instruction = MPNetEmbeddings + .pretrained("mpnet_embedding_sml_ukr_message_classifier", "en") + .setInputCols(Array("documents")) + .setOutputCol("mpnet_embeddings") + +val pipeline = new Pipeline().setStages(Array(document_assembler, instruction)) + +val pipelineModel = pipeline.fit(data) + +val pipelineDF = pipelineModel.transform(data) + +``` +
+ +{:.model-param} +## Model Information + +{:.table-model} +|---|---| +|Model Name:|mpnet_embedding_sml_ukr_message_classifier| +|Compatibility:|Spark NLP 5.1.0+| +|License:|Open Source| +|Edition:|Official| +|Input Labels:|[documents]| +|Output Labels:|[mpnet_embeddings]| +|Language:|en| +|Size:|409.8 MB| \ No newline at end of file From 4935d5f871d52d69f79504b1f7715f698efdc679 Mon Sep 17 00:00:00 2001 From: ahmedlone127 Date: Sat, 19 Aug 2023 01:39:00 +0700 Subject: [PATCH 146/155] Add model 2023-08-18-mpnet_embedding_CPU_Transport_GHG_Classifier_en --- ...bedding_CPU_Transport_GHG_Classifier_en.md | 88 +++++++++++++++++++ 1 file changed, 88 insertions(+) create mode 100644 docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_CPU_Transport_GHG_Classifier_en.md diff --git a/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_CPU_Transport_GHG_Classifier_en.md b/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_CPU_Transport_GHG_Classifier_en.md new file mode 100644 index 00000000000000..0c1cad11d00706 --- /dev/null +++ b/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_CPU_Transport_GHG_Classifier_en.md @@ -0,0 +1,88 @@ +--- +layout: model +title: English mpnet_embedding_CPU_Transport_GHG_Classifier TFMPNetModel from mtyrrell +author: John Snow Labs +name: mpnet_embedding_CPU_Transport_GHG_Classifier +date: 2023-08-18 +tags: [mpnet, en, open_source, tensorflow] +task: Embeddings +language: en +edition: Spark NLP 5.1.0 +spark_version: 3.0 +supported: true +engine: tensorflow +annotator: MPNetEmbeddings +article_header: + type: cover +use_language_switcher: "Python-Scala-Java" +--- + +## Description + +Pretrained mpnet model, adapted from Hugging Face and curated to provide scalability and production-readiness using Spark NLP.`mpnet_embedding_CPU_Transport_GHG_Classifier` is a English model originally trained by mtyrrell. + +{:.btn-box} + + +[Download](https://s3.amazonaws.com/auxdata.johnsnowlabs.com/public/models/mpnet_embedding_CPU_Transport_GHG_Classifier_en_5.1.0_3.0_1692379880190.zip){:.button.button-orange.button-orange-trans.arr.button-icon} +[Copy S3 URI](s3://auxdata.johnsnowlabs.com/public/models/mpnet_embedding_CPU_Transport_GHG_Classifier_en_5.1.0_3.0_1692379880190.zip){:.button.button-orange.button-orange-trans.button-icon.button-copy-s3} + +## How to use + + + +
+{% include programmingLanguageSelectScalaPythonNLU.html %} +```python + +document_assembler = DocumentAssembler() \ + .setInputCol("text") \ + .setOutputCol("documents") + +instruction = MPNetEmbeddings \ + .pretrained("mpnet_embedding_CPU_Transport_GHG_Classifier", "en")\ + .setInputCols(["documents"]) \ + .setOutputCol("mpnet_embeddings") + +pipeline = Pipeline(stages=[ + document_assembler, + instruction, +]) + +pipelineModel = pipeline.fit(data) + +pipelineDF = pipelineModel.transform(data) +``` +```scala + +val document_assembler = new DocumentAssembler() + .setInputCol("text") + .setOutputCol("documents") + +val instruction = MPNetEmbeddings + .pretrained("mpnet_embedding_CPU_Transport_GHG_Classifier", "en") + .setInputCols(Array("documents")) + .setOutputCol("mpnet_embeddings") + +val pipeline = new Pipeline().setStages(Array(document_assembler, instruction)) + +val pipelineModel = pipeline.fit(data) + +val pipelineDF = pipelineModel.transform(data) + +``` +
+ +{:.model-param} +## Model Information + +{:.table-model} +|---|---| +|Model Name:|mpnet_embedding_CPU_Transport_GHG_Classifier| +|Compatibility:|Spark NLP 5.1.0+| +|License:|Open Source| +|Edition:|Official| +|Input Labels:|[documents]| +|Output Labels:|[mpnet_embeddings]| +|Language:|en| +|Size:|410.3 MB| \ No newline at end of file From 47deabb277728a60db5236ba740bea9777ff3a40 Mon Sep 17 00:00:00 2001 From: ahmedlone127 Date: Sat, 19 Aug 2023 01:40:00 +0700 Subject: [PATCH 147/155] Add model 2023-08-18-mpnet_embedding_mpnet_nli_sts_en --- ...-08-18-mpnet_embedding_mpnet_nli_sts_en.md | 88 +++++++++++++++++++ 1 file changed, 88 insertions(+) create mode 100644 docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_mpnet_nli_sts_en.md diff --git a/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_mpnet_nli_sts_en.md b/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_mpnet_nli_sts_en.md new file mode 100644 index 00000000000000..3bd76a40198054 --- /dev/null +++ b/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_mpnet_nli_sts_en.md @@ -0,0 +1,88 @@ +--- +layout: model +title: English mpnet_embedding_mpnet_nli_sts TFMPNetModel from jamescalam +author: John Snow Labs +name: mpnet_embedding_mpnet_nli_sts +date: 2023-08-18 +tags: [mpnet, en, open_source, tensorflow] +task: Embeddings +language: en +edition: Spark NLP 5.1.0 +spark_version: 3.0 +supported: true +engine: tensorflow +annotator: MPNetEmbeddings +article_header: + type: cover +use_language_switcher: "Python-Scala-Java" +--- + +## Description + +Pretrained mpnet model, adapted from Hugging Face and curated to provide scalability and production-readiness using Spark NLP.`mpnet_embedding_mpnet_nli_sts` is a English model originally trained by jamescalam. + +{:.btn-box} + + +[Download](https://s3.amazonaws.com/auxdata.johnsnowlabs.com/public/models/mpnet_embedding_mpnet_nli_sts_en_5.1.0_3.0_1692375971185.zip){:.button.button-orange.button-orange-trans.arr.button-icon} +[Copy S3 URI](s3://auxdata.johnsnowlabs.com/public/models/mpnet_embedding_mpnet_nli_sts_en_5.1.0_3.0_1692375971185.zip){:.button.button-orange.button-orange-trans.button-icon.button-copy-s3} + +## How to use + + + +
+{% include programmingLanguageSelectScalaPythonNLU.html %} +```python + +document_assembler = DocumentAssembler() \ + .setInputCol("text") \ + .setOutputCol("documents") + +instruction = MPNetEmbeddings \ + .pretrained("mpnet_embedding_mpnet_nli_sts", "en")\ + .setInputCols(["documents"]) \ + .setOutputCol("mpnet_embeddings") + +pipeline = Pipeline(stages=[ + document_assembler, + instruction, +]) + +pipelineModel = pipeline.fit(data) + +pipelineDF = pipelineModel.transform(data) +``` +```scala + +val document_assembler = new DocumentAssembler() + .setInputCol("text") + .setOutputCol("documents") + +val instruction = MPNetEmbeddings + .pretrained("mpnet_embedding_mpnet_nli_sts", "en") + .setInputCols(Array("documents")) + .setOutputCol("mpnet_embeddings") + +val pipeline = new Pipeline().setStages(Array(document_assembler, instruction)) + +val pipelineModel = pipeline.fit(data) + +val pipelineDF = pipelineModel.transform(data) + +``` +
+ +{:.model-param} +## Model Information + +{:.table-model} +|---|---| +|Model Name:|mpnet_embedding_mpnet_nli_sts| +|Compatibility:|Spark NLP 5.1.0+| +|License:|Open Source| +|Edition:|Official| +|Input Labels:|[documents]| +|Output Labels:|[mpnet_embeddings]| +|Language:|en| +|Size:|402.0 MB| \ No newline at end of file From b933e8bee5240b9fdae912886ba075045f34eeab Mon Sep 17 00:00:00 2001 From: ahmedlone127 Date: Sat, 19 Aug 2023 01:41:00 +0700 Subject: [PATCH 148/155] Add model 2023-08-18-mpnet_embedding_initial_model_v3_en --- ...-18-mpnet_embedding_initial_model_v3_en.md | 88 +++++++++++++++++++ 1 file changed, 88 insertions(+) create mode 100644 docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_initial_model_v3_en.md diff --git a/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_initial_model_v3_en.md b/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_initial_model_v3_en.md new file mode 100644 index 00000000000000..f171a65e56c7c1 --- /dev/null +++ b/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_initial_model_v3_en.md @@ -0,0 +1,88 @@ +--- +layout: model +title: English mpnet_embedding_initial_model_v3 TFMPNetModel from ishan +author: John Snow Labs +name: mpnet_embedding_initial_model_v3 +date: 2023-08-18 +tags: [mpnet, en, open_source, tensorflow] +task: Embeddings +language: en +edition: Spark NLP 5.1.0 +spark_version: 3.0 +supported: true +engine: tensorflow +annotator: MPNetEmbeddings +article_header: + type: cover +use_language_switcher: "Python-Scala-Java" +--- + +## Description + +Pretrained mpnet model, adapted from Hugging Face and curated to provide scalability and production-readiness using Spark NLP.`mpnet_embedding_initial_model_v3` is a English model originally trained by ishan. + +{:.btn-box} + + +[Download](https://s3.amazonaws.com/auxdata.johnsnowlabs.com/public/models/mpnet_embedding_initial_model_v3_en_5.1.0_3.0_1692378725094.zip){:.button.button-orange.button-orange-trans.arr.button-icon} +[Copy S3 URI](s3://auxdata.johnsnowlabs.com/public/models/mpnet_embedding_initial_model_v3_en_5.1.0_3.0_1692378725094.zip){:.button.button-orange.button-orange-trans.button-icon.button-copy-s3} + +## How to use + + + +
+{% include programmingLanguageSelectScalaPythonNLU.html %} +```python + +document_assembler = DocumentAssembler() \ + .setInputCol("text") \ + .setOutputCol("documents") + +instruction = MPNetEmbeddings \ + .pretrained("mpnet_embedding_initial_model_v3", "en")\ + .setInputCols(["documents"]) \ + .setOutputCol("mpnet_embeddings") + +pipeline = Pipeline(stages=[ + document_assembler, + instruction, +]) + +pipelineModel = pipeline.fit(data) + +pipelineDF = pipelineModel.transform(data) +``` +```scala + +val document_assembler = new DocumentAssembler() + .setInputCol("text") + .setOutputCol("documents") + +val instruction = MPNetEmbeddings + .pretrained("mpnet_embedding_initial_model_v3", "en") + .setInputCols(Array("documents")) + .setOutputCol("mpnet_embeddings") + +val pipeline = new Pipeline().setStages(Array(document_assembler, instruction)) + +val pipelineModel = pipeline.fit(data) + +val pipelineDF = pipelineModel.transform(data) + +``` +
+ +{:.model-param} +## Model Information + +{:.table-model} +|---|---| +|Model Name:|mpnet_embedding_initial_model_v3| +|Compatibility:|Spark NLP 5.1.0+| +|License:|Open Source| +|Edition:|Official| +|Input Labels:|[documents]| +|Output Labels:|[mpnet_embeddings]| +|Language:|en| +|Size:|410.0 MB| \ No newline at end of file From d37170728256963a6fb3a0f88efc254def5d3f2f Mon Sep 17 00:00:00 2001 From: ahmedlone127 Date: Sat, 19 Aug 2023 01:42:01 +0700 Subject: [PATCH 149/155] Add model 2023-08-18-mpnet_embedding_multi_QA_v1_mpnet_asymmetric_Q_en --- ...dding_multi_QA_v1_mpnet_asymmetric_Q_en.md | 88 +++++++++++++++++++ 1 file changed, 88 insertions(+) create mode 100644 docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_multi_QA_v1_mpnet_asymmetric_Q_en.md diff --git a/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_multi_QA_v1_mpnet_asymmetric_Q_en.md b/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_multi_QA_v1_mpnet_asymmetric_Q_en.md new file mode 100644 index 00000000000000..97983765fa4b09 --- /dev/null +++ b/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_multi_QA_v1_mpnet_asymmetric_Q_en.md @@ -0,0 +1,88 @@ +--- +layout: model +title: English mpnet_embedding_multi_QA_v1_mpnet_asymmetric_Q TFMPNetModel from flax-sentence-embeddings +author: John Snow Labs +name: mpnet_embedding_multi_QA_v1_mpnet_asymmetric_Q +date: 2023-08-18 +tags: [mpnet, en, open_source, tensorflow] +task: Embeddings +language: en +edition: Spark NLP 5.1.0 +spark_version: 3.0 +supported: true +engine: tensorflow +annotator: MPNetEmbeddings +article_header: + type: cover +use_language_switcher: "Python-Scala-Java" +--- + +## Description + +Pretrained mpnet model, adapted from Hugging Face and curated to provide scalability and production-readiness using Spark NLP.`mpnet_embedding_multi_QA_v1_mpnet_asymmetric_Q` is a English model originally trained by flax-sentence-embeddings. + +{:.btn-box} + + +[Download](https://s3.amazonaws.com/auxdata.johnsnowlabs.com/public/models/mpnet_embedding_multi_QA_v1_mpnet_asymmetric_Q_en_5.1.0_3.0_1692376212995.zip){:.button.button-orange.button-orange-trans.arr.button-icon} +[Copy S3 URI](s3://auxdata.johnsnowlabs.com/public/models/mpnet_embedding_multi_QA_v1_mpnet_asymmetric_Q_en_5.1.0_3.0_1692376212995.zip){:.button.button-orange.button-orange-trans.button-icon.button-copy-s3} + +## How to use + + + +
+{% include programmingLanguageSelectScalaPythonNLU.html %} +```python + +document_assembler = DocumentAssembler() \ + .setInputCol("text") \ + .setOutputCol("documents") + +instruction = MPNetEmbeddings \ + .pretrained("mpnet_embedding_multi_QA_v1_mpnet_asymmetric_Q", "en")\ + .setInputCols(["documents"]) \ + .setOutputCol("mpnet_embeddings") + +pipeline = Pipeline(stages=[ + document_assembler, + instruction, +]) + +pipelineModel = pipeline.fit(data) + +pipelineDF = pipelineModel.transform(data) +``` +```scala + +val document_assembler = new DocumentAssembler() + .setInputCol("text") + .setOutputCol("documents") + +val instruction = MPNetEmbeddings + .pretrained("mpnet_embedding_multi_QA_v1_mpnet_asymmetric_Q", "en") + .setInputCols(Array("documents")) + .setOutputCol("mpnet_embeddings") + +val pipeline = new Pipeline().setStages(Array(document_assembler, instruction)) + +val pipelineModel = pipeline.fit(data) + +val pipelineDF = pipelineModel.transform(data) + +``` +
+ +{:.model-param} +## Model Information + +{:.table-model} +|---|---| +|Model Name:|mpnet_embedding_multi_QA_v1_mpnet_asymmetric_Q| +|Compatibility:|Spark NLP 5.1.0+| +|License:|Open Source| +|Edition:|Official| +|Input Labels:|[documents]| +|Output Labels:|[mpnet_embeddings]| +|Language:|en| +|Size:|409.3 MB| \ No newline at end of file From 1ce54ac43bb9e39665baaaa3edd1616ad058805f Mon Sep 17 00:00:00 2001 From: ahmedlone127 Date: Sat, 19 Aug 2023 01:43:01 +0700 Subject: [PATCH 150/155] Add model 2023-08-18-mpnet_embedding_paraphrase_mpnet_base_v2_finetuned_polifact_en --- ...ase_mpnet_base_v2_finetuned_polifact_en.md | 88 +++++++++++++++++++ 1 file changed, 88 insertions(+) create mode 100644 docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_paraphrase_mpnet_base_v2_finetuned_polifact_en.md diff --git a/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_paraphrase_mpnet_base_v2_finetuned_polifact_en.md b/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_paraphrase_mpnet_base_v2_finetuned_polifact_en.md new file mode 100644 index 00000000000000..9b2bcf70c35b7e --- /dev/null +++ b/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_paraphrase_mpnet_base_v2_finetuned_polifact_en.md @@ -0,0 +1,88 @@ +--- +layout: model +title: English mpnet_embedding_paraphrase_mpnet_base_v2_finetuned_polifact TFMPNetModel from anuj55 +author: John Snow Labs +name: mpnet_embedding_paraphrase_mpnet_base_v2_finetuned_polifact +date: 2023-08-18 +tags: [mpnet, en, open_source, tensorflow] +task: Embeddings +language: en +edition: Spark NLP 5.1.0 +spark_version: 3.0 +supported: true +engine: tensorflow +annotator: MPNetEmbeddings +article_header: + type: cover +use_language_switcher: "Python-Scala-Java" +--- + +## Description + +Pretrained mpnet model, adapted from Hugging Face and curated to provide scalability and production-readiness using Spark NLP.`mpnet_embedding_paraphrase_mpnet_base_v2_finetuned_polifact` is a English model originally trained by anuj55. + +{:.btn-box} + + +[Download](https://s3.amazonaws.com/auxdata.johnsnowlabs.com/public/models/mpnet_embedding_paraphrase_mpnet_base_v2_finetuned_polifact_en_5.1.0_3.0_1692380188177.zip){:.button.button-orange.button-orange-trans.arr.button-icon} +[Copy S3 URI](s3://auxdata.johnsnowlabs.com/public/models/mpnet_embedding_paraphrase_mpnet_base_v2_finetuned_polifact_en_5.1.0_3.0_1692380188177.zip){:.button.button-orange.button-orange-trans.button-icon.button-copy-s3} + +## How to use + + + +
+{% include programmingLanguageSelectScalaPythonNLU.html %} +```python + +document_assembler = DocumentAssembler() \ + .setInputCol("text") \ + .setOutputCol("documents") + +instruction = MPNetEmbeddings \ + .pretrained("mpnet_embedding_paraphrase_mpnet_base_v2_finetuned_polifact", "en")\ + .setInputCols(["documents"]) \ + .setOutputCol("mpnet_embeddings") + +pipeline = Pipeline(stages=[ + document_assembler, + instruction, +]) + +pipelineModel = pipeline.fit(data) + +pipelineDF = pipelineModel.transform(data) +``` +```scala + +val document_assembler = new DocumentAssembler() + .setInputCol("text") + .setOutputCol("documents") + +val instruction = MPNetEmbeddings + .pretrained("mpnet_embedding_paraphrase_mpnet_base_v2_finetuned_polifact", "en") + .setInputCols(Array("documents")) + .setOutputCol("mpnet_embeddings") + +val pipeline = new Pipeline().setStages(Array(document_assembler, instruction)) + +val pipelineModel = pipeline.fit(data) + +val pipelineDF = pipelineModel.transform(data) + +``` +
+ +{:.model-param} +## Model Information + +{:.table-model} +|---|---| +|Model Name:|mpnet_embedding_paraphrase_mpnet_base_v2_finetuned_polifact| +|Compatibility:|Spark NLP 5.1.0+| +|License:|Open Source| +|Edition:|Official| +|Input Labels:|[documents]| +|Output Labels:|[mpnet_embeddings]| +|Language:|en| +|Size:|409.9 MB| \ No newline at end of file From 9d5033a7388c29854467568ea3265764a68b8052 Mon Sep 17 00:00:00 2001 From: ahmedlone127 Date: Sat, 19 Aug 2023 01:44:01 +0700 Subject: [PATCH 151/155] Add model 2023-08-18-mpnet_embedding_test_model_001_en --- ...08-18-mpnet_embedding_test_model_001_en.md | 88 +++++++++++++++++++ 1 file changed, 88 insertions(+) create mode 100644 docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_test_model_001_en.md diff --git a/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_test_model_001_en.md b/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_test_model_001_en.md new file mode 100644 index 00000000000000..0507ba529141ac --- /dev/null +++ b/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_test_model_001_en.md @@ -0,0 +1,88 @@ +--- +layout: model +title: English mpnet_embedding_test_model_001 TFMPNetModel from intellya22 +author: John Snow Labs +name: mpnet_embedding_test_model_001 +date: 2023-08-18 +tags: [mpnet, en, open_source, tensorflow] +task: Embeddings +language: en +edition: Spark NLP 5.1.0 +spark_version: 3.0 +supported: true +engine: tensorflow +annotator: MPNetEmbeddings +article_header: + type: cover +use_language_switcher: "Python-Scala-Java" +--- + +## Description + +Pretrained mpnet model, adapted from Hugging Face and curated to provide scalability and production-readiness using Spark NLP.`mpnet_embedding_test_model_001` is a English model originally trained by intellya22. + +{:.btn-box} + + +[Download](https://s3.amazonaws.com/auxdata.johnsnowlabs.com/public/models/mpnet_embedding_test_model_001_en_5.1.0_3.0_1692380796219.zip){:.button.button-orange.button-orange-trans.arr.button-icon} +[Copy S3 URI](s3://auxdata.johnsnowlabs.com/public/models/mpnet_embedding_test_model_001_en_5.1.0_3.0_1692380796219.zip){:.button.button-orange.button-orange-trans.button-icon.button-copy-s3} + +## How to use + + + +
+{% include programmingLanguageSelectScalaPythonNLU.html %} +```python + +document_assembler = DocumentAssembler() \ + .setInputCol("text") \ + .setOutputCol("documents") + +instruction = MPNetEmbeddings \ + .pretrained("mpnet_embedding_test_model_001", "en")\ + .setInputCols(["documents"]) \ + .setOutputCol("mpnet_embeddings") + +pipeline = Pipeline(stages=[ + document_assembler, + instruction, +]) + +pipelineModel = pipeline.fit(data) + +pipelineDF = pipelineModel.transform(data) +``` +```scala + +val document_assembler = new DocumentAssembler() + .setInputCol("text") + .setOutputCol("documents") + +val instruction = MPNetEmbeddings + .pretrained("mpnet_embedding_test_model_001", "en") + .setInputCols(Array("documents")) + .setOutputCol("mpnet_embeddings") + +val pipeline = new Pipeline().setStages(Array(document_assembler, instruction)) + +val pipelineModel = pipeline.fit(data) + +val pipelineDF = pipelineModel.transform(data) + +``` +
+ +{:.model-param} +## Model Information + +{:.table-model} +|---|---| +|Model Name:|mpnet_embedding_test_model_001| +|Compatibility:|Spark NLP 5.1.0+| +|License:|Open Source| +|Edition:|Official| +|Input Labels:|[documents]| +|Output Labels:|[mpnet_embeddings]| +|Language:|en| +|Size:|410.0 MB| \ No newline at end of file From a62698ae51b0d5ad692537500618f8193cef567a Mon Sep 17 00:00:00 2001 From: ahmedlone127 Date: Sat, 19 Aug 2023 01:45:02 +0700 Subject: [PATCH 152/155] Add model 2023-08-18-mpnet_embedding_java_rational_classifier_en --- ...t_embedding_java_rational_classifier_en.md | 88 +++++++++++++++++++ 1 file changed, 88 insertions(+) create mode 100644 docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_java_rational_classifier_en.md diff --git a/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_java_rational_classifier_en.md b/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_java_rational_classifier_en.md new file mode 100644 index 00000000000000..7bc4b726b21720 --- /dev/null +++ b/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_java_rational_classifier_en.md @@ -0,0 +1,88 @@ +--- +layout: model +title: English mpnet_embedding_java_rational_classifier TFMPNetModel from AISE-TUDelft +author: John Snow Labs +name: mpnet_embedding_java_rational_classifier +date: 2023-08-18 +tags: [mpnet, en, open_source, tensorflow] +task: Embeddings +language: en +edition: Spark NLP 5.1.0 +spark_version: 3.0 +supported: true +engine: tensorflow +annotator: MPNetEmbeddings +article_header: + type: cover +use_language_switcher: "Python-Scala-Java" +--- + +## Description + +Pretrained mpnet model, adapted from Hugging Face and curated to provide scalability and production-readiness using Spark NLP.`mpnet_embedding_java_rational_classifier` is a English model originally trained by AISE-TUDelft. + +{:.btn-box} + + +[Download](https://s3.amazonaws.com/auxdata.johnsnowlabs.com/public/models/mpnet_embedding_java_rational_classifier_en_5.1.0_3.0_1692378803769.zip){:.button.button-orange.button-orange-trans.arr.button-icon} +[Copy S3 URI](s3://auxdata.johnsnowlabs.com/public/models/mpnet_embedding_java_rational_classifier_en_5.1.0_3.0_1692378803769.zip){:.button.button-orange.button-orange-trans.button-icon.button-copy-s3} + +## How to use + + + +
+{% include programmingLanguageSelectScalaPythonNLU.html %} +```python + +document_assembler = DocumentAssembler() \ + .setInputCol("text") \ + .setOutputCol("documents") + +instruction = MPNetEmbeddings \ + .pretrained("mpnet_embedding_java_rational_classifier", "en")\ + .setInputCols(["documents"]) \ + .setOutputCol("mpnet_embeddings") + +pipeline = Pipeline(stages=[ + document_assembler, + instruction, +]) + +pipelineModel = pipeline.fit(data) + +pipelineDF = pipelineModel.transform(data) +``` +```scala + +val document_assembler = new DocumentAssembler() + .setInputCol("text") + .setOutputCol("documents") + +val instruction = MPNetEmbeddings + .pretrained("mpnet_embedding_java_rational_classifier", "en") + .setInputCols(Array("documents")) + .setOutputCol("mpnet_embeddings") + +val pipeline = new Pipeline().setStages(Array(document_assembler, instruction)) + +val pipelineModel = pipeline.fit(data) + +val pipelineDF = pipelineModel.transform(data) + +``` +
+ +{:.model-param} +## Model Information + +{:.table-model} +|---|---| +|Model Name:|mpnet_embedding_java_rational_classifier| +|Compatibility:|Spark NLP 5.1.0+| +|License:|Open Source| +|Edition:|Official| +|Input Labels:|[documents]| +|Output Labels:|[mpnet_embeddings]| +|Language:|en| +|Size:|409.9 MB| \ No newline at end of file From b72a19d870c8464e69a9610b20a0fc83c6b1e618 Mon Sep 17 00:00:00 2001 From: ahmedlone127 Date: Sat, 19 Aug 2023 01:46:02 +0700 Subject: [PATCH 153/155] Add model 2023-08-18-mpnet_embedding_setfit_ds_version_0_0_1_en --- ...et_embedding_setfit_ds_version_0_0_1_en.md | 88 +++++++++++++++++++ 1 file changed, 88 insertions(+) create mode 100644 docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_setfit_ds_version_0_0_1_en.md diff --git a/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_setfit_ds_version_0_0_1_en.md b/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_setfit_ds_version_0_0_1_en.md new file mode 100644 index 00000000000000..cbbc3cf09bd754 --- /dev/null +++ b/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_setfit_ds_version_0_0_1_en.md @@ -0,0 +1,88 @@ +--- +layout: model +title: English mpnet_embedding_setfit_ds_version_0_0_1 TFMPNetModel from amittian +author: John Snow Labs +name: mpnet_embedding_setfit_ds_version_0_0_1 +date: 2023-08-18 +tags: [mpnet, en, open_source, tensorflow] +task: Embeddings +language: en +edition: Spark NLP 5.1.0 +spark_version: 3.0 +supported: true +engine: tensorflow +annotator: MPNetEmbeddings +article_header: + type: cover +use_language_switcher: "Python-Scala-Java" +--- + +## Description + +Pretrained mpnet model, adapted from Hugging Face and curated to provide scalability and production-readiness using Spark NLP.`mpnet_embedding_setfit_ds_version_0_0_1` is a English model originally trained by amittian. + +{:.btn-box} + + +[Download](https://s3.amazonaws.com/auxdata.johnsnowlabs.com/public/models/mpnet_embedding_setfit_ds_version_0_0_1_en_5.1.0_3.0_1692377361480.zip){:.button.button-orange.button-orange-trans.arr.button-icon} +[Copy S3 URI](s3://auxdata.johnsnowlabs.com/public/models/mpnet_embedding_setfit_ds_version_0_0_1_en_5.1.0_3.0_1692377361480.zip){:.button.button-orange.button-orange-trans.button-icon.button-copy-s3} + +## How to use + + + +
+{% include programmingLanguageSelectScalaPythonNLU.html %} +```python + +document_assembler = DocumentAssembler() \ + .setInputCol("text") \ + .setOutputCol("documents") + +instruction = MPNetEmbeddings \ + .pretrained("mpnet_embedding_setfit_ds_version_0_0_1", "en")\ + .setInputCols(["documents"]) \ + .setOutputCol("mpnet_embeddings") + +pipeline = Pipeline(stages=[ + document_assembler, + instruction, +]) + +pipelineModel = pipeline.fit(data) + +pipelineDF = pipelineModel.transform(data) +``` +```scala + +val document_assembler = new DocumentAssembler() + .setInputCol("text") + .setOutputCol("documents") + +val instruction = MPNetEmbeddings + .pretrained("mpnet_embedding_setfit_ds_version_0_0_1", "en") + .setInputCols(Array("documents")) + .setOutputCol("mpnet_embeddings") + +val pipeline = new Pipeline().setStages(Array(document_assembler, instruction)) + +val pipelineModel = pipeline.fit(data) + +val pipelineDF = pipelineModel.transform(data) + +``` +
+ +{:.model-param} +## Model Information + +{:.table-model} +|---|---| +|Model Name:|mpnet_embedding_setfit_ds_version_0_0_1| +|Compatibility:|Spark NLP 5.1.0+| +|License:|Open Source| +|Edition:|Official| +|Input Labels:|[documents]| +|Output Labels:|[mpnet_embeddings]| +|Language:|en| +|Size:|409.9 MB| \ No newline at end of file From 5dabb2d37697a210f9c6f7b9bda6fd09bd172322 Mon Sep 17 00:00:00 2001 From: ahmedlone127 Date: Sat, 19 Aug 2023 01:47:02 +0700 Subject: [PATCH 154/155] Add model 2023-08-18-mpnet_embedding_initial_model_en --- ...-08-18-mpnet_embedding_initial_model_en.md | 88 +++++++++++++++++++ 1 file changed, 88 insertions(+) create mode 100644 docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_initial_model_en.md diff --git a/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_initial_model_en.md b/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_initial_model_en.md new file mode 100644 index 00000000000000..4664c712ad782e --- /dev/null +++ b/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_initial_model_en.md @@ -0,0 +1,88 @@ +--- +layout: model +title: English mpnet_embedding_initial_model TFMPNetModel from ishan +author: John Snow Labs +name: mpnet_embedding_initial_model +date: 2023-08-18 +tags: [mpnet, en, open_source, tensorflow] +task: Embeddings +language: en +edition: Spark NLP 5.1.0 +spark_version: 3.0 +supported: true +engine: tensorflow +annotator: MPNetEmbeddings +article_header: + type: cover +use_language_switcher: "Python-Scala-Java" +--- + +## Description + +Pretrained mpnet model, adapted from Hugging Face and curated to provide scalability and production-readiness using Spark NLP.`mpnet_embedding_initial_model` is a English model originally trained by ishan. + +{:.btn-box} + + +[Download](https://s3.amazonaws.com/auxdata.johnsnowlabs.com/public/models/mpnet_embedding_initial_model_en_5.1.0_3.0_1692378787120.zip){:.button.button-orange.button-orange-trans.arr.button-icon} +[Copy S3 URI](s3://auxdata.johnsnowlabs.com/public/models/mpnet_embedding_initial_model_en_5.1.0_3.0_1692378787120.zip){:.button.button-orange.button-orange-trans.button-icon.button-copy-s3} + +## How to use + + + +
+{% include programmingLanguageSelectScalaPythonNLU.html %} +```python + +document_assembler = DocumentAssembler() \ + .setInputCol("text") \ + .setOutputCol("documents") + +instruction = MPNetEmbeddings \ + .pretrained("mpnet_embedding_initial_model", "en")\ + .setInputCols(["documents"]) \ + .setOutputCol("mpnet_embeddings") + +pipeline = Pipeline(stages=[ + document_assembler, + instruction, +]) + +pipelineModel = pipeline.fit(data) + +pipelineDF = pipelineModel.transform(data) +``` +```scala + +val document_assembler = new DocumentAssembler() + .setInputCol("text") + .setOutputCol("documents") + +val instruction = MPNetEmbeddings + .pretrained("mpnet_embedding_initial_model", "en") + .setInputCols(Array("documents")) + .setOutputCol("mpnet_embeddings") + +val pipeline = new Pipeline().setStages(Array(document_assembler, instruction)) + +val pipelineModel = pipeline.fit(data) + +val pipelineDF = pipelineModel.transform(data) + +``` +
+ +{:.model-param} +## Model Information + +{:.table-model} +|---|---| +|Model Name:|mpnet_embedding_initial_model| +|Compatibility:|Spark NLP 5.1.0+| +|License:|Open Source| +|Edition:|Official| +|Input Labels:|[documents]| +|Output Labels:|[mpnet_embeddings]| +|Language:|en| +|Size:|410.0 MB| \ No newline at end of file From f7d79152223021e89037d219ca942095d2291257 Mon Sep 17 00:00:00 2001 From: ahmedlone127 Date: Sat, 19 Aug 2023 01:48:03 +0700 Subject: [PATCH 155/155] Add model 2023-08-18-mpnet_embedding_github_issues_preprocessed_mpnet_st_e10_en --- ...hub_issues_preprocessed_mpnet_st_e10_en.md | 88 +++++++++++++++++++ 1 file changed, 88 insertions(+) create mode 100644 docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_github_issues_preprocessed_mpnet_st_e10_en.md diff --git a/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_github_issues_preprocessed_mpnet_st_e10_en.md b/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_github_issues_preprocessed_mpnet_st_e10_en.md new file mode 100644 index 00000000000000..dd0307f194cdff --- /dev/null +++ b/docs/_posts/ahmedlone127/2023-08-18-mpnet_embedding_github_issues_preprocessed_mpnet_st_e10_en.md @@ -0,0 +1,88 @@ +--- +layout: model +title: English mpnet_embedding_github_issues_preprocessed_mpnet_st_e10 TFMPNetModel from Collab-uniba +author: John Snow Labs +name: mpnet_embedding_github_issues_preprocessed_mpnet_st_e10 +date: 2023-08-18 +tags: [mpnet, en, open_source, tensorflow] +task: Embeddings +language: en +edition: Spark NLP 5.1.0 +spark_version: 3.0 +supported: true +engine: tensorflow +annotator: MPNetEmbeddings +article_header: + type: cover +use_language_switcher: "Python-Scala-Java" +--- + +## Description + +Pretrained mpnet model, adapted from Hugging Face and curated to provide scalability and production-readiness using Spark NLP.`mpnet_embedding_github_issues_preprocessed_mpnet_st_e10` is a English model originally trained by Collab-uniba. + +{:.btn-box} + + +[Download](https://s3.amazonaws.com/auxdata.johnsnowlabs.com/public/models/mpnet_embedding_github_issues_preprocessed_mpnet_st_e10_en_5.1.0_3.0_1692378367005.zip){:.button.button-orange.button-orange-trans.arr.button-icon} +[Copy S3 URI](s3://auxdata.johnsnowlabs.com/public/models/mpnet_embedding_github_issues_preprocessed_mpnet_st_e10_en_5.1.0_3.0_1692378367005.zip){:.button.button-orange.button-orange-trans.button-icon.button-copy-s3} + +## How to use + + + +
+{% include programmingLanguageSelectScalaPythonNLU.html %} +```python + +document_assembler = DocumentAssembler() \ + .setInputCol("text") \ + .setOutputCol("documents") + +instruction = MPNetEmbeddings \ + .pretrained("mpnet_embedding_github_issues_preprocessed_mpnet_st_e10", "en")\ + .setInputCols(["documents"]) \ + .setOutputCol("mpnet_embeddings") + +pipeline = Pipeline(stages=[ + document_assembler, + instruction, +]) + +pipelineModel = pipeline.fit(data) + +pipelineDF = pipelineModel.transform(data) +``` +```scala + +val document_assembler = new DocumentAssembler() + .setInputCol("text") + .setOutputCol("documents") + +val instruction = MPNetEmbeddings + .pretrained("mpnet_embedding_github_issues_preprocessed_mpnet_st_e10", "en") + .setInputCols(Array("documents")) + .setOutputCol("mpnet_embeddings") + +val pipeline = new Pipeline().setStages(Array(document_assembler, instruction)) + +val pipelineModel = pipeline.fit(data) + +val pipelineDF = pipelineModel.transform(data) + +``` +
+ +{:.model-param} +## Model Information + +{:.table-model} +|---|---| +|Model Name:|mpnet_embedding_github_issues_preprocessed_mpnet_st_e10| +|Compatibility:|Spark NLP 5.1.0+| +|License:|Open Source| +|Edition:|Official| +|Input Labels:|[documents]| +|Output Labels:|[mpnet_embeddings]| +|Language:|en| +|Size:|409.0 MB| \ No newline at end of file