Skip to content

Commit

Permalink
Models hub (#13876)
Browse files Browse the repository at this point in the history
* Add model 2023-04-13-CyberbullyingDetection_ClassifierDL_tfhub_en (#13757)

Co-authored-by: Naveen-004 <chinna.nk4@gmail.com>

* 2023-04-20-distilbert_base_uncased_mnli_en (#13761)

* Add model 2023-04-20-distilbert_base_uncased_mnli_en

* Add model 2023-04-20-distilbert_base_turkish_cased_allnli_tr

* Add model 2023-04-20-distilbert_base_turkish_cased_snli_tr

* Add model 2023-04-20-distilbert_base_turkish_cased_multinli_tr

* Update and rename 2023-04-20-distilbert_base_turkish_cased_allnli_tr.md to 2023-04-20-distilbert_base_zero_shot_classifier_turkish_cased_allnli_tr.md

* Update and rename 2023-04-20-distilbert_base_turkish_cased_multinli_tr.md to 2023-04-20-distilbert_base_zero_shot_classifier_turkish_cased_multinli_tr.md

* Update and rename 2023-04-20-distilbert_base_turkish_cased_snli_tr.md to 2023-04-20-distilbert_base_zero_shot_classifier_turkish_cased_snli_tr.md

* Update and rename 2023-04-20-distilbert_base_uncased_mnli_en.md to distilbert_base_zero_shot_classifier_turkish_cased_snli

* Rename distilbert_base_zero_shot_classifier_turkish_cased_snli to distilbert_base_zero_shot_classifier_turkish_cased_snli_en.md

* Update 2023-04-20-distilbert_base_zero_shot_classifier_turkish_cased_snli_tr.md

* Update 2023-04-20-distilbert_base_zero_shot_classifier_turkish_cased_multinli_tr.md

* Update 2023-04-20-distilbert_base_zero_shot_classifier_turkish_cased_allnli_tr.md

---------

Co-authored-by: ahmedlone127 <ahmedlone127@gmail.com>

* 2023-04-20-distilbert_base_zero_shot_classifier_turkish_cased_multinli_tr (#13763)

* Add model 2023-04-20-distilbert_base_zero_shot_classifier_turkish_cased_multinli_tr

* Add model 2023-04-20-distilbert_base_zero_shot_classifier_uncased_mnli_en

* Add model 2023-04-20-distilbert_base_zero_shot_classifier_turkish_cased_snli_tr

* Add model 2023-04-20-distilbert_base_zero_shot_classifier_turkish_cased_allnli_tr

* Update 2023-04-20-distilbert_base_zero_shot_classifier_turkish_cased_multinli_tr.md

* Update 2023-04-20-distilbert_base_zero_shot_classifier_turkish_cased_snli_tr.md

---------

Co-authored-by: ahmedlone127 <ahmedlone127@gmail.com>

* 2023-05-04-roberta_base_zero_shot_classifier_nli_en (#13781)

* Add model 2023-05-04-roberta_base_zero_shot_classifier_nli_en

* Fix Spark version to 3.0

---------

Co-authored-by: ahmedlone127 <ahmedlone127@gmail.com>
Co-authored-by: Maziyar Panahi <maziyar.panahi@iscpif.fr>

* 2023-05-09-distilbart_xsum_6_6_en (#13788)

* Add model 2023-05-09-distilbart_xsum_6_6_en

* Add model 2023-05-09-distilbart_xsum_12_6_en

* Add model 2023-05-09-distilbart_cnn_12_6_en

* Add model 2023-05-09-distilbart_cnn_6_6_en

* Add model 2023-05-09-bart_large_cnn_en

* Update 2023-05-09-bart_large_cnn_en.md

* Update 2023-05-09-distilbart_cnn_12_6_en.md

* Update 2023-05-09-distilbart_cnn_6_6_en.md

* Update 2023-05-09-distilbart_xsum_12_6_en.md

* Update 2023-05-09-distilbart_xsum_6_6_en.md

---------

Co-authored-by: prabod <prabod@rathnayaka.me>
Co-authored-by: Maziyar Panahi <maziyar.panahi@iscpif.fr>

* 2023-05-11-distilbart_cnn_12_6_en (#13795)

* Add model 2023-05-11-distilbart_cnn_12_6_en

* Add model 2023-05-11-distilbart_cnn_6_6_en

* Add model 2023-05-11-distilbart_xsum_12_6_en

* Add model 2023-05-11-distilbart_xsum_6_6_en

* Add model 2023-05-11-bart_large_cnn_en

* Update 2023-05-11-bart_large_cnn_en.md

* Update 2023-05-11-distilbart_cnn_12_6_en.md

* Update 2023-05-11-distilbart_cnn_6_6_en.md

* Update 2023-05-11-distilbart_xsum_12_6_en.md

* Update 2023-05-11-distilbart_xsum_6_6_en.md

---------

Co-authored-by: prabod <prabod@rathnayaka.me>
Co-authored-by: Maziyar Panahi <maziyar.panahi@iscpif.fr>

* 2023-05-19-match_pattern_en (#13805)

* Add model 2023-05-19-match_pattern_en

* Add model 2023-05-19-dependency_parse_en

* Add model 2023-05-20-explain_document_md_fr

* Add model 2023-05-20-dependency_parse_en

* Add model 2023-05-20-explain_document_md_it

* Add model 2023-05-20-entity_recognizer_lg_fr

* Add model 2023-05-20-entity_recognizer_md_fr

* Add model 2023-05-20-entity_recognizer_lg_it

* Add model 2023-05-20-entity_recognizer_md_it

* Add model 2023-05-20-check_spelling_en

* Add model 2023-05-20-match_datetime_en

* Add model 2023-05-20-match_pattern_en

* Add model 2023-05-20-clean_pattern_en

* Add model 2023-05-20-clean_stop_en

* Add model 2023-05-20-movies_sentiment_analysis_en

* Add model 2023-05-20-explain_document_ml_en

* Add model 2023-05-20-analyze_sentiment_en

* Add model 2023-05-20-explain_document_dl_en

* Add model 2023-05-20-recognize_entities_dl_en

* Add model 2023-05-20-recognize_entities_bert_en

* Add model 2023-05-20-explain_document_md_de

* Add model 2023-05-21-entity_recognizer_lg_de

* Add model 2023-05-21-entity_recognizer_md_de

* Add model 2023-05-21-onto_recognize_entities_sm_en

* Add model 2023-05-21-onto_recognize_entities_lg_en

* Add model 2023-05-21-match_chunks_en

* Add model 2023-05-21-explain_document_lg_es

* Add model 2023-05-21-explain_document_md_es

* Add model 2023-05-21-explain_document_sm_es

* Add model 2023-05-21-entity_recognizer_lg_es

* Add model 2023-05-21-entity_recognizer_md_es

* Add model 2023-05-21-entity_recognizer_sm_es

* Add model 2023-05-21-explain_document_lg_ru

* Add model 2023-05-21-explain_document_md_ru

* Add model 2023-05-21-explain_document_sm_ru

* Add model 2023-05-21-entity_recognizer_lg_ru

* Add model 2023-05-21-entity_recognizer_md_ru

* Add model 2023-05-21-entity_recognizer_sm_ru

* Add model 2023-05-21-text_cleaning_en

* Add model 2023-05-21-explain_document_lg_pt

* Add model 2023-05-21-explain_document_md_pt

* Add model 2023-05-21-explain_document_sm_pt

* Add model 2023-05-21-entity_recognizer_lg_pt

* Add model 2023-05-21-entity_recognizer_md_pt

* Add model 2023-05-21-entity_recognizer_sm_pt

* Add model 2023-05-21-explain_document_lg_pl

* Add model 2023-05-21-explain_document_md_pl

* Add model 2023-05-21-explain_document_sm_pl

* Add model 2023-05-21-entity_recognizer_lg_pl

* Add model 2023-05-21-entity_recognizer_md_pl

* Add model 2023-05-21-entity_recognizer_sm_pl

* Add model 2023-05-21-explain_document_lg_nl

* Add model 2023-05-21-explain_document_md_nl

* Add model 2023-05-21-explain_document_sm_nl

* Add model 2023-05-21-entity_recognizer_lg_nl

* Add model 2023-05-21-entity_recognizer_md_nl

* Add model 2023-05-21-entity_recognizer_sm_nl

* Add model 2023-05-21-analyze_sentimentdl_glove_imdb_en

* Add model 2023-05-21-explain_document_lg_no

* Add model 2023-05-21-explain_document_md_no

* Add model 2023-05-21-explain_document_sm_no

* Add model 2023-05-21-entity_recognizer_lg_no

* Add model 2023-05-21-entity_recognizer_md_no

* Add model 2023-05-21-entity_recognizer_sm_no

* Add model 2023-05-21-explain_document_lg_sv

* Add model 2023-05-21-explain_document_md_sv

* Add model 2023-05-21-explain_document_sm_sv

* Add model 2023-05-21-entity_recognizer_lg_sv

* Add model 2023-05-21-entity_recognizer_md_sv

* Add model 2023-05-21-entity_recognizer_sm_sv

* Add model 2023-05-21-explain_document_lg_da

* Add model 2023-05-21-explain_document_md_da

* Add model 2023-05-21-explain_document_sm_da

* Add model 2023-05-21-entity_recognizer_lg_da

* Add model 2023-05-21-entity_recognizer_md_da

* Add model 2023-05-21-entity_recognizer_sm_da

* Add model 2023-05-21-explain_document_lg_fi

* Add model 2023-05-21-explain_document_md_fi

* Add model 2023-05-21-explain_document_sm_fi

* Add model 2023-05-21-entity_recognizer_lg_fi

* Add model 2023-05-21-entity_recognizer_md_fi

* Add model 2023-05-21-entity_recognizer_sm_fi

* Add model 2023-05-21-onto_recognize_entities_bert_base_en

* Add model 2023-05-21-onto_recognize_entities_bert_large_en

* Add model 2023-05-21-onto_recognize_entities_bert_medium_en

* Add model 2023-05-21-onto_recognize_entities_bert_mini_en

* Add model 2023-05-21-onto_recognize_entities_bert_small_en

* Add model 2023-05-21-onto_recognize_entities_bert_tiny_en

* Add model 2023-05-21-onto_recognize_entities_electra_base_en

* Add model 2023-05-21-onto_recognize_entities_electra_small_en

* Add model 2023-05-21-onto_recognize_entities_electra_large_en

* Add model 2023-05-21-recognize_entities_dl_fa

* Add model 2023-05-21-nerdl_fewnerd_subentity_100d_pipeline_en

* Add model 2023-05-21-nerdl_fewnerd_100d_pipeline_en

* Add model 2023-05-21-pos_ud_bokmaal_nb

* Add model 2023-05-21-xlm_roberta_large_token_classifier_masakhaner_pipeline_xx

* Add model 2023-05-21-bert_token_classifier_scandi_ner_pipeline_xx

* Add model 2023-05-21-bert_sequence_classifier_trec_coarse_pipeline_en

* Add model 2023-05-21-bert_sequence_classifier_age_news_pipeline_en

* Add model 2023-05-21-distilbert_token_classifier_typo_detector_pipeline_is

* Add model 2023-05-21-distilbert_base_token_classifier_masakhaner_pipeline_xx

* Add model 2023-05-21-nerdl_restaurant_100d_pipeline_en

* Add model 2023-05-21-roberta_token_classifier_timex_semeval_pipeline_en

* Add model 2023-05-21-bert_token_classifier_hi_en_ner_pipeline_hi

* Add model 2023-05-21-xlm_roberta_large_token_classifier_hrl_pipeline_xx

* Add model 2023-05-21-spellcheck_dl_pipeline_en

* Add model 2023-05-21-bert_token_classifier_dutch_udlassy_ner_pipeline_nl

* Add model 2023-05-21-xlm_roberta_large_token_classifier_conll03_pipeline_de

* Add model 2023-05-21-roberta_token_classifier_bne_capitel_ner_pipeline_es

* Add model 2023-05-21-roberta_token_classifier_icelandic_ner_pipeline_is

* Add model 2023-05-21-longformer_base_token_classifier_conll03_pipeline_en

* Add model 2023-05-21-longformer_large_token_classifier_conll03_pipeline_en

* Add model 2023-05-21-xlnet_base_token_classifier_conll03_pipeline_en

* Add model 2023-05-21-xlm_roberta_base_token_classifier_ontonotes_pipeline_en

* Add model 2023-05-21-xlm_roberta_base_token_classifier_conll03_pipeline_en

* Add model 2023-05-21-xlnet_large_token_classifier_conll03_pipeline_en

* Add model 2023-05-21-albert_base_token_classifier_conll03_pipeline_en

* Add model 2023-05-21-albert_large_token_classifier_conll03_pipeline_en

* Add model 2023-05-21-albert_xlarge_token_classifier_conll03_pipeline_en

* Add model 2023-05-21-distilroberta_base_token_classifier_ontonotes_pipeline_en

* Add model 2023-05-21-roberta_base_token_classifier_ontonotes_pipeline_en

* Add model 2023-05-21-roberta_large_token_classifier_conll03_pipeline_en

* Add model 2023-05-21-distilbert_token_classifier_typo_detector_pipeline_en

---------

Co-authored-by: ahmedlone127 <ahmedlone127@gmail.com>

* 2023-05-22-explain_document_md_fr (#13811)

* Add model 2023-05-22-explain_document_md_fr

* Add model 2023-05-22-dependency_parse_en

* Add model 2023-05-22-explain_document_md_it

* Add model 2023-05-22-entity_recognizer_lg_fr

* Add model 2023-05-22-entity_recognizer_md_fr

* Add model 2023-05-22-entity_recognizer_lg_it

* Add model 2023-05-22-entity_recognizer_md_it

* Add model 2023-05-22-check_spelling_en

* Add model 2023-05-22-match_datetime_en

* Add model 2023-05-22-match_pattern_en

* Add model 2023-05-22-clean_pattern_en

* Add model 2023-05-22-clean_stop_en

* Add model 2023-05-22-movies_sentiment_analysis_en

* Add model 2023-05-22-explain_document_ml_en

* Add model 2023-05-22-analyze_sentiment_en

* Add model 2023-05-22-explain_document_dl_en

* Add model 2023-05-22-recognize_entities_dl_en

* Add model 2023-05-22-recognize_entities_bert_en

* Add model 2023-05-22-explain_document_md_de

* Add model 2023-05-22-entity_recognizer_lg_de

* Add model 2023-05-22-entity_recognizer_md_de

* Add model 2023-05-22-onto_recognize_entities_sm_en

* Add model 2023-05-22-onto_recognize_entities_lg_en

* Add model 2023-05-22-match_chunks_en

* Add model 2023-05-22-explain_document_lg_es

* Add model 2023-05-22-explain_document_md_es

* Add model 2023-05-22-explain_document_sm_es

* Add model 2023-05-22-entity_recognizer_lg_es

* Add model 2023-05-22-entity_recognizer_md_es

* Add model 2023-05-22-entity_recognizer_sm_es

* Add model 2023-05-22-explain_document_lg_ru

* Add model 2023-05-22-explain_document_md_ru

* Add model 2023-05-22-explain_document_sm_ru

* Add model 2023-05-22-entity_recognizer_lg_ru

* Add model 2023-05-22-entity_recognizer_md_ru

* Add model 2023-05-22-entity_recognizer_sm_ru

* Add model 2023-05-22-text_cleaning_en

* Add model 2023-05-22-explain_document_lg_pt

* Add model 2023-05-22-explain_document_md_pt

* Add model 2023-05-22-explain_document_sm_pt

* Add model 2023-05-22-entity_recognizer_lg_pt

* Add model 2023-05-22-entity_recognizer_md_pt

* Add model 2023-05-22-entity_recognizer_sm_pt

* Add model 2023-05-22-explain_document_lg_pl

* Add model 2023-05-22-explain_document_md_pl

* Add model 2023-05-22-explain_document_sm_pl

* Add model 2023-05-22-entity_recognizer_lg_pl

* Add model 2023-05-22-entity_recognizer_md_pl

* Add model 2023-05-22-entity_recognizer_sm_pl

* Add model 2023-05-22-explain_document_lg_nl

* Add model 2023-05-22-explain_document_md_nl

* Add model 2023-05-22-explain_document_sm_nl

* Add model 2023-05-22-entity_recognizer_lg_nl

* Add model 2023-05-22-entity_recognizer_md_nl

* Add model 2023-05-22-entity_recognizer_sm_nl

* Add model 2023-05-22-analyze_sentimentdl_glove_imdb_en

* Add model 2023-05-22-explain_document_lg_no

* Add model 2023-05-22-explain_document_md_no

* Add model 2023-05-22-explain_document_sm_no

* Add model 2023-05-22-entity_recognizer_md_no

* Add model 2023-05-22-entity_recognizer_sm_no

* Add model 2023-05-22-explain_document_lg_sv

* Add model 2023-05-22-explain_document_md_sv

* Add model 2023-05-22-explain_document_sm_sv

* Add model 2023-05-22-entity_recognizer_lg_sv

* Add model 2023-05-22-entity_recognizer_md_sv

* Add model 2023-05-22-entity_recognizer_sm_sv

* Add model 2023-05-22-explain_document_lg_da

* Add model 2023-05-22-explain_document_md_da

* Add model 2023-05-22-explain_document_sm_da

* Add model 2023-05-22-entity_recognizer_lg_da

* Add model 2023-05-22-entity_recognizer_md_da

* Add model 2023-05-22-entity_recognizer_sm_da

* Add model 2023-05-22-explain_document_lg_fi

* Add model 2023-05-22-explain_document_md_fi

* Add model 2023-05-22-explain_document_sm_fi

* Add model 2023-05-22-entity_recognizer_lg_fi

* Add model 2023-05-22-entity_recognizer_md_fi

* Add model 2023-05-22-entity_recognizer_sm_fi

* Add model 2023-05-22-onto_recognize_entities_bert_base_en

* Add model 2023-05-22-onto_recognize_entities_bert_large_en

* Add model 2023-05-22-onto_recognize_entities_bert_medium_en

* Add model 2023-05-22-onto_recognize_entities_bert_mini_en

* Add model 2023-05-22-onto_recognize_entities_bert_small_en

* Add model 2023-05-22-onto_recognize_entities_bert_tiny_en

* Add model 2023-05-22-onto_recognize_entities_electra_base_en

* Add model 2023-05-22-onto_recognize_entities_electra_small_en

* Add model 2023-05-22-onto_recognize_entities_electra_large_en

* Add model 2023-05-22-recognize_entities_dl_fa

* Add model 2023-05-22-nerdl_fewnerd_subentity_100d_pipeline_en

* Add model 2023-05-22-nerdl_fewnerd_100d_pipeline_en

* Add model 2023-05-22-pos_ud_bokmaal_nb

* Add model 2023-05-22-xlm_roberta_large_token_classifier_masakhaner_pipeline_xx

* Add model 2023-05-22-bert_token_classifier_scandi_ner_pipeline_xx

* Add model 2023-05-22-bert_sequence_classifier_trec_coarse_pipeline_en

* Add model 2023-05-22-bert_sequence_classifier_age_news_pipeline_en

* Add model 2023-05-22-distilbert_token_classifier_typo_detector_pipeline_is

* Add model 2023-05-22-distilbert_base_token_classifier_masakhaner_pipeline_xx

* Add model 2023-05-22-nerdl_restaurant_100d_pipeline_en

* Add model 2023-05-22-roberta_token_classifier_timex_semeval_pipeline_en

* Add model 2023-05-22-bert_token_classifier_hi_en_ner_pipeline_hi

* Add model 2023-05-22-xlm_roberta_large_token_classifier_hrl_pipeline_xx

* Add model 2023-05-22-spellcheck_dl_pipeline_en

* Add model 2023-05-22-bert_token_classifier_dutch_udlassy_ner_pipeline_nl

* Add model 2023-05-22-xlm_roberta_large_token_classifier_conll03_pipeline_de

* Add model 2023-05-22-roberta_token_classifier_bne_capitel_ner_pipeline_es

* Add model 2023-05-22-roberta_token_classifier_icelandic_ner_pipeline_is

* Add model 2023-05-22-longformer_base_token_classifier_conll03_pipeline_en

* Add model 2023-05-22-longformer_large_token_classifier_conll03_pipeline_en

* Add model 2023-05-22-xlnet_base_token_classifier_conll03_pipeline_en

* Add model 2023-05-22-xlm_roberta_base_token_classifier_ontonotes_pipeline_en

* Add model 2023-05-22-xlm_roberta_base_token_classifier_conll03_pipeline_en

* Add model 2023-05-22-xlnet_large_token_classifier_conll03_pipeline_en

* Add model 2023-05-22-albert_base_token_classifier_conll03_pipeline_en

* Add model 2023-05-22-albert_large_token_classifier_conll03_pipeline_en

* Add model 2023-05-22-albert_xlarge_token_classifier_conll03_pipeline_en

* Add model 2023-05-22-distilroberta_base_token_classifier_ontonotes_pipeline_en

* Add model 2023-05-22-roberta_base_token_classifier_ontonotes_pipeline_en

* Add model 2023-05-22-roberta_large_token_classifier_conll03_pipeline_en

* Add model 2023-05-22-distilbert_token_classifier_typo_detector_pipeline_en

---------

Co-authored-by: ahmedlone127 <ahmedlone127@gmail.com>

* 2023-05-24-explain_document_md_fr (#13821)

* Add model 2023-05-24-explain_document_md_fr

* Add model 2023-05-24-dependency_parse_en

* Add model 2023-05-24-explain_document_md_it

* Add model 2023-05-24-entity_recognizer_lg_fr

* Add model 2023-05-24-entity_recognizer_md_fr

* Add model 2023-05-24-entity_recognizer_lg_it

* Add model 2023-05-24-entity_recognizer_md_it

* Add model 2023-05-24-check_spelling_en

* Add model 2023-05-24-match_datetime_en

* Add model 2023-05-24-match_pattern_en

* Add model 2023-05-24-clean_pattern_en

* Add model 2023-05-24-clean_stop_en

* Add model 2023-05-24-movies_sentiment_analysis_en

* Add model 2023-05-24-explain_document_ml_en

* Add model 2023-05-24-analyze_sentiment_en

* Add model 2023-05-24-explain_document_dl_en

* Add model 2023-05-24-recognize_entities_dl_en

* Add model 2023-05-24-recognize_entities_bert_en

* Add model 2023-05-24-explain_document_md_de

* Add model 2023-05-24-entity_recognizer_lg_de

* Add model 2023-05-24-entity_recognizer_md_de

* Add model 2023-05-24-onto_recognize_entities_sm_en

* Add model 2023-05-24-onto_recognize_entities_lg_en

* Add model 2023-05-24-match_chunks_en

* Add model 2023-05-24-explain_document_lg_es

* Add model 2023-05-24-explain_document_md_es

* Add model 2023-05-24-explain_document_sm_es

* Add model 2023-05-24-entity_recognizer_lg_es

* Add model 2023-05-24-entity_recognizer_md_es

* Add model 2023-05-24-entity_recognizer_sm_es

* Add model 2023-05-24-explain_document_lg_ru

* Add model 2023-05-24-explain_document_md_ru

* Add model 2023-05-24-explain_document_sm_ru

* Add model 2023-05-24-entity_recognizer_lg_ru

* Add model 2023-05-24-entity_recognizer_md_ru

* Add model 2023-05-24-entity_recognizer_sm_ru

* Add model 2023-05-24-text_cleaning_en

* Add model 2023-05-24-explain_document_lg_pt

* Add model 2023-05-24-explain_document_md_pt

* Add model 2023-05-24-explain_document_sm_pt

* Add model 2023-05-24-entity_recognizer_lg_pt

* Add model 2023-05-24-entity_recognizer_md_pt

* Add model 2023-05-24-entity_recognizer_sm_pt

* Add model 2023-05-24-explain_document_lg_pl

* Add model 2023-05-24-explain_document_md_pl

* Add model 2023-05-24-explain_document_sm_pl

* Add model 2023-05-24-entity_recognizer_lg_pl

* Add model 2023-05-24-entity_recognizer_md_pl

* Add model 2023-05-24-entity_recognizer_sm_pl

* Add model 2023-05-24-explain_document_lg_nl

* Add model 2023-05-24-explain_document_md_nl

* Add model 2023-05-24-explain_document_sm_nl

* Add model 2023-05-24-entity_recognizer_lg_nl

* Add model 2023-05-24-entity_recognizer_md_nl

* Add model 2023-05-24-entity_recognizer_sm_nl

* Add model 2023-05-24-analyze_sentimentdl_glove_imdb_en

* Add model 2023-05-24-explain_document_lg_no

* Add model 2023-05-24-explain_document_md_no

* Add model 2023-05-24-explain_document_sm_no

* Add model 2023-05-24-entity_recognizer_lg_no

* Add model 2023-05-24-entity_recognizer_md_no

* Add model 2023-05-24-entity_recognizer_sm_no

* Add model 2023-05-24-explain_document_lg_sv

* Add model 2023-05-24-explain_document_md_sv

* Add model 2023-05-24-explain_document_sm_sv

* Add model 2023-05-24-entity_recognizer_lg_sv

* Add model 2023-05-24-entity_recognizer_md_sv

* Add model 2023-05-24-entity_recognizer_sm_sv

* Add model 2023-05-25-explain_document_lg_da

* Add model 2023-05-25-explain_document_md_da

* Add model 2023-05-25-explain_document_sm_da

* Add model 2023-05-25-entity_recognizer_lg_da

* Add model 2023-05-25-entity_recognizer_md_da

* Add model 2023-05-25-entity_recognizer_sm_da

* Add model 2023-05-25-explain_document_lg_fi

* Add model 2023-05-25-explain_document_md_fi

* Add model 2023-05-25-explain_document_sm_fi

* Add model 2023-05-25-entity_recognizer_lg_fi

* Add model 2023-05-25-entity_recognizer_md_fi

* Add model 2023-05-25-entity_recognizer_sm_fi

* Add model 2023-05-25-onto_recognize_entities_bert_base_en

* Add model 2023-05-25-onto_recognize_entities_bert_large_en

* Add model 2023-05-25-onto_recognize_entities_bert_medium_en

* Add model 2023-05-25-onto_recognize_entities_bert_mini_en

* Add model 2023-05-25-onto_recognize_entities_bert_small_en

* Add model 2023-05-25-onto_recognize_entities_bert_tiny_en

* Add model 2023-05-25-onto_recognize_entities_electra_base_en

* Add model 2023-05-25-onto_recognize_entities_electra_small_en

* Add model 2023-05-25-onto_recognize_entities_electra_large_en

* Add model 2023-05-25-recognize_entities_dl_fa

* Add model 2023-05-25-nerdl_fewnerd_subentity_100d_pipeline_en

* Add model 2023-05-25-nerdl_fewnerd_100d_pipeline_en

* Add model 2023-05-25-pos_ud_bokmaal_nb

* Add model 2023-05-25-xlm_roberta_large_token_classifier_masakhaner_pipeline_xx

* Add model 2023-05-25-bert_token_classifier_scandi_ner_pipeline_xx

* Add model 2023-05-25-bert_sequence_classifier_trec_coarse_pipeline_en

* Add model 2023-05-25-bert_sequence_classifier_age_news_pipeline_en

* Add model 2023-05-25-distilbert_token_classifier_typo_detector_pipeline_is

* Add model 2023-05-25-distilbert_base_token_classifier_masakhaner_pipeline_xx

* Add model 2023-05-25-nerdl_restaurant_100d_pipeline_en

* Add model 2023-05-25-roberta_token_classifier_timex_semeval_pipeline_en

* Add model 2023-05-25-bert_token_classifier_hi_en_ner_pipeline_hi

* Add model 2023-05-25-xlm_roberta_large_token_classifier_hrl_pipeline_xx

* Add model 2023-05-25-spellcheck_dl_pipeline_en

* Add model 2023-05-25-bert_token_classifier_dutch_udlassy_ner_pipeline_nl

* Add model 2023-05-25-xlm_roberta_large_token_classifier_conll03_pipeline_de

* Add model 2023-05-25-roberta_token_classifier_bne_capitel_ner_pipeline_es

* Add model 2023-05-25-roberta_token_classifier_icelandic_ner_pipeline_is

* Add model 2023-05-25-longformer_base_token_classifier_conll03_pipeline_en

* Add model 2023-05-25-longformer_large_token_classifier_conll03_pipeline_en

* Add model 2023-05-25-xlnet_base_token_classifier_conll03_pipeline_en

* Add model 2023-05-25-xlm_roberta_base_token_classifier_ontonotes_pipeline_en

* Add model 2023-05-25-xlm_roberta_base_token_classifier_conll03_pipeline_en

* Add model 2023-05-25-xlnet_large_token_classifier_conll03_pipeline_en

* Add model 2023-05-25-albert_base_token_classifier_conll03_pipeline_en

* Add model 2023-05-25-albert_large_token_classifier_conll03_pipeline_en

* Add model 2023-05-25-albert_xlarge_token_classifier_conll03_pipeline_en

* Add model 2023-05-25-distilroberta_base_token_classifier_ontonotes_pipeline_en

* Add model 2023-05-25-roberta_base_token_classifier_ontonotes_pipeline_en

* Add model 2023-05-25-roberta_large_token_classifier_conll03_pipeline_en

* Add model 2023-05-25-distilbert_token_classifier_typo_detector_pipeline_en

---------

Co-authored-by: ahmedlone127 <ahmedlone127@gmail.com>

* Add model 2023-05-25-explain_document_md_fr (#13827)

Co-authored-by: ahmedlone127 <ahmedlone127@gmail.com>

* 2023-05-25-dependency_parse_en (#13828)

* Add model 2023-05-25-dependency_parse_en

* Add model 2023-05-25-explain_document_md_it

* Add model 2023-05-25-entity_recognizer_lg_fr

* Add model 2023-05-25-entity_recognizer_md_fr

* Add model 2023-05-25-entity_recognizer_lg_it

* Add model 2023-05-25-entity_recognizer_md_it

* Add model 2023-05-25-check_spelling_en

* Add model 2023-05-25-match_datetime_en

* Add model 2023-05-25-match_pattern_en

* Add model 2023-05-25-clean_pattern_en

* Add model 2023-05-25-clean_stop_en

* Add model 2023-05-25-movies_sentiment_analysis_en

* Add model 2023-05-25-explain_document_ml_en

* Add model 2023-05-25-analyze_sentiment_en

* Add model 2023-05-25-explain_document_dl_en

* Add model 2023-05-25-recognize_entities_dl_en

* Add model 2023-05-25-recognize_entities_bert_en

* Add model 2023-05-25-explain_document_md_de

* Add model 2023-05-25-entity_recognizer_lg_de

* Add model 2023-05-25-entity_recognizer_md_de

* Add model 2023-05-25-onto_recognize_entities_sm_en

* Add model 2023-05-25-onto_recognize_entities_lg_en

* Add model 2023-05-25-match_chunks_en

* Add model 2023-05-25-explain_document_lg_es

* Add model 2023-05-25-explain_document_md_es

* Add model 2023-05-25-explain_document_sm_es

* Add model 2023-05-25-entity_recognizer_lg_es

* Add model 2023-05-25-entity_recognizer_md_es

* Add model 2023-05-25-entity_recognizer_sm_es

* Add model 2023-05-25-explain_document_lg_ru

* Add model 2023-05-25-explain_document_md_ru

* Add model 2023-05-25-explain_document_sm_ru

* Add model 2023-05-25-entity_recognizer_lg_ru

* Add model 2023-05-25-entity_recognizer_md_ru

* Add model 2023-05-25-entity_recognizer_sm_ru

* Add model 2023-05-25-text_cleaning_en

* Add model 2023-05-25-explain_document_lg_pt

* Add model 2023-05-25-explain_document_md_pt

* Add model 2023-05-25-explain_document_sm_pt

* Add model 2023-05-25-entity_recognizer_lg_pt

* Add model 2023-05-25-entity_recognizer_md_pt

* Add model 2023-05-25-entity_recognizer_sm_pt

* Add model 2023-05-25-explain_document_lg_pl

* Add model 2023-05-25-explain_document_md_pl

* Add model 2023-05-25-explain_document_sm_pl

* Add model 2023-05-25-entity_recognizer_lg_pl

* Add model 2023-05-25-entity_recognizer_md_pl

* Add model 2023-05-25-entity_recognizer_sm_pl

* Add model 2023-05-25-explain_document_lg_nl

* Add model 2023-05-25-explain_document_md_nl

* Add model 2023-05-25-explain_document_sm_nl

* Add model 2023-05-25-entity_recognizer_lg_nl

* Add model 2023-05-25-entity_recognizer_md_nl

* Add model 2023-05-25-entity_recognizer_sm_nl

* Add model 2023-05-25-analyze_sentimentdl_glove_imdb_en

* Add model 2023-05-25-explain_document_lg_no

* Add model 2023-05-25-explain_document_md_no

* Add model 2023-05-25-explain_document_sm_no

* Add model 2023-05-25-entity_recognizer_lg_no

* Add model 2023-05-25-entity_recognizer_md_no

* Add model 2023-05-25-entity_recognizer_sm_no

* Add model 2023-05-25-explain_document_lg_sv

* Add model 2023-05-25-explain_document_md_sv

* Add model 2023-05-25-explain_document_sm_sv

* Add model 2023-05-25-entity_recognizer_lg_sv

* Add model 2023-05-25-entity_recognizer_md_sv

* Add model 2023-05-25-entity_recognizer_sm_sv

* Add model 2023-05-25-explain_document_lg_da

* Add model 2023-05-25-explain_document_md_da

* Add model 2023-05-25-explain_document_sm_da

* Add model 2023-05-25-entity_recognizer_lg_da

* Add model 2023-05-25-entity_recognizer_md_da

* Add model 2023-05-25-entity_recognizer_sm_da

* Add model 2023-05-25-explain_document_lg_fi

* Add model 2023-05-25-explain_document_md_fi

* Add model 2023-05-25-explain_document_sm_fi

* Add model 2023-05-25-entity_recognizer_lg_fi

* Add model 2023-05-25-entity_recognizer_md_fi

* Add model 2023-05-25-entity_recognizer_sm_fi

* Add model 2023-05-25-onto_recognize_entities_bert_base_en

* Add model 2023-05-25-onto_recognize_entities_bert_large_en

* Add model 2023-05-25-onto_recognize_entities_bert_medium_en

* Add model 2023-05-25-onto_recognize_entities_bert_mini_en

* Add model 2023-05-25-onto_recognize_entities_bert_small_en

* Add model 2023-05-25-onto_recognize_entities_bert_tiny_en

* Add model 2023-05-25-onto_recognize_entities_electra_base_en

* Add model 2023-05-25-onto_recognize_entities_electra_small_en

* Add model 2023-05-25-onto_recognize_entities_electra_large_en

* Add model 2023-05-26-recognize_entities_dl_fa

* Add model 2023-05-26-nerdl_fewnerd_subentity_100d_pipeline_en

* Add model 2023-05-26-nerdl_fewnerd_100d_pipeline_en

* Add model 2023-05-26-pos_ud_bokmaal_nb

* Add model 2023-05-26-xlm_roberta_large_token_classifier_masakhaner_pipeline_xx

* Add model 2023-05-26-bert_token_classifier_scandi_ner_pipeline_xx

* Add model 2023-05-26-bert_sequence_classifier_trec_coarse_pipeline_en

* Add model 2023-05-26-bert_sequence_classifier_age_news_pipeline_en

* Add model 2023-05-26-distilbert_token_classifier_typo_detector_pipeline_is

* Add model 2023-05-26-distilbert_base_token_classifier_masakhaner_pipeline_xx

* Add model 2023-05-26-nerdl_restaurant_100d_pipeline_en

* Add model 2023-05-26-roberta_token_classifier_timex_semeval_pipeline_en

* Add model 2023-05-26-bert_token_classifier_hi_en_ner_pipeline_hi

* Add model 2023-05-26-xlm_roberta_large_token_classifier_hrl_pipeline_xx

* Add model 2023-05-26-spellcheck_dl_pipeline_en

* Add model 2023-05-26-bert_token_classifier_dutch_udlassy_ner_pipeline_nl

* Add model 2023-05-26-xlm_roberta_large_token_classifier_conll03_pipeline_de

* Add model 2023-05-26-roberta_token_classifier_bne_capitel_ner_pipeline_es

* Add model 2023-05-26-roberta_token_classifier_icelandic_ner_pipeline_is

* Add model 2023-05-26-longformer_base_token_classifier_conll03_pipeline_en

* Add model 2023-05-26-longformer_large_token_classifier_conll03_pipeline_en

* Add model 2023-05-26-xlnet_base_token_classifier_conll03_pipeline_en

* Add model 2023-05-26-xlm_roberta_base_token_classifier_ontonotes_pipeline_en

* Add model 2023-05-26-xlm_roberta_base_token_classifier_conll03_pipeline_en

* Add model 2023-05-26-xlnet_large_token_classifier_conll03_pipeline_en

* Add model 2023-05-26-albert_base_token_classifier_conll03_pipeline_en

* Add model 2023-05-26-albert_large_token_classifier_conll03_pipeline_en

* Add model 2023-05-26-albert_xlarge_token_classifier_conll03_pipeline_en

* Add model 2023-05-26-distilroberta_base_token_classifier_ontonotes_pipeline_en

* Add model 2023-05-26-roberta_base_token_classifier_ontonotes_pipeline_en

* Add model 2023-05-26-roberta_large_token_classifier_conll03_pipeline_en

* Add model 2023-05-26-distilbert_token_classifier_typo_detector_pipeline_en

---------

Co-authored-by: ahmedlone127 <ahmedlone127@gmail.com>

* 2023-05-25-distilcamembert_french_legal_fr (#13826)

* Add model 2023-05-25-distilcamembert_french_legal_fr

* Update 2023-05-25-distilcamembert_french_legal_fr.md

* Update 2023-05-25-distilcamembert_french_legal_fr.md

* Add model 2023-05-25-camembert_french_legal_fr

* Update 2023-05-25-camembert_french_legal_fr.md

* Update 2023-05-25-camembert_french_legal_fr.md

* Update 2023-05-25-distilcamembert_french_legal_fr.md

---------

Co-authored-by: Mary-Sci <meryemyildiz366@gmail.com>
Co-authored-by: Merve Ertas Uslu <67653613+Mary-Sci@users.noreply.github.com>

* Update title for 2023-05-25-distilcamembert_french_legal_fr.md (#13831)

* 2023-05-27-explain_document_md_fr (#13836)

* Add model 2023-05-27-explain_document_md_fr

* Add model 2023-05-27-dependency_parse_en

* Add model 2023-05-27-explain_document_md_it

* Add model 2023-05-27-entity_recognizer_lg_fr

* Add model 2023-05-27-entity_recognizer_md_fr

* Add model 2023-05-27-entity_recognizer_lg_it

* Add model 2023-05-27-entity_recognizer_md_it

* Add model 2023-05-27-check_spelling_en

* Add model 2023-05-27-match_datetime_en

* Add model 2023-05-27-match_pattern_en

* Add model 2023-05-27-clean_pattern_en

* Add model 2023-05-27-clean_stop_en

* Add model 2023-05-27-movies_sentiment_analysis_en

* Add model 2023-05-27-explain_document_ml_en

* Add model 2023-05-27-analyze_sentiment_en

* Add model 2023-05-27-explain_document_dl_en

* Add model 2023-05-27-recognize_entities_dl_en

* Add model 2023-05-27-recognize_entities_bert_en

* Add model 2023-05-27-explain_document_md_de

* Add model 2023-05-27-entity_recognizer_lg_de

* Add model 2023-05-27-entity_recognizer_md_de

* Add model 2023-05-27-onto_recognize_entities_sm_en

* Add model 2023-05-27-onto_recognize_entities_lg_en

* Add model 2023-05-27-match_chunks_en

* Add model 2023-05-27-explain_document_lg_es

* Add model 2023-05-27-explain_document_md_es

* Add model 2023-05-27-explain_document_sm_es

* Add model 2023-05-27-entity_recognizer_lg_es

* Add model 2023-05-27-entity_recognizer_md_es

* Add model 2023-05-27-entity_recognizer_sm_es

* Add model 2023-05-27-explain_document_lg_ru

* Add model 2023-05-27-explain_document_md_ru

* Add model 2023-05-27-explain_document_sm_ru

* Add model 2023-05-27-entity_recognizer_lg_ru

* Add model 2023-05-27-entity_recognizer_md_ru

* Add model 2023-05-27-entity_recognizer_sm_ru

* Add model 2023-05-27-text_cleaning_en

* Add model 2023-05-27-explain_document_lg_pt

* Add model 2023-05-27-explain_document_md_pt

* Add model 2023-05-27-explain_document_sm_pt

* Add model 2023-05-27-entity_recognizer_lg_pt

* Add model 2023-05-27-entity_recognizer_md_pt

* Add model 2023-05-27-entity_recognizer_sm_pt

* Add model 2023-05-27-explain_document_lg_pl

* Add model 2023-05-27-explain_document_md_pl

* Add model 2023-05-27-explain_document_sm_pl

* Add model 2023-05-27-entity_recognizer_lg_pl

* Add model 2023-05-27-entity_recognizer_md_pl

* Add model 2023-05-27-entity_recognizer_sm_pl

* Add model 2023-05-27-explain_document_lg_nl

* Add model 2023-05-27-explain_document_md_nl

* Add model 2023-05-27-explain_document_sm_nl

* Add model 2023-05-27-entity_recognizer_lg_nl

* Add model 2023-05-27-entity_recognizer_md_nl

* Add model 2023-05-27-entity_recognizer_sm_nl

* Add model 2023-05-27-analyze_sentimentdl_glove_imdb_en

* Add model 2023-05-27-explain_document_lg_no

* Add model 2023-05-27-explain_document_md_no

* Add model 2023-05-27-explain_document_sm_no

* Add model 2023-05-27-entity_recognizer_lg_no

* Add model 2023-05-27-entity_recognizer_md_no

* Add model 2023-05-27-entity_recognizer_sm_no

* Add model 2023-05-27-explain_document_lg_sv

* Add model 2023-05-27-explain_document_md_sv

* Add model 2023-05-27-explain_document_sm_sv

* Add model 2023-05-27-entity_recognizer_lg_sv

* Add model 2023-05-27-entity_recognizer_md_sv

* Add model 2023-05-27-entity_recognizer_sm_sv

* Add model 2023-05-27-explain_document_lg_da

* Add model 2023-05-27-explain_document_md_da

* Add model 2023-05-27-explain_document_sm_da

* Add model 2023-05-27-entity_recognizer_lg_da

* Add model 2023-05-27-entity_recognizer_md_da

* Add model 2023-05-27-entity_recognizer_sm_da

* Add model 2023-05-27-explain_document_lg_fi

* Add model 2023-05-27-explain_document_md_fi

* Add model 2023-05-27-explain_document_sm_fi

* Add model 2023-05-27-entity_recognizer_lg_fi

* Add model 2023-05-27-entity_recognizer_md_fi

* Add model 2023-05-27-entity_recognizer_sm_fi

* Add model 2023-05-27-onto_recognize_entities_bert_base_en

* Add model 2023-05-27-onto_recognize_entities_bert_large_en

* Add model 2023-05-27-onto_recognize_entities_bert_medium_en

* Add model 2023-05-27-onto_recognize_entities_bert_mini_en

* Add model 2023-05-27-onto_recognize_entities_bert_small_en

* Add model 2023-05-27-onto_recognize_entities_bert_tiny_en

* Add model 2023-05-27-onto_recognize_entities_electra_base_en

* Add model 2023-05-27-onto_recognize_entities_electra_small_en

* Add model 2023-05-27-onto_recognize_entities_electra_large_en

* Add model 2023-05-27-recognize_entities_dl_fa

* Add model 2023-05-27-nerdl_fewnerd_subentity_100d_pipeline_en

* Add model 2023-05-27-nerdl_fewnerd_100d_pipeline_en

* Add model 2023-05-27-pos_ud_bokmaal_nb

* Add model 2023-05-27-xlm_roberta_large_token_classifier_masakhaner_pipeline_xx

* Add model 2023-05-27-bert_token_classifier_scandi_ner_pipeline_xx

* Add model 2023-05-27-bert_sequence_classifier_trec_coarse_pipeline_en

* Add model 2023-05-27-bert_sequence_classifier_age_news_pipeline_en

* Add model 2023-05-27-distilbert_token_classifier_typo_detector_pipeline_is

* Add model 2023-05-27-distilbert_base_token_classifier_masakhaner_pipeline_xx

* Add model 2023-05-27-nerdl_restaurant_100d_pipeline_en

* Add model 2023-05-27-roberta_token_classifier_timex_semeval_pipeline_en

* Add model 2023-05-27-bert_token_classifier_hi_en_ner_pipeline_hi

* Add model 2023-05-27-xlm_roberta_large_token_classifier_hrl_pipeline_xx

* Add model 2023-05-27-spellcheck_dl_pipeline_en

* Add model 2023-05-27-bert_token_classifier_dutch_udlassy_ner_pipeline_nl

* Add model 2023-05-27-xlm_roberta_large_token_classifier_conll03_pipeline_de

* Add model 2023-05-27-roberta_token_classifier_bne_capitel_ner_pipeline_es

* Add model 2023-05-27-roberta_token_classifier_icelandic_ner_pipeline_is

* Add model 2023-05-27-longformer_base_token_classifier_conll03_pipeline_en

* Add model 2023-05-27-longformer_large_token_classifier_conll03_pipeline_en

* Add model 2023-05-27-xlnet_base_token_classifier_conll03_pipeline_en

* Add model 2023-05-27-xlm_roberta_base_token_classifier_ontonotes_pipeline_en

* Add model 2023-05-27-xlm_roberta_base_token_classifier_conll03_pipeline_en

* Add model 2023-05-27-xlnet_large_token_classifier_conll03_pipeline_en

* Add model 2023-05-27-albert_base_token_classifier_conll03_pipeline_en

* Add model 2023-05-27-albert_large_token_classifier_conll03_pipeline_en

* Add model 2023-05-27-albert_xlarge_token_classifier_conll03_pipeline_en

* Add model 2023-05-27-distilroberta_base_token_classifier_ontonotes_pipeline_en

* Add model 2023-05-27-roberta_base_token_classifier_ontonotes_pipeline_en

* Add model 2023-05-27-roberta_large_token_classifier_conll03_pipeline_en

* Add model 2023-05-27-distilbert_token_classifier_typo_detector_pipeline_en

---------

Co-authored-by: ahmedlone127 <ahmedlone127@gmail.com>

* 2023-05-28-longformer_base_english_legal_en (#13838)

* Add model 2023-05-28-longformer_base_english_legal_en

* Update 2023-05-28-longformer_base_english_legal_en.md

---------

Co-authored-by: Mary-Sci <meryemyildiz366@gmail.com>
Co-authored-by: Merve Ertas Uslu <67653613+Mary-Sci@users.noreply.github.com>

* 2023-05-28-xlm_longformer_base_english_legal_en (#13839)

* Add model 2023-05-28-xlm_longformer_base_english_legal_en

* Update 2023-05-28-xlm_longformer_base_english_legal_en.md

* Add model 2023-05-28-longformer_large_english_legal_en

* Update 2023-05-28-longformer_large_english_legal_en.md

---------

Co-authored-by: Mary-Sci <meryemyildiz366@gmail.com>
Co-authored-by: Merve Ertas Uslu <67653613+Mary-Sci@users.noreply.github.com>

* 2023-06-21-bert_embeddings_distil_clinical_en (#13861)

* Add model 2023-06-21-bert_embeddings_distil_clinical_en

* Add model 2023-06-21-bert_embeddings_carlbert_webex_mlm_spatial_en

* Add model 2023-06-21-bert_embeddings_chemical_uncased_finetuned_cust_c2_en

* Add model 2023-06-21-bert_embeddings_lsg16k_Italian_Legal_it

* Add model 2023-06-21-bert_embeddings_chemical_uncased_finetuned_cust_c1_cust_en

* Add model 2023-06-21-bert_embeddings_legalbert_adept_en

* Add model 2023-06-21-bert_embeddings_base_uncased_issues_128_en

* Add model 2023-06-21-bert_embeddings_pretrain_ko

* Add model 2023-06-21-bert_embeddings_olm_base_uncased_oct_2022_en

* Add model 2023-06-21-legalectra_small_es

* Add model 2023-06-21-biobert_pubmed_base_cased_v1.2_en

* Add model 2023-06-21-bert_embeddings_jobbert_base_cased_en

* Add model 2023-06-21-electra_embeddings_electra_base_gc4_64k_700000_cased_generator_de

* Add model 2023-06-21-electra_embeddings_electra_base_gc4_64k_800000_cased_generator_de

* Add model 2023-06-21-legalectra_base_es

* Add model 2023-06-21-electra_embeddings_electra_base_gc4_64k_900000_cased_generator_de

* Add model 2023-06-21-bert_embeddings_scibert_scivocab_finetuned_cord19_en

* Add model 2023-06-21-bert_embeddings_InLegalBERT_en

* Add model 2023-06-21-bert_embeddings_InCaseLawBERT_en

* Add model 2023-06-21-bert_base_uncased_contracts_en

* Add model 2023-06-21-electra_embeddings_electra_base_turkish_mc4_uncased_generator_tr

* Add model 2023-06-21-electra_embeddings_electra_base_gc4_64k_500000_cased_generator_de

* Add model 2023-06-21-electra_embeddings_electra_base_generator_en

* Add model 2023-06-21-electra_embeddings_electra_base_gc4_64k_200000_cased_generator_de

* Add model 2023-06-21-electra_embeddings_electra_base_italian_xxl_cased_generator_it

* Add model 2023-06-21-bert_embeddings_bioclinicalbert_finetuned_covid_papers_en

* Add model 2023-06-21-electra_embeddings_electra_base_gc4_64k_1000000_cased_generator_de

* Add model 2023-06-21-electra_embeddings_electra_base_gc4_64k_600000_cased_generator_de

* Add model 2023-06-21-electra_embeddings_electra_base_gc4_64k_400000_cased_generator_de

* Add model 2023-06-21-electra_embeddings_finance_koelectra_base_generator_ko

* Add model 2023-06-21-electra_embeddings_koelectra_base_v2_generator_ko

* Add model 2023-06-21-electra_embeddings_electra_base_gc4_64k_300000_cased_generator_de

* Add model 2023-06-21-electra_embeddings_electra_base_turkish_mc4_cased_generator_tr

* Add model 2023-06-21-electra_embeddings_electra_base_gc4_64k_0_cased_generator_de

* Add model 2023-06-21-electra_embeddings_electra_small_generator_en

* Add model 2023-06-21-electra_embeddings_electra_large_generator_en

* Add model 2023-06-21-electra_embeddings_electricidad_base_generator_es

* Add model 2023-06-21-electra_embeddings_gelectra_large_generator_de

* Add model 2023-06-21-electra_embeddings_koelectra_base_generator_ko

* Add model 2023-06-21-electra_embeddings_koelectra_base_v3_generator_ko

* Add model 2023-06-21-electra_embeddings_electra_base_gc4_64k_0_cased_generator_de

* Add model 2023-06-21-electra_embeddings_electra_base_gc4_64k_100000_cased_generator_de

* Add model 2023-06-21-electra_embeddings_electra_base_gc4_64k_400000_cased_generator_de

* Add model 2023-06-21-electra_embeddings_electra_base_gc4_64k_600000_cased_generator_de

* Add model 2023-06-21-electra_embeddings_electra_tagalog_small_cased_generator_tl

* Add model 2023-06-21-electra_embeddings_gelectra_base_generator_de

* Add model 2023-06-21-electra_embeddings_electra_tagalog_base_cased_generator_tl

* Add model 2023-06-21-bert_sentence_embeddings_financial_de

* Add model 2023-06-21-electra_embeddings_electra_small_japanese_generator_ja

* Add model 2023-06-21-electra_embeddings_electra_tagalog_base_uncased_generator_tl

* Add model 2023-06-21-electra_embeddings_koelectra_small_generator_ko

* Add model 2023-06-21-electra_embeddings_finance_koelectra_small_generator_ko

* Add model 2023-06-21-bert_embeddings_sec_bert_base_en

* Add model 2023-06-21-electra_embeddings_kr_electra_generator_ko

* Add model 2023-06-21-bert_embeddings_sec_bert_sh_en

* Add model 2023-06-21-bert_embeddings_german_financial_statements_bert_de

* Add model 2023-06-21-electra_embeddings_electra_tagalog_small_uncased_generator_tl

* Add model 2023-06-21-bert_embeddings_javanese_bert_small_jv

* Add model 2023-06-21-bert_embeddings_finest_bert_en

* Add model 2023-06-21-bert_embeddings_indic_transformers_te_bert_te

* Add model 2023-06-21-bert_embeddings_gbert_base_de

* Add model 2023-06-21-bert_embeddings_indic_transformers_hi_bert_hi

* Add model 2023-06-21-bert_embeddings_hateBERT_en

* Add model 2023-06-21-bert_embeddings_false_positives_scancode_bert_base_uncased_L8_1_en

* Add model 2023-06-21-bert_embeddings_finbert_pretrain_yiyanghkust_en

* Add model 2023-06-21-bert_embeddings_indic_transformers_te_bert_te

* Add model 2023-06-21-bert_embeddings_hseBert_it_cased_it

* Add model 2023-06-21-bert_embeddings_finbert_pretrain_yiyanghkust_en

* Add model 2023-06-21-bert_embeddings_dpr_spanish_question_encoder_allqa_base_es

* Add model 2023-06-21-bert_embeddings_dziribert_ar

* Add model 2023-06-21-bert_embeddings_deberta_base_uncased_en

* Add model 2023-06-21-bert_embeddings_dbert_ko

* Add model 2023-06-21-bert_embeddings_javanese_bert_small_imdb_jv

* Add model 2023-06-21-bert_embeddings_dpr_spanish_passage_encoder_squades_base_es

* Add model 2023-06-21-bert_embeddings_dpr_spanish_question_encoder_squades_base_es

* Add model 2023-06-21-bert_embeddings_crosloengual_bert_en

* Add model 2023-06-21-bert_embeddings_clinical_pubmed_bert_base_512_en

* Add model 2023-06-21-bert_embeddings_dpr_spanish_passage_encoder_allqa_base_es

* Add model 2023-06-21-bert_embeddings_legal_bert_base_uncased_en

* Add model 2023-06-21-biobert_embeddings_all_pt

* Add model 2023-06-21-bert_embeddings_wineberto_italian_cased_it

* Add model 2023-06-21-bert_embeddings_clinical_pubmed_bert_base_128_en

* Add model 2023-06-21-biobert_embeddings_clinical_pt

* Add model 2023-06-21-bert_embeddings_telugu_bertu_te

* Add model 2023-06-21-bert_embeddings_wobert_chinese_plus_zh

* Add model 2023-06-21-bert_embeddings_wineberto_italian_cased_it

* Add model 2023-06-21-bert_embeddings_sikuroberta_zh

* Add model 2023-06-21-biobert_embeddings_biomedical_pt

* Add model 2023-06-21-bert_embeddings_sikubert_zh

* Add model 2023-06-21-bert_embeddings_psych_search_en

* Add model 2023-06-21-bert_embeddings_marathi_bert_mr

* Add model 2023-06-21-bert_embeddings_netbert_en

* Add model 2023-06-21-bert_embeddings_mbert_ar_c19_ar

* Add model 2023-06-21-bert_embeddings_multi_dialect_bert_base_arabic_ar

* Add model 2023-06-21-bert_embeddings_lic_class_scancode_bert_base_cased_L32_1_en

* Add model 2023-06-21-bert_embeddings_MARBERTv2_ar

* Add model 2023-06-21-bert_embeddings_bert_base_cased_pt_lenerbr_pt

* Add model 2023-06-21-bert_embeddings_bert_base_arabic_camelbert_msa_half_ar

* Add model 2023-06-21-bert_embeddings_bert_base_german_cased_oldvocab_de

* Add model 2023-06-21-bert_embeddings_bert_base_arabic_camelbert_msa_ar

* Add model 2023-06-21-bert_embeddings_bert_base_arabic_camelbert_msa_eighth_ar

* Add model 2023-06-21-bert_embeddings_bert_base_german_uncased_de

* Add model 2023-06-21-bert_embeddings_bert_base_arabic_camelbert_msa_quarter_ar

* Add model 2023-06-21-bert_embeddings_bert_base_historical_german_rw_cased_de

* Add model 2023-06-21-bert_embeddings_bert_base_italian_xxl_uncased_it

* Add model 2023-06-21-bert_embeddings_bert_base_arabertv2_ar

* Add model 2023-06-21-bert_embeddings_bert_base_arabic_camelbert_msa_sixteenth_ar

* Add model 2023-06-21-bert_embeddings_bert_base_arabic_camelbert_mix_ar

* Add model 2023-06-21-bert_embeddings_bert_base_italian_xxl_cased_it

* Add model 2023-06-21-bert_embeddings_bert_base_gl_cased_pt

* Add model 2023-06-21-bert_embeddings_MARBERT_ar

* Add model 2023-06-21-bert_embeddings_AraBertMo_base_V1_ar

* Add model 2023-06-21-bert_embeddings_bert_base_arabic_ar

* Add model 2023-06-21-bert_embeddings_DarijaBERT_ar

* Add model 2023-06-21-bert_embeddings_Ara_DialectBERT_ar

* Add model 2023-06-21-bert_embeddings_German_MedBERT_de

* Add model 2023-06-21-bert_embeddings_bert_base_arabertv02_twitter_ar

* Add model 2023-06-21-bert_embeddings_FinancialBERT_en

* Add model 2023-06-21-bert_embeddings_ARBERT_ar

* Add model 2023-06-21-bert_embeddings_COVID_SciBERT_en

* Add model 2023-06-21-bert_embeddings_alberti_bert_base_multilingual_cased_es

* Add model 2023-06-21-bert_embeddings_agriculture_bert_uncased_en

* Add model 2023-06-21-bert_embeddings_bangla_bert_bn

* Add model 2023-06-21-bert_embeddings_bert_kor_base_ko

* Add model 2023-06-21-bert_embeddings_bert_base_arabertv02_ar

* Add model 2023-06-21-bert_embeddings_arabert_c19_ar

* Add model 2023-06-21-bert_embeddings_bert_base_5lang_cased_es

* Add model 2023-06-21-bert_embeddings_bert_base_arabertv01_ar

* Add model 2023-06-21-bert_embeddings_bangla_bert_base_bn

* Add model 2023-06-21-bert_embeddings_bert_medium_arabic_ar

* Add model 2023-06-21-bert_embeddings_bert_political_election2020_twitter_mlm_en

* Add model 2023-06-21-bert_embeddings_bert_mini_arabic_ar

* Add model 2023-06-21-bert_embeddings_bert_base_arabert_ar

* Add model 2023-06-21-bert_embeddings_beto_gn_base_cased_es

* Add model 2023-06-21-bert_embeddings_chemical_bert_uncased_en

* Add model 2023-06-21-bert_embeddings_bert_base_ko

* Add model 2023-06-21-bert_embeddings_chefberto_italian_cased_it

* Add model 2023-06-21-bert_embeddings_childes_bert_en

* Add model 2023-06-21-bert_embeddings_bert_base_portuguese_cased_finetuned_peticoes_pt

* Add model 2023-06-21-bert_embeddings_bert_base_portuguese_cased_finetuned_tcu_acordaos_pt

* Add model 2023-06-21-bert_embeddings_bert_base_portuguese_cased_pt

* Add model 2023-06-21-bert_embeddings_bert_base_qarib60_1790k_ar

* Add model 2023-06-21-bert_embeddings_bert_base_uncased_dstc9_en

* Add model 2023-06-21-bert_embeddings_bert_base_uncased_mnli_sparse_70_unstructured_no_classifier_en

* Add model 2023-06-21-bert_embeddings_bert_base_qarib_ar

* Add model 2023-06-21-bert_embeddings_bert_base_uncased_sparse_70_unstructured_en

* Add model 2023-06-21-ms_bluebert_base_uncased_en

* Add model 2023-06-21-bert_embeddings_bert_base_qarib60_860k_ar

* fixing wrong spark version and removing tensorflow

---------

Co-authored-by: ahmedlone127 <ahmedlone127@gmail.com>
Co-authored-by: MaziyarPanahi <maziyar.panahi@iscpif.fr>

* 2023-06-26-distilbert_embeddings_finetuned_sarcasm_classification_en (#13867)

* Add model 2023-06-26-distilbert_embeddings_finetuned_sarcasm_classification_en

* Add model 2023-06-26-distilbert_embeddings_distilbert_base_indonesian_id

* Add model 2023-06-26-distilbert_embeddings_BERTino_it

* Add model 2023-06-26-distilbert_embeddings_distilbert_base_uncased_sparse_85_unstructured_pruneofa_en

* Add model 2023-06-26-distilbert_embeddings_malaysian_distilbert_small_ms

* Add model 2023-06-26-distilbert_embeddings_distilbert_fa_zwnj_base_fa

* Add model 2023-06-26-distilbert_embeddings_javanese_distilbert_small_jv

* Add model 2023-06-26-distilbert_embeddings_javanese_distilbert_small_imdb_jv

* Add model 2023-06-26-distilbert_embeddings_indic_transformers_hi_distilbert_hi

* Add model 2023-06-26-distilbert_embeddings_marathi_distilbert_mr

* Add model 2023-06-26-distilbert_embeddings_indic_transformers_bn_distilbert_bn

* Add model 2023-06-26-distilbert_embeddings_distilbert_base_uncased_sparse_90_unstructured_pruneofa_en

* Add model 2023-06-26-deberta_embeddings_xsmall_dapt_scientific_papers_pubmed_en

* Add model 2023-06-26-deberta_embeddings_spm_vie_vie

* Add model 2023-06-26-deberta_embeddings_vie_small_vie

* Add model 2023-06-26-deberta_embeddings_tapt_nbme_v3_base_en

* Add model 2023-06-26-deberta_embeddings_erlangshen_v2_chinese_sentencepiece_zh

* Add model 2023-06-26-deberta_v3_xsmall_en

* Add model 2023-06-26-deberta_embeddings_mlm_test_en

* Add model 2023-06-26-deberta_v3_small_en

* Add model 2023-06-26-roberta_base_swiss_legal_gsw

---------

Co-authored-by: ahmedlone127 <ahmedlone127@gmail.com>

* 2023-06-27-roberta_embeddings_robertinh_gl (#13868)

* Add model 2023-06-27-roberta_embeddings_robertinh_gl

* Add model 2023-06-27-roberta_embeddings_roberta_base_wechsel_german_de

* Add model 2023-06-27-roberta_embeddings_roberta_base_russian_v0_ru

* Add model 2023-06-27-roberta_embeddings_ruperta_base_finetuned_spa_constitution_en

* Add model 2023-06-27-roberta_embeddings_robasqu_eu

* Add model 2023-06-27-roberta_embeddings_roberta_ko_small_ko

* Add model 2023-06-27-roberta_embeddings_hindi_hi

* Add model 2023-06-27-roberta_embeddings_sundanese_roberta_base_su

* Add model 2023-06-27-roberta_embeddings_roberta_pubmed_en

* Add model 2023-06-27-roberta_embeddings_distilroberta_base_climate_f_en

* Add model 2023-06-27-roberta_embeddings_roberta_urdu_small_ur

* Add model 2023-06-27-roberta_embeddings_BR_BERTo_pt

* Add model 2023-06-27-roberta_embeddings_distilroberta_base_climate_d_s_en

* Add model 2023-06-27-roberta_embeddings_distilroberta_base_climate_d_en

* Add model 2023-06-27-roberta_embeddings_ukr_roberta_base_uk

* Add model 2023-06-27-roberta_embeddings_roberta_base_wechsel_french_fr

* Add model 2023-06-27-roberta_embeddings_Bible_roberta_base_en

* Add model 2023-06-27-roberta_embeddings_bertin_roberta_large_spanish_es

* Add model 2023-06-27-roberta_embeddings_roberta_base_wechsel_chinese_zh

* Add model 2023-06-27-roberta_embeddings_bertin_roberta_base_spanish_es

* Add model 2023-06-27-roberta_embeddings_bertin_base_gaussian_es

* Add model 2023-06-27-roberta_embeddings_bertin_base_random_exp_512seqlen_es

* Add model 2023-06-27-roberta_embeddings_RuPERTa_base_es

* Add model 2023-06-27-roberta_embeddings_roberta_base_bne_es

* Add model 2023-06-27-roberta_embeddings_bertin_base_stepwise_exp_512seqlen_es

* Add model 2023-06-27-roberta_embeddings_MedRoBERTa.nl_nl

* Add model 2023-06-27-roberta_embeddings_bertin_base_random_es

* Add model 2023-06-27-roberta_embeddings_RoBERTalex_es

* Add model 2023-06-27-roberta_embeddings_SecRoBERTa_en

* Add model 2023-06-27-roberta_embeddings_KanBERTo_kn

* Add model 2023-06-27-roberta_embeddings_distilroberta_base_finetuned_jira_qt_issue_title_en

* Add model 2023-06-27-roberta_embeddings_MedRoBERTa.nl_nl

* Add model 2023-06-27-roberta_embeddings_distilroberta_base_finetuned_jira_qt_issue_titles_and_bodies_en

* Add model 2023-06-27-roberta_embeddings_bertin_base_stepwise_es

* Add model 2023-06-27-roberta_embeddings_KanBERTo_kn

* Add model 2023-06-27-roberta_embeddings_bertin_base_gaussian_exp_512seqlen_es

* Add model 2023-06-27-roberta_embeddings_mlm_spanish_roberta_base_es

* Add model 2023-06-27-roberta_embeddings_KNUBert_kn

* Add model 2023-06-27-roberta_embeddings_javanese_roberta_small_jv

* Add model 2023-06-27-roberta_embeddings_indonesian_roberta_base_id

* Add model 2023-06-27-roberta_embeddings_indic_transformers_hi_roberta_hi

* Add model 2023-06-27-roberta_embeddings_indo_roberta_small_id

* Add model 2023-06-27-roberta_embeddings_fairlex_scotus_minilm_en

* Add model 2023-06-27-roberta_embeddings_indic_transformers_te_roberta_te

* Add model 2023-06-27-roberta_embeddings_javanese_roberta_small_imdb_jv

* Add model 2023-06-27-roberta_embeddings_jurisbert_es

* Add model 2023-06-27-roberta_embeddings_roberta_base_indonesian_522M_id

* Add model 2023-06-27-roberta_embeddings_fairlex_ecthr_minilm_en

* Add model 2023-06-27-roberta_embeddings_muppet_roberta_base_en

---------

Co-authored-by: ahmedlone127 <ahmedlone127@gmail.com>

* Add model 2023-06-29-xlmroberta_embeddings_paraphrase_mpnet_base_v2_xx (#13872)

Co-authored-by: Damla-Gurbaz <dml.grbz.01@gmail.com>

* 2023-06-08-instructor_base_en (#13850)

* Add model 2023-06-08-instructor_base_en

* Update 2023-06-08-instructor_base_en.md

* Add model 2023-06-21-e5_base_v2_en

* Add model 2023-06-21-e5_base_en

* Add model 2023-06-21-e5_large_v2_en

* Add model 2023-06-21-e5_large_en

* Add model 2023-06-21-e5_small_v2_en

* Add model 2023-06-21-e5_small_en

* Add model 2023-06-21-instructor_large_en

---------

Co-authored-by: prabod <prabod@rathnayaka.me>
Co-authored-by: Maziyar Panahi <maziyar.panahi@iscpif.fr>

* 2023-06-28-roberta_base_en (#13871)

* Add model 2023-06-28-roberta_base_en

* Add model 2023-06-28-roberta_base_opt_en

* Add model 2023-06-28-roberta_base_quantized_en

* Add model 2023-06-28-small_bert_L2_768_en

* Add model 2023-06-28-small_bert_L2_768_opt_en

* Add model 2023-06-28-small_bert_L2_768_quantized_en

* Add model 2023-06-28-distilbert_base_cased_en

* Add model 2023-06-28-distilbert_base_cased_opt_en

* Add model 2023-06-28-distilbert_base_cased_quantized_en

* Add model 2023-06-28-deberta_v3_base_en

* Add model 2023-06-28-deberta_v3_base_opt_en

* Add model 2023-06-28-deberta_v3_base_quantized_en

* Add model 2023-06-28-distilbert_base_uncased_en

* Add model 2023-06-28-distilbert_base_uncased_opt_en

* Add model 2023-06-28-distilbert_base_uncased_quantized_en

* Add model 2023-06-28-distilbert_base_multilingual_cased_xx

* Add model 2023-06-28-distilbert_base_multilingual_cased_xx

* Add model 2023-06-28-distilbert_base_multilingual_cased_opt_xx

* Add model 2023-06-28-distilbert_base_multilingual_cased_quantized_xx

* Add model 2023-06-28-distilbert_embeddings_distilbert_base_german_cased_de

* Add model 2023-06-28-distilbert_embeddings_distilbert_base_german_cased_opt_de

* Add model 2023-06-28-distilbert_embeddings_distilbert_base_german_cased_quantized_de

* Add model 2023-06-29-bert_base_cased_en

* Add model 2023-06-29-bert_base_cased_opt_en

* Add model 2023-06-29-bert_base_cased_quantized_en

---------

Co-authored-by: ahmedlone127 <ahmedlone127@gmail.com>

---------

Co-authored-by: jsl-models <74001263+jsl-models@users.noreply.github.com>
Co-authored-by: Naveen-004 <chinna.nk4@gmail.com>
Co-authored-by: ahmedlone127 <ahmedlone127@gmail.com>
Co-authored-by: prabod <prabod@rathnayaka.me>
Co-authored-by: Mary-Sci <meryemyildiz366@gmail.com>
Co-authored-by: Merve Ertas Uslu <67653613+Mary-Sci@users.noreply.github.com>
Co-authored-by: Damla-Gurbaz <dml.grbz.01@gmail.com>
  • Loading branch information
8 people authored Jul 3, 2023
1 parent d732eaa commit 179e4df
Show file tree
Hide file tree
Showing 246 changed files with 34,769 additions and 0 deletions.
Original file line number Diff line number Diff line change
@@ -0,0 +1,100 @@
---
layout: model
title: Multilingual XLMRoBerta Embeddings Cased Model
author: John Snow Labs
name: xlmroberta_embeddings_paraphrase_mpnet_base_v2
date: 2023-06-29
tags: [xx, embeddings, xlmroberta, open_source, transformer, tensorflow]
task: Embeddings
language: xx
edition: Spark NLP 4.4.4
spark_version: 3.0
supported: true
engine: tensorflow
annotator: XlmRoBertaEmbeddings
article_header:
type: cover
use_language_switcher: "Python-Scala-Java"
---

## Description

Pretrained XLMRoberta Embeddings model is a multilingual embedding model adapted from Hugging Face and curated to provide scalability and production-readiness using Spark NLP.

## Predicted Entities



{:.btn-box}
<button class="button button-orange" disabled>Live Demo</button>
<button class="button button-orange" disabled>Open in Colab</button>
[Download](https://s3.amazonaws.com/auxdata.johnsnowlabs.com/public/models/xlmroberta_embeddings_paraphrase_mpnet_base_v2_xx_4.4.4_3.0_1688073546075.zip){:.button.button-orange.button-orange-trans.arr.button-icon}
[Copy S3 URI](s3://auxdata.johnsnowlabs.com/public/models/xlmroberta_embeddings_paraphrase_mpnet_base_v2_xx_4.4.4_3.0_1688073546075.zip){:.button.button-orange.button-orange-trans.button-icon.button-copy-s3}

## How to use



<div class="tabs-box" markdown="1">
{% include programmingLanguageSelectScalaPythonNLU.html %}
```python
documentAssembler = DocumentAssembler() \
.setInputCol("text") \
.setOutputCol("document")

tokenizer = Tokenizer() \
.setInputCols("document") \
.setOutputCol("token")

embeddings = XlmRoBertaEmbeddings.pretrained("xlmroberta_embeddings_paraphrase_mpnet_base_v2","xx") \
.setInputCols(["document", "token"]) \
.setOutputCol("embeddings") \
.setCaseSensitive(True)

pipeline = Pipeline(stages=[documentAssembler,
tokenizer,
embeddings])

data = spark.createDataFrame([["I love Spark NLP"]]).toDF("text")
result = pipeline.fit(data).transform(data)
```
```scala
val documentAssembler = new DocumentAssembler()
.setInputCol("text")
.setOutputCol("document")

val tokenizer = new Tokenizer()
.setInputCols("document")
.setOutputCol("token")

val embeddings = XlmRoBertaEmbeddings.pretrained("xlmroberta_embeddings_paraphrase_mpnet_base_v2", "xx")
.setInputCols(Array("document", "token"))
.setOutputCol("embeddings")

val pipeline = new Pipeline().setStages(Array(documentAssembler,
tokenizer,
embeddings))

val data = Seq("I love Spark NLP").toDS.toDF("text")
val result = pipeline.fit(data).transform(data)
```
</div>

{:.model-param}
## Model Information

{:.table-model}
|---|---|
|Model Name:|xlmroberta_embeddings_paraphrase_mpnet_base_v2|
|Compatibility:|Spark NLP 4.4.4+|
|License:|Open Source|
|Edition:|Official|
|Input Labels:|[sentence, token]|
|Output Labels:|[embeddings]|
|Language:|xx|
|Size:|1.0 GB|
|Case sensitive:|true|

## References

https://huggingface.co/sentence-transformers/paraphrase-multilingual-mpnet-base-v2
Original file line number Diff line number Diff line change
@@ -0,0 +1,97 @@
---
layout: model
title: English Legal Longformer Base Embeddings Model
author: John Snow Labs
name: longformer_base_english_legal
date: 2023-05-28
tags: [en, longformerformaskedlm, transformer, open_source, legal, tensorflow]
task: Embeddings
language: en
edition: Spark NLP 4.4.2
spark_version: 3.0
supported: true
engine: tensorflow
annotator: LongformerEmbeddings
article_header:
type: cover
use_language_switcher: "Python-Scala-Java"
---

## Description

Pretrained Legal Longformer Embeddings model, adapted from Hugging Face and curated to provide scalability and production-readiness using Spark NLP. `legal-longformer-base` is a English model originally trained by `lexlms`.

{:.btn-box}
<button class="button button-orange" disabled>Live Demo</button>
<button class="button button-orange" disabled>Open in Colab</button>
[Download](https://s3.amazonaws.com/auxdata.johnsnowlabs.com/public/models/longformer_base_english_legal_en_4.4.2_3.0_1685282124579.zip){:.button.button-orange.button-orange-trans.arr.button-icon}
[Copy S3 URI](s3://auxdata.johnsnowlabs.com/public/models/longformer_base_english_legal_en_4.4.2_3.0_1685282124579.zip){:.button.button-orange.button-orange-trans.button-icon.button-copy-s3}

## How to use



<div class="tabs-box" markdown="1">
{% include programmingLanguageSelectScalaPythonNLU.html %}

```python
documentAssembler = DocumentAssembler() \
.setInputCols("text") \
.setOutputCols("document")

tokenizer = Tokenizer() \
.setInputCols("document") \
.setOutputCol("token")

embeddings = LongformerEmbeddings.pretrained("longformer_base_english_legal","en") \
.setInputCols(["document", "token"]) \
.setOutputCol("embeddings") \
.setCaseSensitive(True)

pipeline = Pipeline(stages=[documentAssembler, tokenizer, embeddings])

data = spark.createDataFrame([["I love Spark NLP"]]).toDF("text")

result = pipeline.fit(data).transform(data)
```
```scala
val documentAssembler = new DocumentAssembler()
.setInputCols(Array("text"))
.setOutputCols(Array("document"))

val tokenizer = new Tokenizer()
.setInputCols("document")
.setOutputCol("token")

val embeddings = LongformerEmbeddings.pretrained("longformer_base_english_legal","en")
.setInputCols(Array("document", "token"))
.setOutputCol("embeddings")
.setCaseSensitive(True)

val pipeline = new Pipeline().setStages(Array(documentAssembler, tokenizer, embeddings))

val data = Seq("I love Spark NLP").toDS.toDF("text")

val result = pipeline.fit(data).transform(data)
```
</div>

{:.model-param}
## Model Information

{:.table-model}
|---|---|
|Model Name:|longformer_base_english_legal|
|Compatibility:|Spark NLP 4.4.2+|
|License:|Open Source|
|Edition:|Official|
|Input Labels:|[sentence, token]|
|Output Labels:|[embeddings]|
|Language:|en|
|Size:|561.6 MB|
|Case sensitive:|true|
|Max sentence length:|4096|

## References

https://huggingface.co/lexlms/legal-longformer-base
Original file line number Diff line number Diff line change
@@ -0,0 +1,97 @@
---
layout: model
title: English Legal Longformer Large Embeddings Model
author: John Snow Labs
name: longformer_large_english_legal
date: 2023-05-28
tags: [en, longformerformaskedlm, transformer, open_source, legal, tensorflow]
task: Embeddings
language: en
edition: Spark NLP 4.4.2
spark_version: 3.0
supported: true
engine: tensorflow
annotator: LongformerEmbeddings
article_header:
type: cover
use_language_switcher: "Python-Scala-Java"
---

## Description

Pretrained Legal Longformer Large Embeddings model, adapted from Hugging Face and curated to provide scalability and production-readiness using Spark NLP. `legal-longformer-large` is a English model originally trained by `lexlms`.

{:.btn-box}
<button class="button button-orange" disabled>Live Demo</button>
<button class="button button-orange" disabled>Open in Colab</button>
[Download](https://s3.amazonaws.com/auxdata.johnsnowlabs.com/public/models/longformer_large_english_legal_en_4.4.2_3.0_1685289330980.zip){:.button.button-orange.button-orange-trans.arr.button-icon}
[Copy S3 URI](s3://auxdata.johnsnowlabs.com/public/models/longformer_large_english_legal_en_4.4.2_3.0_1685289330980.zip){:.button.button-orange.button-orange-trans.button-icon.button-copy-s3}

## How to use



<div class="tabs-box" markdown="1">
{% include programmingLanguageSelectScalaPythonNLU.html %}

```python
documentAssembler = DocumentAssembler() \
.setInputCols("text") \
.setOutputCols("document")

tokenizer = Tokenizer() \
.setInputCols("document") \
.setOutputCol("token")

embeddings = LongformerEmbeddings.pretrained("longformer_large_english_legal","en") \
.setInputCols(["document", "token"]) \
.setOutputCol("embeddings") \
.setCaseSensitive(True)

pipeline = Pipeline(stages=[documentAssembler, tokenizer, embeddings])

data = spark.createDataFrame([["I love Spark NLP"]]).toDF("text")

result = pipeline.fit(data).transform(data)
```
```scala
val documentAssembler = new DocumentAssembler()
.setInputCols(Array("text"))
.setOutputCols(Array("document"))

val tokenizer = new Tokenizer()
.setInputCols("document")
.setOutputCol("token")

val embeddings = LongformerEmbeddings.pretrained("longformer_large_english_legal","en")
.setInputCols(Array("document", "token"))
.setOutputCol("embeddings")
.setCaseSensitive(True)

val pipeline = new Pipeline().setStages(Array(documentAssembler, tokenizer, embeddings))

val data = Seq("I love Spark NLP").toDS.toDF("text")

val result = pipeline.fit(data).transform(data)
```
</div>

{:.model-param}
## Model Information

{:.table-model}
|---|---|
|Model Name:|longformer_large_english_legal|
|Compatibility:|Spark NLP 4.4.2+|
|License:|Open Source|
|Edition:|Official|
|Input Labels:|[sentence, token]|
|Output Labels:|[embeddings]|
|Language:|en|
|Size:|1.6 GB|
|Case sensitive:|true|
|Max sentence length:|4096|

## References

https://huggingface.co/lexlms/legal-longformer-large
Original file line number Diff line number Diff line change
@@ -0,0 +1,97 @@
---
layout: model
title: English Legal XLM-Longformer Base Embeddings Model
author: John Snow Labs
name: xlm_longformer_base_english_legal
date: 2023-05-28
tags: [en, longformerformaskedlm, transformer, open_source, legal, tensorflow]
task: Embeddings
language: en
edition: Spark NLP 4.4.2
spark_version: 3.0
supported: true
engine: tensorflow
annotator: LongformerEmbeddings
article_header:
type: cover
use_language_switcher: "Python-Scala-Java"
---

## Description

Pretrained Legal XLM-Longformer Embeddings model, adapted from Hugging Face and curated to provide scalability and production-readiness using Spark NLP. `legal-xlm-longformer-base` is a English model originally trained by `joelito`.

{:.btn-box}
<button class="button button-orange" disabled>Live Demo</button>
<button class="button button-orange" disabled>Open in Colab</button>
[Download](https://s3.amazonaws.com/auxdata.johnsnowlabs.com/public/models/xlm_longformer_base_english_legal_en_4.4.2_3.0_1685286936656.zip){:.button.button-orange.button-orange-trans.arr.button-icon}
[Copy S3 URI](s3://auxdata.johnsnowlabs.com/public/models/xlm_longformer_base_english_legal_en_4.4.2_3.0_1685286936656.zip){:.button.button-orange.button-orange-trans.button-icon.button-copy-s3}

## How to use



<div class="tabs-box" markdown="1">
{% include programmingLanguageSelectScalaPythonNLU.html %}

```python
documentAssembler = DocumentAssembler() \
.setInputCols("text") \
.setOutputCols("document")

tokenizer = Tokenizer() \
.setInputCols("document") \
.setOutputCol("token")

embeddings = LongformerEmbeddings.pretrained("xlm_longformer_base_english_legal","en") \
.setInputCols(["document", "token"]) \
.setOutputCol("embeddings") \
.setCaseSensitive(True)

pipeline = Pipeline(stages=[documentAssembler, tokenizer, embeddings])

data = spark.createDataFrame([["I love Spark NLP"]]).toDF("text")

result = pipeline.fit(data).transform(data)
```
```scala
val documentAssembler = new DocumentAssembler()
.setInputCols(Array("text"))
.setOutputCols(Array("document"))

val tokenizer = new Tokenizer()
.setInputCols("document")
.setOutputCol("token")

val embeddings = LongformerEmbeddings.pretrained("xlm_longformer_base_english_legal","en")
.setInputCols(Array("document", "token"))
.setOutputCol("embeddings")
.setCaseSensitive(True)

val pipeline = new Pipeline().setStages(Array(documentAssembler, tokenizer, embeddings))

val data = Seq("I love Spark NLP").toDS.toDF("text")

val result = pipeline.fit(data).transform(data)
```
</div>

{:.model-param}
## Model Information

{:.table-model}
|---|---|
|Model Name:|xlm_longformer_base_english_legal|
|Compatibility:|Spark NLP 4.4.2+|
|License:|Open Source|
|Edition:|Official|
|Input Labels:|[sentence, token]|
|Output Labels:|[embeddings]|
|Language:|en|
|Size:|788.6 MB|
|Case sensitive:|true|
|Max sentence length:|4096|

## References

https://huggingface.co/joelito/legal-xlm-longformer-base
Loading

0 comments on commit 179e4df

Please sign in to comment.