Skip to content

Commit

Permalink
Models hub (#13972)
Browse files Browse the repository at this point in the history
---------

Co-authored-by: ahmedlone127 <ahmedlone127@gmail.com>

* 2023-08-28-asr_whisper_tiny_opt_xx (#13944)

* Add model 2023-08-28-asr_whisper_tiny_opt_xx

* Update 2023-08-28-asr_whisper_tiny_opt_xx.md

Change Spark Version

* Update 2023-08-28-asr_whisper_tiny_opt_xx.md

Spark version 3.0

* Update 2023-08-28-asr_whisper_tiny_opt_xx.md

spark version

* Update 2023-08-28-asr_whisper_tiny_opt_xx.md

---------

Co-authored-by: DevinTDHa <duc.hatrung95@gmail.com>
Co-authored-by: Devin Ha <33089471+DevinTDHa@users.noreply.github.com>

* 2023-09-07-java_pointer_classifier_en (#13968)

* Add model 2023-09-07-invoiceornot_en

* Add model 2023-09-07-biolord_stamb2_v1_en

* Add model 2023-09-07-cross_all_mpnet_base_v2_finetuned_webnlg2020_metric_average_en

* Add model 2023-09-07-tiny_random_mpnetmodel_hf_internal_testing_en

* Add model 2023-09-07-ikitracs_mitigation_en

* Add model 2023-09-07-mpnet_snli_en

* Add model 2023-09-07-sml_ukr_message_classifier_en

* Add model 2023-09-07-action_policy_plans_classifier_en

* Add model 2023-09-07-nooks_amd_detection_v2_full_en

* Add model 2023-09-07-tiny_random_mpnetformultiplechoice_en

* Add model 2023-09-07-all_datasets_v4_mpnet_base_en

* Add model 2023-09-07-review_intent_20230116_en

* Add model 2023-09-07-tiny_random_mpnetfortokenclassification_hf_tiny_model_private_en

* Add model 2023-09-07-multi_qa_v1_mpnet_asymmetric_q_en

* Add model 2023-09-07-tiny_random_mpnetmodel_hf_tiny_model_private_en

* Add model 2023-09-07-setfit_alpaca_spanish_unprocessable_sample_detection_es

* Add model 2023-09-07-nps_psb_lds_en

* Add model 2023-09-07-github_issues_mpnet_southern_sotho_e10_en

* Add model 2023-09-07-mpnet_retriever_squad2_en

* Add model 2023-09-07-mpnet_adaptation_mitigation_classifier_en

* Add model 2023-09-07-stackoverflow_mpnet_base_en

* Add model 2023-09-07-all_mpnet_base_v2_diptanuc_en

* Add model 2023-09-07-setfit_model_pradipta11_en

* Add model 2023-09-07-setfit_zero_shot_classification_pbsp_p1_life_en

* Add model 2023-09-07-due_retail_25_en

* Add model 2023-09-07-java_summary_classifier_en

* Add model 2023-09-07-tiny_random_mpnetforsequenceclassification_hf_tiny_model_private_en

* Add model 2023-09-07-multi_qa_v1_mpnet_asymmetric_a_en

* Add model 2023-09-07-all_mpnet_base_v2_sentence_transformers_en

* Add model 2023-09-07-setfit_ds_version_0_0_2_en

* Add model 2023-09-07-multi_qa_mpnet_base_cos_v1_sentence_transformers_en

* Add model 2023-09-07-setfit_ds_version_0_0_4_en

* Add model 2023-09-07-java_expand_classifier_en

* Add model 2023-09-07-python_summary_classifier_en

* Add model 2023-09-07-test_food_en

* Add model 2023-09-07-sbert_paper_en

* Add model 2023-09-07-setfit_model_rajistics_en

* Add model 2023-09-07-all_mpnet_base_v2_embedding_all_en

* Add model 2023-09-07-due_eshop_21_multilabel_en

* Add model 2023-09-07-initial_model_v3_en

* Add model 2023-09-07-retriever_coding_guru_adapted_en

* Add model 2023-09-07-paraphrase_mpnet_base_v2_fuzzy_matcher_en

* Add model 2023-09-07-setfit_ethos_multilabel_example_lewtun_en

* Add model 2023-09-07-python_expand_classifier_en

* Add model 2023-09-07-kw_classification_setfit_model_en

* Add model 2023-09-07-pharo_collaborators_classifier_en

* Add model 2023-09-07-mpnet_base_articles_ner_en

* Add model 2023-09-07-shona_mpnet_base_snli_mnli_en

* Add model 2023-09-07-fail_detect_en

* Add model 2023-09-07-python_usage_classifier_en

* Add model 2023-09-07-invoiceornot_en

* Add model 2023-09-07-tiny_random_mpnetforquestionanswering_hf_internal_testing_en

* Add model 2023-09-07-cpu_netzero_classifier_en

* Add model 2023-09-07-pharo_responsibilities_classifier_en

* Add model 2023-09-07-all_mpnet_base_v2_obrizum_en

* Add model 2023-09-07-tiny_random_mpnetmodel_hf_internal_testing_en

* Add model 2023-09-07-tiny_random_mpnetfortokenclassification_hf_internal_testing_en

* Add model 2023-09-07-tiny_random_mpnet_hf_internal_testing_en

* Add model 2023-09-07-python_developmentnotes_classifier_en

* Add model 2023-09-08-test_model_001_en

* Add model 2023-09-07-covid_qa_mpnet_en

* Add model 2023-09-08-setfit_ostrom_en

* Add model 2023-09-08-patentsberta_v2_en

* Add model 2023-09-07-pdfsegs_en

* Add model 2023-09-07-mpnet_snli_negatives_en

* Add model 2023-09-08-eth_setfit_payment_model_en

* Add model 2023-09-08-all_mpnet_base_questions_clustering_english_en

* Add model 2023-09-08-esci_jp_mpnet_crossencoder_en

* Add model 2023-09-07-kw_classification_setfithead_model_en

* Add model 2023-09-08-setfit_zero_shot_classification_pbsp_p4_time_en

* Add model 2023-09-07-579_stmodel_product_rem_v3a_en

* Add model 2023-09-07-tiny_random_mpnetformaskedlm_hf_internal_testing_en

* Add model 2023-09-07-setfit_all_data_en

* Add model 2023-09-07-review_multiclass_20230116_en

* Add model 2023-09-07-nli_mpnet_base_v2_sentence_transformers_en

* Add model 2023-09-07-reddit_single_context_mpnet_base_en

* Add model 2023-09-07-tiny_random_mpnetforsequenceclassification_hf_internal_testing_en

* Add model 2023-09-07-java_rational_classifier_en

* Add model 2023-09-08-setfit_zero_shot_classification_pbsp_p3_bhvr_en

* Add model 2023-09-07-all_mpnet_base_v2_tasky_classification_en

* Add model 2023-09-08-setfit_zero_shot_classification_pbsp_p4_meas_en

* Add model 2023-09-07-sb_temfac_en

* Add model 2023-09-07-sb_temfac_en

* Add model 2023-09-07-all_mpnet_base_v2_ftlegal_v3_en

* Add model 2023-09-07-pharo_collaborators_classifier_en

* Add model 2023-09-08-all_mpnet_base_v2_feature_extraction_en

* Add model 2023-09-08-setfit_zero_shot_classification_pbsp_p4_achiev_en

* Add model 2023-09-07-cpu_economywide_classifier_en

* Add model 2023-09-08-setfit_zero_shot_classification_pbsp_p4_rel_en

* Add model 2023-09-07-all_mpnet_base_v2_sentence_transformers_en

* Add model 2023-09-08-setfit_zero_shot_classification_pbsp_p3_cons_en

* Add model 2023-09-07-setfit_ds_version_0_0_5_en

* Add model 2023-09-07-cpu_conditional_classifier_en

* Add model 2023-09-07-all_mpnet_base_v2_table_en

* Add model 2023-09-07-ikitracs_mitigation_en

* Add model 2023-09-07-vulnerable_groups_en

* Add model 2023-09-08-setfit_zero_shot_classification_pbsp_p4_specific_en

* Add model 2023-09-07-tiny_random_mpnetforsequenceclassification_hf_tiny_model_private_en

* Add model 2023-09-07-tiny_random_mpnetforquestionanswering_hf_tiny_model_private_en

* Add model 2023-09-07-multi_qa_v1_mpnet_asymmetric_a_en

* Add model 2023-09-07-biencoder_all_mpnet_base_v2_mmarcofr_fr

* Add model 2023-09-07-initial_model_en

* Add model 2023-09-07-sentiment140_fewshot_en

* Add model 2023-09-07-python_summary_classifier_en

* Add model 2023-09-08-cpu_target_classifier_en

* Add model 2023-09-07-ikitracs_conditional_en

* Add model 2023-09-07-setfit_ag_news_endpoint_en

* Add model 2023-09-08-setfit_model_feb11_misinformation_on_law_en

* Add model 2023-09-07-multi_qa_mpnet_base_dot_v1_sentence_transformers_en

* Add model 2023-09-08-setfit_zero_shot_classification_pbsp_q8a_azure_gpt35_en

* Add model 2023-09-07-mpnet_multilabel_sector_classifier_en

* Add model 2023-09-08-paraphrase_mpnet_base_v2_finetuned_polifact_en

* Add model 2023-09-07-all_datasets_v3_mpnet_base_en

* Add model 2023-09-08-all_mpnet_base_v2_for_sb_clustering_en

* Add model 2023-09-07-negation_categories_classifier_es

* Add model 2023-09-07-python_parameters_classifier_en

* Add model 2023-09-07-due_eshop_21_en

* Add model 2023-09-07-contradiction_psb_en

* Add model 2023-09-07-mpnet_mnr_v2_fine_tuned_en

* Add model 2023-09-07-labels_per_job_title_fine_tune_en

* Add model 2023-09-07-paraphrase_mpnet_base_v2_sentence_transformers_en

* Add model 2023-09-07-all_datasets_v4_mpnet_base_en

* Add model 2023-09-07-pharo_example_classifier_en

* Add model 2023-09-07-paraphrase_mpnet_base_v2_setfit_sst2_en

* Add model 2023-09-07-all_mpnet_base_v2_ftlegal_v3_en

* Add model 2023-09-07-mpnet_adaptation_mitigation_classifier_en

* Add model 2023-09-07-abstract_sim_query_en

* Add model 2023-09-07-python_developmentnotes_classifier_en

* Add model 2023-09-08-few_shot_model_en

* Add model 2023-09-07-tiny_random_mpnetformaskedlm_hf_tiny_model_private_en

* Add model 2023-09-07-biencoder_multi_qa_mpnet_base_cos_v1_mmarcofr_fr

* Add model 2023-09-07-cpu_economywide_classifier_en

* Add model 2023-09-07-contradiction_psb_lds_en

* Add model 2023-09-07-setfit_ds_version_0_0_1_en

* Add model 2023-09-07-kw_classification_setfit_model_en

* Add model 2023-09-07-all_mpnet_base_v2_finetuned_v2_en

* Add model 2023-09-07-mpnet_base_snli_mnli_en

* Add model 2023-09-07-tiny_random_mpnetformaskedlm_hf_internal_testing_en

* Add model 2023-09-07-mpnet_base_snli_mnli_en

* Add model 2023-09-07-multi_qa_v1_mpnet_cls_dot_en

* Add model 2023-09-07-tiny_random_mpnetforquestionanswering_hf_tiny_model_private_en

* Add model 2023-09-07-setfit_zero_shot_classification_pbsp_p1_likes_en

* Add model 2023-09-07-nooks_amd_detection_realtime_en

* Add model 2023-09-07-setfit_zero_shot_classification_pbsp_p1_en

* Add model 2023-09-07-github_issues_mpnet_southern_sotho_e10_en

* Add model 2023-09-07-burmese_awesome_setfit_model_98_en

* Add model 2023-09-07-setfit_few_shot_classifier_en

* Add model 2023-09-07-pdfsegs_en

* Add model 2023-09-07-mpnet_snli_en

* Add model 2023-09-07-abstract_sim_query_en

* Add model 2023-09-07-pharo_keyimplementationpoints_classifier_en

* Add model 2023-09-07-abstract_sim_sentence_en

* Add model 2023-09-07-all_mpnet_base_v2_feature_extraction_pipeline_en

* Add model 2023-09-07-shona_mpnet_base_snli_mnli_en

* Add model 2023-09-07-nooks_amd_detection_realtime_en

* Add model 2023-09-07-pharo_keyimplementationpoints_classifier_en

* Add model 2023-09-07-domainadaptm2_en

* Add model 2023-09-07-setfit_finetuned_financial_text_en

* Add model 2023-09-07-cpu_mitigation_classifier_en

* Add model 2023-09-07-sml_ukr_word_classifier_medium_en

* Add model 2023-09-07-java_expand_classifier_en

* Add model 2023-09-07-tiny_random_mpnetforsequenceclassification_hf_internal_testing_en

* Add model 2023-09-07-setfit_zero_shot_classification_pbsp_p1_comm_en

* Add model 2023-09-07-setfit_ds_version_0_0_2_en

* Add model 2023-09-07-setfit_zero_shot_classification_pbsp_p3_func_en

* Add model 2023-09-07-mpnet_multilabel_sector_classifier_en

* Add model 2023-09-07-all_mpnet_base_v2_finetuned_v2_en

* Add model 2023-09-07-ouvrage_classif_en

* Add model 2023-09-07-setfit_zero_shot_classification_pbsp_p3_sev_en

* Add model 2023-09-07-kw_classification_setfithead_model_en

* Add model 2023-09-07-setfit_zero_shot_classification_pbsp_p1_en

* Add model 2023-09-07-tiny_random_mpnetformultiplechoice_en

* Add model 2023-09-07-setfit_model_test_sensitve_v1_en

* Add model 2023-09-07-spiced_en

* Add model 2023-09-07-mpnet_nli_sts_en

* Add model 2023-09-07-java_deprecation_classifier_en

* Add model 2023-09-07-cross_all_mpnet_base_v2_finetuned_webnlg2020_metric_average_en

* Add model 2023-09-07-test_food_en

* Add model 2023-09-07-testing_setfit_en

* Add model 2023-09-07-multi_qa_mpnet_base_dot_v1_eclass_en

* Add model 2023-09-07-ecolo_pas_ecolo_v0.1_en

* Add model 2023-09-07-mpnet_retriever_squad2_en

* Add model 2023-09-07-github_issues_preprocessed_mpnet_southern_sotho_e10_en

* Add model 2023-09-07-stsb_mpnet_base_v2_en

* Add model 2023-09-07-sentence_transformers_bible_reference_final_en

* Add model 2023-09-07-setfit_all_data_en

* Add model 2023-09-07-ouvrage_classif_en

* Add model 2023-09-07-all_mpnet_base_v1_en

* Add model 2023-09-07-all_mpnet_base_v1_en

* Add model 2023-09-07-ouvrage_classif_en

* Add model 2023-09-07-mpnet_nli_sts_en

* Add model 2023-09-07-tiny_random_mpnetforquestionanswering_hf_tiny_model_private_en

* Add model 2023-09-07-sml_ukr_word_classifier_medium_en

* Add model 2023-09-07-vulnerable_groups_en

* Add model 2023-09-07-python_expand_classifier_en

* Add model 2023-09-07-all_mpnet_base_v2_tasky_classification_en

* Add model 2023-09-07-biencoder_multi_qa_mpnet_base_cos_v1_mmarcofr_fr

* Add model 2023-09-07-java_ownership_classifier_en

* Add model 2023-09-07-multi_qa_mpnet_base_cos_v1_navteca_en

* Add model 2023-09-07-setfit_zero_shot_classification_pbsp_p3_sev_en

* Add model 2023-09-07-biolord_stamb2_v1_en

* Add model 2023-09-07-java_pointer_classifier_en

* Add model 2023-09-07-reddit_single_context_mpnet_base_en

* Add model 2023-09-07-setfit_finetuned_financial_text_en

* Add model 2023-09-07-cpu_transport_ghg_classifier_en

* Add model 2023-09-08-setfit_zero_shot_classification_pbsp_p3_trig_en

* Add model 2023-09-07-paraphrase_mpnet_base_v2_sentence_transformers_en

* Add model 2023-09-07-579_stmodel_product_rem_v3a_en

* Add model 2023-09-07-multi_qa_mpnet_base_dot_v1_eclass_en

* Add model 2023-09-07-nps_psb_lds_en

* Add model 2023-09-07-negation_categories_classifier_es

* Add model 2023-09-08-setfit_zero_shot_classification_pbsp_p3_dur_en

* Add model 2023-09-07-mpnet_mnr_v2_fine_tuned_en

* Add model 2023-09-07-keyphrase_mpnet_v1_en

* Add model 2023-09-07-cpu_mitigation_classifier_en

* Add model 2023-09-07-multi_qa_mpnet_base_cos_v1_navteca_en

* Add model 2023-09-07-setfit_model_test_sensitve_v1_en

* Add model 2023-09-07-spiced_en

* Add model 2023-09-07-ecolo_pas_ecolo_v0.1_en

* Add model 2023-09-07-paraphrase_mpnet_base_v2_setfit_sst2_en

* Add model 2023-09-07-setfit_ds_version_0_0_1_en

* Add model 2023-09-08-multi_qa_mpnet_base_dot_v1_model_embeddings_en

* Add model 2023-09-07-setfit_occupation_en

* Add model 2023-09-07-multi_qa_mpnet_base_dot_v1_legal_finetune_en

* Add model 2023-09-07-burmese_awesome_setfit_model_en

* Add model 2023-09-07-multi_qa_v1_mpnet_cls_dot_en

* Add model 2023-09-07-tiny_random_mpnetfortokenclassification_hf_internal_testing_en

* Add model 2023-09-07-setfit_ethos_multilabel_example_neilthematic_en

* Add model 2023-09-07-keyphrase_mpnet_v1_en

* Add model 2023-09-07-fewshotissueclassifier_nlbse23_en

* Add model 2023-09-07-stsb_mpnet_base_v2_en

* Add model 2023-09-07-all_mpnet_base_v2_obrizum_en

* Add model 2023-09-08-setfit_ft_sentinent_eval_en

* Add model 2023-09-07-attack_bert_en

* Add model 2023-09-07-all_datasets_v3_mpnet_base_en

* Add model 2023-09-07-cpu_transport_ghg_classifier_en

* Add model 2023-09-07-fewshotissueclassifier_nlbse23_en

* Add model 2023-09-07-java_deprecation_classifier_en

* Add model 2023-09-07-java_usage_classifier_en

* Add model 2023-09-07-sbert_paper_en

* Add model 2023-09-07-setfit_ethos_multilabel_example_neilthematic_en

* Add model 2023-09-07-patentsberta_en

* Add model 2023-09-07-setfit_occupation_en

* Add model 2023-09-07-setfit_ds_version_0_0_5_en

* Add model 2023-09-07-mpnet_snli_negatives_en

* Add model 2023-09-08-nli_mpnet_base_v2_en

* Add model 2023-09-08-multi_qa_mpnet_base_cos_v1_en

* Add model 2023-09-08-multi_qa_mpnet_base_dot_v1_en

* Add model 2023-09-08-all_mpnet_base_v2_en

* Add model 2023-09-08-paraphrase_mpnet_base_v2_en

---------

Co-authored-by: ahmedlone127 <ahmedlone127@gmail.com>

* 2023-09-09-medium_mlm_imdb_en (#13970)

* Add model 2023-09-09-medium_mlm_imdb_en

* Add model 2023-09-09-bert_base_cased_finetuned_mrpc_en

* Add model 2023-09-09-base_mlm_imdb_en

* Add model 2023-09-09-vbert_2021_large_en

* Add model 2023-09-09-bert_base_german_dbmdz_cased_de

* Add model 2023-09-09-arabic_mbertmodel_mberttok_en

* Add model 2023-09-09-bert_base_german_dbmdz_uncased_de

* Add model 2023-09-09-arabic_mbertmodel_monotok_adapter_en

* Add model 2023-09-09-bert_large_cased_sigir_support_refute_norwegian_label_40_2nd_test_lr10_8_8_en

* Add model 2023-09-09-kw_pubmed_5000_0.0003_en

* Add model 2023-09-09-bertunam_en

* Add model 2023-09-09-bert_large_cased_sigir_support_refute_norwegian_label_40_2nd_test_lr10_8_9_en

* Add model 2023-09-09-arabic_mbertmodel_monotok_en

* Add model 2023-09-09-kw_pubmed_10000_0.00006_en

* Add model 2023-09-09-arabic_monomodel_mberttok_en

* Add model 2023-09-09-bert_base_uncased_issues_128_susnato_en

* Add model 2023-09-09-bert_large_uncased_whole_word_masking_en

* Add model 2023-09-09-finnish_mbertmodel_mberttok_en

* Add model 2023-09-09-kw_pubmed_10000_0.0003_en

* Add model 2023-09-09-kw_pubmed_5000_0.000006_en

* Add model 2023-09-09-bertimbau_large_fine_tuned_md_en

* Add model 2023-09-09-lsg16k_italian_legal_bert_it

* Add model 2023-09-09-finnish_mbertmodel_monotok_en

* Add model 2023-09-09-alephbertgimmel_20_epochs_en

* Add model 2023-09-09-kw_pubmed_5000_0.00006_en

* Add model 2023-09-09-arabic_monomodel_monotok_en

* Add model 2023-09-09-aethiqs_gembert_bertje_50k_en

* Add model 2023-09-09-bert_nlp_project_news_en

* Add model 2023-09-09-dummy_model2_en

* Add model 2023-09-09-aethiqs_base_bertje_data_rotterdam_epochs_10_en

* Add model 2023-09-09-bert_large_cased_whole_word_masking_en

* Add model 2023-09-09-alephbertgimmel_10_epochs_en

* Add model 2023-09-09-indonesian_mbertmodel_mberttok_en

* Add model 2023-09-09-kw_pubmed_10000_0.000006_en

* Add model 2023-09-09-indonesian_mbertmodel_monotok_en

* Add model 2023-09-09-s3_v1_20_epochs_en

* Add model 2023-09-09-bert_base_uncased_semeval2014_en

* Add model 2023-09-09-indonesian_monomodel_mberttok_en

* Add model 2023-09-09-bertimbau_large_fine_tuned_sindhi_en

* Add model 2023-09-09-bert_srb_base_cased_oscar_en

* Add model 2023-09-09-prompt_finetune_en

* Add model 2023-09-09-bert_large_uncased_semeval2014_en

* Add model 2023-09-09-finnish_monomodel_mberttok_en

* Add model 2023-09-09-indonesian_mbertmodel_monotok_adapter_en

* Add model 2023-09-09-bert_base_parsbert_uncased_finetuned_en

* Add model 2023-09-09-korean_mbertmodel_mberttok_en

* Add model 2023-09-09-finnish_monomodel_monotok_en

* Add model 2023-09-09-finnish_mbertmodel_monotok_adapter_en

* Add model 2023-09-09-indonesian_monomodel_monotok_en

* Add model 2023-09-09-covid_trained_bert_en

* Add model 2023-09-09-alephbertgimmel_50_epochs_en

* Add model 2023-09-09-wordpred_arabert_en

* Add model 2023-09-09-aethiqs_base_bertje_data_rotterdam_epochs_30_epoch_30_en

* Add model 2023-09-09-newsbert_en

* Add model 2023-09-09-biodivbert_en

* Add model 2023-09-09-bert_base_arabic_camelbert_catalan_ar

* Add model 2023-09-09-korean_monomodel_mberttok_en

* Add model 2023-09-09-bert_random_weights_en

* Add model 2023-09-09-bert_base_arabic_camelbert_danish_ar

* Add model 2023-09-09-bert_gb_2021_en

* Add model 2023-09-09-turkish_mbertmodel_mberttok_en

* Add model 2023-09-09-bert_large_uncased_semeval2014_restaurants_en

* Add model 2023-09-09-turkish_mbertmodel_monotok_adapter_en

* Add model 2023-09-09-pt_pol_en

* Add model 2023-09-09-bert_base_arabic_camelbert_msa_eighth_ar

* Add model 2023-09-09-bert_base_arabic_camelbert_msa_half_ar

* Add model 2023-09-09-turkish_mbertmodel_monotok_en

* Add model 2023-09-09-bert_base_arabic_camelbert_msa_quarter_ar

* Add model 2023-09-09-turkish_monomodel_mberttok_en

* Add model 2023-09-09-bert_base_arabic_camelbert_mix_ar

* Add model 2023-09-09-csci544_project_mabel_en

* Add model 2023-09-09-turkish_monomodel_monotok_en

* Add model 2023-09-09-gujbert_senti_a_en

* Add model 2023-09-09-bert_base_arabic_camelbert_msa_ar

* Add model 2023-09-09-javabert_uncased_en

* Add model 2023-09-09-bert_base_arabic_camelbert_msa_sixteenth_ar

* Add model 2023-09-09-pt_legalbert_en

* Add model 2023-09-09-bert_large_uncased_semeval2014_laptops_en

* Add model 2023-09-09-bert_large_uncased_semeval2015_restaurants_en

* Add model 2023-09-09-bert_base_uncased_finetuned_jira_hyperledger_issue_titles_and_bodies_en

* Add model 2023-09-09-spacescibert_en

* Add model 2023-09-09-bert_base_nli_ct_en

* Add model 2023-09-09-bert_base_ct_en

* Add model 2023-09-09-pt_caselawbert_en

* Add model 2023-09-09-hubert_base_cc_finetuned_forum_en

* Add model 2023-09-09-xtreme_squad_bert_base_multilingual_cased_xx

* Add model 2023-09-09-bert_racism_en

* Add model 2023-09-09-mymodel04_illvmi_en

* Add model 2023-09-09-bert_racism15_en

* Add model 2023-09-09-bert_large_nli_ct_en

* Add model 2023-09-09-jobbert_test_org_trial_26_12_2022_en

* Add model 2023-09-09-bert_large_uncased_semeval2016_restaurants_en

* Add model 2023-09-09-bert_base_uncased_finetuned_jira_jira_issue_titles_and_bodies_en

* Add model 2023-09-09-bert_large_uncased_semeval2015_laptops_en

* Add model 2023-09-09-bert_base_irish_cased_v1_en

* Add model 2023-09-09-bert_base_uncased_finetuned_jira_inteldaos_issue_titles_and_bodies_en

* Add model 2023-09-09-indobertweet_base_uncased_id

* Add model 2023-09-09-bert_large_ct_en

* Add model 2023-09-09-jobbert_org_add_words_trial_26_12_2022_en

* Add model 2023-09-09-indobert_base_uncased_id

* Add model 2023-09-09-bert_base_uncased_finetuned_jira_qt_issue_titles_and_bodies_en

* Add model 2023-09-09-bert_base_uncased_dish_descriptions_128_en

* Add model 2023-09-09-inlegalbert_cbp_lkg_triples_finetuned_en

* Add model 2023-09-09-spacebert_en

* Add model 2023-09-09-javabert_en

* Add model 2023-09-09-bert_large_uncased_semeval2016_laptops_en

* Add model 2023-09-09-bert_base_multilingual_cased_finetuned_hausa_xx

* Add model 2023-09-09-bert_base_uncased_dish_descriptions_128_0.5m_en

* Add model 2023-09-09-jobbert_org_add_words_trial_26_12_2022_en

* Add model 2023-09-09-bert_base_multilingual_cased_finetuned_hausa_xx

* Add model 2023-09-09-bert_base_uncased_fined_en

* Add model 2023-09-09-bert_ucb_3_en

* Add model 2023-09-09-bert_base_multilingual_cased_finetuned_igbo_xx

* Add model 2023-09-09-jobbert_org_add_words_v2_trial_26_12_2022_en

* Add model 2023-09-09-tod_bert_jnt_en

* Add model 2023-09-09-scholarbert_en

* Add model 2023-09-09-legal_bert_base_uncased_finetuned_ledgarscotus7_en

* Add model 2023-09-09-bert_base_multilingual_cased_finetuned_kinyarwanda_xx

* Add model 2023-09-09-bert_tagalog_base_cased_wwm_tl

* Add model 2023-09-09-bert_base_uncased_zhibinhong_en

* Add model 2023-09-09-bert_tagalog_base_cased_tl

* Add model 2023-09-09-bert__racism80000_en

* Add model 2023-09-09-bert_tagalog_base_uncased_wwm_tl

* Add model 2023-09-09-scholarbert_10_en

* Add model 2023-09-09-bert_base_multilingual_cased_finetuned_luganda_xx

* Add model 2023-09-09-bert_large_uncased_facebook_election_ads_en

* Add model 2023-09-09-kinyabert_small_en

* Add model 2023-09-09-bert_base_multilingual_cased_finetuned_swahili_xx

* Add model 2023-09-09-bert_base_multilingual_cased_finetuned_dholuo_xx

* Add model 2023-09-09-bert_hateracism90000_en

* Add model 2023-09-09-scholarbert_1_en

* Add model 2023-09-09-mtl_bert_base_uncased_en

* Add model 2023-09-09-bert_base_multilingual_cased_finetuned_wolof_xx

* Add model 2023-09-09-bert_base_multilingual_cased_finetuned_yoruba_xx

* Add model 2023-09-09-kinyabert_large_en

* Add model 2023-09-09-medbert_breastcancer_en

* Add model 2023-09-09-scholarbert_10_wb_en

* Add model 2023-09-09-kbert_base_esg_e10_en

* Add model 2023-09-09-alglegal3_bert_base_arabertv2_en

* Add model 2023-09-09-kbert_base_esg_e3_en

* Add model 2023-09-09-bert_large_portuguese_cased_legal_tsdae_pt

* Add model 2023-09-09-bert_large_cased_sigir_support_refute_norwegian_label_40_2nd_test_lr10_8_fast_0_en

* Add model 2023-09-09-kbert_base_esg_e5_en

* Add model 2023-09-09-mtl_bert_base_uncased_ww_en

* Add model 2023-09-09-mlm_20230427_indobertlarge_001_en

* Add model 2023-09-09-kaz_legal_bert_en

* Add model 2023-09-09-bert_ucb_4_en

* Add model 2023-09-09-mlm_20230427_mbert_001_en

* Add model 2023-09-09-scholarbert_100_wb_en

* Add model 2023-09-09-bert_sparql_en

* Add model 2023-09-09-kaz_legal_bert_5_en

* Add model 2023-09-09-knowbias_bert_base_uncased_gender_en

* Add model 2023-09-09-bert_tagalog_base_uncased_tl

* Add model 2023-09-09-bert_base_arabertv2_algarlegalbert_en

* Add model 2023-09-09-sae_bert_base_uncased_en

* Add model 2023-09-09-bert_base_multilingual_cased_finetuned_naija_xx

* Add model 2023-09-09-bantu_bert_xx

* Add model 2023-09-09-adaptive_lm_molecules_en

* Add model 2023-09-09-bert_base_portuguese_cased_test_server_en

* Add model 2023-09-09-bert_large_cased_sigir_support_refute_norwegian_label_40_2nd_test_lr10_8_fast_2_en

* Add model 2023-09-09-nepali_bert_npvec1_en

* Add model 2023-09-09-jzmodel01_en

* Add model 2023-09-09-bert_large_cased_sigir_support_refute_norwegian_label_40_2nd_test_lr10_8_fast_3_en

* Add model 2023-09-09-biomednlp_pubmedbert_large_uncased_abstract_en

* Add model 2023-09-09-bert_base_uncased_finetune_security_en

* Add model 2023-09-09-guidebias_bert_base_uncased_gender_en

* Add model 2023-09-09-arabert32_flickr8k_en

* Add model 2023-09-09-bert_large_cased_sigir_support_refute_norwegian_label_40_2nd_test_lr10_8_fast_4_en

* Add model 2023-09-09-metaphor_finetuned_bert_5epochs_en

* Add model 2023-09-09-indojave_codemixed_indobertweet_base_id

* Add model 2023-09-09-tiny_mlm_snli_en

* Add model 2023-09-09-bert_base_uncased_issues_128_tanviraumi_en

* Add model 2023-09-09-romanian_bert_tweet_ro

* Add model 2023-09-09-tendencias_en

* Add model 2023-09-09-bert_base_aeslc_danish_en

* Add model 2023-09-09-materialsbert_en

* Add model 2023-09-09-222_en

* Add model 2023-09-09-bert_base_aeslc_aktsvigun_en

* Add model 2023-09-09-dfm_encoder_large_v1_da

* Add model 2023-09-09-ai12_junzai_en

* Add model 2023-09-09-bert_large_cased_sigir_support_refute_norwegian_label_40_2nd_test_lr10_8_fast_5_en

* Add model 2023-09-09-bert_large_portuguese_cased_legal_mlm_pt

* Add model 2023-09-09-bert_finetuning_test1227_hug_en

* Add model 2023-09-09-german_poetry_bert_en

* Add model 2023-09-09-mlm_20230428_indobert_base_p2_001_en

* Add model 2023-09-09-bodo_bert_mlm_base_article_en

* Add model 2023-09-09-tiny_mlm_glue_cola_en

* Add model 2023-09-09-bert_test_junzai_en

* Add model 2023-09-09-bert_base_cnndm_en

* Add model 2023-09-09-gww_en

* Add model 2023-09-09-bert_large_cased_sigir_support_refute_norwegian_label_40_2nd_test_lr10_8_fast_6_en

* Add model 2023-09-09-tiny_mlm_glue_qnli_en

* Add model 2023-09-09-bert_funting_test_ai10_junzai_en

* Add model 2023-09-09-tiny_mlm_glue_mnli_en

* Add model 2023-09-09-bert_large_cased_sigir_support_refute_norwegian_label_40_2nd_test_lr10_8_fast_7_en

* Add model 2023-09-09-tiny_mlm_glue_mrpc_en

* Add model 2023-09-09-dal_bert_fa

* Add model 2023-09-09-tiny_mlm_glue_qqp_en

* Add model 2023-09-09-bert_patent_reference_extraction_en

* Add model 2023-09-09-test_ru

* Add model 2023-09-09-bert_finetuning_test_hug_en

* Add model 2023-09-09-bert_base_pubmed_en

* Add model 2023-09-09-bert_large_cased_sigir_support_refute_norwegian_label_40_2nd_test_lr10_8_fast_8_en

* Add model 2023-09-09-bert_base_aeslc_kenkaneki_en

* Add model 2023-09-09-absa_maskedlm_en

---------

Co-authored-by: ahmedlone127 <ahmedlone127@gmail.com>

---------

Co-authored-by: jsl-models <74001263+jsl-models@users.noreply.github.com>
Co-authored-by: ahmedlone127 <ahmedlone127@gmail.com>
  • Loading branch information
3 people authored Sep 11, 2023
1 parent 543faaf commit eec96df
Show file tree
Hide file tree
Showing 376 changed files with 34,994 additions and 0 deletions.
119 changes: 119 additions & 0 deletions docs/_posts/DevinTDHa/2023-08-28-asr_whisper_tiny_opt_xx.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,119 @@
---
layout: model
title: Official whisper-tiny Optimized
author: John Snow Labs
name: asr_whisper_tiny_opt
date: 2023-08-28
tags: [whisper, audio, open_source, asr, onnx, xx]
task: Automatic Speech Recognition
language: xx
edition: Spark NLP 5.1.1
spark_version: 3.0
supported: true
engine: onnx
annotator: WhisperForCTC
article_header:
type: cover
use_language_switcher: "Python-Scala-Java"
---

## Description

Official pretrained Whisper model, adapted from HuggingFace transformer and curated to provide scalability and production-readiness using Spark NLP.

This is a multilingual model and supports the following languages:

Afrikaans, Arabic, Armenian, Azerbaijani, Belarusian, Bosnian, Bulgarian, Catalan, Chinese, Croatian, Czech, Danish, Dutch, English, Estonian, Finnish, French, Galician, German, Greek, Hebrew, Hindi, Hungarian, Icelandic, Indonesian, Italian, Japanese, Kannada, Kazakh, Korean, Latvian, Lithuanian, Macedonian, Malay, Marathi, Maori, Nepali, Norwegian, Persian, Polish, Portuguese, Romanian, Russian, Serbian, Slovak, Slovenian, Spanish, Swahili, Swedish, Tagalog, Tamil, Thai, Turkish, Ukrainian, Urdu, Vietnamese, and Welsh.

## Predicted Entities



{:.btn-box}
<button class="button button-orange" disabled>Live Demo</button>
<button class="button button-orange" disabled>Open in Colab</button>
[Download](https://s3.amazonaws.com/auxdata.johnsnowlabs.com/public/models/asr_whisper_tiny_opt_xx_5.1.1_3.0_1693213918398.zip){:.button.button-orange.button-orange-trans.arr.button-icon}
[Copy S3 URI](s3://auxdata.johnsnowlabs.com/public/models/asr_whisper_tiny_opt_xx_5.1.1_3.0_1693213918398.zip){:.button.button-orange.button-orange-trans.button-icon.button-copy-s3}

## How to use



<div class="tabs-box" markdown="1">
{% include programmingLanguageSelectScalaPythonNLU.html %}
```python
import sparknlp
from sparknlp.base import *
from sparknlp.annotator import *
from pyspark.ml import Pipeline

audioAssembler = AudioAssembler() \
.setInputCol("audio_content") \
.setOutputCol("audio_assembler")

speechToText = WhisperForCTC.pretrained("asr_whisper_tiny_opt", "xx") \
.setInputCols(["audio_assembler"]) \
.setOutputCol("text")

pipeline = Pipeline().setStages([audioAssembler, speechToText])
processedAudioFloats = spark.createDataFrame([[rawFloats]]).toDF("audio_content")
result = pipeline.fit(processedAudioFloats).transform(processedAudioFloats)
result.select("text.result").show(truncate = False)
```
```scala
import spark.implicits._
import com.johnsnowlabs.nlp.base._
import com.johnsnowlabs.nlp.annotators._
import com.johnsnowlabs.nlp.annotators.audio.WhisperForCTC
import org.apache.spark.ml.Pipeline

val audioAssembler: AudioAssembler = new AudioAssembler()
.setInputCol("audio_content")
.setOutputCol("audio_assembler")

val speechToText: WhisperForCTC = WhisperForCTC
.pretrained("asr_whisper_tiny_opt", "xx")
.setInputCols("audio_assembler")
.setOutputCol("text")

val pipeline: Pipeline = new Pipeline().setStages(Array(audioAssembler, speechToText))

val bufferedSource =
scala.io.Source.fromFile("src/test/resources/audio/txt/librispeech_asr_0.txt")

val rawFloats = bufferedSource
.getLines()
.map(_.split(",").head.trim.toFloat)
.toArray
bufferedSource.close

val processedAudioFloats = Seq(rawFloats).toDF("audio_content")

val result = pipeline.fit(processedAudioFloats).transform(processedAudioFloats)
result.select("text.result").show(truncate = false)
```
</div>

## Results

```bash
+------------------------------------------------------------------------------------------------------------------------------------------------+
|document |
+------------------------------------------------------------------------------------------------------------------------------------------------+
|[{document, 0, 87, Mr. Quilter is the apostle of the middle classes and we are glad to welcome his gospel., {length -> 93680, audio -> 0}, []}]|
+------------------------------------------------------------------------------------------------------------------------------------------------+
```

{:.model-param}
## Model Information

{:.table-model}
|---|---|
|Model Name:|asr_whisper_tiny_opt|
|Compatibility:|Spark NLP 5.1.1+|
|License:|Open Source|
|Edition:|Official|
|Input Labels:|[audio_assembler]|
|Output Labels:|[document]|
|Language:|xx|
|Size:|239.3 MB|
Original file line number Diff line number Diff line change
@@ -0,0 +1,93 @@
---
layout: model
title: English 579_stmodel_product_rem_v3a MPNetEmbeddings from jamiehudson
author: John Snow Labs
name: 579_stmodel_product_rem_v3a
date: 2023-09-07
tags: [mpnet, en, open_source, onnx]
task: Embeddings
language: en
edition: Spark NLP 5.1.1
spark_version: 3.0
supported: true
engine: onnx
annotator: MPNetEmbeddings
article_header:
type: cover
use_language_switcher: "Python-Scala-Java"
---

## Description

Pretrained MPNetEmbeddings model, adapted from Hugging Face and curated to provide scalability and production-readiness using Spark NLP.`579_stmodel_product_rem_v3a` is a English model originally trained by jamiehudson.

{:.btn-box}
<button class="button button-orange" disabled>Live Demo</button>
<button class="button button-orange" disabled>Open in Colab</button>
[Download](https://s3.amazonaws.com/auxdata.johnsnowlabs.com/public/models/579_stmodel_product_rem_v3a_en_5.1.1_3.0_1694126614739.zip){:.button.button-orange.button-orange-trans.arr.button-icon}
[Copy S3 URI](s3://auxdata.johnsnowlabs.com/public/models/579_stmodel_product_rem_v3a_en_5.1.1_3.0_1694126614739.zip){:.button.button-orange.button-orange-trans.button-icon.button-copy-s3}

## How to use



<div class="tabs-box" markdown="1">
{% include programmingLanguageSelectScalaPythonNLU.html %}
```python


document_assembler = DocumentAssembler() \
.setInputCol("text") \
.setOutputCol("documents")


embeddings =MPNetEmbeddings.pretrained("579_stmodel_product_rem_v3a","en") \
.setInputCols(["documents"]) \
.setOutputCol("mpnet_embeddings")

pipeline = Pipeline().setStages([document_assembler, embeddings])

pipelineModel = pipeline.fit(data)

pipelineDF = pipelineModel.transform(data)

```
```scala


val document_assembler = new DocumentAssembler()
.setInputCol("text")
.setOutputCol("documents")

val embeddings = MPNetEmbeddings
.pretrained("579_stmodel_product_rem_v3a", "en")
.setInputCols(Array("documents"))
.setOutputCol("mpnet_embeddings")

val pipeline = new Pipeline().setStages(Array(document_assembler, embeddings))

val pipelineModel = pipeline.fit(data)

val pipelineDF = pipelineModel.transform(data)


```
</div>

{:.model-param}
## Model Information

{:.table-model}
|---|---|
|Model Name:|579_stmodel_product_rem_v3a|
|Compatibility:|Spark NLP 5.1.1+|
|License:|Open Source|
|Edition:|Official|
|Input Labels:|[documents]|
|Output Labels:|[mpnet_embeddings]|
|Language:|en|
|Size:|407.2 MB|

## References

https://huggingface.co/jamiehudson/579-STmodel-product-rem-v3a
93 changes: 93 additions & 0 deletions docs/_posts/ahmedlone127/2023-09-07-abstract_sim_query_en.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,93 @@
---
layout: model
title: English abstract_sim_query MPNetEmbeddings from biu-nlp
author: John Snow Labs
name: abstract_sim_query
date: 2023-09-07
tags: [mpnet, en, open_source, onnx]
task: Embeddings
language: en
edition: Spark NLP 5.1.1
spark_version: 3.0
supported: true
engine: onnx
annotator: MPNetEmbeddings
article_header:
type: cover
use_language_switcher: "Python-Scala-Java"
---

## Description

Pretrained MPNetEmbeddings model, adapted from Hugging Face and curated to provide scalability and production-readiness using Spark NLP.`abstract_sim_query` is a English model originally trained by biu-nlp.

{:.btn-box}
<button class="button button-orange" disabled>Live Demo</button>
<button class="button button-orange" disabled>Open in Colab</button>
[Download](https://s3.amazonaws.com/auxdata.johnsnowlabs.com/public/models/abstract_sim_query_en_5.1.1_3.0_1694125761744.zip){:.button.button-orange.button-orange-trans.arr.button-icon}
[Copy S3 URI](s3://auxdata.johnsnowlabs.com/public/models/abstract_sim_query_en_5.1.1_3.0_1694125761744.zip){:.button.button-orange.button-orange-trans.button-icon.button-copy-s3}

## How to use



<div class="tabs-box" markdown="1">
{% include programmingLanguageSelectScalaPythonNLU.html %}
```python


document_assembler = DocumentAssembler() \
.setInputCol("text") \
.setOutputCol("documents")


embeddings =MPNetEmbeddings.pretrained("abstract_sim_query","en") \
.setInputCols(["documents"]) \
.setOutputCol("mpnet_embeddings")

pipeline = Pipeline().setStages([document_assembler, embeddings])

pipelineModel = pipeline.fit(data)

pipelineDF = pipelineModel.transform(data)

```
```scala


val document_assembler = new DocumentAssembler()
.setInputCol("text")
.setOutputCol("documents")

val embeddings = MPNetEmbeddings
.pretrained("abstract_sim_query", "en")
.setInputCols(Array("documents"))
.setOutputCol("mpnet_embeddings")

val pipeline = new Pipeline().setStages(Array(document_assembler, embeddings))

val pipelineModel = pipeline.fit(data)

val pipelineDF = pipelineModel.transform(data)


```
</div>

{:.model-param}
## Model Information

{:.table-model}
|---|---|
|Model Name:|abstract_sim_query|
|Compatibility:|Spark NLP 5.1.1+|
|License:|Open Source|
|Edition:|Official|
|Input Labels:|[documents]|
|Output Labels:|[mpnet_embeddings]|
|Language:|en|
|Size:|406.8 MB|

## References

https://huggingface.co/biu-nlp/abstract-sim-query
Loading

0 comments on commit eec96df

Please sign in to comment.