Skip to content

Commit

Permalink
2023-09-15-distilbert_base_german_cased_de (#13984)
Browse files Browse the repository at this point in the history
* Add model 2023-09-15-distilbert_base_uncased_finetuned_imdb_jaese_en

* Add model 2023-09-15-distilbert_base_uncased_finetuned_imdb_apatidar0_en

* Add model 2023-09-15-distilbert_base_uncased_finetuned_imdb_gg1313_en

* Add model 2023-09-15-we4lkd_aml_distilbert_1921_2006_en

* Add model 2023-09-15-distilbert_mlm_best_en

* Add model 2023-09-15-distilbert_base_uncased_finetuned_imdb_edraper88_en

* Add model 2023-09-15-distilbert_base_uncased_finetuned_imdb_accelerate_thangvip_en

* Add model 2023-09-15-distilbert_pubmed_mlm_en

* Add model 2023-09-15-distilbert_base_uncased_finetuned_game_accelerate_v2_en

* Add model 2023-09-15-distilbert_base_uncased_finetuned_imdb_reza93v_en

* Add model 2023-09-15-distilbert_base_uncased_finetuned_imdb_raphaelmerx_en

* Add model 2023-09-15-distilbert_base_uncased_finetuned_imdb_accelerate_qianyu88_en

* Add model 2023-09-15-distilbert_base_uncased_finetuned_imdb_hemanth11_en

* Add model 2023-09-15-erwt_year_southern_sotho_en

* Add model 2023-09-15-distilbert_base_uncased_finetuned_imdb_prasanthin_en

* Add model 2023-09-15-distilabena_base_v2_asante_twi_uncased_en

* Add model 2023-09-15-distilbert_base_uncased_finetuned_squad_d5716d28_juancopi81_en

* Add model 2023-09-15-distilbert_base_uncased_finetuned_imdb_physhunter_en

* Add model 2023-09-15-distilbert_base_uncased_finetuned_outop_y_en

* Add model 2023-09-15-eighteenth_century_distilbert_en

* Add model 2023-09-15-distilbert_base_uncased_finetuned_imdb_dave_sheets_en

* Add model 2023-09-15-few_mask_en

* Add model 2023-09-15-distilbert_base_uncased_finetuned_test_headline_en

* Add model 2023-09-15-distilbert_base_uncased_finetuned_squad_d5716d28_peterhsu_en

* Add model 2023-09-15-film20000distilbert_base_uncased_en

* Add model 2023-09-15-distilbert_base_uncased_finetuned_imdb_rd124_en

* Add model 2023-09-15-distilbert_base_uncased_finetuned_imdb_geolearner_en

* Add model 2023-09-15-lsg_distilbert_base_uncased_4096_en

* Add model 2023-09-15-distilbert_base_multilingual_cased_bulgarian_wikipedia_xx

* Add model 2023-09-15-distilbert_base_uncased_finetuned_imdb_qianyu88_en

* Add model 2023-09-15-distilbert_base_uncased_mlm_tamil_local_en

* Add model 2023-09-15-distilbert_base_uncased_finetuned_imdb_jchhabra_en

* Add model 2023-09-15-distilbert_base_uncased_finetuned_imdb_dewa_en

* Add model 2023-09-15-distilbert_base_uncased_finetuned_imdb_kyle2023_en

* Add model 2023-09-15-distilbert_base_uncased_finetuned_imdb_accelerate_caroline_betbeze_en

* Add model 2023-09-15-we4lkd_aml_distilbert_1921_1997_en

* Add model 2023-09-15-we4lkd_aml_distilbert_1921_1994_en

* Add model 2023-09-15-we4lkd_aml_distilbert_1921_1966_en

* Add model 2023-09-15-distilbert_base_uncased_finetuned_recipe_accelerate_en

* Add model 2023-09-15-we4lkd_aml_distilbert_1921_2008_en

* Add model 2023-09-15-distilbert_base_uncased_finetuned_imdb_jake777_en

* Add model 2023-09-15-we4lkd_aml_distilbert_1921_1988_en

* Add model 2023-09-15-we4lkd_aml_distilbert_1921_2021_en

* Add model 2023-09-15-distilbert_base_uncased_finetuned_imdb_feeeper_en

* Add model 2023-09-15-distilbert_base_uncased_finetuned_imdb_mchalek_en

* Add model 2023-09-15-distilbert_base_uncased_finetuned_imdb_v2_accelerate_en

* Add model 2023-09-15-500_sdb_tbb_en

* Add model 2023-09-15-distilbert_base_uncased_finetuned_ccnews_en

* Add model 2023-09-15-distilbert_base_uncased_finetuned_imdb_wjbmattingly_en

* Add model 2023-09-15-marathi_distilbert_pretrained_mr

* Add model 2023-09-15-remote_sensing_distilbert_cased_en

* Add model 2023-09-15-distilbert_base_uncased_finetuned_imdb_guidoivetta_en

* Add model 2023-09-15-distilbert_base_uncased_finetuned_imdb_solver_paul_en

* Add model 2023-09-15-inisw08_distilbert_mlm_lion_32bit_test_en

* Add model 2023-09-15-distilbert_base_uncased_finetuned_imdb_brenton_en

* Add model 2023-09-15-20split_dataset_version1_en

* Add model 2023-09-15-distilbert_base_uncased_finetuned_imdb_v2_francesco_a_en

* Add model 2023-09-15-mtl_distilbert_base_uncased_en

* Add model 2023-09-15-we4lkd_aml_distilbert_1921_1985_en

* Add model 2023-09-15-distilbert_base_uncased_finetuned_imdb_accelerate_rushikesh_en

* Add model 2023-09-15-we4lkd_aml_distilbert_1921_1991_en

* Add model 2023-09-15-distilbert_base_uncased_finetuned_imdb_accelerate_shadowtwin41_en

* Add model 2023-09-15-aave_distil_bert_en

* Add model 2023-09-15-distilbert_base_uncased_finetuned_imdb_eitanli_en

* Add model 2023-09-15-distilbert_base_uncased_finetuned_imdb_accelerate_waynechiu_en

* Add model 2023-09-15-distilbert_add_pre_training_dim_96_en

* Add model 2023-09-15-distilbert_base_uncased_finetuned_rap_lyrics_v1_en

* Add model 2023-09-15-distilbert_base_uncased_finetuned_imdb_iven5880_en

* Add model 2023-09-15-distilbert_base_uncased_finetuned_imdb_accelerate_vsrinivas_en

* Add model 2023-09-15-we4lkd_aml_distilbert_1921_2000_en

* Add model 2023-09-15-burmese_finetuned_distilbert_portuguese_en

* Add model 2023-09-15-distilbert_base_uncased_finetuned_tweet_en

* Add model 2023-09-15-distilbert_base_uncased_finetuned_discord_en

* Add model 2023-09-15-distilbert_base_english_chinese_hindi_cased_en

* Add model 2023-09-15-distilbert_base_uncased_finetuned_imdb_junchengding_en

* Add model 2023-09-15-distilbert_base_uncased_sparse_90_unstructured_pruneofa_en

* Add model 2023-09-15-tod_distilbert_jnt_v1_en

* Add model 2023-09-15-distilbert_embeddings_clinical_en

* Add model 2023-09-15-absa_with_maskedlm_finetuned_sentihood_en

* Add model 2023-09-15-distilbert_perigon_200k_en

* Add model 2023-09-15-distilbert_base_uncased_wholewordmasking_finetuned_imdb_en

* Add model 2023-09-15-distilbert_base_uncased_finetuned_imdb_sonali_behera_en

* Add model 2023-09-15-distilbert_base_uncased_finetuned_squad_d5716d28_nicolacandussi_en

* Add model 2023-09-15-distilbert_base_english_french_spanish_german_chinese_cased_en

* Add model 2023-09-15-first_try_4_en

* Add model 2023-09-15-distilbert_base_uncased_finetuned_imdb_accelerate_tanvirkhan_en

* Add model 2023-09-15-distilbert_base_uncased_finetuned_imdb_golightly_en

* Add model 2023-09-15-distilbert_base_uncased_finetuned_imdb_harshseth_en

* Add model 2023-09-15-distilbert_base_uncased_finetuned_im_en

* Add model 2023-09-15-distilbert_base_uncased_finetuned_imdb_tkoyama_en

* Add model 2023-09-15-distilbert_base_uncased_finetuned_imdb_caroline_betbeze_en

* Add model 2023-09-15-distilbert_base_uncased_finetuned_imdb_accelerate_vanhoan_en

* Add model 2023-09-15-distilbert_base_uncased_finetuned_imdb_talha185_en

* Add model 2023-09-15-distilbert_ravenk_en

* Add model 2023-09-15-distilbert_base_uncased_finetuned_imdb_accelerate_arthuerwang_en

* Add model 2023-09-15-distilbert_base_uncased_finetuned_cnn_en

* Add model 2023-09-15-distilbert_base_uncased_finetuned_imdb_robkayinto_en

* Add model 2023-09-15-distilbert_base_uncased_finetuned_imdb_lokeshsoni2801_en

* Add model 2023-09-15-distilbert_base_uncased_finetuned_imdb_accelerate_rdvdsn_en

* Add model 2023-09-15-distilbert_base_uncased_finetuned_squad_d5716d28_dchung117_en

* Add model 2023-09-15-distilbert_base_uncased_finetuned_imdb_mholi_en

* Add model 2023-09-15-distilbert_base_uncased_finetuned_imdb_arunadiraju_en

* Add model 2023-09-15-distilbert_splade_en

* Add model 2023-09-15-hinglish_distilbert_en

* Add model 2023-09-15-distilbert_base_uncased_finetuned_hina_en

* Add model 2023-09-15-distilbert_base_uncased_finetuned_amazon_review_en

* Add model 2023-09-15-distilbert_base_uncased_finetuned_imdb_accelerate_mascariddu8_en

* Add model 2023-09-15-distilbert_base_uncased_finetuned_squad_d5716d28_mbateman_en

* Add model 2023-09-15-distilbert_base_english_french_chinese_japanese_vietnamese_cased_en

* Add model 2023-09-15-we4lkd_aml_distilbert_1921_2013_en

* Add model 2023-09-15-distilbert_base_english_thai_cased_en

* Add model 2023-09-15-distilbert_base_english_japanese_cased_en

* Add model 2023-09-15-distilbert_base_uncased_finetuned_imdb_sgasparorippa_en

* Add model 2023-09-15-distilabena_base_v2_akuapem_twi_cased_en

* Add model 2023-09-15-experiment_en

* Add model 2023-09-15-flang_distilbert_en

* Add model 2023-09-15-bert_base_uncased_finetuned_imdb_accelerate_en

* Add model 2023-09-15-we4lkd_aml_distilbert_1921_2002_en

* Add model 2023-09-15-distilbert_base_uncased_finetuned_outop_j_en

* Add model 2023-09-15-distilbert_base_uncased_finetuned_imdb_snousias_en

* Add model 2023-09-15-distilbert_base_uncased_finetuned_imdb_accelerate_anikaai_en

* Add model 2023-09-15-distilbert_base_uncased_finetuned_squad_d5716d28_suzuki0829_en

* Add model 2023-09-15-distilbert_base_uncased_finetuned_allocation_en

* Add model 2023-09-15-we4lkd_aml_distilbert_1921_1998_en

* Add model 2023-09-15-distilbert_base_uncased_finetuned_himani_auto_en

* Add model 2023-09-15-bert_tuned_trial_20_12_2022_en

* Add model 2023-09-15-distilbert_base_uncased_finetuned_imdb_dieexbr_en

* Add model 2023-09-15-distilbert_base_uncased_finetuned_squad_d5716d28_sophon_en

* Add model 2023-09-15-distilbert_base_uncased_finetuned_himani_auto_text_gen_en

* Add model 2023-09-15-distilbert_base_uncased_finetuned_squad_d5716d28_nugget00_en

* Add model 2023-09-15-distilbert_base_uncased_finetuned_imdb_accelerate_yangwooko_en

* Add model 2023-09-15-distilbert_base_uncased_finetuned_imdb_cchychen_en

* Add model 2023-09-15-distilbert_base_uncased_finetuned_squad_d5716d28_gautam1989_en

* Add model 2023-09-15-distilbert_embeddings_base_uncased_finetuned_imdb_accelerate_en

* Add model 2023-09-15-bertino_lsg_en

* Add model 2023-09-15-distilbert_base_spanish_uncased_model_en

* Add model 2023-09-15-distilbert_base_uncased_finetuned_imdb_cssupport_en

* Add model 2023-09-15-distilbert_base_uncased_finetuned_imdb_accelerate_terps_en

* Add model 2023-09-15-distilbert_base_uncased_finetuned_imdb_vanhoan_en

* Add model 2023-09-15-fine_tuned_distilbert_nosql_injection_en

* Add model 2023-09-15-distilbert_base_uncased_finetuned_imdb_elggman_en

* Add model 2023-09-15-sae_distilbert_base_uncased_en

* Add model 2023-09-15-distilbert_base_uncased_finetuned_squad_d5716d28_iotengtr_en

* Add model 2023-09-15-distilbert_base_uncased_malayalam_arxiv_papers_en

* Add model 2023-09-15-clr_pretrained_distilbert_base_uncased_en

* Add model 2023-09-15-we4lkd_aml_distilbert_1921_2020_en

* Add model 2023-09-15-we4lkd_aml_distilbert_1921_2011_en

* Add model 2023-09-15-distilbert_base_uncased_imdb_distilbert_en

* Add model 2023-09-15-inisw08_distilbert_mlm_adamw_torch_fused_en

* Add model 2023-09-15-distilbert_mlm_500k_en

* Add model 2023-09-15-distilbert_base_uncased_finetuned_imdb_eusojk_en

* Add model 2023-09-15-distilbert_base_uncased_finetuned_crypto_en

* Add model 2023-09-15-spladex_zs_en

* Add model 2023-09-15-hf_distilbert_imdb_mlm_en

* Add model 2023-09-15-distilbert_base_uncased_finetuned_imdb_fadliaulawi_en

* Add model 2023-09-15-distilabena_base_asante_twi_uncased_en

* Add model 2023-09-15-distilbert_base_uncased_finetuned_imdb_accelerate_hilariooliveira_en

* Add model 2023-09-15-distilbert_classification_eplorer_en

* Add model 2023-09-15-distilbert_hemingway_sar_en

* Add model 2023-09-15-distilbert_base_english_lithuanian_cased_en

* Add model 2023-09-15-distilbert_base_uncased_finetuned_himani_m_en

* Add model 2023-09-15-splade_v2_max_en

* Add model 2023-09-15-distilbert_base_uncased_finetuned_squad_d5716d28_sgr23_en

* Add model 2023-09-15-distilbert_base_uncased_finetuned_imdb_accelerate_peterhsu_en

* Add model 2023-09-15-distilbert_base_uncased_finetuned_squad_d5716d28_shadowtwin41_en

* Add model 2023-09-15-distilbert_base_uncased_finetuned_imdb_evincent18_en

* Add model 2023-09-15-distilbert_base_uncased_finetuned_fetch_en

* Add model 2023-09-15-distilbert_base_uncased_finetuned_imdb_cleandata_en

* Add model 2023-09-15-distilbert_base_multilingual_cased_finetuned_kintweetse_xx

* Add model 2023-09-15-distilbert_base_uncased_finetuned_imdb_tlapusan_en

* Add model 2023-09-15-distilbert_base_uncased_finetuned_imdb_accelerate_tsobolev_en

* Add model 2023-09-15-dummy_model_en

* Add model 2023-09-15-distilbert_base_uncased_finetuned_himani_auto_gen_en

* Add model 2023-09-15-distilbert_base_uncased_finetuned_imdb_accelerate_surjray_en

* Add model 2023-09-15-we4lkd_aml_distilbert_1921_2022_en

* Add model 2023-09-15-clinical_bert_finetuned_en

* Add model 2023-09-15-distilbert_base_uncased_finetuned_imdb_thangvip_en

* Add model 2023-09-15-distilbert_base_uncased_finetuned_squad_d5716d28_coreyabs_db_en

* Add model 2023-09-15-distilbert_base_uncased_mask_finetuned_imdb_v1_en

* Add model 2023-09-15-distilbert_base_uncased_finetuned_imdb_accelerate_spasis_en

* Add model 2023-09-15-erwt_year_en

* Add model 2023-09-15-bertino_it

* Add model 2023-09-15-distilbert_base_uncased_finetuned_himani_auto_textgeneration_en

* Add model 2023-09-15-distilbert_base_uncased_imdb_accelerate_en

* Add model 2023-09-15-film20000film20000distilbert_base_uncased_en

* Add model 2023-09-15-distilbert_base_uncased_finetuned_imdb_accelerate_fadliaulawi_en

* Add model 2023-09-15-distilbert_base_uncased_finetuned_imdb_accelerate_ttmusic_en

* Add model 2023-09-15-distilbert_base_uncased_finetuned_squad_d5716d28_lakecrimsonn_en

* Add model 2023-09-15-distilbert_base_uncased_finetuned_imdb_huggingface_course_en

* Add model 2023-09-15-distilbert_base_uncased_finetuned_imdb_coreyabs_db_en

* Add model 2023-09-15-dbert_finetuned_g_en

* Add model 2023-09-15-distilbert_base_uncased_finetuned_imdb_cl_wood_en

* Add model 2023-09-15-distilbert_base_uncased_finetuned_imdb_raulgdp_en

* Add model 2023-09-15-distilbert_base_uncased_finetuned_imdb_accelerate_jwchung_en

* Add model 2023-09-15-distilbert_base_uncased_sparse_80_1x4_block_pruneofa_en

* Add model 2023-09-15-distilbert_base_uncased_finetuned_imdb_averageandyyy_en

* Add model 2023-09-15-test_text_en

* Add model 2023-09-15-distilbert_base_uncased_finetuned_imdb_accelerate_averageandyyy_en

* Add model 2023-09-15-distilbert_base_uncased_finetuned_imdb_accelerate_francesc_en

* Add model 2023-09-15-distilbert_base_turkish_cased_offensive_mlm_tr

* Add model 2023-09-15-distilbert_base_english_french_spanish_portuguese_italian_cased_xx

* Add model 2023-09-15-distilbert_base_uncased_finetuned_imdb_techtank_en

* Add model 2023-09-15-distilbert_base_uncased_finetuned_cvent_2019_2022_en

* Add model 2023-09-15-distilbert_base_uncased_mlm_scirepeval_fos_chemistry_en

* Add model 2023-09-15-100_sdb_tbb_en

* Add model 2023-09-15-we4lkd_aml_distilbert_1921_2009_en

* Add model 2023-09-15-distilbert_base_uncased_finetuned_imdb_yuto01_en

* Add model 2023-09-15-distilbert_base_uncased_finetuned_imdb_thetaphipsi_en

* Add model 2023-09-15-distilbert_base_uncased_finetuned_imdb_rugo_en

* Add model 2023-09-15-distilbert_base_uncased_finetuned_squad_d5716d28_guoguo_en

* Add model 2023-09-15-yolochess_mlm_azure_cloud_35_en

* Add model 2023-09-15-we4lkd_aml_distilbert_1921_2019_en

* Add model 2023-09-15-distilbert_base_uncased_finetuned_imdb_ryanlai_en

* Add model 2023-09-15-distilbert_embeddings_base_uncased_continued_training_medqa_en

* Add model 2023-09-15-distilbert_base_uncased_finetuned_squad_d5716d28_luzimu_en

* Add model 2023-09-15-distilbert_base_uncased_finetuned_imdb_thutrang_en

* Add model 2023-09-15-javanese_distilbert_small_jv

* Add model 2023-09-15-distilbert_base_uncased_finetuned_imdb_accelerate_johnyyhk_en

* Add model 2023-09-15-distilbert_base_uncased_finetuned_imdb_finetuned_imdb_chenyanjin_en

* Add model 2023-09-15-spladex_tt_spanish_en

* Add model 2023-09-15-distilbert_base_uncased_finetuned_imdb_gtxygyzb_en

* Add model 2023-09-15-erwt_year_masked_75_en

* Add model 2023-09-15-distilbert_base_uncased_finetuned_imdb_andrewr_en

* Add model 2023-09-15-distilbert_base_english_urdu_cased_en

* Add model 2023-09-15-distilbert_base_uncased_finetuned_imdb_accelerate_sungchun71_en

* Add model 2023-09-15-we4lkd_aml_distilbert_1921_1992_en

* Add model 2023-09-15-pt_distilbert_base_en

* Add model 2023-09-15-distilbert_base_english_french_german_cased_en

* Add model 2023-09-15-distilbert_base_uncased_finetuned_imdb_mlm_accelerate_en

* Add model 2023-09-15-distilbert_base_uncased_finetuned_imdb_nugget00_en

* Add model 2023-09-15-distilbert_base_uncased_finetuned_imdb_tux_en

* Add model 2023-09-15-distilbert_base_uncased_finetuned_squad_d5716d28_gostrive_en

* Add model 2023-09-15-we4lkd_aml_distilbert_1921_1965_en

* Add model 2023-09-15-distilbert_base_uncased_finetuned_imdb_renyulin_en

* Add model 2023-09-15-we4lkd_aml_distilbert_1921_2016_en

* Add model 2023-09-15-distilbert_finetuned_spmlm_en

* Add model 2023-09-15-distilbert_base_uncased_finetuned_imdb_dipika09_en

* Add model 2023-09-15-distilbert_mlm_750k_en

* Add model 2023-09-15-distilbert_base_uncased_finetuned_imdb_accelerate_golightly_en

* Add model 2023-09-15-distilbert_base_english_french_spanish_cased_en

* Add model 2023-09-15-distilbert_base_uncased_finetuned_imdb_minye819_en

* Add model 2023-09-15-distilbert_base_english_russian_cased_en

* Add model 2023-09-15-distilbert_base_uncased_finetuned_imdb_accelerate_ryanlai_en

* Add model 2023-09-15-kaz_legal_distilbert_legal_corpus_312818008_words_4.945454545454545_en

* Add model 2023-09-15-distilbert_base_indonesian_id

* Add model 2023-09-15-we4lkd_aml_distilbert_1921_1977_en

* Add model 2023-09-15-inisw08_distilbert_mlm_adamw_torch_0608_en

* Add model 2023-09-15-distilbert_base_uncased_finetuned_imdb_accelerate_jjinbbangman_en

* Add model 2023-09-15-distilbert_base_uncased_finetuned_imdb_accelerate_dmlea_en

* Add model 2023-09-15-distilbert_base_uncased_finetuned_himani_auto_text_en

---------

Co-authored-by: ahmedlone127 <ahmedlone127@gmail.com>
  • Loading branch information
jsl-models and ahmedlone127 authored Sep 16, 2023
1 parent 0f86237 commit d9fdbf0
Show file tree
Hide file tree
Showing 600 changed files with 54,566 additions and 28 deletions.
93 changes: 93 additions & 0 deletions docs/_posts/ahmedlone127/2023-09-15-100_sdb_tbb_en.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,93 @@
---
layout: model
title: English 100_sdb_tbb DistilBertEmbeddings from sripadhstudy
author: John Snow Labs
name: 100_sdb_tbb
date: 2023-09-15
tags: [distilbert, en, open_source, fill_mask, onnx]
task: Embeddings
language: en
edition: Spark NLP 5.1.2
spark_version: 3.0
supported: true
engine: onnx
annotator: DistilBertEmbeddings
article_header:
type: cover
use_language_switcher: "Python-Scala-Java"
---

## Description

Pretrained DistilBertEmbeddings model, adapted from Hugging Face and curated to provide scalability and production-readiness using Spark NLP.`100_sdb_tbb` is a English model originally trained by sripadhstudy.

{:.btn-box}
<button class="button button-orange" disabled>Live Demo</button>
<button class="button button-orange" disabled>Open in Colab</button>
[Download](https://s3.amazonaws.com/auxdata.johnsnowlabs.com/public/models/100_sdb_tbb_en_5.1.2_3.0_1694784123220.zip){:.button.button-orange.button-orange-trans.arr.button-icon}
[Copy S3 URI](s3://auxdata.johnsnowlabs.com/public/models/100_sdb_tbb_en_5.1.2_3.0_1694784123220.zip){:.button.button-orange.button-orange-trans.button-icon.button-copy-s3}

## How to use



<div class="tabs-box" markdown="1">
{% include programmingLanguageSelectScalaPythonNLU.html %}
```python


document_assembler = DocumentAssembler() \
.setInputCol("text") \
.setOutputCol("documents")


embeddings =DistilBertEmbeddings.pretrained("100_sdb_tbb","en") \
.setInputCols(["documents","token"]) \
.setOutputCol("embeddings")

pipeline = Pipeline().setStages([document_assembler, embeddings])

pipelineModel = pipeline.fit(data)

pipelineDF = pipelineModel.transform(data)

```
```scala


val document_assembler = new DocumentAssembler()
.setInputCol("text")
.setOutputCol("embeddings")

val embeddings = DistilBertEmbeddings
.pretrained("100_sdb_tbb", "en")
.setInputCols(Array("documents","token"))
.setOutputCol("embeddings")

val pipeline = new Pipeline().setStages(Array(document_assembler, embeddings))

val pipelineModel = pipeline.fit(data)

val pipelineDF = pipelineModel.transform(data)


```
</div>

{:.model-param}
## Model Information

{:.table-model}
|---|---|
|Model Name:|100_sdb_tbb|
|Compatibility:|Spark NLP 5.1.2+|
|License:|Open Source|
|Edition:|Official|
|Input Labels:|[documents, token]|
|Output Labels:|[embeddings]|
|Language:|en|
|Size:|249.0 MB|

## References

https://huggingface.co/sripadhstudy/100_SDB_TBB
93 changes: 93 additions & 0 deletions docs/_posts/ahmedlone127/2023-09-15-20split_dataset_version1_en.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,93 @@
---
layout: model
title: English 20split_dataset_version1 DistilBertEmbeddings from Billwzl
author: John Snow Labs
name: 20split_dataset_version1
date: 2023-09-15
tags: [distilbert, en, open_source, fill_mask, onnx]
task: Embeddings
language: en
edition: Spark NLP 5.1.2
spark_version: 3.0
supported: true
engine: onnx
annotator: DistilBertEmbeddings
article_header:
type: cover
use_language_switcher: "Python-Scala-Java"
---

## Description

Pretrained DistilBertEmbeddings model, adapted from Hugging Face and curated to provide scalability and production-readiness using Spark NLP.`20split_dataset_version1` is a English model originally trained by Billwzl.

{:.btn-box}
<button class="button button-orange" disabled>Live Demo</button>
<button class="button button-orange" disabled>Open in Colab</button>
[Download](https://s3.amazonaws.com/auxdata.johnsnowlabs.com/public/models/20split_dataset_version1_en_5.1.2_3.0_1694780611703.zip){:.button.button-orange.button-orange-trans.arr.button-icon}
[Copy S3 URI](s3://auxdata.johnsnowlabs.com/public/models/20split_dataset_version1_en_5.1.2_3.0_1694780611703.zip){:.button.button-orange.button-orange-trans.button-icon.button-copy-s3}

## How to use



<div class="tabs-box" markdown="1">
{% include programmingLanguageSelectScalaPythonNLU.html %}
```python


document_assembler = DocumentAssembler() \
.setInputCol("text") \
.setOutputCol("documents")


embeddings =DistilBertEmbeddings.pretrained("20split_dataset_version1","en") \
.setInputCols(["documents","token"]) \
.setOutputCol("embeddings")

pipeline = Pipeline().setStages([document_assembler, embeddings])

pipelineModel = pipeline.fit(data)

pipelineDF = pipelineModel.transform(data)

```
```scala


val document_assembler = new DocumentAssembler()
.setInputCol("text")
.setOutputCol("embeddings")

val embeddings = DistilBertEmbeddings
.pretrained("20split_dataset_version1", "en")
.setInputCols(Array("documents","token"))
.setOutputCol("embeddings")

val pipeline = new Pipeline().setStages(Array(document_assembler, embeddings))

val pipelineModel = pipeline.fit(data)

val pipelineDF = pipelineModel.transform(data)


```
</div>

{:.model-param}
## Model Information

{:.table-model}
|---|---|
|Model Name:|20split_dataset_version1|
|Compatibility:|Spark NLP 5.1.2+|
|License:|Open Source|
|Edition:|Official|
|Input Labels:|[documents, token]|
|Output Labels:|[embeddings]|
|Language:|en|
|Size:|247.3 MB|

## References

https://huggingface.co/Billwzl/20split_dataset_version1
93 changes: 93 additions & 0 deletions docs/_posts/ahmedlone127/2023-09-15-20split_dataset_version2_en.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,93 @@
---
layout: model
title: English 20split_dataset_version2 DistilBertEmbeddings from Billwzl
author: John Snow Labs
name: 20split_dataset_version2
date: 2023-09-15
tags: [distilbert, en, open_source, fill_mask, onnx]
task: Embeddings
language: en
edition: Spark NLP 5.1.2
spark_version: 3.0
supported: true
engine: onnx
annotator: DistilBertEmbeddings
article_header:
type: cover
use_language_switcher: "Python-Scala-Java"
---

## Description

Pretrained DistilBertEmbeddings model, adapted from Hugging Face and curated to provide scalability and production-readiness using Spark NLP.`20split_dataset_version2` is a English model originally trained by Billwzl.

{:.btn-box}
<button class="button button-orange" disabled>Live Demo</button>
<button class="button button-orange" disabled>Open in Colab</button>
[Download](https://s3.amazonaws.com/auxdata.johnsnowlabs.com/public/models/20split_dataset_version2_en_5.1.2_3.0_1694781304116.zip){:.button.button-orange.button-orange-trans.arr.button-icon}
[Copy S3 URI](s3://auxdata.johnsnowlabs.com/public/models/20split_dataset_version2_en_5.1.2_3.0_1694781304116.zip){:.button.button-orange.button-orange-trans.button-icon.button-copy-s3}

## How to use



<div class="tabs-box" markdown="1">
{% include programmingLanguageSelectScalaPythonNLU.html %}
```python


document_assembler = DocumentAssembler() \
.setInputCol("text") \
.setOutputCol("documents")


embeddings =DistilBertEmbeddings.pretrained("20split_dataset_version2","en") \
.setInputCols(["documents","token"]) \
.setOutputCol("embeddings")

pipeline = Pipeline().setStages([document_assembler, embeddings])

pipelineModel = pipeline.fit(data)

pipelineDF = pipelineModel.transform(data)

```
```scala


val document_assembler = new DocumentAssembler()
.setInputCol("text")
.setOutputCol("embeddings")

val embeddings = DistilBertEmbeddings
.pretrained("20split_dataset_version2", "en")
.setInputCols(Array("documents","token"))
.setOutputCol("embeddings")

val pipeline = new Pipeline().setStages(Array(document_assembler, embeddings))

val pipelineModel = pipeline.fit(data)

val pipelineDF = pipelineModel.transform(data)


```
</div>

{:.model-param}
## Model Information

{:.table-model}
|---|---|
|Model Name:|20split_dataset_version2|
|Compatibility:|Spark NLP 5.1.2+|
|License:|Open Source|
|Edition:|Official|
|Input Labels:|[documents, token]|
|Output Labels:|[embeddings]|
|Language:|en|
|Size:|247.0 MB|

## References

https://huggingface.co/Billwzl/20split_dataset_version2
93 changes: 93 additions & 0 deletions docs/_posts/ahmedlone127/2023-09-15-20split_dataset_version3_en.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,93 @@
---
layout: model
title: English 20split_dataset_version3 DistilBertEmbeddings from Billwzl
author: John Snow Labs
name: 20split_dataset_version3
date: 2023-09-15
tags: [distilbert, en, open_source, fill_mask, onnx]
task: Embeddings
language: en
edition: Spark NLP 5.1.2
spark_version: 3.0
supported: true
engine: onnx
annotator: DistilBertEmbeddings
article_header:
type: cover
use_language_switcher: "Python-Scala-Java"
---

## Description

Pretrained DistilBertEmbeddings model, adapted from Hugging Face and curated to provide scalability and production-readiness using Spark NLP.`20split_dataset_version3` is a English model originally trained by Billwzl.

{:.btn-box}
<button class="button button-orange" disabled>Live Demo</button>
<button class="button button-orange" disabled>Open in Colab</button>
[Download](https://s3.amazonaws.com/auxdata.johnsnowlabs.com/public/models/20split_dataset_version3_en_5.1.2_3.0_1694781556318.zip){:.button.button-orange.button-orange-trans.arr.button-icon}
[Copy S3 URI](s3://auxdata.johnsnowlabs.com/public/models/20split_dataset_version3_en_5.1.2_3.0_1694781556318.zip){:.button.button-orange.button-orange-trans.button-icon.button-copy-s3}

## How to use



<div class="tabs-box" markdown="1">
{% include programmingLanguageSelectScalaPythonNLU.html %}
```python


document_assembler = DocumentAssembler() \
.setInputCol("text") \
.setOutputCol("documents")


embeddings =DistilBertEmbeddings.pretrained("20split_dataset_version3","en") \
.setInputCols(["documents","token"]) \
.setOutputCol("embeddings")

pipeline = Pipeline().setStages([document_assembler, embeddings])

pipelineModel = pipeline.fit(data)

pipelineDF = pipelineModel.transform(data)

```
```scala


val document_assembler = new DocumentAssembler()
.setInputCol("text")
.setOutputCol("embeddings")

val embeddings = DistilBertEmbeddings
.pretrained("20split_dataset_version3", "en")
.setInputCols(Array("documents","token"))
.setOutputCol("embeddings")

val pipeline = new Pipeline().setStages(Array(document_assembler, embeddings))

val pipelineModel = pipeline.fit(data)

val pipelineDF = pipelineModel.transform(data)


```
</div>

{:.model-param}
## Model Information

{:.table-model}
|---|---|
|Model Name:|20split_dataset_version3|
|Compatibility:|Spark NLP 5.1.2+|
|License:|Open Source|
|Edition:|Official|
|Input Labels:|[documents, token]|
|Output Labels:|[embeddings]|
|Language:|en|
|Size:|247.2 MB|

## References

https://huggingface.co/Billwzl/20split_dataset_version3
Loading

0 comments on commit d9fdbf0

Please sign in to comment.