`SecBERT`

SecBERT is a BERT model trained on cyber security text, learned CyberSecurity Knowledge.

SecBERT is trained on papers from the corpus of
SecBERT has its own vocabulary (secvocab) that's built to best match the training corpus. We trained SecBERT and SecRoBERTa versions.

Downloading Trained Models

SecBERT models now installable directly within Huggingface's framework:

from transformers import AutoTokenizer, AutoModelForMaskedLM

tokenizer = AutoTokenizer.from_pretrained("jackaduma/SecBERT")

model = AutoModelForMaskedLM.from_pretrained("jackaduma/SecBERT")


tokenizer = AutoTokenizer.from_pretrained("jackaduma/SecRoBERTa")

model = AutoModelForMaskedLM.from_pretrained("jackaduma/SecRoBERTa")

Pretrained-Weights

We release the the pytorch version of the trained models. The pytorch version is created using the Hugging Face library, and this repo shows how to use it.

Huggingface Modelhub

Using SecBERT in your own model

SecBERT models include all necessary files to be plugged in your own model and are in same format as BERT.

If you use PyTorch, refer to Hugging Face's repo where detailed instructions on using BERT models are provided.

Fill Mask

We proposed to build language model which work on cyber security text, as result, it can improve downstream tasks (NER, Text Classification, Semantic Understand, Q&A) in Cyber Security Domain.

First, as below shows Fill-Mask pipeline in Google Bert, AllenAI SciBert and our SecBERT .

cd lm
python eval_fillmask_lm.py

Downstream-tasks

TODO

Star-History

Donation

If this project help you reduce time to develop, you can give me a cup of coffee :)

AliPay(支付宝)

WechatPay(微信)

License

MIT © Kun

Name		Name	Last commit message	Last commit date
Latest commit History 18 Commits
downstream_tasks		downstream_tasks
lm		lm
misc		misc
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
README.zh-CN.md		README.zh-CN.md
fill-mask-result.png		fill-mask-result.png

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

`SecBERT`

Table of Contents

Downloading Trained Models

Pretrained-Weights

Using SecBERT in your own model

Fill Mask

Downstream-tasks

TODO

Star-History

Donation

License

About

Releases

Packages

Languages

License

jackaduma/SecBERT

Folders and files

Latest commit

History

Repository files navigation

SecBERT

Table of Contents

Downloading Trained Models

Pretrained-Weights

Using SecBERT in your own model

Fill Mask

Downstream-tasks

TODO

Star-History

Donation

License

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

`SecBERT`

Packages