GitHub - Mint-hfut/One2MultiSeq

This repository contains the code for our paper: Training with One2MultiSeq: CopyBART for Social Media Keyphrase Generation.

Dataset

The datasets can be downloaded from here

For more details about the Twitter dataset, please reference here or contact us at gaochunyang@mail.hfut.edu.cn

Prepocessing

To preprocess the source data, run: python One2MultiSeq_dataprocess.py

Training

To preprocess the source data, run: python train_One2MultiSeq.py After the training, you can change model_name in line 707 to the path of the trained model(for example, model_name = 'models/temp_model/CMKP/CopyBART_One2MultiSeq_base_epochs-10_learning_rate-5e-05_batch_size-32_seed-100') and set is_train = False in train_One2MultiSeq.py.

Note:

Please download and unzip the datasets in the ./data directory first.

Name		Name	Last commit message	Last commit date
Latest commit History 41 Commits
data		data
models		models
On2MultiSeq_dataprocess.py		On2MultiSeq_dataprocess.py
One2MultiSeq.py		One2MultiSeq.py
One2Set.py		One2Set.py
One2Set_dataprocess.py		One2Set_dataprocess.py
README.md		README.md
seq2seq_trainer_.py		seq2seq_trainer_.py
train_One2MultiSeq.py		train_One2MultiSeq.py
train_one2set.py		train_one2set.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Dataset

Prepocessing

Training

About

Releases

Packages

Languages

Mint-hfut/One2MultiSeq

Folders and files

Latest commit

History

Repository files navigation

Dataset

Prepocessing

Training

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages