A Python tool for automated conversion of BibTeX entries to DBLP format. The tool uses the DBLP API to search for entries and match them with the input BibTeX file. This is particularly useful when you have references from various sources (e.g., arXiv, Google Scholar, conference websites) and want to standardize them using DBLP's detailed up-to-date metadata.
- Levenshtein distance-based similarity scoring for titles (threshold: 0.7) and authors (threshold: 0.4)
- Support for various author name formats and normalizations
- It handles nested DBLP response structures
- Checkpoint-based saving with resume capability to avoid rate-limiting. It saves the progress after each entry, so it can be resumed from the same point (if using the same log file).
- Rate limiting and exponential backoff for API requests
git clone https://github.com/bilgehanertan/bibtex2dblp.git
cd bibtex2dblp
pip install -r requirements.txt
python bibtex2dblp.py input.bib [output.bib] [log.csv]
Arguments:
input.bib
: Input BibTeX file (required)output.bib
: Output file for DBLP entries (default: 'output.bib')log.csv
: Conversion log file (default: 'log.csv')
The tool generates:
- A BibTeX file containing DBLP-formatted entries
- A CSV log file tracking:
- Original Key
- Title
- Authors
- DBLP Found
- DBLP Key
- DBLP Title
- Python 3.6+
- bibtexparser
- requests
- python-Levenshtein
- DBLP for providing their API