2.0
Pydriller 2.0 finally arrived! Biggest updates are:
- Multithreading is now supported! With the flag
num_workers
you can specify how many threads you want to use (default = 1). With threads you can be much faster: when analysing the entire history of apache hadoop, I went from 11.04 minutes to 1.46 minutes! - It was time to rename and shorten the main classes:
- RepositoryMining -> Repository
- GitRepository -> Git
- Renamed
Modification
to a more meaningful class nameModifiedFile