Recent Releases of pydriller

pydriller - v2.9

Update lizard to latest version.

- Python
Published by ishepard 6 months ago

pydriller - 2.8

What's Changed

  • Cache the _stats function by @IP1102 in https://github.com/ishepard/pydriller/pull/305
  • Fix some small formatting faults in the documentation by @BaumiCoder in https://github.com/ishepard/pydriller/pull/306
  • fix: add missed files to package by @n-bes in https://github.com/ishepard/pydriller/pull/313

New Contributors

  • @BaumiCoder made their first contribution in https://github.com/ishepard/pydriller/pull/306
  • @n-bes made their first contribution in https://github.com/ishepard/pydriller/pull/313

Full Changelog: https://github.com/ishepard/pydriller/compare/2.7...2.8

- Python
Published by ishepard 8 months ago

pydriller - 2.7

What's Changed

  • Check for Git > 2.38 by @alaws-scottlogic in https://github.com/ishepard/pydriller/pull/294
  • Update repository.rst by @abdulxs in https://github.com/ishepard/pydriller/pull/297
  • Handle exception when analyzing an empty repository by @IP1102 in https://github.com/ishepard/pydriller/pull/300
  • FEAT(CodeChurn): Add method to return lines added and removed per file by @BrenoFariasdaSilva in https://github.com/ishepard/pydriller/pull/299
  • Add support of Git mailmap files by @HelgeCPH in https://github.com/ishepard/pydriller/pull/303
  • Add description and help for local development by @HelgeCPH in https://github.com/ishepard/pydriller/pull/304

New Contributors

  • @alaws-scottlogic made their first contribution in https://github.com/ishepard/pydriller/pull/294
  • @abdulxs made their first contribution in https://github.com/ishepard/pydriller/pull/297
  • @IP1102 made their first contribution in https://github.com/ishepard/pydriller/pull/300
  • @BrenoFariasdaSilva made their first contribution in https://github.com/ishepard/pydriller/pull/299

Full Changelog: https://github.com/ishepard/pydriller/compare/2.6...2.7

- Python
Published by ishepard over 1 year ago

pydriller - 2.6

What's Changed

  • Add support for co-author retrieval by @HelgeCPH in https://github.com/ishepard/pydriller/pull/284
  • Add diff function in git.py by @VelaYF in https://github.com/ishepard/pydriller/pull/286
  • recognize "git://" prefix as remote repository by @babenek in https://github.com/ishepard/pydriller/pull/287

New Contributors

  • @VelaYF made their first contribution in https://github.com/ishepard/pydriller/pull/286
  • @babenek made their first contribution in https://github.com/ishepard/pydriller/pull/287

Full Changelog: https://github.com/ishepard/pydriller/compare/2.5.1...2.6

- Python
Published by ishepard over 2 years ago

pydriller - 2.5.1

What's Changed

  • Stop using GitPython to parse git stats by @ishepard in https://github.com/ishepard/pydriller/pull/281

Full Changelog: https://github.com/ishepard/pydriller/compare/2.5...2.5.1

- Python
Published by ishepard over 2 years ago

pydriller - 2.5

What's Changed

  • Added sinceasfilter by @cmtg in https://github.com/ishepard/pydriller/pull/256
  • Rephrased Paragraph and fixed small typo in documentation by @Knniff in https://github.com/ishepard/pydriller/pull/257
  • Improve modified files by @ishepard in https://github.com/ishepard/pydriller/pull/263
  • #266: Fix from commit filter by @ishepard in https://github.com/ishepard/pydriller/pull/270

New Contributors

  • @cmtg made their first contribution in https://github.com/ishepard/pydriller/pull/256
  • @Knniff made their first contribution in https://github.com/ishepard/pydriller/pull/257

Full Changelog: https://github.com/ishepard/pydriller/compare/2.4.1...2.5

- Python
Published by ishepard over 2 years ago

pydriller - 2.4.1

What's Changed

  • Fix calculation of Method.hash by @otethal in https://github.com/ishepard/pydriller/pull/253

Full Changelog: https://github.com/ishepard/pydriller/compare/2.4...2.4.1

- Python
Published by ishepard about 3 years ago

pydriller - 2.4

What's Changed

  • Fix multithread memory consumption by @ishepard in https://github.com/ishepard/pydriller/pull/245
  • Use public datetime module instead of _datetime by @otethal in https://github.com/ishepard/pydriller/pull/251

New Contributors

  • @otethal made their first contribution in https://github.com/ishepard/pydriller/pull/251

Full Changelog: https://github.com/ishepard/pydriller/compare/2.3...2.4

- Python
Published by ishepard about 3 years ago

pydriller - 2.3

What's Changed

  • update document type for clonerepoto param by @hatrg in https://github.com/ishepard/pydriller/pull/239
  • enhance method getreponamefrom_url to handle case when .git is part of the repo name by @hatrg in https://github.com/ishepard/pydriller/pull/240

New Contributors

  • @hatrg made their first contribution in https://github.com/ishepard/pydriller/pull/239

Full Changelog: https://github.com/ishepard/pydriller/compare/2.2...2.3

- Python
Published by ishepard about 3 years ago

pydriller - 2.2

What's Changed

  • Update tutorial.rst by @stefanodallapalma in https://github.com/ishepard/pydriller/pull/209
  • Update repository.py by @stefanodallapalma in https://github.com/ishepard/pydriller/pull/212
  • Workaround Windows Temp File Cleanup Error by @totoroyyb in https://github.com/ishepard/pydriller/pull/214
  • Adding ModifiedFile.content and deprecating ModifiedFile.source_code by @stefanodallapalma in https://github.com/ishepard/pydriller/pull/220
  • Cleaning unused property setters by @stefanodallapalma in https://github.com/ishepard/pydriller/pull/221
  • Return undecoded content by @stefanodallapalma in https://github.com/ishepard/pydriller/pull/231
  • Stop packaging tests by @gremat in https://github.com/ishepard/pydriller/pull/233
  • Add CodeQL workflow for GitHub code scanning by @lgtm-com in https://github.com/ishepard/pydriller/pull/234

New Contributors

  • @totoroyyb made their first contribution in https://github.com/ishepard/pydriller/pull/214
  • @gremat made their first contribution in https://github.com/ishepard/pydriller/pull/233
  • @lgtm-com made their first contribution in https://github.com/ishepard/pydriller/pull/234

Full Changelog: https://github.com/ishepard/pydriller/compare/2.1...2.2

- Python
Published by ishepard over 3 years ago

pydriller - 2.1

What's Changed

  • Choose calculation variant to calculate the code churn metric by @glato in https://github.com/ishepard/pydriller/pull/176
  • Fixed exception on repeated run of clonerepoto by @HelgeCPH in https://github.com/ishepard/pydriller/pull/180
  • Update README.md by @shantanuo in https://github.com/ishepard/pydriller/pull/182
  • Update repository.py docs by @stefanodallapalma in https://github.com/ishepard/pydriller/pull/185
  • Catch specific exception with malformed repo url by @andodet in https://github.com/ishepard/pydriller/pull/189
  • Fix mypy issues by @ishepard in https://github.com/ishepard/pydriller/pull/192
  • Changed documentation for commit and modifiedfile object by @k----n in https://github.com/ishepard/pydriller/pull/191
  • Add Python 3.10 to GHA workflow by @gliptak in https://github.com/ishepard/pydriller/pull/193
  • Treat URLs starting with http:// as remote by @thtrummer in https://github.com/ishepard/pydriller/pull/200
  • Add the option to include deleted files in repository mining by @12f23eddde in https://github.com/ishepard/pydriller/pull/206

New Contributors

  • @glato made their first contribution in https://github.com/ishepard/pydriller/pull/176
  • @shantanuo made their first contribution in https://github.com/ishepard/pydriller/pull/182
  • @andodet made their first contribution in https://github.com/ishepard/pydriller/pull/189
  • @k----n made their first contribution in https://github.com/ishepard/pydriller/pull/191
  • @gliptak made their first contribution in https://github.com/ishepard/pydriller/pull/193
  • @thtrummer made their first contribution in https://github.com/ishepard/pydriller/pull/200
  • @12f23eddde made their first contribution in https://github.com/ishepard/pydriller/pull/206

Full Changelog: https://github.com/ishepard/pydriller/compare/2.0...2.1

- Python
Published by ishepard about 4 years ago

pydriller - 2.0

Pydriller 2.0 finally arrived! Biggest updates are: - Multithreading is now supported! With the flag num_workers you can specify how many threads you want to use (default = 1). With threads you can be much faster: when analysing the entire history of apache hadoop, I went from 11.04 minutes to 1.46 minutes! - It was time to rename and shorten the main classes: - RepositoryMining -> Repository - GitRepository -> Git - Renamed Modification to a more meaningful class name ModifiedFile

- Python
Published by ishepard almost 5 years ago

pydriller - 1.15.5

The commit obj now carries on information from gitstats, namely:

  • deletions: number of deleted lines in the commit
  • insertions: number of added lines in the commit
  • lines: total number of added + deleted lines in the commit
  • files: number of files changed in the commit

- Python
Published by ishepard about 5 years ago

pydriller - 1.15.4

  • Remove git property from GitRepository.
  • Better tests for getcommitslastmodifiedlines

- Python
Published by ishepard about 5 years ago

pydriller - 1.15.3

Bug fixes: - when using include_refs the commit now has information about remote branches as well

Features: - support for relative paths - git_repo available in repository mining

Remove python3.5 support.

- Python
Published by ishepard over 5 years ago

pydriller - 1.15.2

Fix: - error that prevented directories from being deleted when cloned in a temporary directory - Properly handle zero-change modifications in DMM

Features: - pydriller.__version__

- Python
Published by ishepard over 5 years ago

pydriller - 1.15

Fix:

  • fixed bug the prevented users to pass tags filters when analysing multiple repos

- Python
Published by ishepard almost 6 years ago

pydriller - 1.13

Features:

  • new parameter order, that can accept "date-order", "author-date-order", "topo-order" and "reverse"
  • process metrics can now accept dates

Deprecated: - "reversed_order": now the user should use order='reverse'

Bug fix: - fix on ordering of the commits, that was a bit different from what Git was presenting in reality. We now use git filters to order the commits. - fix on "fromcommit" and "tocommit" if they had the same commit date

- Python
Published by ishepard almost 6 years ago

pydriller - 1.12

Features: - a modified file now contains the methods that changed in the commit.

Thanks to @NullConvergence for the PR!

- Python
Published by ishepard almost 6 years ago

pydriller - 1.11.2

Fix: - fix on the log message

- Python
Published by ishepard almost 6 years ago

pydriller - 1.11.1

New features: - 'HEAD' can now be used as filter in 'single', 'fromcommit' or 'tocommit' - if the user specify a commit that doesn't exist in a commit filter, Pydriller raises an exception - GitRepository doesn't contain the function "parse_diff()" anymore. Now it is available in the Modification object

Bug fix: - 'fromcommit' and 'tocommit' are automatically swapped when used with 'reversed_order'

- Python
Published by ishepard almost 6 years ago

pydriller - 1.11

Many updates for this release!

1 - new process metrics: now pydriller can calculate commits count, contributors count, contributors experience, history complexity, hunks count, lines count and minor contributors. Everything in just one line! Thank you @stefanodallapalma 2 - fix bug of the filters fromcommit and tocommit. In some cases it was returning more commits than it should have 3 - fixed a bug that was causing problems in Windows machines when too many runs were launched contemporary

Deprecated

  1. parse_diff: now you can find this information directly in the modification object

- Python
Published by ishepard about 6 years ago

pydriller - 1.10.1

  • we stopped using git hyper blame, and instead we use --ignore-revs (git > 2.23)
  • new "clonerepoto" option to clone the repository in a predefined folder

Thanks to @marco-c and @jphgoodwin for the PRs.

- Python
Published by ishepard about 6 years ago

pydriller - 1.10

Thank you to @yusufsn, @s0nata, @stefanodallapalma for the awesome ideas (and/or PRs).

This release includes: - new filter to skip changes with whitespaces only (see git diff -w) - new algorithm to diff files (see git diff --histogram) - a new class ProcessMetric, that can be used to obtain process metrics at file level. - bug fix on getting the main branch of a detached repository

- Python
Published by ishepard about 6 years ago

pydriller - 1.9.2

Small improvements and bugs fix on git hyper blame

- Python
Published by ishepard over 6 years ago

pydriller - 1.9.1

  • removed leftovers prints from git_repository

- Python
Published by ishepard over 6 years ago

pydriller - 1.9

The function "getcommitslastmodifiedlines" now uses git hyper-blame if available. Thanks to @marco-c for the PR!

- Python
Published by ishepard over 6 years ago

pydriller - 1.8

  • better handling of merge commits
  • fix bug in decoding diffs and source code that was causing many diffs to be skipped
  • include new filter only_releases
  • now modification has the source code of before and after the change

- Python
Published by ishepard almost 7 years ago

pydriller - 1.7

Add new configuration "filepath" in RepositoryMining to obtain the list of commits that modified a file.

thanks to @kanghj for the new feature!

- Python
Published by ishepard about 7 years ago

pydriller - 1.6.3

Fix bug of the parse_diff() function that returned wrong results if the modified line was the last line in the file without a new line ending the file.

Thank you to @gotec for his contribution!

- Python
Published by ishepard about 7 years ago

pydriller - 1.6.2

Fixing an issue of the last update, that was causing memory swap.

- Python
Published by ishepard about 7 years ago

pydriller - 1.6.1

Performance improvement. Modifications and Branches are now calculated only the firs time they are accessed: consecutive calls on these 2 fields do not call Git anymore.

- Python
Published by ishepard about 7 years ago

pydriller - 1.6

  • 2 new filters: only_authors and only_commits.
  • bug fix: the modification type can be unknown now. Thanks to @lucapascarella for the PR!

- Python
Published by ishepard about 7 years ago

pydriller - 1.5.1

  • Bug fix: now when filtering the commits, the order is the same of git log (ordered by commit date, not author date)

- Python
Published by ishepard over 7 years ago

pydriller - 1.5

  • Bug fixes
  • only_in_main_branch doesn't exist anymore, since it was buggy
  • only_in_branches became only_in_branch: getting commits from multiple branches did not work, and it won't. PyDriller can not decide the order of commits in different branches, this decision is left to the user

- Python
Published by ishepard over 7 years ago

pydriller - 1.4.3

  • Fix bug #13 : the filename was not correctly displayed on Windows machine.

- Python
Published by ishepard over 7 years ago

pydriller - 1.4.2

  • Now the Commit object has the project_path field

- Python
Published by ishepard over 7 years ago

pydriller - 1.4.1

  • Small fix of variables and functions scope in RepositoryMining

- Python
Published by ishepard over 7 years ago

pydriller - 1.4

  1. Support the analysis of multiple repos
  2. Support remote repos

- Python
Published by ishepard over 7 years ago

pydriller - 1.3

PyDriller now calculates structural metrics of every file changed in a commit: the modification object now has this info.

- Python
Published by ishepard over 7 years ago

pydriller - 1.2.3

Fix issue on resetting the repository.

Contributor: @Ledenel

- Python
Published by ishepard over 7 years ago

pydriller - 1.2.2

Speed improvement in calculating the bug inducing commits.

- Python
Published by ishepard over 7 years ago

pydriller - 1.2.1

Speed improvement

- Python
Published by ishepard over 7 years ago

pydriller - 1.2

New function "getcommitslastmodifiedlines" that applies SZZ to retrieve the list of the commits that last modified some lines.

Plus, is much faster than before since I fixed a bug in the logging of RepositoryMining. Now it analyses ~60 commits per second.

- Python
Published by ishepard over 7 years ago

pydriller - 1.1

Pydriller is now lazy!

All the attributes of a Commit object are lazy evaluated. This allows Pydriller to be up-to 10x faster than before!

- Python
Published by ishepard almost 8 years ago

pydriller - 1.0

initial release

- Python
Published by ishepard almost 8 years ago