I am getting ready to do some timeseries analysis on a git repo with python, my first step is to figure out a way to list all of the git commits so that I can analyze each one however I want. The GitPython library made this almost trivial once I realized how.
from git import Repo repo = Repo('.') commits = repo.iter_commits()
This returns a generator, if you are iterating over them this is likely what you want.
commits # <generator object Commit._iter_from_process_or_stream at 0x7f3307584510>
The generator will return
git.Commit objects with lots of information about each commit such as
next(commits) # <git.Commit "d125317892d0fab10a36638a2d23356ba25c5621">