Re: CVS diff statistics generation?

From: Mark D. Baushke
Subject: Re: CVS diff statistics generation?
Date: Tue, 05 Apr 2005 22:08:01 -0700

Chris Lewis <address@hidden> writes:

> As part of an evaluation I am undertaking, it is necessary for me to
> analyse the entire diff output generated by a CVS repository. I wish to
> look at statistics such as who is responsible for the percentage of
> changes to a file, to see what roles people have played in development
> of the project; whether they are specialising on one file, one module,
> or working more broadly.

You may wish to start here:

Some of the stuff that viewcvs can add to a MySQL database will be of
interest to you.
> I'd also like some more advanced features, like how many lines people
> are adding, so I can try and see who was bug fixing and who was
> actually adding new content and such. One step at a time, however!

Feel free to contribute if you add things like this.

> My Googling for any sort of solution has been less than fruitless, the
> closest I found was 'diffstat'. 

Well, diffstat has problems getting lost in changes to files that have a
'-- ' leading comment prefix and does not understand the '-p'
annotations, so you may want to normalize your diffs using something
like one of the patchutils programs.

> I don't believe I am the first to have
> wanted to do this, and I don't believe I'm the only one that would
> release any programs or scripts I write to extract the data. Does
> anyone know of anything that could be helpful to me? Whatever platform
> the program is on, I'll find a way to run it!
> Otherwise, I guess it's time to open the Camel Book again and brush up
> on my regular expressions!
> Thanks every so much for any help guys,
> Chris Lewis

        Good luck,
        -- Mark
