-
Notifications
You must be signed in to change notification settings - Fork 62
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[cocom] Add repository level analysis option for CoCom backend #38
Conversation
There can be two scenarios, a deleted file and other renamed (or the one which has changed the directory). These changes handles both the cases. Alter test data to accomodate cases for deleted files results Signed-off-by: inishchith <inishchith@gmail.com>
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thank you @inishchith for sharing the idea of an analysis at repository level. Overall it looks interesting, however I have some doubts about some implementation choices.
Instead of passing a flag to trigger the analysis at repo level, it wouldn't be better to introduce a new category that performs this kind of analysis ? What do you think ?
The variable self.history
collects the cocom analysis for each file. The solution seems to work well when performing the initial fetch, but I'm not sure it would work for incremental fetches. Could you explain the logic in case I'm missing something ?
A different approach could be to execute the analyzer over the full repo for each commit and then sum up the results obtained ? Maybe the param -t may speed up the analysis. What do you think ?
@valeriocos Sorry for the delayed response.
I'll update you once i've worked on the evaluation. Let me know what you think! |
No worries @inishchith , thank you for answering.
If the data has a different shape it's probably better to use a different category. However, we can proceed without adding a new category, and change the code afterwards if needed :) |
Edit:
|
The idea is to produce repository level analysis for CoCom backend using a history of files and picking the latest values of the available results Signed-off-by: inishchith <inishchith@gmail.com>
1d04905
to
159cf41
Compare
@valeriocos I had a thought over the As pointed by you above (about lizard's worker thread for repository-level analysis):
Here the
(there's a divergence due to Perceval have a lot more files than the other repositories in consideration). Let me know what you think. Thanks :) |
Closing in reference to #39 |
@valeriocos As discussed, this adds repository level analysis as an option to CoCom Backend.
As of now, this is being added to make repository level analysis and visualization to be carried out in Kibana in a bit easier way.
We can discuss the limitation that might be caused in the future and some edge cases that I might have missed in the implementation.
Results of comparison can be found in #36
Edit: This is just a rough idea (implementation)
Some things that need to be worked on: