Clone Digger

 discovers duplicate code in Python and Java

Clone Examples

We have tested Clone Digger on sources of several open-source projects. There are following drawbacks of the current report format:

  • Differences are highlighted using diff algorithm. This way of highlighting has nothing common with the abstract syntax tree based algorithm of comparing sequences of statements for similarity.
  • Class and function comments are taken into account during the computation of similarity, but they are not presented in the output. Therefore some clone pairs can look equal but marked as convergent.

BioPython project
NLTK project