Sorry if this is off topic, but here is your chance to reduce the number of "homework" on this site :-)
I teach a C programming class, where students work with a small library of numerical routines in C. This year, source files from several student groups had a significant amount of code duplication.
(Up until the debug printf reports are identically deferred. I mean, how can you not be.)
I know that Git can detect that two source files are similar to each other beyond a certain threshold, but I am never a manager to get them to work with two source files that are not in the Git repository.
Keep in mind that these are not particularly difficult students. They are unlikely to encounter the problem of changing variable / function names.
Is there a way that I can use Git to detect significant and literal duplication of aka plagiarism code? Or is there some other tool that you could recommend for this
source share