I am analyzing some old code. This is about 80,000 lines of old plsql code. At first glance, there is quite some duplication in the source that needs to be removed. Instead of doing a diff manual and looking at each file, there should be some tool / command line to detect duplicate lines of source code.
My goal is to give a reasonable assumption about the minimum size of the source rewriting and how much factual knowledge was recorded in this program. I wrote some basic static code analyzer to find the number of IF IFSE FOR etc and Functions control statements in each file. But duplicate code still needs to be removed from my statistics.
source share