We have many |(pipe) split flat files that we process daily in SQL Server using the SSIS package. Each flat file is divided into a header section, a content section, and a footer section. We regularly get a newer version of the same files. We are trying to implement file comparison functionality between two versions of the same file in order to reduce the processing load.
Which method will be more efficient?
Saving both versions of the same file in separate SQL Server tables with a checksum column and filtering rows for which the checksum values do not match.
Implementing similar checksum logic in C # or any other comparison algorithm available in C #.
You can offer any other new algorithm to achieve the same.
source
share