Trimming old backups in stages

Question

Trimming old backups in stages

I am looking for a way to reduce old backups. Backups are performed daily, and I want to increase the interval when backups get older.

After a couple of days, I want to delete the daily backups, leaving only a “Sunday” backup. After a couple of weeks, only the first backup of the month that is available should be deleted.

Since I am dealing with historical backups, I cannot just change the naming scheme.

I tried using "find" for it, but could not find the parameters I needed.

Has anyone got anything that could help?

+4

linux backup

Flyhard Mar 07 '11 at 10:36

source share

3 answers

yup for example

 find -type f -mtime 30

details - http://www.gnu.org/software/findutils/manual/html_mono/find.html#Age-Ranges

+1

ajreal Mar 07 '11 at 10:48

source share

I developed a solution for my similar needs on top of the @ajreal starting point. My backups are called "backup-2015-06-01T01: 00: 01" (using date "+%Y-%m-%dT%H:%M:%S" ).

Two simple steps: touch the files to use the glob shell template for the first day of every month, and use find and xargs to delete anything more than 30 days.

 cd $BACKUPS_DIR # touch backups from the first of each month touch *-01T* # delete backups more than 30 days old echo "Deleting these backups:" find -maxdepth 1 -mtime +30 find -maxdepth 1 -mtime +30 -print0 | xargs -0 rm -r

0

Dave burt Jun 01 '15 at 6:54

source share

sarnold · Accepted Answer · 2011-03-07T11:25:21+0000

I know this is historical data, but you may prefer a naming scheme to help solve this problem. It can be much easier to solve this problem in two passes: first rename the directories based on the date, then select the directories to save them in the future.

You can make a quick approximation if all the dates in the directory in ls -l are displayed fairly well:

 ls -l | awk '{print "mv " $8 " " $6;}' > /tmp/runme

Take a look at /tmp/runme , and if it looks good, you can run it with sh /tmp/runme . You might want to trim records or something like that before you.

If all backups are stored in named directories, for example:

 2011-01-01/ 2011-01-02/ 2011-01-03/ ... 2011-02-01/ 2011-02-02/ ... 2011-03-07/

then your problem will be reduced to calculating the names to save and delete. This problem is much easier to solve than searching all your files and trying to choose which ones to save and delete based on when they were made. (See Output date "+%Y-%m-%d" for a quick way to create such a name).

Once they are named conveniently, you can save the first backup of each month using a script as follows:

 for y in `seq 2008 2010` do for m in `seq -w 1 12` do for d in `seq -w 2 31` do echo "rm $y-$m-$d" done done done

Save its output, check it :) and then run the output, similar to renaming the script.

After you have kept the previous backups under control, you can generate 2010 from date --date="Last Year" "+%Y" and other improvements, therefore it processes "once a week" for the current month and saves itself forever in the future.

Trimming old backups in stages

More articles: