Is there a way to run DistCp, but with the option of renaming to filenames? Maybe the easiest way to explain with an example.
Let's say I copy hdfs: /// foo to hdfs: /// bar, and foo contains these files:
hdfs:
hdfs:
hdfs:
and bar contains the following values:
hdfs:
hdfs:
Then after the copy, I would like for bar to contain something like:
hdfs:
hdfs:
hdfs:
hdfs:
hdfs:
If there is no such option, what could be the most reliable / efficient way to do this? My own adult version of distcp could certainly do this, but it looks like it could be a lot of work and quite error prone. Basically, I'm not interested in file names at all, only their directory, and I want to periodically copy large amounts of data into the "consolidation" directory.