Why not create some datasets yourself?
A very simple task is to fill the file with millions of random numbers, and then use Hadoop to find duplicates, triples, primes, numbers that have duplicates in their factors, etc.
Of course, it's not as fun as making mutual friends on Facebook, but this is enough to get some Hadoop practice.
rolve source share