This is certainly doable. Best to work with Hadoop Streaming . As stated on this website:
Hadoop streaming is a utility that comes with the Hadoop distribution. The utility allows you to create and run map / reduce jobs with any executable file or script as a converter and / or reducer.
I usually start with external code inside Hadoop Streaming. Depending on your language, there are many good examples of how to use it in streaming; as soon as you get to your language of choice, you can usually transfer data to another program, if necessary. I had several levels of programs in different languages โโthat played beautifully without any extra effort than if I ran them on a regular Linux box, except that the outer layer worked with Hadoop Streaming.
source share