shanthanu, your first question
Q) what scripting language is good for hadoop?
A) Most scripting languages ββlike php, python, perl, ruby ββbash are good. Any language that can be read using stdin, written on sdtout and on the parsing tab, and on new string characters will work: Hadoop Streaming simply passes the string representations of key value pairs in the form of concatenation with the tab to any program that should be executed on each node task tracker.
Most linux distributions used to configure hadoop, python, bash, ruby, perl ... clusters are already installed, but nothing will stop you from minimizing your own runtime for your favorite script or compiled programming language.
Q) PHP is not multithreaded?
A) yes, but there are ways we can make PHP multithreaded. For example, use: pnctl_fork () (but this does not work in windows)
The question you should always keep in mind before moving on to scripting languages ββwith hadoop is not "what scripting language?" because everything is in order.
But the difference between java and the scripting language is "Heart Beat of the child nodes will not be sent to the parent nodes when we use the scripting languages."
source share