I am using the hortonworks sandbox.
topic creation :
./kafka-topics.sh
apache access log directory lag :
tail -f /var/log/httpd/access_log |./kafka-console-producer.sh --broker-list 10.25.3.207:6667 --topic lognew
On another terminal (from kafka bin), run the user :
./kafka-console-consumer.sh --zookeeper 10.25.3.207:2181 --topic lognew --from-beginning
Apache access logs are sent to kafka's " lognew " topic.
I need to save them in HDFS.
Any ideas or suggestions on how to do this.
Thanks in advance. Deepthy
source share