I am using Amazon Stretch Card.
I have log files that look something like this.
random text foo="1" more random text foo="2"
more text notamatch="5" noise foo="1"
blah blah blah foo="1" blah blah foo="3" blah blah foo="4" ...
How can I write a pig expression to highlight all the numbers in "foo" expressions?
I prefer tuples that look something like this:
(1,2)
(1)
(1,3,4)
I tried the following:
TUPLES = foreach LINES generate FLATTEN(EXTRACT(line,'foo="([0-9]+)"'));
But this gives only the first match on each line:
(1)
(1)
(1)
source
share