Apache Hive regEx serde: data types

For log processing, I want to use Apache Hive regEx serde, but I found examples that use String as the data type for table columns.

Now my question is: do dates, types and integers and arrays carry, or are they just strings?

In this example (and others), only strings are used:

CREATE TABLE access_log (
  remote_ip STRING,
  request_date STRING,
  method STRING,
  request STRING,
  protocol STRING
)
ROW FORMAT SERDE 'org.apache.hadoop.hive.contrib.serde2.RegexSerDe'
WITH SERDEPROPERTIES  (
"input.regex" = "([^ ]) . . [([^]]+)] \"([^ ]) ([^ ]) ([^ \"])\" *",
"output.format.string" = "%1$s %2$s %3$s %4$s %5$s"
)
STORED AS TEXTFILE

;

+4
source share
2 answers

, :)

+6

Source: https://habr.com/ru/post/1523619/


All Articles