It Is About Apache Hive, but What Is a SerDe?
The original title of the article is “How-to: Use a SerDe in Apache Hive“, so I knew it was something about Hive, but still had no idea what SerDe is:
The SerDe interface allows you to instruct Hive as to how a record should be processed. A SerDe is a combination of a Serializer and a Deserializer (hence, Ser-De). The Deserializer interface takes a string or binary representation of a record, and translates it into a Java object that Hive can manipulate. The Serializer, however, will take a Java object that Hive has been working with, and turn it into something that Hive can write to HDFS or another supported system. Commonly, Deserializers are used at query time to execute SELECT statements, and Serializers are used when writing data, such as through an INSERT-SELECT statement.
On one side we have the Spring frameworks with names like PreAuthenticatedGrantedAuthoritiesWebAuthenticationDetails1, then we have YouAreDeadException and end with SerDe. No middle ground in the Java world.
-
Jacek found this Spring class name which has 59 characters. His post is from 2011, so who knows if there isn’t a longer one since then. ↩
Original title and link: It Is About Apache Hive, but What Is a SerDe? (©myNoSQL)
via: http://blog.cloudera.com/blog/2012/12/how-to-use-a-serde-in-apache-hive/