Hive has built-in support for gzipped text files so you don't need to do
anything special with them (meaning you just load them like regular files).
However, there is one important limitation with this approach these files
are not splittable. Depending on the size of your log files and how many of
them you have perday that may not be a problem. The other, slightly more
complicated approach is to use sequence files, which support splittable
compression. You can find more details here:https://cwiki.apache.org/confluence/display/Hive/CompressedStorage
On Mon, Nov 11, 2013 at 5:09 PM, Raj Hadoop <[EMAIL PROTECTED]> wrote: