clear query| facets| time Search criteria: .   Results from 31 to 40 from 348 (0.0s).
Loading phrases to help you
refine your search...
why my test result on dfs short circuit read is slower? - Hadoop - [mail # user]
...On Feb 17, 2013, at 7:09 PM, "Liu, Raymond"  wrote:  > io.file.buffer.size   Drop this down to 64KB not 128KB.   You have 16 cpu which really means 8 cores and 4 disks...
   Author: Michael Segel , 2013-02-18, 10:16
[expand - 1 more] - Sorting huge text files in Hadoop - Hadoop - [mail # user]
...Why do you need a 1TB block?   On Feb 15, 2013, at 1:29 PM, Jay Vyas  wrote:  > well.. ok... i guess you could have a 1TB block do an in place sort on the file, write it to...
   Author: Michael Segel , 2013-02-15, 20:09
How to handle sensitive data - Hadoop - [mail # user]
...Simple, have your app encrypt the field prior to writing to HDFS.   Also consider HBase.  On Feb 14, 2013, at 10:35 AM, abhishek  wrote:  >  >> Hi all, >...
   Author: Michael Segel , 2013-02-15, 13:47
Mainframe to ASCII conversion - Hadoop - [mail # user]
...Depends.   If the data is straight EBCDIC you have somewhat splittable data, however its really better to do this in a single stream.  If the data is COMP-3 (Zoned and Packed Data)...
   Author: Michael Segel , 2013-02-11, 17:01
How can I limit reducers to one-per-node? - Hadoop - [mail # user]
...Adding a combiner step first then reduce?    On Feb 8, 2013, at 11:18 PM, Harsh J  wrote:  > Hey David, >  > There's no readily available way to do this tod...
   Author: Michael Segel , 2013-02-11, 01:30
Generic output key class - Hadoop - [mail # user]
...Why not just write out the int as a numeric string?   On Feb 10, 2013, at 1:07 PM, Sandy Ryza  wrote:  > Hi Amit, >  > One way to accomplish this would be to cre...
   Author: Michael Segel , 2013-02-11, 01:18
Select Linux Distro for Hbase - Hadoop - [mail # general]
...RedHat, or Centos is the best.  (Its the same thing... well sort of... ;-)   You can use other distros but YMMV and you need to make sure that you're not using the Open Source JDK ...
   Author: Michael Segel , 2013-01-23, 14:14
On a lighter note - Hadoop - [mail # user]
...I'm thinking 'Downfall'  But I could be wrong.  On Jan 17, 2013, at 6:56 PM, Yongzhi Wang  wrote:  > Who can tell me what is the name of the original film? Thanks! >...
   Author: Michael Segel , 2013-01-18, 01:03
[expand - 1 more] - How does hadoop decide how many reducers to run? - Hadoop - [mail # user]
...Since you are using EMR,  AWS pre configures the number of slots per node.  So you are already getting the optimum number of slots that their 'machines' can handle.   So when ...
   Author: Michael Segel , 2013-01-12, 14:05
queues in haddop - Hadoop - [mail # user]
...He's got two different queues.   1) queue in capacity scheduler so he can have a set or M/R tasks running in the background to pull data off of...  2) a durable queue that receives...
   Author: Michael Segel , 2013-01-11, 15:06