clear query| facets| time Search criteria: .   Results from 1 to 10 from 41 (0.0s).
Loading phrases to help you
refine your search...
Hadoop Data Modeling Tutorial? - Hadoop - [mail # user]
...Checkout my book, Agile Data Science: http://shop.oreilly.com/product/0636920025054.doIn Chapter 3, we get you up and going from "I know how to program but knownothing about Hadoop" to "I ju...
   Author: Russell Jurney , 2015-08-06, 00:14
Joins in Hadoop - Hadoop - [mail # user]
...You are insane to do this with mapreduce. Use Pig or Hive, or Spark andperform a join. This will take you less than ten minutes, including thetime to download and install pig or hive and run...
   Author: Russell Jurney , 2015-06-25, 07:02
Interview Questions asked - Hadoop - [mail # user]
...Diagram/code a mapreduce join.On Thursday, February 12, 2015, Krish Donald  wrote:> Hi,>> Does anybody has interview questions which was asked during their> interview on had...
   Author: Russell Jurney , 2015-02-12, 17:49
Spark vs Tez - Hadoop - [mail # user]
...Check out PySpark. No Scala required.On Friday, October 17, 2014, Adaryl "Bob" Wakefield, MBA <[EMAIL PROTECTED]> wrote:>   “The only problem with Spark adoption is the steep l...
   Author: Russell Jurney , 2014-10-18, 12:38
Reading json format input - Hadoop - [mail # user]
...Seriously consider Pig (free answer, 4 LOC):  my_data = LOAD 'my_data.json' USING com.twitter.elephantbird.pig.load.JsonLoader() AS json:map[]; words = FOREACH my_data GENERATE $0#'auth...
   Author: Russell Jurney , 2013-05-29, 22:13
[expand - 1 more] - Accumulo and Mapreduce - Hadoop - [mail # user]
...You can chain MR jobs with Oozie, but would suggest using Cascading, Pig or Hive. You can do this is a couple lines of code, I suspect. Two map reduce jobs should not pose any kind of challe...
   Author: Russell Jurney , 2013-03-04, 18:52
[expand - 1 more] - how to find top N values using map-reduce ? - Hadoop - [mail # user]
...Maybe look at the pig source to see how it does it?  Russell Jurney http://datasyndrome.com  On Feb 1, 2013, at 11:37 PM, praveenesh kumar  wrote:  > Thanks for that R...
   Author: Russell Jurney , 2013-02-02, 08:10
building a department GPU cluster - Hadoop - [mail # user]
...Hadoop streaming can do this, and there's been some discussion in the past, but it's not a core use case. Check the list archives.  Russell Jurney http://datasyndrome.com  On Jan 1...
   Author: Russell Jurney , 2013-01-18, 00:24
Map-Reduce V/S Hadoop Ecosystem - Hadoop - [mail # user]
...Hourly consultants may prefer MapReduce. Everyone else should be using Pig, Hive, Cascading, etc.  Russell Jurney twitter.com/rjurney   On Nov 7, 2012, at 8:08 PM, yogesh dhari &nb...
   Author: Russell Jurney , 2012-11-07, 20:48
[expand - 2 more] - reference architecture - Hadoop - [mail # user]
...You just made my year. Let me know how I can make it better (off list).  Russell Jurney twitter.com/rjurney   On Oct 29, 2012, at 2:17 PM, "Daniel Käfer"  wrote:  > Th...
   Author: Russell Jurney , 2012-10-29, 23:26