clear query| facets| time Search criteria: .   Results from 1 to 10 from 1064 (0.0s).
Loading phrases to help you
refine your search...
Joining more than two PTables in a single MR job - Crunch - [mail # user]
...That is correct, the Cogroup will load all of the values for the key intomemory-- is this not a situation where a combination of aMapSideJoinStrategy plus another JoinStrategy will do what y...
   Author: Josh Wills , 2018-08-31, 06:46
CombineFn not run as Combiner - Crunch - [mail # user]
...It feels to me that the answer to that is yes-- there is some reason why Icannot do a combiner with a map-side output; but for the life of me Icannot remember what it was at the moment.On Th...
   Author: Josh Wills , 2018-06-01, 01:19
[expand - 6 more] - AvroParquetPathPerKeyTarget with Spark - Crunch - [mail # user]
...CRUNCH-670 is the issue, FWIWOn Fri, May 25, 2018 at 1:39 PM, Josh Wills  wrote:> Ah, of course-- nice detective work! Can you send me the code so I can> patch it in?>> On F...
   Author: Josh Wills , 2018-05-25, 20:40
[expand - 1 more] - Support for reading 4mc compressed text input files in apache crunch - Crunch - [mail # user]
...Ah, okay-- yeah, I could see where it would be a problem if it's notextending FileInputFormat.On Thu, May 24, 2018 at 12:52 PM, Suyash Agarwal wrote:> Hi,>> On trying something like...
   Author: Josh Wills , 2018-05-24, 21:40
[CRUNCH-670] Make the AvroPathPerKeyTarget work with the SparkRuntime - Crunch - [issue]
...There is an issue where the AvroPathPerKeyTarget won't properly copy the output of a Spark pipeline from the temp directory to the target directory because it assumes it will always get a va...    Author: Josh Wills , 2018-05-24, 05:49
[CRUNCH-669] Add an option to preserve Crunch temp directories - Crunch - [issue]
...I have a problem where a Crunch client can potentially get killed through no fault of its own (e.g., an Airflow task failing a heartbeat check), which will kill the client, but leave the MR ...    Author: Josh Wills , 2018-04-30, 19:59
IncompatibleClassChangeError trying to run spark pipeline - Crunch - [mail # user]
...It means that a hadoop1 dependency is getting into the jar somehow,although it's not obvious to me you have a dependency tree you cantease apart?On Thu, Apr 26, 2018 at 12:17 PM, Da...
   Author: Josh Wills , 2018-04-26, 22:08
[expand - 1 more] - Reading from HCat as Avro - Crunch - [mail # dev]
...Yeah, that makes a ton of sense; I don't know how much a non-generic,Avro-only solution would be used outside of this specific context, where ateam has all of their data in Avro already. con...
   Author: Josh Wills , 2018-03-22, 04:07
[CRUNCH-661] Make DataBaseSource.Builder methods public - Crunch - [issue]
...Looks like the methods on the DataBaseSource.Builder class were left as package protected by default due to an oversight; this is a quick fix to make them public....    Author: Josh Wills , 2018-01-18, 21:52
[expand - 1 more] - DataBaseSource.Builder methods are not public - Crunch - [mail # user]
... for this; I'm going to runtests and then merge to master. Thanks for the heads up Michael!On Thu, Jan 18, 2018 at 1:08 PM, Josh Wills  w...
   Author: Josh Wills , 2018-01-18, 21:14