clear query| facets| time Search criteria: .   Results from 21 to 30 from 7902 (0.0s).
Loading phrases to help you
refine your search...
[expand - 5 more] - Figuring out to which CombineFileSplit the input record of DoFn process each record belongs to - Crunch - [mail # user]
...We recently came into a situation where we had this same sort of need.  Ilogged with a potentialsolution.  I'm curious for any feed...
   Author: Ben Roling , Marcin Michalski , ... , 2018-01-30, 20:22
[CRUNCH-662] Improve KafkaSource error handling - Crunch - [issue]
...The KafkaSource and specific KafkaRecordReader does not handle errors and empty reads as well as it could. The code could be improved to better handle these errors and appropriately retry ...    Author: Bryan Baugher , 2018-01-25, 16:11
[CRUNCH-661] Make DataBaseSource.Builder methods public - Crunch - [issue]
...Looks like the methods on the DataBaseSource.Builder class were left as package protected by default due to an oversight; this is a quick fix to make them public....    Author: Josh Wills , 2018-01-18, 21:52
[expand - 2 more] - DataBaseSource.Builder methods are not public - Crunch - [mail # user]
... for this; I'm going to runtests and then merge to master. Thanks for the heads up Michael!On Thu, Jan 18, 2018 at 1:08 PM, Josh Wills  w...
   Author: Josh Wills , Michael Linthicum , ... , 2018-01-18, 21:14
[CRUNCH-654] KafkaSource should use new Kafka Consumer API instead of Simple Consumer - Crunch - [issue]
...We should update the KafkaSource to use the modern Consumer API. The old Consumer used by KafkaUtils#getBrokerOffsets(...) is considered deprecated as of Kafka 0.11 (https://issues.apache.or...    Author: Andrew Olson , 2017-12-11, 18:14
[CRUNCH-660] FileTargetImpl uses Distcp vs FileUtils.copy - Crunch - [issue]
...So for handling multiple runtimes I'm not sure there is a way to solve this but documenting as a JIRA regardless.If you are running in a multi-cluster environment where you might want to rea...    Author: Micah Whitacre , 2017-12-11, 15:02
[CRUNCH-340] Create HCatSource and HCatTarget - Crunch - [issue]
...This patch adds HCatSource, which enables crunch pipeline to read from Hive tables. This is the very first version, leaving a few TODOs in code.It adds new dependency from crunch-core to hca...    Author: Chao Shi , 2017-12-10, 16:57
[CRUNCH-659] Upgrade to Hive 2.x - Crunch - [issue]
...I've been working on CRUNCH-340 to finish implementing the HCatSource and HCatTarget. It seems to be in a better place now that crunch only supports hadoop 2. I was looking to target as high...    Author: Stephen Durfey , 2017-12-08, 16:27
[CRUNCH-657] Can't activate Snappy compression with AvroPathPerKeyTarget - Crunch - [issue]
...Compress.snappy(AvroPathPerKeyTarget) doesn't work.I needed to create my own class that extends AvroPathPerKeyTarget.public class CompressedAvroPathPerKeyTarget extends AvroPathPerKeyTarget ...    Author: Jihed JOOBEUR , 2017-10-28, 04:19
[CRUNCH-658] Add a way to skip the getSize checks for Sources from object stores - Crunch - [issue]
...Ran into a problem when using Crunch to process a lot of data from S3: the getSize checks can be very slow to run and don't materially add much to the overall processing of a pipeline when t...    Author: Josh Wills , 2017-10-28, 04:13