Subject: Parsing huge PDF (400Mb, 2700 pages)


CC'ing colleagues on PDFBox...any recommendations?

Sergey's recommendation is great for documents that can be parsed via
streaming.  However, PDFBox does not currently parse PDFs in a streaming
mode.  It builds the full document tree -- PDFBox colleagues let me know if
I'm wrong.

On Thu, Nov 14, 2019 at 5:51 AM Sergey Beryozkin <[EMAIL PROTECTED]>
wrote: