We are currently processing the PDF document and extracting the content to use them for our analysis. Data scientist use the data as their input put model and rest of them are business directed output. This way it helps us efficient document processing.
This seems to be better options at the moment