Hy,
im currenty evaluating alfresco community.
my key point is that i want to integrate alfresco as a document management system with the target of paperfree work.
so i integrated tesseract to OCR my uploaded tiff, jpg and png files - it works fine all text is in the search index.
but what i need, would be that tesseract also processes my uploaded pdf scan files, we have a lot of them.
how can i realize this with tesseract?
thank you
rene