Automatically extract data from PDF and export to Excel or SQL

Status
Not open for further replies.

Zazism

Distinguished
Nov 28, 2011
3
0
18,510
This is a tall order, the policy statements for the company I work for are received in PDF format. I am looking for a way to extract specific blocks of data ( in the form of tables but they poorly structured ) from within a PDF and export them to a excel spreadsheet or sql. Are there any document management systems or tools able to do this on a large scale? Maybe a program that can scan a page and convert it to a spreadsheet?

Any help would be appreciated
 
Solution
Most of OCR software can be configured to "scan" from a picture or PDF (that's one way PDF documents are converted into editable text). It's another story what you would do with that text. You can start looking at VBA, and try to find out where your data is based on tokens within the scanned documents.
Most of OCR software can be configured to "scan" from a picture or PDF (that's one way PDF documents are converted into editable text). It's another story what you would do with that text. You can start looking at VBA, and try to find out where your data is based on tokens within the scanned documents.
 
Solution
Status
Not open for further replies.