Tools for extracting structured data from a PDF file These are tools that have been suggested to me to extract structured data from a PDF files: pdftotext from xpdf CometDocs Cogniview PDF2Excel Tabula