These are tools that have been suggested to me to extract structured data from a PDF files:
making a 4up PDF
This is the command that I used to make a 4up (that would be four equally sized images on one page) layout on letter sized paper of a tabloid sized poster. I used the excellent Multivalent tool’s Impose feature.
C:\Documents and Settings\surplus\Desktop>java -Xms32m -Xmx256m -classpath “C:\Documents and Settings\surplus\Desktop\Multivalent20060102.jar” tool.pdf.Impose -nup 4 -dim 2×2 -paper letter -page 1,1,1,1 promposter.pdf
This is the command that I use on my notebook running Ubuntu Gutsy Gibbon:
/usr/lib/jvm/java-6-sun/bin/java -classpath /home/ghing/java/Multivalent.jar tool.pdf.Impose -nup 4 -dim 2x2 -paper letter -page 1,1,1,1 describe_yourself.pdf