pdftorr NAME pdftorr - Portable Document Format (PDF) to RichReader Doc format. SYNOPSIS pdftorr [options] PDF-file ["title"] DESCRIPTION Pdftorr converts Portable Document Format (PDF) files to a compressed .pdb file in Doc format with RichReader format codes. NOTE: If the PDF file contains compressed text, you will need to have the "gzip" program installed somewhere on your PATH. OPTIONS -f number Specifies the first page to convert. -l number Specifies the last page to convert. -ascii7 Convert the text to 7-bit ASCII; the default is to use the 8-bit ISO Latin-1 character set. -raw Keep the text in content stream order. This is a hack which often "undoes" column formatting, etc. This optiplain text.on will likely be replaced with something more sophisticated when pdftotext is rewritten to use a smarter text placement algorithm. -t Keep the intermediate text file -h Print usage information. (-help is equivalent.) BUGS Some PDF files contain fonts whose encodings have been mangled beyond recognition. There is no way (short of OCR) to extract text from these files. AUTHOR The pdftorr was created by Michael Arena and is based on the pdftotext software and documentation which are copyright 1996-1999 Derek B. Noonburg (derekn@foolabs.com). Source code changes to xpdf are available upon request in accordance of the GPL.