PDF -> Text or PDF -> HTML?
I've looked through CPAN for modules that will extract the text from PDF files
and output it to either plain text or HTML. I've found plenty of stuff to
go the other way (PDF::Parse), and plenty of stuff to tell me the properties
of a PDF file (Text::PDF::Utils), but nothing so far that I've seen will
do what I need. The ONLY thing I need from the PDF file is the text:
preferrably in HTML, but I can swing it with plain text also.