By kalanir
via kalanir.blogspot.com
Published: Aug 08 2008 / 13:45
This describes how to extract the text from a PDF document. (In this particular post the text is extracted for indexing purposes).
Comments
giffo replied ago:
or Document doc = LucenePDFDocument.getDocument(new File(filename)); ?
kalanir replied ago:
Thanks. This didn't work for me ( for Lucene 2.3.2 ). Seems it is a problem with the version compatibility.
Voters For This Link (7)
Voters Against This Link (0)