«« Next » « Previous
«« Next » « Previous

Link Details

Link 101586 thumbnail
User 317734 avatar

By kalanir
via kalanir.blogspot.com
Published: Aug 08 2008 / 13:45

This describes how to extract the text from a PDF document. (In this particular post the text is extracted for indexing purposes).
  • 7
  • 0
  • 1182
  • 368

Comments

Add your comment
User 310334 avatar

giffo replied ago:

0 votes Vote down Vote up Reply

or Document doc = LucenePDFDocument.getDocument(new File(filename)); ?

User 317734 avatar

kalanir replied ago:

0 votes Vote down Vote up Reply

Thanks. This didn't work for me ( for Lucene 2.3.2 ). Seems it is a problem with the version compatibility.

Add your comment


Html tags not supported. Reply is editable for 5 minutes. Use [code lang="java|ruby|sql|css|xml"][/code] to post code snippets.

Voters For This Link (7)



Voters Against This Link (0)