Full text search within PDFs

Permalink
I've been doing a bit of searching, and there don't seem to be any suitable solutions.

What I'm trying to do is index the content of all of my PDF documents. Since C5 does not do this, what I was thinking of doing was pdftotext'ing all my PDFs, then loading them up into a new table in my C5 DB with MySQL full-text. Then a cronjob to check for any new or updated files on a regular basis to reindex.

So before I embark, a couple of questions:
1. Has anyone done this before?
2. If not, is there any other interest?
3. Would this be achievable in C5 using only add-on packages, or would I likely need to hack into the core?

 
Salixlol replied on at Permalink Reply
Hello there,

a little late, but did you and if how did you manage to achieve a suitable solution? I am kinda facing the same Problem, so thanks for any respond :)

Greetings