[ic] Search content of a PDF file

Kevin Walsh kevin at cursor.biz
Sun May 28 20:33:31 EDT 2006


maillists <lists at gmnet.net> wrote:
> I have a client who has about 150 pdf files, and each file contains
> about 20 - 50 pages of text!
> 
> They want to have a search engine that can search the content of these
> files, and display at least a link to the pdf files for download. 
> 
> Is there a good way for IC to do this kind of search, or should I
> integrate some other system for this into the IC site.
> 
> My first thought was to copy/paste all the content of each file into a
> database. But I was wondering if there was a way to search inside the
> content of the pdf file itself. Speed is not too much an issue.
> 
Swish-e can be used to index the content of PDF files.  Interchange
searches can make use of Swish-e indexes (st=swish).

-- 
   _/   _/  _/_/_/_/  _/    _/  _/_/_/  _/    _/
  _/_/_/   _/_/      _/    _/    _/    _/_/  _/   K e v i n   W a l s h
 _/ _/    _/          _/ _/     _/    _/  _/_/    kevin at cursor.biz
_/   _/  _/_/_/_/      _/    _/_/_/  _/    _/


More information about the interchange-users mailing list