text indexing on Linux?

Jamon Camisso jamon.camisso-H217xnMUJC0sA/PxXw9srA at public.gmane.org
Thu Jul 5 17:23:14 UTC 2012


On 05/07/12 12:31 PM, William Park wrote:
> Hi all,
> 
> Suppose all your files are text files and contain 10 words max.  What
> program would you use to index them based on contents?  That is, given a
> set of words, it has to return the name of files that contain those
> words.
> 
> I know of "updatedb" and "locate", but they index only filenames, not
> the content.  For my need, "grep" is still faster than any SQL solution,
> but I'm curious as to what is the correct approach.

Seems like a document store like mongodb would be a good tool for this
if you have a lot of files.

Jamon
--
The Toronto Linux Users Group.      Meetings: http://gtalug.org/
TLUG requests: Linux topics, No HTML, wrap text below 80 columns
How to UNSUBSCRIBE: http://gtalug.org/wiki/Mailing_lists





More information about the Legacy mailing list