Building cross reference -- how?

D. Hugh Redelmeier hugh-pmF8o41NoarQT0dZR+AlfA at public.gmane.org
Sun Oct 13 17:03:46 UTC 2013


| From: William Park <opengeometry-FFYn/CNdgSA at public.gmane.org>

There are surely many ways to do what you want.  But your problem
description isn't very clear.

Are the terms you want to index a small fixed set?  A relational
database can do that very well, with a unary relation for each term,
and a join for forming the conjunction.  Likely overkill.

Is what you want a concordance or KWIC (Key Word In Context) index?
In the old days (1970s) the paper UNIX manuals came with a KWIC index.
So there were tools to generate it.  Now we use man -k instead.

Here's a link that might be relevant:
  <http://www.billposer.org/Linguistics/Computation/LectureNotes/Concordances.html>

What's wrong with the multiple fgrep solution that you came up with?
The answer might help us understand your problem better.

| Is there a program, similar to "updatedb" and "slocate"?

You mean to preprocess your data?

Why do you want one?  Is the processing so slow that preprocessing is
worthwhile?
--
The Toronto Linux Users Group.      Meetings: http://gtalug.org/
TLUG requests: Linux topics, No HTML, wrap text below 80 columns
How to UNSUBSCRIBE: http://gtalug.org/wiki/Mailing_lists





More information about the Legacy mailing list