Open Source Document Retrieval System

Ilya Palagin tux-4CS0UopE6WdBDgjK7y7TUQ at public.gmane.org
Wed Oct 20 05:29:53 UTC 2004


Christopher Browne wrote:
>>If you need some kind of on-line document repository, Owl Intranet
>>Knowledgebase is a great choice:
>>
>>http://sourceforge.net/projects/owl
> 
> 
> Another approach would be to install Wiki software such as TWiki, and
> upload documents into that.  
> 
> Indexing would be handled two ways:
>  - Wikis generally have some "search" scheme;
>  - You can build Wiki documents that provide indexes pointing to
>    documents of interest
> 
> At the office, we have a Zope install; it provides the two indexing
> mechanisms:
>  a) You can do "text search" search throughout the set of documents
>     checked in;
>  b) Each document has to be placed _somewhere_ in a "tree" that tries
>     to organize documents.
Isn't it the same approach?

> 
> I would like to see an "open source Documentum," where the idea would be
> thus:
>  -> You "check in" documents perhaps including some metadata...
> 
>  -> A "spooler/archiver" process takes the documents, extracts
>     whatever metadata is possible ("file" can give some useful data!)
>     and then does two things:
> 
>      1. Stashes the document somewhere, perhaps under an arbitrarily
>         cryptic name;
> 
>      2. Stores the metadata in a database of some sort to assist in
>         searching/selecting documents.

This is almost  what Owl is :-).  It creates indexes for uploaded ASCII, 
Word and PDF documents, saving them either in MySQL database or as 
files.  It allows to check in/check out documents while one is updating 
them.
--
The Toronto Linux Users Group.      Meetings: http://tlug.ss.org
TLUG requests: Linux topics, No HTML, wrap text below 80 columns
How to UNSUBSCRIBE: http://tlug.ss.org/subscribe.shtml





More information about the Legacy mailing list