home *** CD-ROM | disk | FTP | other *** search
- Path: sparky!uunet!mcsun!hp4at.eunet.co.at!actrix!paul
- From: paul@actrix.co.at (Paul Gillingwater)
- Newsgroups: comp.sys.novell
- Subject: Re: Searching documwnts for key words
- Message-ID: <0TcTwB1w165w@actrix.co.at>
- Date: Sun, 03 Jan 93 16:21:20 ECT
- References: <1993Jan2.184211.19489@trentu.ca>
- Distribution: world
- Organization: Home Office in Vienna, Austria
- Lines: 26
-
- ccksb@trentu.ca (Ken Brown) writes:
-
- > In article <Z7FkwB1w165w@actrix.co.at> paul@actrix.co.at (Paul Gillingwater)
- > >jc@i5120h.nrl.navy.mil (J. Clark) writes:
-
- > >> on a LAN, looking for key words. Suppose users need to find
- > >> all docs that contained the words 'NOVELL NETWARE'. I could
- > >> enter it in and let the software go looking. It would be ideal
- > >> to be able to setup predefined paths, etc.
- > >
- > >Two products spring to mind: high end, expensive TOPIC or networkable
- > >DOS-based ISYS. Sorry, don't know vendor addresses.
- >
- > What about the Norton Utilities? The utility ts (text search) will do
- > this for you. There's a variety of other great utilities with it. I
- > think Norton is now produced by Symantec (spelling?).
-
- I guess one reason why I didn't recommend ts or grep is the approach
- they take to searching -- simply a "brute force" linear search. In
- large textbases, this can unduly burden a server. A far better approach
- is to build inverted indexes of all words (eliminating common words like
- "a", "the", "and" etc) and use the indexes only to search.
-
- --
- paul@actrix.co.at (Paul Gillingwater)
- Home Office in Vienna, Austria
-