Search Overview The search engine is bundled with the Sambar Server. All files being indexed must reside under the Sambar Server document directory and be available to the HTTP server. URLs are created by the index server for all files found as part of the indexing task. Should files be removed or new files added, the index must be regenerated. Search indexes can be scheduled to be automatically re-indexes using the System Administration GUI; indexes can be re-generated daily, weekly, or monthly. The indexing process is initiated from the System Administration console of the Sambar Server (WWW interface).
Search Indexer The Search Indexer provides the ability to specify the files to be indexed. The WWW Server must have read access to all the files being indexed. Files may be filtered by file extension, individual files, directory, or by a directory and all its sub-directories. All index files are placed in the search sub-directory located in the installation directory of the Sambar Server. Documents are indexed by file name, file size and last modified date. In addition, in the case of HTML files, the TITLE is parsed and used as the description of the file. In this release, the only weighting used is a count of the number of times a word appears in a document, as well as additional weighting for words appearing in the title or heading. Multiple indexes may be built and individually searched. Additional indexes are defined by editing the search.ini (via the system administration GUI) adding additional search indexes. Each [section] entry in the search.ini file results in an index of that name being made available. The System Administration GUI should be used to manage these entries. Indexes are restricted to files found within the default directory identified in the config.ini file. Directories associated with virtual-hosts cannot presently be indexed. In a future release the ability to search across multiple indexes will be supported as will the ability to index files associated with virtual-host directories.
Stop Word List
Query String
Wildcard Searches Wildcard search patterns are:
* The star (*) character performs an expansive pattern match. Ranking Simple QueriesThe Sambar Search engine ranks the results based on a scoring algorithm; documents with a higher score appear at the head of the ranking list. A document has a higher score if the following hold:
Multiple Indexes Searches can be performed across multiple indexes by providing a space separated list of the indexes to be searched with the indexname parameter to the /session/find search request. |
© 1998 Sambar Technologies. All rights reserved. Terms of Use.