Morten Hulden
Nordic Gene Bank, P.O.
Box 41, 23053 Alnarp, Sweden
WAIS (Wide Area Information Servers) is an information retrieval system for on-line databases on the Internet. WAIS was originally developed by Apple Computers, Thinking Machine Inc. and Dow Jones. Since the release of the first software package WAIS has become a standard for connecting individual databases on the Internet into a network of servers. WAIS has its own protocol in the TCP/IP (Transport Control Protocol/Internet Protocol) stack. Computers running WAIS form a network on Internet similar to the WWW (World Wide Web).
WAIS is optimised for fast searches based on specific search criteria, and thus is very well suited as a complement to downloadable databases. A user does not have to go through the process of downloading a large file, extracting the database from the archive, running local database software and using a decoding table to find information, but can find the answers on-line within a few seconds by using a WAIS client (or a WWW-search form).
WAIS is not a programme. WAIS is a standard for indexing, storing and retrieving information from source documents that can be located anywhere on the Internet. Several programmes that support WAIS are available, both commercial and free software. Some programmes cover a wide range of source file formats such as text files, HTML, BibTex, Medline, as well as user-defined database structures.
A complete WAIS software package contains three pieces of software: the indexer, the server and the client. WAIS clients are available separately for people who only want to retrieve data. The indexer and (optionally) the server are needed by those who want to provide on-line WAIS databases. In some software packages the WAIS server programme can be replaced or enhanced with a CGI (Common Gateway Interface) script that provides a WWW server running on the same machine with direct access to the WAIS index files and source documents.
Pros and Cons
Advantages
Example of a working WAIS system linked with the WWW
Requirements
Hardware: PC, Pentium
200MHz, 64MB RAM, 2GB disk space (but any 386+ will do)
Operating system: Linux
2.0.x, free software (GNU General Public License).
C-compiler: GCC 2.7.2,
free software (GNU General Public License)
Perl interpreter: Perl
5.003, free software (GNU General Public License)
WAIS package: freeWAIS-sf
2.1.2, free software (GNU General Public License)
WWW server: Cern 3.0,
free software (GNU General Public License)
Search engine: SFgate
5.x (enhanced by NGB), free software (GNU General Public License)
The freeWAIS-sf package
FreeWAIS-sf is available
as C source code or as precompiled for several UNIX-like operating systems.
Compilation on the local machine is recommended, since many more configuration
options can be selected before compiling.
The Indexer
The indexer creates index
files with entries for each word appearing within the source documents,
including entries for every word (or number) within database fields. The
indexer is run on the server at least once, or regularly after each update.
Linking WWW and WAIS through SFgate provides a very user-friendly way of searching the databases, since the user does not have to know the field names or the syntax of the query language in order to make a search; it is sufficient to fill in a HTML search form.
With a CGI script like SFgate the result of the searches can be presented as preformatted HTML pages, which in addition to the data extracted from the WAIS databases of course also can include pictures, maps and other graphics. Descriptor values that are coded in the source files can be decoded by the script before the data are displayed to the user.