Department of Mathematics and Computing ScienceWorld Wide Web OrganizationPaul De Bra

Index Databases

Building an index-database is often considered a matter of "inverting" the document database. Instead of a database of documents containing words one creates a database of words, linked to documents.


home red tour

An index-database which contains all information from the indexed documents will be at least as large as the documents themselves. In the World Wide Web of over 100 gigabytes it is not feasible to generate and maintain databases this large. Hence all databases omit some information, and hope this will not be the information users wish to search for.