Introduction

This tutorial is for people who are to make an information retrieval project. It will help you to use ht://Dig to retrieve documents and store them in an inverted index. Furthermore, it will show how you extract information from this inverted index using your own C++ code.

It is expected that people know C++ and a little about information retrieval. Autoconf and Automake is used, but no previous knowledge of these tools are expected.

Firstly, the downloading and compilation of ht://Dig is described. Secondly, how to fill the index. Thirdly, how to extract information from the index. The last is shown via a programming example.

If you have questions regarding this tutorial, the author can be contacted at mads_lindstroem@yahoo.dk.

Beware, that this tutorial is still a beta version and it may therefore still have a lot of errors in it.



Mads Lindstrøm