CSc 571 - Reverse Index Project
The course project for CSc 571 involved the development and use of a reverse index for searching. The dataset recommended for use was a set of articles from Routers. Our team decided to be a bit more ambitious and instead used a 24GB dump of the Wikipedia article content.
The articles were read into an Oracle database using the import component available here. They were then searchable through a GWT web interface provided by the Client side code available here.
For further reading please see the project report (pdf).