December 6, 2019

229 words 2 mins read

pisa-engine/pisa

pisa-engine/pisa

PISA: Performant Indexes and Search for Academia

repo name pisa-engine/pisa
repo link https://github.com/pisa-engine/pisa
homepage https://pisa.readthedocs.io/en/latest/
language C++
size (curr.) 34851 kB
stars (curr.) 232
created 2018-09-26
license Apache License 2.0

PISA: Performant Indexes and Search for Academia v0.8.0

Build Status codecov Documentation Status Codacy Badge GitHub issues GitHub forks GitHub stars PRs Welcome DOI

Join us on Slack

Get in touch via Slack: Slack

Description

PISA is a text search engine able to run on large-scale collections of documents. It allows researchers to experiment with state-of-the-art techniques, allowing an ideal environment for rapid development.

Some features of PISA are listed below:

  • Written in C++ for performance;
  • Indexing & Parsing & Sharding capabilities;
  • Many index compression methods implemented;
  • Many query processing algorithms implemented;
  • Implementation of document reordering;
  • Free and open-source with permissive license;

The best way to get started is by reading the official documentation.

If you want to get involved with PISA, please check out our Contributing page.

Reference

Reference to cite when you use PISA in a research paper:

@inproceedings{MSMS2019,
  author    = {Antonio Mallia and Michal Siedlaczek and Joel Mackenzie and Torsten Suel},
  title     = {{PISA:} Performant Indexes and Search for Academia},
  booktitle = {Proceedings of the Open-Source {IR} Replicability Challenge co-located
               with 42nd International {ACM} {SIGIR} Conference on Research and Development
               in Information Retrieval, OSIRRC@SIGIR 2019, Paris, France, July 25,
               2019.},
  pages     = {50--56},
  year      = {2019},
  url       = {http://ceur-ws.org/Vol-2409/docker08.pdf}
}

Credits

PISA is a fork of the ds2i project started by Giuseppe Ottaviano.

comments powered by Disqus