Content-based Document Routing and Index Partitioning for Scalable Similarity-based Searches in a Large Corpus

Appeared in Proceedings of the 13th ACM SIGKDD international conference on Knowledge Discovery and Data Mining (KDD '07).

Publication date:
August 2007

Authors:
Deepavali Bhagwat
Kave Eshghi
Pankaj Mehra

Projects:
Archival Storage
Scalable File System Indexing
Deduplication

Available for download:

Full text:
Download as PDF

Bibtex entry

@inproceedings{bhagwat07-kdd,
  author       = {Deepavali Bhagwat and Kave Eshghi and Pankaj Mehra},
  title        = {Content-based Document Routing and Index Partitioning for
Scalable Similarity-based Searches in a Large Corpus},
  booktitle = {Proceedings of the 13th ACM SIGKDD international conference on
Knowledge Discovery and Data Mining (KDD '07)},
  pages        = {105-112},
  month        = aug,
  year         = {2007},
}
Last modified 27 May 2009