From 92db664b2f37fa16e9c6733cee76953e1e88cfa8 Mon Sep 17 00:00:00 2001 From: Doug Turnbull Date: Mon, 8 Jul 2024 07:46:39 -0400 Subject: [PATCH] Update README.md --- README.md | 1 + 1 file changed, 1 insertion(+) diff --git a/README.md b/README.md index 88a8f28..7fa3243 100644 --- a/README.md +++ b/README.md @@ -36,6 +36,7 @@ pip install searcharray * Search w/ a [phrase w/ edit-distance](https://lucene.apache.org/core/9_6_0/core/org/apache/lucene/search/PhraseQuery.html) by passing slop=N. * Access raw stats arrays in termfreqs / docfreqs methods on the array * Bring your own tokenizer. Pass any (`def tokenize(value: str) -> List[str]`) when indexing. +* Memory map by passing `data_dir` to index for memory mapped index * Accepts any python function to compute similarity. Here's [one similarity](https://github.com/softwaredoug/searcharray/blob/main/searcharray/similarity.py#L103) * Scores the entire dataframe, allowing combination w/ other ranking attributes (recency, popularity, etc) or scores from other fields (ie boolean queries) * Implement's Solr's [edismax query parser](https://github.com/softwaredoug/searcharray/blob/main/searcharray/solr.py) for efficient prototyping