-
-
Notifications
You must be signed in to change notification settings - Fork 708
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
benchmark questions #816
Comments
Hello! The benchmark is on a different repository which has its own set of issues. Anyway, let me answer you here, inline with your questions.
Thank you
The benchmark does not use shingles at all and they would definitely have a huge impact on phrase queries. It is very difficult to decide where to draw the line on this kind of issue.
So about the README bit. It is just not up to date. I wrote that before lucene 8, but lucene 8 introduced an optimization called block wand. The story is more complicated now. Of course, for users, actual throughput (more or less homogenous to average timings under load) or Another thing is that I was a bit scared that in the case of Lucene, the GC may have a very strong effect on the latency figure. This could have lead to endless comments on VM settings and I would prefer to avoid those. In reality, the variance is higher for Lucene but not bad at all. For the hardware, I will open a ticket to capture and display relevant hardware information. Feel free to take that ticket if you want to contribute! |
Closing. |
I love your benchmark site!
But since I noticed all phrase queries seem slower in Lucene, I wondered whether shingles had been used or not? I am not sure what would make the comparison more "fair" but it would be helpful in making the choice to know.
In general, is there a page describing the index configuration, machine used, etc.? And could percentiles for latency be produced instead of just the average? (With a reported query per second?) The readme says Tantivy is "usually faster" than Lucene but the average latency is actually slower as listed (9,963 μs vs 5,849 μs).
Please let me know if I can answer my questions myself somewhere in the code.
The text was updated successfully, but these errors were encountered: