-
Notifications
You must be signed in to change notification settings - Fork 59
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Add grok or regex support. #1
Comments
Hi @wuranbo. For several reason I think, Log analytics could be a great use case for tantivy. A "grok" preprocessor for our document would be a great addition... in the future. Right now tantivy's document processing is close to inexistant, so it will This does not necessarily mean you should not go on with your project. I don't have a clear idea of what the analysis pipeline API should look like. |
@fulmicoton Totally agree that the pipline is a huge thing needing us to think and design carefully before doing it. And I'm interesting in it. I think my bigger concerne is about the friendly usage of trantivy-cli to more use case(like log) now. About attracting more people to this project at first glance. For now, I think people can only use the two json files which downloaded in the README easily, is not sexy. Besides log, another use case I think should be HTML parser. I think a quick demo usage of log and HTML just implenmented in trantivy-cli (for now) may be enough to together people at first glance. In future, we can fix it. Imageing that, the man first notice trantivy can use trantivy-cli to a 'real' log file in his daily work or some 'real' HTML pages on Internet, then searching it. This is what I image that the thing should be happened when I first saw lucene serval years ago. If it is, may I had been better on lucene. ^_^ |
Like elastic 5.0 have done. https://www.elastic.co/guide/en/elasticsearch/reference/master/grok-processor.html.
With regex support, the tantivy-cli should be more practical, eg. use the Nginx or Apache log directly as input file.
@fulmicoton what about you thought? This is what I want to do with the https://github.com/BurntSushi/fst in my owner project.
So I will take it.
But as a very Rust newbie I may take some time. If you think it is a bad idea, actually I will still do it in my fork to familiar the code base of tantivy. ^_^
The text was updated successfully, but these errors were encountered: