Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

feat(functions): add jaro_winkler string similarity function #16993

Open
wants to merge 2 commits into
base: main
Choose a base branch
from

Conversation

maxjustus
Copy link
Contributor

@maxjustus maxjustus commented Dec 4, 2024

I hereby agree to the terms of the CLA available at: https://docs.databend.com/dev/policies/cla/

Summary

Adds a jaro_winkler string edit distance function taken from this project https://github.com/joshuaclayton/jaro_winkler

Tests

  • Unit Test
  • Logic Test
  • Benchmark Test
  • No Test - Explain why

Type of change

  • Bug Fix (non-breaking change which fixes an issue)
  • New Feature (non-breaking change which adds functionality)
  • Breaking Change (fix or feature that could cause existing functionality not to work as expected)
  • Documentation Update
  • Refactoring
  • Performance Improvement
  • Other (please describe):

This change is Reviewable

@github-actions github-actions bot added the pr-feature this PR introduces a new feature to the codebase label Dec 4, 2024
@BohuTANG BohuTANG requested a review from TCeason December 4, 2024 01:07
@@ -841,6 +854,190 @@ pub(crate) mod soundex {
}
}

// this implementation comes from https://github.com/joshuaclayton/jaro_winkler
pub(crate) mod jaro_winkler {
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

The function jaro_winkler and mod jaro_winkler can refactor to file other.rs.


#[test]
fn different_is_zero() {
assert_eq!(jaro_winkler("foo", "bar"), 0.0);
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@maxjustus
Copy link
Contributor Author

ok @TCeason - I've moved the definition to other.rs

@TCeason
Copy link
Collaborator

TCeason commented Dec 12, 2024

ok @TCeason - I've moved the definition to other.rs

Please fmt code

@maxjustus
Copy link
Contributor Author

ok done @TCeason !

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
pr-feature this PR introduces a new feature to the codebase
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants