Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Feature request: date normalizer & format parser #10

Closed
percevalw opened this issue Jan 8, 2024 · 1 comment
Closed

Feature request: date normalizer & format parser #10

percevalw opened this issue Jan 8, 2024 · 1 comment

Comments

@percevalw
Copy link
Member

Feature type

Once a date has been extracted by the ML NER module (e.g., eds.ner_crf), it still needs to be normalized, and its format extracted to be able to shift it before replacing it in the pseudonymized report.

The normalizer should assign a date attribute (edsnlp's AbsoluteDate) and a date format (either strftime's %d %m %Y or java's dd-mm-yyyy, compatible with pendulum). The question is not format syntax is not trivial, since C standard date format, since strftime does not support case modifiers (a full letter month will always be replaced in titlecase), nor full letter numbers.

@percevalw
Copy link
Member Author

Completed in #11

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant