Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add support for Unicode normalization output iterators #20

Open
tahonermann opened this issue Mar 3, 2016 · 4 comments
Open

Add support for Unicode normalization output iterators #20

tahonermann opened this issue Mar 3, 2016 · 4 comments
Assignees

Comments

@tahonermann
Copy link
Owner

No description provided.

@tahonermann tahonermann self-assigned this Mar 3, 2016
@ruoso
Copy link

ruoso commented Sep 23, 2016

I would argue that normalization shouldn't be an output iterator, but rather a transform algorithm.

@tahonermann
Copy link
Owner Author

I tend to agree that, in most cases, treating normalization as a transcoding operation is probably what is desirable. Such an interface can be specialized for particular iterators (pointers) to provide higher performance as well. The benefit of an output iterator (or a proxy input iterator) is that the transformation can be done lazily. I think both interfaces have their uses.

@ruoso
Copy link

ruoso commented Sep 25, 2016

I think that is an academical use case. In practice the reason why you need
to normalize the text is because you will perform an operation that
requires the text to be normalized.

Making the normalizing into an output iterator would be technically valid,
but semantically confusing.

Em sáb, 24 de set de 2016 18:14, Tom Honermann [email protected]
escreveu:

I tend to agree that, in most cases, treating normalization as a
transcoding operation is probably what is desirable. Such an interface can
be specialized for particular iterators (pointers) to provide higher
performance as well. The benefit of an output iterator (or a proxy input
iterator) is that the transformation can be done lazily. I think both
interfaces have their uses.


You are receiving this because you commented.
Reply to this email directly, view it on GitHub
#20 (comment),
or mute the thread
https://github.com/notifications/unsubscribe-auth/AAE9K53M6Ig3W4DnWU9QRItO7KXiHy39ks5qtaCmgaJpZM4Hoxuy
.

@tahonermann
Copy link
Owner Author

I can see use cases for wanting to perform an operation on normalized text in a lazy fashion.

Regardless, I agree that a transcoding interface has more potential uses. That falls under issue #4.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

2 participants