Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Replace backslashes in URIs #5

Open
jindrichmynarz opened this issue Sep 9, 2014 · 3 comments
Open

Replace backslashes in URIs #5

jindrichmynarz opened this issue Sep 9, 2014 · 3 comments
Assignees
Labels

Comments

@jindrichmynarz
Copy link
Member

Some URLs in the extracted TED dataset that are reused from the source XML data contain backslash characters (i.e. \), which are not allowed to be a part of URI unescaped. For example, backslashes appear in URL values of pc:buyerProfile (BTW, this should be changed to pc:profile). In most cases, this is a typo of replacing the regular forward slash with the backslash.

The solution to this issue could be to replace any backslashes in source URLs by forward slashes. A more conservative option would be to escape the backslashes.

@jindrichmynarz
Copy link
Member Author

In fact, it would be better to just escape the backslash or treat the whole value as literal instead of a URI, because there are values such as www\lovech.bg, where the straight-forward replacement with forward slash cannot be applied.

@michalpomykacz
Copy link
Contributor

Contracs has a value of pc:buyerProfile formatted as literal for now. About older Contracts I suggest change URIs to literals using SPARQL UPDATE query. May I do it?

@jindrichmynarz
Copy link
Member Author

OK. Please update the older data tomorrow or later (the lod2.vse.cz server should be kept highly available due to the LOD2 review).

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

No branches or pull requests

2 participants