Indication of vowel length interfering with xml parsing #1

wmshort · 2018-12-08T09:44:56Z

The coding scheme for vowel length -- following " for long-vowels and following ' for short vowels, where this indicates etymological an difference -- is interfering with XML parsing. I would suggest a different coding scheme entirely (e.g., following : for long vowels) or, where possible, relying on already-included morphological information for differentiation?

gfranzini · 2018-12-10T16:01:32Z

Which data are you trying to parse? We'll be converting them to long and short accent marks at some point.

wmshort · 2018-12-10T16:20:48Z

This was in the 'formario'. Distinctions of length are marked with single (breve) or double (longum) quotes, which was interfering with attribute strings. As a quick fix i replaced with another symbol, but unicode character might be best? William Michael Short Lecturer in Classics Department of Classics & Ancient History University of Exeter

…

________________________________ From: Greta Franzini <[email protected]> Sent: Monday, December 10, 2018 4:01:32 PM To: CIRCSE/WFL Cc: William Michael Short; Author Subject: Re: [CIRCSE/WFL] Indication of vowel length interfering with xml parsing (#1) Which data are you trying to parse? We'll be converting them to long and short accent marks at some point. — You are receiving this because you authored the thread. Reply to this email directly, view it on GitHub<https://nam04.safelinks.protection.outlook.com/?url=https%3A%2F%2Fgithub.com%2FCIRCSE%2FWFL%2Fissues%2F1%23issuecomment-445868139&data=02%7C01%7C%7C000da9609c3248032bbf08d65eb8c1bf%7C84df9e7fe9f640afb435aaaaaaaaaaaa%7C1%7C0%7C636800544944104061&sdata=phIQPiHjHv9pe1XkxWAWJvdccDI90VEulN6CIodxIxg%3D&reserved=0>, or mute the thread<https://nam04.safelinks.protection.outlook.com/?url=https%3A%2F%2Fgithub.com%2Fnotifications%2Funsubscribe-auth%2FAizEouRmZPr4OA9qAI_W4nbErR1lLpWPks5u3oVcgaJpZM4ZJjvm&data=02%7C01%7C%7C000da9609c3248032bbf08d65eb8c1bf%7C84df9e7fe9f640afb435aaaaaaaaaaaa%7C1%7C0%7C636800544944104061&sdata=L1S7GcMfDjZSa3LTxoUXtk80QYSGYbA7CJFQ5RZ6qdk%3D&reserved=0>.

passarom · 2018-12-10T16:40:36Z

Sure. Unicode is best. We will use it. Il giorno 10 dic 2018, alle ore 17:20, William Michael Short <[email protected]<mailto:[email protected]>> ha scritto: This was in the 'formario'. Distinctions of length are marked with single (breve) or double (longum) quotes, which was interfering with attribute strings. As a quick fix i replaced with another symbol, but unicode character might be best? William Michael Short Lecturer in Classics Department of Classics & Ancient History University of Exeter

…

________________________________ From: Greta Franzini <[email protected]<mailto:[email protected]>> Sent: Monday, December 10, 2018 4:01:32 PM To: CIRCSE/WFL Cc: William Michael Short; Author Subject: Re: [CIRCSE/WFL] Indication of vowel length interfering with xml parsing (#1) Which data are you trying to parse? We'll be converting them to long and short accent marks at some point. — You are receiving this because you authored the thread. Reply to this email directly, view it on GitHub<https://nam04.safelinks.protection.outlook.com/?url=https%3A%2F%2Fgithub.com%2FCIRCSE%2FWFL%2Fissues%2F1%23issuecomment-445868139&data=02%7C01%7C%7C000da9609c3248032bbf08d65eb8c1bf%7C84df9e7fe9f640afb435aaaaaaaaaaaa%7C1%7C0%7C636800544944104061&sdata=phIQPiHjHv9pe1XkxWAWJvdccDI90VEulN6CIodxIxg%3D&reserved=0>, or mute the thread<https://nam04.safelinks.protection.outlook.com/?url=https%3A%2F%2Fgithub.com%2Fnotifications%2Funsubscribe-auth%2FAizEouRmZPr4OA9qAI_W4nbErR1lLpWPks5u3oVcgaJpZM4ZJjvm&data=02%7C01%7C%7C000da9609c3248032bbf08d65eb8c1bf%7C84df9e7fe9f640afb435aaaaaaaaaaaa%7C1%7C0%7C636800544944104061&sdata=L1S7GcMfDjZSa3LTxoUXtk80QYSGYbA7CJFQ5RZ6qdk%3D&reserved=0>. — You are receiving this because you are subscribed to this thread. Reply to this email directly, view it on GitHub<https://emea01.safelinks.protection.outlook.com/?url=https%3A%2F%2Fgithub.com%2FCIRCSE%2FWFL%2Fissues%2F1%23issuecomment-445875620&data=02%7C01%7Cmarco.passarotti%40unicatt.it%7Cbb6172e4568c4ac5671a08d65ebb7393%7Cb94f7d7481ff44a9b5886682acc85779%7C0%7C0%7C636800556527370261&sdata=9VMfi4T8wHtyya3f04qKQAQnWR8YxEmk7diwE0Hu9ok%3D&reserved=0>, or mute the thread<https://emea01.safelinks.protection.outlook.com/?url=https%3A%2F%2Fgithub.com%2Fnotifications%2Funsubscribe-auth%2FALwRv3RELnxkyC7TnlCQzuU7Kyr8PekQks5u3onggaJpZM4ZJjvm&data=02%7C01%7Cmarco.passarotti%40unicatt.it%7Cbb6172e4568c4ac5671a08d65ebb7393%7Cb94f7d7481ff44a9b5886682acc85779%7C0%7C0%7C636800556527370261&sdata=qhG8ob3FAao%2BASEKcoMB%2BAKRTfk1DhqVDK%2BWxmCGJMg%3D&reserved=0>. [http://Static.unicatt.it/layout/img/layout/5x1000.gif] Destina il tuo 5 per mille all’Università Cattolica CF 02133120150 www.unicatt.it/5permille<http://www.unicatt.it/5permille/>

gersh0m · 2018-12-10T17:21:57Z

The problem is caused by a sloppy/buggy output of LemLat.
You don't need any UNICODE in here!
You just need to use the corresponding codes for (double)quote as basic XML syntax states...

wmshort · 2018-12-10T17:37:35Z

Agreed, HTML double quote: " William Michael Short Lecturer in Classics Department of Classics & Ancient History University of Exeter

…

________________________________ From: gersh0m <[email protected]> Sent: Monday, December 10, 2018 5:21:57 PM To: CIRCSE/WFL Cc: William Michael Short; Author Subject: Re: [CIRCSE/WFL] Indication of vowel length interfering with xml parsing (#1) The problem is caused by a sloppy/buggy output of LemLat. You don't need any UNICODE in here! You just need to use the corresponding codes for (double)quote as basic XML syntax states... — You are receiving this because you authored the thread. Reply to this email directly, view it on GitHub<https://nam04.safelinks.protection.outlook.com/?url=https%3A%2F%2Fgithub.com%2FCIRCSE%2FWFL%2Fissues%2F1%23issuecomment-445898145&data=02%7C01%7C%7Cdfc187cc91d847b8b20208d65ec3fe4a%7C84df9e7fe9f640afb435aaaaaaaaaaaa%7C1%7C0%7C636800593201568755&sdata=Xp8G0tKCl6vph6bmFdeE0HY%2F%2FG3LAfNIQWGFT3sMA88%3D&reserved=0>, or mute the thread<https://nam04.safelinks.protection.outlook.com/?url=https%3A%2F%2Fgithub.com%2Fnotifications%2Funsubscribe-auth%2FAizEoqa2BmYEZpakW5C80BWEIu-LWaruks5u3pg1gaJpZM4ZJjvm&data=02%7C01%7C%7Cdfc187cc91d847b8b20208d65ec3fe4a%7C84df9e7fe9f640afb435aaaaaaaaaaaa%7C1%7C0%7C636800593201568755&sdata=qLoDIiSXxXADWfTanciszt7k0RAyXPgeBQsGEMrErFY%3D&reserved=0>.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Indication of vowel length interfering with xml parsing #1

Indication of vowel length interfering with xml parsing #1

wmshort commented Dec 8, 2018

gfranzini commented Dec 10, 2018

wmshort commented Dec 10, 2018 via email

passarom commented Dec 10, 2018 via email

gersh0m commented Dec 10, 2018

wmshort commented Dec 10, 2018 via email

Indication of vowel length interfering with xml parsing #1

Indication of vowel length interfering with xml parsing #1

Comments

wmshort commented Dec 8, 2018

gfranzini commented Dec 10, 2018

wmshort commented Dec 10, 2018 via email

passarom commented Dec 10, 2018 via email

gersh0m commented Dec 10, 2018

wmshort commented Dec 10, 2018 via email