ISO-20022 validation not possible with new xsd schema
The payment schema for ISO-20022 lxml.etree.XMLSchemaParseError: Element '{http://www.w3.org/2001/XMLSchema}pattern': The value '[\p{IsBasicLatin}\p{IsLatin-1Supplement}\p{IsLatinExtended-A}€ȘșȚț-[\p{C}]]+' of the facet 'pattern' is not a valid regular expression., line 1434 Here is the XSD part <xs:simpleType name="SPSText"> <xs:restriction base="xs:string"> <xs:pattern value="[\p{IsBasicLatin}\p{IsLatin-1Supplement}\p{IsLatinExtended-A}€ȘșȚț-[\p{C}]]+"/> </xs:restriction> </xs:simpleType>
Salut Jean! Am December 15, 2021 9:46:25 PM UTC schrieb Jean de Bosset:
The payment schema for ISO-20022
lxml.etree.XMLSchemaParseError: Element '{http://www.w3.org/2001/XMLSchema}pattern': The value '[\p{IsBasicLatin}\p{IsLatin-1Supplement}\p{IsLatinExtended-A}€ȘșȚț-[\p{C}]]+' of the facet 'pattern' is not a valid regular expression., line 1434
Here is the XSD part <xs:simpleType name="SPSText"> <xs:restriction base="xs:string"> <xs:pattern value="[\p{IsBasicLatin}\p{IsLatin-1Supplement}\p{IsLatinExtended-A}€ȘșȚț-[\p{C}]]+"/> </xs:restriction> </xs:simpleType>
Am I interpreting your message (and specifically the word "new" in the subject) correctly as meaning that the latest lxml release, 4.7, behaves differently from the previous one, 4.6.5, for you with respect to the specific XML Schema construct that you provided? lxml 4.7 uses the latest version 2.9.12 of libxml2 (which implements the XML Schema support, lxml does not do that itself). 4.6 still uses the older 2.9.10 (and actually 2.9.4 on Windows). You may be able to stick with that. Sadly, libxml2 is currently not maintained, so a fix (assuming that this really is a bug) is unlikely to happen on their side any time soon. Stefan
This pattern validates in xmllint from libxml2 <xs:pattern value="[€ȘșȚț\p{IsBasicLatin}\p{IsLatin-1Supplement}\p{IsLatinExtended-A}-[\p{C}]]+"/> It seems to do the same evaluation, although does not 'fix' the bug. Paul -----Original Message----- From: Stefan Behnel <stefan_ml@behnel.de> Sent: 16 December 2021 06:37 To: lxml@python.org Subject: [lxml] Re: ISO-20022 validation not possible with new xsd schema Salut Jean! Am December 15, 2021 9:46:25 PM UTC schrieb Jean de Bosset:
The payment schema for ISO-20022
lxml.etree.XMLSchemaParseError: Element '{http://www.w3.org/2001/XMLSchema}pattern': The value '[\p{IsBasicLatin}\p{IsLatin-1Supplement}\p{IsLatinExtended-A}€ȘșȚț-[\p{C}]]+' of the facet 'pattern' is not a valid regular expression., line 1434
Here is the XSD part <xs:simpleType name="SPSText"> <xs:restriction base="xs:string"> <xs:pattern value="[\p{IsBasicLatin}\p{IsLatin-1Supplement}\p{IsLatinExtended-A}€ȘșȚț-[\p{C}]]+"/> </xs:restriction> </xs:simpleType>
Am I interpreting your message (and specifically the word "new" in the subject) correctly as meaning that the latest lxml release, 4.7, behaves differently from the previous one, 4.6.5, for you with respect to the specific XML Schema construct that you provided? lxml 4.7 uses the latest version 2.9.12 of libxml2 (which implements the XML Schema support, lxml does not do that itself). 4.6 still uses the older 2.9.10 (and actually 2.9.4 on Windows). You may be able to stick with that. Sadly, libxml2 is currently not maintained, so a fix (assuming that this really is a bug) is unlikely to happen on their side any time soon. Stefan _______________________________________________ lxml - The Python XML Toolkit mailing list -- lxml@python.org To unsubscribe send an email to lxml-leave@python.org https://mail.python.org/mailman3/lists/lxml.python.org/ Member address: paul_higgs@hotmail.com
Bob Kline schrieb am 16.12.21 um 16:28:
On Thu, Dec 16, 2021 at 1:37 AM Stefan Behnel wrote:
Sadly, libxml2 is currently not maintained, ...
Was there an announcement to that effect since the most recent release (which was in May of this year)? https://mail.gnome.org/archives/xml/2021-July/msg00000.html
Stefan
On Thu, Dec 16, 2021 at 5:37 PM Stefan Behnel <stefan_ml@behnel.de> wrote:
Bob Kline schrieb am 16.12.21 um 16:28:
Was there an announcement to that effect since the most recent release (which was in May of this year)? https://mail.gnome.org/archives/xml/2021-July/msg00000.html
Bummer, but thanks for the link. Bob
participants (4)
-
Bob Kline
-
Jean de Bosset
-
Paul Higgs
-
Stefan Behnel