multi-saxon
swiftly converts large amounts of XML TEI files into text. Harnessing the power of Saxonica's SaxonC-HE processor (XSLT 2.0+), it handles XSLT 2.0 and 3.0 transformations in parallel. This approach enables users to circumvent some of the limitations of lxml
, which in spite of its speed, operates exclusively within the XSLT 1.0 framework.
- Fast Transformations: Utilize the multiprocessing capabilities of your machine for simultaneous XML transformations.
- Saxon Integration: Seamlessly process XML files using the renowned Saxon processor.
- CSV Output: Generate comprehensive CSV reports containing relevant metadata about the processed XML TEI files.
- Limited Logging Capabilities:
multi-saxon
is optimized for TEI P5 files. I do not plan on extending it to other frameworks.
-
Ensure you have Python 3.x installed on your machine. If not, download and install Python.
-
Clone this repository:
git clone https://github.com/Pantagrueliste/multi-saxon.git