v2.54.0 #1096
Replies: 1 comment
-
Announcement in the OCR-D/Lobby: METS ServerThis solves the problem of concurrent access to the METS file by providing a server that handles all the METS operations so that an arbitrary number of processors can add files at the same time. With the METS server running, pages can now be processed in parallel, no need to split and merge workspaces anymore. In practice, a METS server can be started with the
With the server running, provide
OcrdFile now have separate URL and filename fields.Until now, when downloading files to the workspace, e.g. with With the newest release, we do not change the URL of For developers, this means that the distinction between Web API getting more robustWe've been working on the implementation of the processing parts of the Web API in core. In the processing server, requests are now cached internally, with an optional callback when the jobs is finished. The processing server has a locking mechanism to prevent concurrent access - making full use of the METS server is in the works. Jobs in the processing server can now depend on, i.e. be blocked by, another job. This allows us to model a complete workflow as a set of jobs, also on page-level, that can be pushed as a whole into the processing server - a workflow endpoint that does that and supports the We also changed the way processing workers and processor servers are started. We replaced the For example to start ocrd-tesserocr-recognize worker --queue ... --database ... Cleaned up loggingWe simplified our logging setup in ocrd_utils.logging to make it easier to use OCR-D/core as a library and not interfere with other logging conventions as much. With the newest version, it is no longer necessary to call In practice, this means that we strongly recommend you to use a logging namespace starting with ocrd_utils.configWe've developed a config system for handling the environment variables we support in various places. This should be a transparent change for users, so no changes necessary but makes life easier for core developers, e.g. we generate the list of environment variables supported listed in Other changesThere were also some smaller bug fixes and additions, see the release notes for the details We will release a new ocrd_all with updated core and some related processor updates soon, we will post here when that is available. Thanks to @MehmedGIT, @joschrew, @tdoan2010, @bertsky and all other contributors! |
Beta Was this translation helpful? Give feedback.
-
Added:
OCRD_NETWORK_WORKER_QUEUE_CONNECT_ATTEMPTS
, Make ProcessingWorker create its queue by default #1093ocrd_utils.config
to collect all configuration based on environment variables in one place, ocrd_utils.config to handle all environment variables #1081--page-id
evaluates to an empty set #1088, raise ValueError if --page-id is provided but leads to empty result #1089Changed:
worker
andserver
subcommands, with separate --help, for starting processing worker/processor server, replace --agent-type with subcommand, #1032, #1085 #1087tf_disable_interactive_logs
(to silence keras/tensorflow print statements) toocrd_utils.logging
and do not call on module-level, Can we properly solve the reason fortf1_disable_interactive_logs
existence? #1090, move tf_disable_interactive_logs to utils/logging and remove calls fr… #1091This discussion was created from the release v2.54.0.
Beta Was this translation helpful? Give feedback.
All reactions