Fix tiseg #79

bertsky · 2021-02-01T01:35:46Z

This brings about the bare minimum functionality of this processor.

Results are not that good, but that's another story. (Legacy text/non-text segmentation does not detect enough non-text, so the recall is bad but precision is good. Deep ML segmentation however fails miserably as it does not even preserve most foreground at all, plus precision is equally bad as recall.)

bertsky · 2021-02-01T11:50:48Z

Results are not that good, but that's another story. (Legacy text/non-text segmentation does not detect enough non-text, so the recall is bad but precision is good. Deep ML segmentation however fails miserably as it does not even preserve most foreground at all, plus precision is equally bad as recall.)

See #80 for a discussion about that.

- drop unused and dysfunctional code against overlaps - drop wrong reading order algorithm - improve mask post-processing (closing instead of dilation) - make mask-polygon conversion optional - add optional post-processing to reduce overlaps (bbox-only or mask-based): - non-maximum suppression across classes (min_iou_drop) - non-maximum merging across classes (min_iou_merge) - within-other suppression across classes (min_share_drop) - within-other merging across classes (min_share_merge) - implement correct reading order algorithm (bbox-only or mask-based): - partial order constraints under lr-tb assumption - topological sort - annotate confidence along with coordinate results

…ote/header etc)

bertsky · 2021-02-04T10:43:42Z

Here's more, bringing some sanity to block segmentation. (Changes to the mrcnn part loosely derive from my Mask_RCNN fork.)

Again, this is not about model quality itself, but what best to make of it. For concerns about training/model quality, cf. last comments in #82.

bertsky · 2021-02-04T10:51:09Z

Regarding tiseg again:

But does any consuming processor actually make use of the alpha channel? I highly doubt it.

Since the model was obviously trained on raw images, we have to apply it on raw images. But we can still take binarized images (from a binarization step in the workflow) to apply our resulting mask – by filling with white.

That seems like the better OCR-D interface to me. (Of course, contour-finding and annotation via coordinates would still be better than as clipped derived image.) What do you think, @kba?

I already have a commit for this in line, i.e. merely comparing the text score with image score (and ignoring background score), and then white-filling and alpha-masking the image parts in the clipped raw result. Should I add that?

bertsky · 2021-02-04T11:01:17Z

BTW, we still do have a problem with loggers here. In 5a4d874 I had to remove the initLogging from tensorflow_importer, as this would cause a large and unfriendly error message about doing initLogging twice. But now, the TF logger does not adhere to OCR-D conventions, and what's worse, spoil -h and -J output again.

So how do we do this properly, @kba? Use our getLogger, but only import tensorflow_importer in process?

kba · 2021-02-04T11:25:42Z

I already have a commit for this in line, i.e. merely comparing the text score with image score (and ignoring background score), and then white-filling and alpha-masking the image parts in the clipped raw result. Should I add that?

IIUC (and that's a big if) then yes, please.

BTW, we still do have a problem with loggers here. In 5a4d874 I had to remove the initLogging from tensorflow_importer, as this would cause a large and unfriendly error message about doing initLogging twice. But now, the TF logger does not adhere to OCR-D conventions, and what's worse, spoil -h and -J output again.

So how do we do this properly, @kba? Use our getLogger, but only import tensorflow_importer in process?

The longer I have to deal with the not initialized or initialized twice logged errors, the less I am convinced that it was a good idea :( But in this case, dropping the initLogging() from and importing only at process tensorflow_importer seems reasonable. It might cause runtime errors due to unavailable dependencies but also saves time for non-processing calls.

bertsky · 2021-02-04T11:49:34Z

The longer I have to deal with the not initialized or initialized twice logged errors, the less I am convinced that it was a good idea :( But in this case, dropping the initLogging() from and importing only at process tensorflow_importer seems reasonable. It might cause runtime errors due to unavailable dependencies but also saves time for non-processing calls.

I get the same feeling. Also, processing vs non-processing context and module-level imports: Implementing process() almost always drags in full dependencies, but splitting a (Processor) class across modules is impossible. We are even lucky -J does work at all for now: Every keras processor will invariably print("Using TensorFlow backend"). We cannot rule such things out ever. – No idea how to address this.

kba · 2021-02-04T12:11:37Z

Every keras processor will invariably print("Using TensorFlow backend")

I have been actively defending against that with hacks like tensorflow_importer or

os.environ["TF_CPP_MIN_LOG_LEVEL"] = "3"
stderr = sys.stderr
sys.stderr = open(os.devnull, "w")
from keras import backend as K
from keras.models import load_model
sys.stderr = stderr
import tensorflow as tf
tf.get_logger().setLevel("ERROR")
warnings.filterwarnings("ignore")

It's quite a mouthful to stop those import-time print statements but I see no way around it for --dump-json etc.

bertsky added 11 commits February 1, 2021 01:27

tiseg: fix typo

57fd1e7

tiseg: remove trailing whitespace

bab56a6

tiseg: unused parameters

624b32e

tiseg (legacy): do not enforce deskewed/cropped

99c457b

tiseg (legacy): fix image pageId

62a9765

tiseg: clean imports and import order

21a2cd9

tiseg (ML): load during init/setup instead of process

82a0055

tiseg (ML): clean unused function

eb6c98f

tiseg (legacy): fix image vs text part

96ec2ee

tiseg (legacy): fix image vs background

9105973

tiseg: show class counts

25fc8e1

kba approved these changes Feb 1, 2021

View reviewed changes

bertsky added 11 commits February 3, 2021 11:08

block-segmentation: resolve_resource already exits verbosely

665a8dd

block-segmentation: proper class ID/name mapping

266756c

block-segmentation: fix Border intersection

a956f63

block-segmentation: fix TF logger init

5a4d874

block-segmentation: remove buggy/useless AlternativeImage creation

e641e31

block-segmentation: fix Border intersection (applies in absolute coords)

046735d

block-segmentation: fix overwrite==false (continue by adding more)

0febe79

block-segmentation: move model loading to setup()

3d54a19

block-segmentation: decode masks into polygons

681b70f

block segmentation: restrict active classes (default suppresses footn…

8c3db37

…ote/header etc)

bertsky mentioned this pull request May 19, 2021

update 2021-05-19 OCR-D/ocrd_all#256

Merged

tiseg: import keras from tensorflow not directly, OCR-D/ocrd_all#256

06e75ef

kba merged commit 0fde740 into OCR-D:master May 19, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix tiseg #79

Fix tiseg #79

bertsky commented Feb 1, 2021

bertsky commented Feb 1, 2021

bertsky commented Feb 4, 2021

bertsky commented Feb 4, 2021

bertsky commented Feb 4, 2021

kba commented Feb 4, 2021

bertsky commented Feb 4, 2021

kba commented Feb 4, 2021

Fix tiseg #79

Fix tiseg #79

Conversation

bertsky commented Feb 1, 2021

bertsky commented Feb 1, 2021

bertsky commented Feb 4, 2021

bertsky commented Feb 4, 2021

bertsky commented Feb 4, 2021

kba commented Feb 4, 2021

bertsky commented Feb 4, 2021

kba commented Feb 4, 2021