tags: EAST Text Detection Deep Learning Balanced Cross Entropy
Algorithm flow:
Algorithm details:

Loss function?
For score: We are using balanced cross-entropy. This can balance the impact of the imbalance of positive and negative samples. Its definition is shown below. The implementation code is as follows:
def cross_entropy(y_true_cls, y_pred_cls, training_mask):
'''
:param y_true_cls: numpy array
:param y_pred_cls: numpy array
:param training_mask: numpy array
:return:
'''
# eps = 1e-10
# y_pred_cls = y_pred_cls * training_mask + eps
# y_true_cls = y_true_cls * training_mask + eps
# shape = list(np.shape(y_true_cls))
# beta = 1 - (np.sum(np.reshape(y_true_cls, [shape[0], -1]), axis=1) / (1.0 * shape[1] * shape[2]))
# cross_entropy_loss = -beta * y_true_cls * np.log(y_pred_cls) - (1 - beta) * (1 - y_true_cls) * np.log(
# 1 - y_pred_cls)
# return np.mean(cross_entropy_loss)
eps = 1e-10
y_pred_cls = y_pred_cls * training_mask + eps
y_true_cls = y_true_cls * training_mask + eps
each_y_true_sample = tf.split(y_true_cls, num_or_size_splits=FLAGS.batch_size_per_gpu, axis=0)
each_y_pred_sample = tf.split(y_pred_cls, num_or_size_splits=FLAGS.batch_size_per_gpu, axis=0)
loss = None
for i in range(FLAGS.batch_size_per_gpu):
cur_true = each_y_true_sample[i]
cur_pred = each_y_pred_sample[i]
beta = 1 - (tf.reduce_sum(cur_true) / (FLAGS.input_size * FLAGS.input_size))
cur_loss = -beta * cur_true * tf.log(cur_pred) - (1-beta) * (1-cur_true) * tf.log((1-cur_pred))
if loss is None:
loss = cur_loss
else:
loss = loss + cur_loss
return tf.reduce_mean(loss)For geometry: we calculate its IoU loss, which is defined as follows:
doubt?
Introduction to the text detector paper --- Scene Text Detector Overview Overview Methods Summary Pixel based (segmentation) Anchor based Fusion I have recently begun to explore the algorithm of text ...
EAST text detector application Introduction Primary introduction Model application Environmental preparation Basic parameters Text detection function application effect Introduction Recently, a projec...
Feature Pyramid Based Scene Text Detector Publications: ICDAR 2017 Author: MengYi En, Beijing University of Technology Content: OCR, text detection in multi-scale scene Abstract Question: CNN network ...
1 Overview The method given in this article is to solve the problem of rotating text detection. Therefore, the method TextBoxes++ of the article can detect slanted text. The method of detecting text i...
Article Directory 1 Introduction and Related Work 2 Method 2.1 Network Architectures 2.2 Flow Alignment Module(FAM) 3 Experiments 3.1 on Cityscapes 4 Conclusion Paper address Code address 1 Introducti...
Preface: I am an ordinary fourth-year engineering student at the Zhanwu slag level. The reading of this article originated from my graduation project, because I found it very difficult during the read...
Scene Text Detection with Scribble Lines Scene text test with doodle lines 2021 AAAI Papers analysis Scene Text Detection with Scribble Lines Core thinking Training and reasoning details Result analys...
Perface Recently, I was curious about the detection problem of large and long text lines in the text detection of the scene. So I investigated the detection results of the ICDAR2017MLT data set and fo...
Foreword The first blog, found himself after reading the papers do not always understand the deep and over a period of time some of the concepts will be blurred, so I plan to in the future for somethi...
originalhttps://arxiv.org/pdf/1801.01315.pdf Abstract The most advanced scene text detection algorithm is based on deep learning, which relies on bounding box regression and performs at least two pred...