WordFences: Text localization and recognition

Inici | Què és? | Contacte

English | Castellano

Consultar RECERCAT

Per comunitats i
col·leccions Per data Per autors Per títols Per matèries

Consultar col·lecció

Per data Per autors Per títols Per matèries

Estadístiques

Del document Tot RECERCAT

El meu RECERCAT

Entrar Alertes per correu-e

Directori d’altres repositoris

Pàgina inicial del RECERCAT > Universitat Politècnica de Catalunya > Tesines i projectes i treballs de final de carrera > Visualitza document

Per accedir als documents amb el text complet, si us plau, seguiu el següent enllaç: http://hdl.handle.net/2117/101911

Títol:	WordFences: Text localization and recognition
Autor/a:	Polzounov, Andrei
Altres autors:	Institute for Infocomm Research; Escalera, Sergio; Lu, Shijian
Abstract:	En col·laboració amb la Universitat de Barcelona (UB) i la Universitat Rovira i Virgili (URV)
Abstract:	In recent years, text recognition has achieved remarkable success in recognizing scanned document text. However, word recognition in natural images is still an open problem, which generally requires time consuming post-processing steps. We present a novel architecture for individual word detection in scene images based on semantic segmentation. Our contributions are twofold: the concept of WordFence, which detects border areas surrounding each individual word and a unique pixelwise weighted softmax loss function which penalizes background and emphasizes small text regions. WordFence ensures that each word is detected individually, and the new loss function provides a strong training signal to both text and word border localization. The proposed technique avoids intensive post-processing by combining semantic word segmentation with a voting scheme for merging segmentations of multiple scales, producing an end-to-end word detection system. We achieve superior localization recall on common benchmark datasets - 92% recall on ICDAR11 and ICDAR13 and 63% recall on SVT. Furthermore, end-to-end word recognition achieves state-of-the-art 86% F-Score on ICDAR13.
Matèries:	-Àrees temàtiques de la UPC::Informàtica::Intel·ligència artificial -Machine learning -Neural networks (Computer science) -artificial intelligence -object detection -text detection -text recognition -Aprenentatge automàtic -Xarxes neuronals (Informàtica)
Drets:
Tipus de document:	Treballs d'investigació/Fi de màster
Publicat per:	Universitat Politècnica de Catalunya
Compartir:

Mostra el registre complet del document

Accessibilitat | Avís legal | Política de Cookies | Documents d'ús intern

Coordinació

Patrocini