Beyond visual semantics: Exploring the role of scene text in image understanding

E-viri

PDF

Celotno besedilo

Recenzirano Odprti dostop

Beyond visual semantics: Exploring the role of scene text in image understanding

Dey, Arka Ujjal; Ghosh, Suman K.; Valveny, Ernest; Harit, Gaurav

Pattern recognition letters, September 2021, 2021-09-00, 20210901, Letnik: 149

Journal Article

•Images use visual and scene text to convey ideas.•Jointly leveraging scene text and visual cues leads to robust semantic interpretation.•Contextual encoding capture dynamics between co-occurring visual and text elements.•Text visual semantics can be applied to retrieval and classification tasks alike. Images with visual and scene text content are ubiquitous in everyday life. However, current image interpretation systems are mostly limited to using only the visual features, neglecting to leverage the scene text content. In this paper, we propose to jointly use scene text and visual channels for robust semantic interpretation of images. We not only extract and encode visual and scene text cues but also model their interplay to generate a contextual joint embedding with richer semantics. The contextual embedding thus generated is applied to retrieval and classification tasks on multimedia images with scene text content to demonstrate its effectiveness. In the retrieval framework, we augment the contextual semantic representation with scene text cues to mitigate vocabulary misses that may have occurred during the semantic embedding. To deal with irrelevant or erroneous scene text recognition, we also apply query-based attention to the text channel. We show that our multi-channel approach, involving contextual semantics and scene text, improves upon the absolute accuracy of the current state-of-the-art methods on Advertisement Images Dataset by 8.9% in the relevant statement retrieval task and by 5% in the topic classification task.

Išči dalje

Avtor

Dey, Arka Ujjal | Ghosh, Suman K. | Valveny, Ernest | Harit, Gaurav

Dostop do baze podatkov JCR je dovoljen samo uporabnikom iz Slovenije. Vaš trenutni IP-naslov ni na seznamu dovoljenih za dostop, zato je potrebna avtentikacija z ustreznim računom AAI.

Leto	Faktor vpliva		Izdaja		Kategorija		Razvrstitev
Leto	JCR	SNIP	JCR	SNIP	JCR	SNIP	JCR	SNIP

Povezave do osebnih bibliografij avtorjev	Povezave do podatkov o raziskovalcih v sistemu SICRIS

Vir: Osebne bibliografije in: SICRIS

Naloži sliko

Vnos na polico

Dodajanje gradiva na polico je uspelo.

Dodajanje gradiva na polico je spodletelo.

Dodajanje gradiva na polico ni bilo potrebno.

Trajna povezava

E-pošta

Faktor vpliva

Izberite knjižnično izkaznico:

Baze podatkov, v katerih je revija indeksirana

Citiranje

Tema