It is challenging to detect arbitrary-shape text accurately and effectively in natural scenes. While many methods have been implemented for arbitrary-shape text detection, most cannot achieve ...real-time detection or meet practical needs. In this work, we propose a YOLOv6-based detector that can effectively implement arbitrary-shape text detection and achieve real-time detection. We include two additional branches in the neck part of the YOLOv6 network to adapt the network to text detection, and the output side uses the pixel aggregation (PA) algorithm to decouple the PA output to use it as the detection head of the model. Experiments on benchmark Total-Text, CTW1500, ICDAR2015, and MSRA-TD500 showed that the proposed method outperformed competing methods in terms of detection accuracy and running time. Specifically, our method achieved an F-measure of 84.1% at 291.8 FPS for 640 × 640 Total-Text images and an F-measure of 81.5% at 199.6 FPS for 896 × 896 ICDAR2015 incidental text images.
Effective collaboration in computer-mediated settings among spatially distributed people is a precondition for success in many new learning and working contexts but it is hard to achieve. We have ...developed two instructional approaches to improve collaboration in such settings by promoting people's capabilities to collaborate in a fruitful way and furthering their understanding of what characterizes good collaboration. The rationale is that strategies necessary for a good and effective computer-mediated collaboration may be conveyed to people by exposing them to an elaborated worked-out collaboration example (observational learning) or by giving them the opportunity to learn from scripted collaborative problem-solving. An experimental study was conducted that compared learning from observing a worked-out collaboration example with the learning effects of scripted collaborative problem-solving, the effects of unscripted collaborative problem-solving, and a control condition without a learning phase. The experimental design provided clearly separated phases for the instructional treatments (learning phase) and for applying and testing the acquired skills (application phase). Both observing a worked-out collaboration example and collaborating with a script during the learning phase showed positive effects on process and outcome of the second collaboration in the application phase.
The encoder–decoder architecture is a well-established, effective and widely used approach in many tasks of natural language processing (NLP), among other domains. It consists of two ...closely-collaborating components: An encoder that transforms the input into an intermediate form, and a decoder producing the output. This paper proposes a new method for the encoder, named Causal Feature Extractor (CFE), based on three main ideas: Causal convolutions, dilatations and bidirectionality. We apply this method to text normalization, which is a ubiquitous problem that appears as the first step of many text-to-speech (TTS) systems. Given a text with symbols, the problem consists in writing the text exactly as it should be read by the TTS system. We make use of an attention-based encoder–decoder architecture using a fine-grained character-level approach rather than the usual word-level one. The proposed CFE is compared to other common encoders, such as convolutional neural networks (CNN) and long-short term memories (LSTM). Experimental results show the feasibility of CFE, achieving better results in terms of accuracy, number of parameters, convergence time, and use of an attention mechanism based on attention matrices. The obtained accuracy ranges from 83.5% to 96.8% correctly normalized sentences, depending on the dataset. Moreover, the proposed method is generic and can be applied to different types of input such as text, audio and images.
The Polish economic transformation of the 1990s created an appetite for software that was only partially satisfied by piracy. This market was yet to be taken seriously by Western companies, so local ...developers stepped up to fill the void. They translated obscure foreign applications, created weird character encoding standards, and built complex business software from scratch, shaping the local IT market for years to come.
Mastering Vim will introduce you to the wonderful world of Vim by example of working with Python code and tools in a project-based fashion. This book will prompt you to make Vim your primary IDE as ...you will learn to use it for any programming language.
Track Changes Kirschenbaum, Matthew G
2016, 2016-05-02, 2016-01-01
eBook
Writing in the digital age has been as messy as the inky rags in Gutenberg's shop or the molten lead of a Linotype machine. Matthew Kirschenbaum examines how creative authorship came to coexist with ...the computer revolution. Who were the early adopters, and what made others anxious? Was word processing just a better typewriter, or something more?.