Quantum-Theoretic Modeling in Computer Science Aerts, Diederik; Beltran, Lester; Geriente, Suzette ...
International journal of theoretical physics,
02/2021, Letnik:
60, Številka:
2
Journal Article
Recenzirano
We work out a quantum-theoretic model in complex Hilbert space of a recently performed test on co-occurrencies of two concepts and their combination in retrieval processes on specific corpuses of ...documents. The test violated the Clauser-Horne-Shimony-Holt version of Bell’s inequalities (‘CHSH inequality’), thus indicating the presence of entanglement between the combined concepts. We make use of a recently elaborated ‘entanglement scheme’ and represent the collected data in the tensor product of Hilbert spaces of the individual concepts, showing that the identified violation is due to the presence of a strong form of entanglement, involving both states and measurements and reflecting the meaning connection between the component concepts. These results provide a significant confirmation of the presence of quantum structures in corpuses of documents, like it is the case for the entanglement identified in human cognition.
We show that data collected from corpuses of documents violate the Clauser-Horne-Shimony-Holt version of Bell’s inequality (CHSH inequality) and therefore indicate the presence of quantum ...entanglement in their structure. We obtain this result by considering two concepts and their combination and coincidence operations consisting of searches of co-occurrences of exemplars of these concepts in specific corpuses of documents. Measuring the frequencies of these co-occurrences and calculating the relative frequencies as approximate probabilities entering in the CHSH inequality, we obtain manifest violations of the latter for all considered corpuses of documents. In comparing these violations with those analogously obtained in an earlier work for the same combined concepts in psychological coincidence experiments with human participants, also violating the CHSH inequality, we identify the entanglement as being carried by the meaning connection between the two considered concepts within the combination they form. We explain the stronger violation for the corpuses of documents, as compared to the violation in the psychology experiments, as being due to the superior meaning domain of the human mind and, on the other side, to the latter reaching a broader domain of meaning and being possibly also actively influenced during the experimentation. We mention some of the issues to be analyzed in future work such as the violations of the CHSH inequality being larger than the ‘Cirel’son bound’ for all of the considered corpuses of documents.
The HathiTrust Digital Library (HTDL) is a digital library containing about 14 million volumes which comprise billions of pages of content. The HathiTrust Research Center (HTRC) is a collaborative ...research initiative jointly led by Indiana University and the University of Illinois at Urbana-Champaign. This paper describes the development of a collections data model by the Workset Creation for Scholarly Analysis project, a HTRC research initiative funded by the Andrew W. Mellon Foundation. The resulting HTRC Workset data model is designed to aid humanities scholars by helping them to describe selected portions of the HTDL corpus that serve as the objects of their research. The resulting worksets are persistent, citable, and can be assessed by other scholars for reuse in additional research processes.
Este trabalho tem por objetivo discutir as escolhas de transitividade, comparando o conto “Amor”, publicado pela escritora Clarice Lispector no livro “Laços de Família” em português, e sua tradução ...para o inglês. A análise contou com a utilização do software “UAM CorpusTool”, de caráter quanti/qualitativo que possibilita mapeamento de sistemas linguísticos em uma perspectiva funcional (O’Donnell 2016) e de uma ferramenta online de alinhamento de corpora (YouAlign 2016). Os resultados apontam para algumas diferenças quanto ao uso e frequência de processos, havendo equivalência de escolhas na maioria dos casos. Os exemplos que destoam parecem servir à construção de sistemas textuais específicos, motivados por questões culturais da língua de chegada.
Channel variability is one of the largest challenges for speaker verification (SV) techniques. Techniques in the feature, model and score domains have been applied to mitigate the channel impact. In ...this paper, we strive to study on robust deep feature learning with the deep belief network (DBN) by using traditional spectral features such as MFCC or PLP. In detail, during the training phase, a DBN is trained to map spectral features to the corresponding speaker identity, then deep features extracted at k th hidden layers are selected where k is determined by maximizing the ratio between within-class distance and between-class distance. In the enrollment phase, the well-trained DBN is used to extract deep features at k th hidden layers, then k th -DBN-vector is formed by averaging these features. In the test phase, k th -DBN-vector is extracted for test utterance and compared to the enrolled k th -DBN-vector to make a verification decision. To validate the effectiveness of the learned DBN-vectors for speaker verification, extensive experiments have been purposely conducted on Mandarin corpuses. It is encouraged to see that our proposed DBN-vector based SV system is superior to the state-of-the-art i-vector based SV system under channel mismatch conditions in terms of equal error rate (EER) and minimum detection cost function (minDCF).
Introduction Fromont, CéCile
Art of Conversion,
11/2014
Book Chapter
This introductory chapter presents textual, visual, and archeological evidence for the creation, use, and meaning of Christian visual forms in the Kongo between the 16th and 19th centuries. The ...images, objects, and practices present how Christianity and its visual manifestations remained significant in the shaping of political and religious life in the kingdom of Kongo during civil and foreign wars and the transatlantic slave trade. It establishes dates and offers interpretations on the sources and historical significance of the iconography of two main visual corpuses: Capuchin didactic watercolors and Kongo Christian art.
The essence of paraphrasing lies in retrieving correct paraphrases. Word-level paraphrasing is sensitive to the context, and its critical indicator is interchangeability. This paper presents a ...two-stage multi-feature word-level Chinese paraphrase extracting method. In stage one, using data mining technology the target word and its candidate paraphrases are extracted from large-size corpuses and the Internet. In stage two, stratified probability statistical model is established, and seven similarity feature values which are to train binary classifier later are calculated. Finally, candidate paraphrases with high similarity values are filtered out. Experimental results show that (1) Retrieving candidate paraphrases from large-size corpuses through data mining has practical value. On average 3.1 correct paraphrases for a word are obtained, (2) The binary classifier is effective in filtering out the correct paraphrases, with an accuracy of 0.676; (3) 34.32% of the retrieved paraphrases cannot be found in the Chinese Expanded Synonym Dictionary, which proves that the paraphrase retrieving method presented in this paper is an expansion of the traditional paraphrase extracting methods.