This study focuses the expression recognition of facial images using higher-order local autocorrelation features and discriminant analysis. Higher-order local autocorrelation features are the ...higher-order extension of the autocorrelation function which is shift-invariant, and the range of the displacements is restricted within a 3×3 local region, the center of which is the reference point. Discriminant analysis linearly maps the primitive leaning data classified into some classes into new discriminant space, which maximizes the inner-class covariance while minimizing the between-class covariance. In this experiment, we photographs facial images of some test subjects with live basic expressions and calculates the recognition rate of expression using higher-order local autocorrelation features and discriminant analysis. We also consider the application to facial animation using the locus of expression changes in discriminant space.
“Sapporo IT Carrozzeria” is one of the 15 intelligent cluster projects organized by MEXT(Ministry of Education, Culture, Sports, Science and Technology, Japan).This project focuses on the ...reinforcement of “Sapporo Valley” which consists of a variety of IT companies. In this project, the author has been developing some IT products with such IT companies. This article describes some of the achievement in the project.
Speech quality of VoIP is degraded by transmission errors such as packet loss that is inevitable in best-effort communications. We have studied how to decrease such degradation. This study ...investigated PWR (Pitch Waveform Replication) method that is employed as an error hiding technique. The previous version of PWR employs an interpolation where past speech data is only used. For the decrease of some artifacts such as echoic quality in the begging and end of phonation, this study has proposed a new version that takes account of an interpolation from future speech data. In order to shorten the delay of the pitch extraction from future speech data, the proposed method employs a template matching. From the experimental results of objective and subjective evaluation, it is indicated that the proposed method is potentially useful for the improvement of speech quality.
Today the internet uses a lot of text, pictures, videos, animations, etc. to communicate eachother. In this article, we expose the Communicating System using Avator that is easy to make, easy to ...move, and easy to communicate for everyone. We made two types of 3D models based on MPEG-4 facial animation parameters. One is real face model, the other is a ANIME-model. We will make a database of human emotions that will contribute to easy making facial 3D animation.
This research objective is to investigate about the fundamentals on the possibilities of embedded ubiquitous computing and its application development. In this research, hardware platform is ...developed to construct the environment for ubiquitous computing.
A rule-based speech synthesis system for the Japanese language, which employs a MELP (Mixed Excitation Linear Prediction) vocoder as its speech synthesizer, was implemented. This paper especially ...describes some speech synthesis techniques utilized in our system. Since the MELP vocoder developed in this study could effectively gain the naturalness of voiced consonants as well as purely voiced speech, the implemented system could succeed in enhancing the voice quality of synthesized speech more than a system that employed a conventional normal LPC (Linear Predictive Coding) vocoder.
Speech quality of VoIP (voice over Internet protocol) may potentially be degraded by transmission errors such as packet loss and delay, which is basically inevitable in best-effort communications. ...This study investigates an error concealment technique for such degradation by using a receiver-based technique called pitch waveform replication. For enhancing the conventional technique, this study proposes a waveform reconstruction technique that also takes account of the pitch variation between the backward and forward frames of gap frames. From experimental results of objective evaluation, it is indicated that the proposed technique may potentially be useful for improving the speech quality, compared with the conventional technique.