ISCSLP 2006


Accepted Papers

1) Papers accepted in ISCSLP Proceedings (Springer LNAI Book, SCI Indexed):

The Implementation of Service Enabling with Spoken Language of a Multi-modal System Ozone
Sen Zhang

Signal Trajectory Based Noise Compensation for Robust Speech Recognition
Zhi-Jie Yan, Jian-Lai Zhou, Frank K. Soong and Ren-Hua Wang

Towards Automatic Tone Correction in Non-native Mandarin
Mitchell Peabody and Stephanie Seneff

Interactive Computer Aids for Acquiring Proficiency in Mandarin (Invited Keynote)
Stephanie Seneff

Discriminative Transformation for Sufficient Adaptation in Text-Independent Speaker Verification
Hao Yang, Yuan Dong, Xianyu Zhao, Jian Zhao and Haila Wang

Vector Autoregressive Model for Missing Feature Reconstruction
Xiong Xiao, Haizhou Li and Eng Siong Chng

All-Path Decoding Algorithm for Segmental based Speech Recognition
Yun Tang, Wen-Ju Liu and Bo Xu

Spoken Correction for Chinese Text Entry
Bo-June (Paul) Hsu and James Glass

State-Dependent Phoneme-Based Model Merging for Dialectal Chinese Speech Recognition
Linquan Liu, Thomas Fang Zheng and Wenhu Wu

Pitch Mean Based Frequency Warping
Jian Liu, Thomas Fang Zheng and Wenhu Wu

Linguistic Markings of Units in Spontaneous Mandarin
Shu-Chuan Tseng

Prosodic Word Prediction using a Maximum Entropy Approach
Honghui Dong, Jianhua Tao and Bo Xu

Phonetic and Phonological Analysis of Focal Accents of Disyllabic Words in Standard Chinese
Yuan Jia, Ziyu Xiong and Aijun Li

Effect of Speech and Noise Cross Correlation on AMFCC Speech Recognition Features
Benjamin Shannon and Kuldip Paliwal

Speech Synthesis Based on a Physiological Articulatory Model
Qiang FANG and Jianwu DANG

A Minimum Boundary Error Framework for Automatic Phonetic Segmentation
Jen-Wei Kuo and Hsin-Min Wang

A Study of Knowledge-based Features for Obstruent Detection and Classification in Continuous Mandarin Speech
Kuang-Ting Sung and Hsiao-Chuan Wang

The IIR NIST 2006 Speaker Recognition System: Fusion of Acoustic and Tokenization Features
Rong Tong, Bin Ma, Kong Aik Lee, Chang Huai You, Dong Lai Zhou, Tomi Kinnunen, Han wu Sun, Ming hui Dong, Eng Siong Chng and Hai zhou Li

Auditory Contrast Spectrum for Robust Speech Recognition
Xugang Lu

Unsupervised Speaker Adaptation using Reference Speaker Weighting
Tsz-Chung Lai and Brian Mak

Prosodic Structure Prediction based on Maximum Entropy Model with Error-Driven Modification
Xiaonan Zhang, Jun Xu and Lianhong Cai

Non-uniform Kernel Allocation based Parsimonious HMM
Peng Liu

Speaker-and-environment Change Detection in Broadcast News using Maximum Divergence Common Component GMM
Yih-Ru Wang

Distributed Speech Recognition of Mandarin Digits String
Yih-Ru Wang, Bo-Xuan Lu and Bo-Xuan Lu Chen

Automatic Construction of Regression Class Tree for MLLR via Model-based Hierarchical Clustering
Shih-Sian Cheng, Yeong-Yuh Xu, Hsin-Min Wang and Hsin-Chia Fu

Multi-channel Noise Reduction in Noisy Environments
Junfeng Li, Masato Akagi and Yoiti Suzuki

Predicting Prosody From Text
Keh-Jiann Chen, Chiu-yu Tseng and Chia-hung Tai

HMM-Based Emotional Speech Synthesis using Average Emotion Model
Long Qin, Zhen-Hua Ling, Yi-Jian Wu, Bu-Fan Zhang and Ren-Hua Wang

Consistent modeling of the static and time-derivative cepstrums for speech recognition using HSPTM
Yiu-Pong Lai and Man-Hung Siu

Adaptive Null-Forming Algorithm with Auditory Sub-bands
Heng Zhang, Qiang Fu and Yonghong Yan

A Unified Framework for Text Analysis in Chinese TTS
Guohong Fu

On the Use of Entropy Information for Improving Posterior Probability based Confidence Measures
Tzan-Hwei Chen, Berlin Chen and Hsin-Min Wang

Integrating Complementary Features with a Confidence Measure for Speaker Identification
Nengheng Zheng, P. C. Ching, Ning Wang and Tan Lee

A Cantonese Speech-Driven Talking Face using Translingual Audio-to-Visual Conversion
Lei Xie, Helen Meng and Zhi-Qiang Liu

Some Improvements in Phrase-Based Statistical Machine Translation
Zhendong Yang, Wei Pang, Jinhua Du, Wei Wei and Bo Xu

Prosodic Words Prediction from Lexicon Words with CRF and TBL Joint Method
Heng Kang and Wenju Liu

A Robust Voice Activity Detection based on Noise Eigenspace Projection
Dongwen Ying, Yu Shi, Frank Soong, Jianwu Dang and Xugang Lu

Focus, Lexical Stress and Boundary Tone: Interaction of Three Prosodic Features
Lu Zhang, Yi-qing Zu and Run-qiang Yan

Automatic Detection of Tone Mispronunciation in Mandarin
Li Zhang, Chao Huang, Min Chu, Frank Soong, Xianda Zhang and Yudong Chen

An HMM Compensation Approach Using Unscented Transformation For Noisy Speech Recognition
Yu Hu and Qiang Huo

Automatic Spoken Language Translation Template Acquisition Based on Boosting Structure Extraction and Alignment
Rile Hu and Xia Wang

Vietnamese Automatic Speech Recognition: the FLaVoR Approach
Quan Vu

Noisy Speech Recognition Performance of Discriminative HMMs
Jun Du, Peng Liu, Frank K. Soong, Jian-Lai Zhou and Ren-Hua Wang

A Hakka Text-to-Speech System
Hsiu-Min Yu, Hsin-Te Hwang, Dong-Yi Lin and Sin-Horng Chen

Minimum Phone Error (MPE) Model and Feature Training on Mandarin Broadcast News Task
Jia-yu Chen, Chia-yu Wan, Yi Chen, Berlin Chen and Lin-shan Lee

A Corpus-based Approach for Cooperative Response Generation in a Dialog System
Zhiyong Wu, Helen M. Meng, Hui Ning and Sam C. Tse

Design of Cubic Spline Wavelet for Open Set Speaker Classification in Marathi
Hemant Patil and Tapan Basu

HKUST/MTS: A Very Large Scale Mandarin Telephone Speech Corpus
Yi Liu, Pascale Fung, Yongsheng Yang, Christopher Cieri, Shudong Huang and David Graff

Mechanisms of Question Intonation in Mandarin
Jiahong Yuan

Improved Mandarin Speech Recognition by Lattice Rescoring with Enhanced Tone Models
Huanliang Wang, Yao Qian, Frank K. Soong, Jian-Lai Zhou and Jiqing Han

An HMM-Based Mandarin Chinese Text-to-Speech System
Yao Qian, Frank Soong, Yining Chen and Min Chu

Advances in Mandarin Broadcast Speech Transcription at IBM under the DARPA GALE Program
Yong Qin, Qin Shi, Yi Y. Liu, Hagai Aronowitz, Stephen M. Chu, Hong-kwang Kuo and Geoffrey Zweig

Language Identification by Using Syllable-based Duration Classification on Code-switching Speech
Dau-Cheng Lyu, Ren-Yuan Lyu, Yuang-Chin Chiang and Chun-Nan Hsu

Comparison of Perceived Prosodic Boundaries and Global Characteristics of Voice Fundamental Frequency Contours in Mandarin Speech
Wentao Gu, Keikichi Hirose and Hiroya Fujisaki

Improved Large Vocabulary Continuous Chinese Speech Recognition by Character-based Consensus Network
Yi-Sheng Fu, Yi-Cheng Pan and Lin-Shan Lee

The IIR Submission to CSLP 2006 Speaker Recognition Evaluation
K.-A. Lee

A Novel Alternative Hypothesis Characterization Using Kernel Classifiers for LLR-based Speaker Verification
Yi-Hsiang Chao, Hsin-Min Wang and Ruei-Chuan Chang

UBM based Speaker Segmentation and Clustering for 2-Speaker Detection
Jing Deng, Thomas Fang Zheng and Wenhu Wu

CCC Speaker Recognition Evaluation 2006: Overview, Methods, Data, Results and Perspective
Thomas Fang Zheng, Zhanjiang Song, Lihong Zhang, Michael Brasser, Shuifa Sun, Wei Wu and Jing Deng

Speaker Verification Using Complementary Information from Vocal Source and Vocal Tract
Nengheng Zheng, Ning Wang, Tan Lee and P. C. Ching

ISCSLP SR Evaluation, UVA(r)CCS es System Description. A System Based on ANNs
Carlos E. Vivaracho

Evaluation of EMD-based Speaker Recognition using ISCSLP2006 Chinese Speaker Recognition Evaluation Corpus
Shingo Kuroiwa, Satoru Tsuge, Masahiko Kita and Fuji Ren

The Contribution of Lexical Resources to Natural Language Processing of CJK
Jack Halpern

The Paradigm for Creating Multi-lingual Text-to-Speech Voice Databases
Min Chu, Yong Zhao, Yining Chen, Lijuan Wang and Frank Soong

Construct Trilingual Parallel Corpus on Demand
Muyun Yang, Hongfei Jiang, Tiejun Zhao and Sheng Li

Development of Multi-lingual Spoken Corpora of Indian Languages
K. Samudravijaya

Multilingual Speech Corpora for TTS System Development
Hsi-Chun Hsiao, Hsiu-Min Yu, Yih-Ru Wang and Sin-Horng Chen

Multilingual Spoken Language Corpus Development for Communication Research
Toshiyuki Takezawa

Extractive Chinese Spoken Document Summarization Using Probabilistic Ranking Models
Yi-Ting Chen, Suhan Yu, Hsin-min Wang and Berlin Chen

Meeting Segmentation Using Two-Layer Cascaded Subband Filters
Manuel Giuliani, Tin Lay Nwe and Haizhou Li

A Multi-layered Summarization System for Multi-media Archives by Understanding and Structuring of Chinese Spoken Documents
Lin-shan Lee, Sheng-yi Kong, Yi-cheng Pan, Yi-sheng Fu, Yu-tsun Huang and Chien-Chih Wa

Initial Experiments on Automatic Story Segmentation in Chinese Spoken Documents using Lexical Cohesion of Extracted Named Entities
Devon Li, Wai-Kit Lo and Helen Meng

Contextual Maximum Entropy Model for Edit Disfluency Detection of Spontaneous Speech
Jui-Feng Yeh, Chung-Hsien Wu and Wei-Yen Wu

Rhythmic Organization of Mandarin Utterances - a Two-Stage Process
Min Chu and Yunjia Wang

Nonlinear Emotional Prosody Annotation and Generation
Jianhua Tao, Jian Yu and Yongguo Kang

2) Papers accepted in ISCSLP Proceedings (Companion Volume, Published by COLIPS) :

A Speaker Adaptation Method Using Projection to Latent Structure method
Wang Jingying and Wang Zuoying

FEATURE EXTRACTION AND TEST ALGORITHM FOR SPEAKER VERIFICATION
wu guo

Automatic Chinese Dialogue Text Summarization Based On LSA and Segmentation
Chuanhan Liu, Yongcheng Wang, Fei Zheng and Derong Liu

A Robust Acoustic Echo Canceller for Noisy Environment
QIN Shenghao, MENG Sha and LIU Jia

A Closed-loop Multimode Varible Bit Rate Characteristic Waveform Interpolation Coder
Jing Wang, Jing-ming Kuang and Sheng-hui Zhao

Prosodic Word Grouping in Mandarin TTS System
Qing Guo, Endong Xun and Nobuyuki Katae

A Low-complexity Improved WI Speech Coding at 2kbps
Fengyan QI and Changchun BAO

A Low-Cost Robust Front-end for Embedded ASR System
Lihui Guo, Xin He, Yue Lu and Yaxin Zhang

English Alphabet Recognition Based on Chinese Acoustic Modeling
Linquan Liu, Thomas Fang Zheng and Wenhu Wu

SpeechQoogle: An Open-Domain Question Answering System with Speech Interface
Guoping Hu, Dan Liu, Qingfeng Liu and Ren-Hua Wang

A Unified Totally-Data-Driven Framework for Duration and Intonation Modeling
Lifu Yi, Jian Li, Xiaoyan Lou and Jie Hao

On the Use of Long-Term Average Spectrum in Automatic Speaker Recognition
Tomi Kinnunen, Ville Hautamäki and Pasi Fränti

An Efficient and Robust Approach to Audio ID Identification
Ming Li, Jian Liu and Yonghong Yan

Frame-level Nonlinearity for Robust DTW-based Speaker Verification
Jian Luan, Jie Hao, Tomonari Kakino and Tomonori Ikumi

Evaluation of Aspiration Sounds of Chinese Labial and Alveolar Diphthong Uttered by Japanese Students Using Voice Onset Time and Breathing Power
Akemi Hoshino and Akio Yasuda

Robust Speech-Annotated Photo Retrieval Using Syllable-Transformed Patterns
Chien-Lin Huang, Wei-Chuan Lee and Chung-Hsien Wu

The application of phone weight in Putonghua pronunciation quality assessment
QingSheng Liu, Si Wei, Yu Hu, Wu Guo and RenHua Wang

Optimizing the Implementation of MMSE Enhancement for Robust Speech Recognition
Pei Ding, Lei He, Xiang Yan, Rui Zhao and Jie Hao

Universal TRAP-TANDEM ASR System: Recognition of Noisy Speech and Random Processes During System Training
Petr Svojanovsky

An Efficient Rate-Distortion Control Algorithm for MPEG-4 AAC Based on JNLD Bit Allocation Estimation Psychoacoustic Model
Sheng Wu and Xiaojun Qiu

Automatic Tonal and Non-Tonal Language Classification and Language Identification Using Prosodic Information
Liang Wang, Eliathamby Ambikairajah and Eric H.C. Choi

Word Intelligibility Evaluation for Diagnosing High Quality Small Footprint TTS Engine Based on Variable-length Unit Concatenation
Zhen-Li Yu, Yi-Qing Zu, Dong-Jian Yue and Gui-Lin Chen

Pitch Prediction for Mandarin TTS with Mutual Prosodic Constraint
Jian Yu, Jianhua Tao and Xia Wang

A Diphone Sharing Method towards Scalable Unit-training-based TTS
Jian Li, Xiaoyan Lou, Jie Hao and Lifu Yi

Full Utilization of Closed-captions in Broadcast News Recognition
Meng Meng, Wang Shijin, Liang Jiaen, Ding Peng and Xu Bo

Minimum Classification Error Based Optimal Linear Combination for Spoken Language Identification
Donglai Zhu, Rong Tong, Bin Ma and Haizhou Li

Concatenation Speech Synthesis for Low-Tier Device
Dongjian Yue

Spectral Continuity Measures at Mandarin Syllable Boundaries
Jun Xu and Lianhong Cai

Acoustic Analysis of Emotional Speech in Mandarin Chinese
Sheng Zhang, Pak-Chung Ching and Fan-rang Kong

Unvoiced Landmark Detection for Segment-based Mandarin Continuous Speech Recognition
Hua Zhang, Yun Tang, Wen Ju Liu and Bo Xu

An Initial System for Integrated Synthesis of Mandarin, Min-nan, and Hakka Speech
Hung-Yan Gu, Yan-Zuo Zhou and Huang-Liang Liau

Multi-Pitch Detection for Co-Channel Speech Utilizing Frequency Channel Piecewise Integration and Morphological Feedback Verification Tracking
Yong Guan, Peng Li, Wen Ju Liu and Bo Xu

Multi-accented Mandarin Database Construction and Benchmark Evaluations
Xiang Yan, Lei He, Pei Ding, Rui Zhao and Jie Hao

A Feasibility Study for Chinese-Spanish Statistical Machine Translation
Rafael E. Banchs, Josep M. Crego, Patrik Lambert and José B. Mariño

Research and Analysis of Fast Training in SVM-based Audio Classification
Shilei Zhang, Hongchen Jiang, Shuwu Zhang and Bo Xu

A Top-down Approach to Melody Match in Pitch Contour for Query by Humming
Xiao Wu Ming Li

State-Correlated Duration model for HMM-Based Speech Synthesis System
Xiao-Cui Li, Heng Kang and Wen-Ju Liu

Speaker, vocabulary and context independent word spotting system for continuous speech
Radu Timofte, Ville Hautamäki and Pasi Fränti

Improvements in Tone Pronunciation Scoring for Strongly Accented Mandarin Speech
Fuping Pan, Qingwei Zhao and Yonghong Yan

Recognition of Emotional Speech and Speech Emotion in Farsi
Davood Gharavian and S.Mohammad Ahadi

DOE and ANOVA based Performance Influencing Factor Analysis for Evaluation of Speech Recognition Systems
Xiangdong Wang, Feng Xie, Shouxun Lin, Yueliang Qian and Qun Liu

Training Discriminative HMM by Optimal Allocation of Gaussian Kernels
Zhi-Jie Yan, Peng Liu, Jun Du, Frank K. Soong and Ren-Hua Wang

A Comparative Study on Confidence Measure in Mandarin Command Word Recognition
Cong Liu, Zhe-jie Yan, Yu Hu and Ren-Hua Wang

Experimental Investigation into Alignment-based Acoustic Confidence Measures in Keyword Verification for Mandarin Speech
Yiyan Liu, Yingchun Yang and Zhaohui Wu

A Multi-stage Method for Text-To-Pronunciation Conversion
CHING-HSIEN LEE, REN-Jr WANG and CHUNG-JEN CHIU

Bandwidth Scalable Wideband Codec using Hybrid Matching Pursuit Harmonic/CELP Scheme
Gyu-Hyeok Jeong, Yeong-Uk Ahn, Jong-Hark Kim, Gyu-Jin Kim and In-Sung Lee

F0 Analysis of Chinese Accented German Speech
Hongwei Ding, Oliver Jokisch and Ruediger Hoffmann

Affect-insensitive Speaker Recognition via Feature Transformation
Dongdong Li, Yingchun Yang and Zhaohui Wu

Short-time ICA for Blind Separation of Noisy Speech
Jing Zhang and Pak Chung Ching

Chinese Character-based Segmentation & POS-tagging and Named Entity Identification With a CRF Chunker
Hu Xinhui and Kashioka Hideki

Decision Tree Classification Approach for Model Selection in Segmenting Mandarin TTS Corpus
Yuan Xiaoliang, Dong Yuan, Huang Dezhi, Guo Jun and Wang Haila

Comparison of News Announcing and Talking Styles in Broadcast Speech
Yu Zou, Xiaohua Li, Min Hou and Na An

Investigation on pleasure related acoustic features of affective speech
Dandan Cui, Lianhong Cai, Yongxin Wang and Xiaozhou Zhang

EM Algorithm with Split and Merge in Trajectory Clustering for Automatic Speech Recognition
Yan Han and Lou Boves

Keyword Spotting Based on Confusion Matrix
Pengyuan Zhang

A New Approach for Speech/Music Discrimination Based on Cepstral Distance
Mu-Yeol Choi, Seul-Han Park, Hwa Jeon Song and Hyung Soon Kim

Performance Evaluation of Non-Keyword Modeling for Vocabulary-Independent Keyword Spotting
Young Kuk Kim, Hwa Jeon Song and Hyung Soon Kim

Two-layer Distance Scheme in Matching Engine for Query by Humming System
Feng Zhang, Yan Song, Li-Rong Dai and Ren-Hua Wang

Monte Carlo Noisy HMM Estimation and Segmental Differential Features on the Aurora2 Clean Training Evaluation
Jing-Teng Zeng, Cheng-Chang Lee, Jeng-Shien Lin, Yuan-Fu Liao and Sen-Chia Chang

Automatic Scoring of Flat Tongue and Raised Tongue in Computer-assisted Mandarin Learning
Bin Dong, Qingwei Zhao and Yonghong Yan

Sausage-net-based Minimum Phone Error Training for Continuous Phone Recognition
Jiang-Chun Chen, Chun-Jen Lee, Shuo-Pin Hsu and J.-S. Roger Jang

Temporal Discrete Cosine Transform: Towards Longer Term Temporal Features for Speaker Verification
Tomi Kinnunen, C. W. Eugene Koh, Lei Wang, Haizhou Li and Eng Siong Chng

Contrastive study on tonal patterns between accented and standard Chinese
Aijun LI, Ziyu XIONG and Xia WANG

Mismatch Negativity Elicited by Non-cluster and Cluster Consonants Changes in Thai Words in Humans
Wichian Sittiprapaporn, Usanee Sothiwat, Chittin Chindaduangratn and Naiphinich Kotchabhakdi

Improving the Robustness of LPCC Feature Against Impulsive Noise by Applying the FOP Method
Pei Ding

A Novel Fast Real-Time Audio Mixing Algorithm
Wenlin Wang, Jianxin Liao, Xiaomin Zhu and Qiwei Shen

Robust Speech Endpoint Detection in Noisy Environments
Yanmeng Guo and Qiang Fu

Embedded Implementation and Optimization for HMM-Based Continuous Speech Recognition System
Lingyun Xie

Integrating Hypotheses of Multiple Recognizers for Improving Mandarin LVCSR Performance
Yu SHI, Frank SOONG and Jian-Lai ZHOU

Modular Text-to-Speech Synthesis Evaluation for Mandarin Chinese
Jilei Tian, Jani Nurminen and Imre Kiss

Exploiting GMM-based Quality Measure for SVM Speaker Verification
Rong Zheng, Hongchen Jiang, Shuwu Zhang and Bo Xu

Speaker Diarization System Based on GMM and BIC
Tantan Liu, Xiaoxing Liu and Yonghong Yan

Compensations for SVM in Text-Independent Speaker Verification
Xiang-Feng Lu and Jia Liu

A New Data Fusion Technique and Performance Measure for Identification of Twins in Marathi
Hemant A. Patil and T. K. Basu

Incorporating Prosodic with Acoustic information for ISCSLP'2006 Speaker Recognition Evaluation- Robust Cross-Channel Speaker Verification
Wen-Chieh Chang, Ding-Yun Chen, Zi-He Chen, Zhi-Ren Zeng, Yuan-Fu Liao and Yau-Tarng Juang

Multilingual Text - Speech Corpus of Mongolian
Idomuco Dawa

Design of Vietnamese Speech Corpus and Current Status
Luong Chi Mai

Design of Cross-lingual and Multilingual Corpora for Speaker Recognition
Hemant Patil, S. Ghosh, A. Si and T. K. Basu

Recent Advances of Speech Databases development activity for Indian Languages
S. Agrawal, K. Samudravijaya and Karunesh Arora

Multi-lingual TTS Speech Corpus Development
Yiqing Zu, Zhenhai Cao, Guilin Chen, Kesong Han, Peng Lu, Runqiang Yan, Kaizhi Wang, Zhenli Yu and Dongjian Yue

Syllable Based Audio Search Using Confusion Network Arc as Indexing Unit
Jian Shao, Pengyuan Zhang, Jiang Han, Jun Yang and Yonghong Yan

Applying SFC Model for Chinese Expressive Speech Synthesis
Bu-fan Zhang, Zhen-hua Ling, Long Qin and Ren-hua Wang

The Breath Segment in Expressive Speech
Chu YUAN and Aijun LI

Automatic Gender Annotation from Chinese Person Name
Ke-Song Han and Gui-Lin Chen

Note: the Springer LNAI Book is published by Springer and is indexed by SCI expanded, while the Companion Volume is published by COLIPS with an ISBN number, and not indexed by SCI.

©2005-2006 Chinese and Oriental Languages Information Processing Society, Singapore | Last updated on December 21, 2006 .