Skip to main content

Program of the 25th Conference of the Oriental COCOSDA

Local Time (GMT+7)

Session

Session Chair

Details

DAY 1: November 24, 2022

8:30-9:00

   

Opening Ceremony

9:00-10:00

Keynote 1

Prof. Satoshi Nakamura

Seeing to Hear Better

Professor Haizhou Li

Chinese University of Hong Kong and National University of Singapore

10:00-10:15

   

Break

10:15-11:45

Session 1:

Speech Recognition & Speech Synthesis

Dr. Chenglin Xu

#603. Development of a High Quality Text to Speech System for Lao, Ngoc-Anh Nguyen Thi, Tien-Thanh Nguyen and Nhat-Minh Le

#4193. NICT-Tib1: A Public Speech Corpus of Lhasa Dialect for Benchmarking Tibetan Language Speech Recognition Systems, Kak Soky, Zhuo Gong and Sheng Li

#7252. The Speech Labeling and Modeling Toolkit (SLMTK) Version 1.0, Chen-Yu Chiang, Wu-Hao Li, Yen-Ting Lin, Jia-Jyu Su, Wei-Cheng Chen, Cheng-Che Kao, Shu-Lei Lin, Pin-Han Lin, Shao-Wei Hong, Guan-Ting Liou, Wen-Yang Chang, Jen-Chieh Chiang, Yen-Ting Lin, Yih-Ru Wang and Sin-Horng Chen

#8531. Toward Automatic Generation of Transcript from Spoken Lectures: The “Dream of The Red Chamber” Series, Tzu-Han Lin, Kuan-Lin Lee, Hsin-Yun Chung, Fu-Hai Frank Wu, Jui-Chu Li, Tung-Lung Li, Shih-Lung Lo, Yi-Wen Liu and Jason S. Chang

#8910. End-to-End Named Entity Recognition for Vietnamese Speech, Thu-Hien Nguyen, Thai-Binh Nguyen, Quoc-Truong Do and Tuan-Linh Nguyen

#9634. MNASR: a Free Speech Corpus for Mongolian Speech Recognition and Accompanied Baselines, Yihao Wu, Yonghe Wang, Hui Zhang, Feilong Bao and Guanglai Gao

11:45-14:00

   

Lunch Break

14:00-15:30

Session 2:

Speech Prosody

Dr. Minghui Dong

#896. Patterns of Vowel Production in The Speakers of Sanskrit Language, Pooja Gambhir, Amita Dev and Poonam Bansal

#1937. The Interaction Pattern of Focal Accent and Declarative Intonation in Mongolian, Min Ao and Aijun Li

#2910. Neural Network Models for User Attribute Extraction from Dialogues, Son Dang Ngoc and Quang Nhat Minh Pham

#4525. Multilingual Analysis of Intelligibility Classification using English, Korean, and Tamil Dysarthric Speech Datasets, Eun Jung Yeo, Sunhee Kim and Minhwa Chung

#6681. Nasality in Zhangzhou: Distribution and Constraint, Yishan Huang

#8904. A Corpus-Based Analysis of Age-Related Changes in the Acoustic Features of Elderly to Super Elderly Speech, Meiko Fukuda, Masakazu Sugiyama and Ryota Nishimura

15:30-15:45

   

Break

15:45-17:15

Session 3:

Poster

Prof. Hsin-Min Wang

#542. Towards the Development of Accent Conversion Model For (L1) Bengali Speaker Using Cycle Consistent Adversarial Network (Cyclegan), Sabyasachi Chandra, Puja Bharati and Shyamal Kumar Das Mandal

#2053. Speaking-Rate Effect on Prosodic Grammar of Mandarin Read Speech, Yu-Siang Hong, Chen-Yu Chiang, Yih-Ru Wang and Sin-Horng Chen

#2284. An Automated Speech Recognition System for Phonological Awareness of Kindergarten Students in Filipino, Jazzmin Maranan

#2580. Experimentation of Various Preprocessing Pipelines for Sentiment Analysis on Twitter Data About New Indonesia’s Capital City using SVM and CNN, Siska Pebiana, Nuraisa Novia Hidayati, Dian Isnaeni Nurul Afra, Elvira Nurfadhilah, Harnum Annisa, Junanto Prihantoro, Radhiyatul Fajri, M. Teduh Uliniansyah, Agung Santosa, Lyla Ruslana Aini, Yosi Sahreza, Aulia Haritsuddin Karisma Muhammad Subekti, Josua Geovani Pinem, Muhammad Reza Alfin, Agung Septiadi, Siti Shaleha, Gembong Satrio Wibowanto, Asril Jarin, Gunarso, Andi Djalal Latief and Hammam Riza

#2928. Harvard-NGSL Sentences for English Learner Speech Corpora, Kakeru Yazawa

#3619. Acoustical Analysis of Speech of ASD Children and Typically Developing Children, Babita Saxena, Sunita Arora, Karunesh Arora and Hemant Keshwal

#3711. Designing a Speaking Assessment Task using EI to Build a Korean Learner Corpus, Chang Kyung Song, Hye Yun Jeong and Ho Jung Kim

#3844. Analysis on the Interference of Chinese /n/-/l/ Confusion to Japanese /n/-/r/ Discrimination by Chinese JFL Learners, Cenyu Xiang, Tianxiang Cao and Yanlong Zhang

#4157. Creakiness Judgments by Burmese and Vietnamese Speakers, Julián Villegas and Seunghun Lee

#4258. Text to Speech System for Lambani - a Zero Resource, Tribal Language of India, Ashwini Dasare, Deepak K T, Samudravijaya K and Mahadeva Prasanna

#4493. Production and Perception of Intonation Features by Cantonese EFL Learners, Chenyang Zhao, Ziyu Xiong and Aijun Li

#5944. Analysis of Layer-wise Training in Direct Speech to Speech Translation using Bi-LSTM, Lalaram Arya, Ayush Agarwal, Jagabandhu Mishra and S. R. Mahadeva Prasanna

#8967. The Influence of Working Memory on Intonation Production of Chinese EFL Learners, Ronghao Gu and Xiaoli Ji

DAY 2: November 25, 2022

8:30-9:45

Session 4:

Multimodal Databases

Dr. Siqi Cai

#2975. UIT-VLFC: Vietnamese Lipstick Feedbacks Corpus, Binh Van Duong, An Trong Nguyen, Chien Nhu Ha, Hong-Hanh Thi Duong, My-Linh Thi Tran and Trong-Hop Do

#4801. ESAA: an EEG-Speech Auditory Attention Detection Database, Peiwen Li, Enze Su, Jia Li, Siqi Cai, Longhan Xie and Haizhou Li

#9020. A Speech Corpus for Chronic Kidney Disease, Jihyun Mun, Sunhee Kim, Myeong Ju Kim, Jiwon Ryu, Sejoong Kim and Minhwa Chung

#9232. Building a Speech Corpus of Children with Cochlear Implants via an Enhanced Metadata Structure, Seonwoo Lee, Sunhee Kim and Minhwa Chung

#9830. Taiwanese Across Taiwan Corpus and Its Applications, Yuan-Fu Liao, Hui-Lu Khoo, Un-Gian Iunn, Tsun-Guan Thiann, Jane S. Tsay, Le-Kun Tan, Huang-Lan Su, Hak-Khiam Tiun, Peter Kang, Li-Chen Chang, Su-Lian Liao, Hong-Hūi Tân, Siok-Hong Liau and Chhun-Sui Na

9:45-10:00

   

Break

10:00-11:00

Keynote 2

Prof. Luong Chi Mai

Language Technology for All: From the technology and indigenous community perspectives

Professor Sakriani Sakti

Japan Advanced Institute of Science and Technology

11:00-11:15

   

Break

11:15-12:15

   

O-COCOSDA Steering Committee Meeting (Committee members only)

12:15-14:00

   

Lunch Break

14:00-15:30

Session 5:

Language Learning

Prof. Aijun Li

#622. The Effect of Acoustic Features on Chinese EFL Learners' Perception of English Accentual Prominence, Weizhong Zhang, Jian Gong, Xiaoli Ji, Yuhong Sun and Kai Sheng

#1348. Designing a Korean French-Learners' Speech Corpus (KFLSC) for Spoken Language Assessment, Soeun Park, Jihye Chun, Mihyun Kim, Hyunjoo Lee, Seongheon Lee and Sunhee Kim

#1966. Syntactic Complexity in Narrative Speech Produced by Prelingually Deaf Mandarin-Speaking Children with Cochlear Implants, Jue Yu, Han Sun and Zhaoqi Dong

#2256. voisTUTOR 2.0: A Speech Corpus with Phonetic Transcription for Pronunciation Evaluation of Indian L2 English Learners, Priyanshi Pal, Chiranjeevi Yarra and Prasanta Ghosh

#4834. Spanish Stops and Their Allophones Produced by Proficient Mandarin Learners of Spanish, Xiaotong Xi and Peng Li

#5279. NASAM 2.0: Cleft-Palate Speech Assessment Application, Nattapong Kurpukdee, Kwanchiva Thangthai, Vataya Chunwijitra, Patcharika Chootrakool and Sawit Kasuriya

15:30-15:45

   

Break

15:45-17:15

Country/Region Reports and Discussion

Prof. Satoshi Nakamura

China, Aijun Li and Dong Wang

Hong Kong, Tan Lee

India, S.S Agrawal

Indonesia, Hammam Riza

Japan, Satoshi Nakamura

Korea, Yong-Ju Lee

Malaysia, Zuraidah Mohd Don

Myanmar, Win Pa Pa

Philippine, Nathaniel Oco

Singapore, Haizhou Li

Taiwan, Hsin-Min Wang, Yuan-Fu Liao

Thailand, Kwanchiva Thangthai

Vietnam, Luong Chi Mai

DAY 3: November 26, 2022

8:30-9:30

Session 6:

Dialects and Accents

Prof. Sin-Horng Chen

#490. Effects of the Syntactic Structure on the Productivity of Tone Sandhi Rules: In the Case of Xiamen Dialect, Yiying Hu and Hui Feng

#1234. Improving Vietnamese Accent Recognition using ASR Transfer Learning, Bao Thang Ta, Xuan Vuong Dang, Quang Tien Duong, Nhat Minh Le and Van Hai Do

#4539. Transliteration of Foreign Words in Burmese: Descriptions by a Mortise-and-Tenon Notation, Chenchen Ding, Win Pa Pa, Masao Utiyama and Eiichiro Sumita

#9464. Korean Dialect Identification Based on an Ensemble of Prosodic and Segmental Feature Learning for Forensic Speaker Profiling, Jooyoung Lee, Kyungwha Kim and Minhwa

9:30-9:35

   

Break

9:35-11:20

Speaker Verification Challenge

Dr. Nguyen Thi Thu Trang

COCOSDA & VLSP 2022 Challenges: Multilingual Speaker Verification, Nguyen Thi Thu Trang

COCOSDA & VLSP 2022 Challenge: Multilingual Speaker Verification for Asian languages, Pham Viet Thanh, Nguyen Thi Thu Trang

The Smartcall - ITS Systems for VLSP2022 Speaker Verification Tasks, Mai Van Tuan

Underfitt Systems for Cross-lingual Speaker Verification at VLSP 2022, Nguyen Xuan Thai Hoa

COCOSDA 2022 Challenge: Indic-Multilingual Speaker Verification, Jagabandhu Mishra

Popcorn System For Indian Multilingual Speaker Verification VLSP Challenge 2022, Nguyen Van Tue

Exploring Invariant Embedding with TDNN Features for MSV Challenge, Mohit Bansal

11:20-11:25

   

Break

11:25-11:55

   

Closing Ceremony