Local Time (GMT+7)
|
Session
|
Session Chair
|
Details
|
DAY 1: November 24, 2022
|
8:30-9:00
|
|
|
Opening Ceremony
|
9:00-10:00
|
Keynote 1
|
Prof. Satoshi Nakamura
|
Seeing to Hear Better
Professor Haizhou Li
Chinese University of Hong Kong and National University of Singapore
|
10:00-10:15
|
|
|
Break
|
10:15-11:45
|
Session 1:
Speech Recognition & Speech Synthesis
|
Dr. Chenglin Xu
|
#603. Development of a High Quality Text to Speech System for Lao, Ngoc-Anh Nguyen Thi, Tien-Thanh Nguyen and Nhat-Minh Le
#4193. NICT-Tib1: A Public Speech Corpus of Lhasa Dialect for Benchmarking Tibetan Language Speech Recognition Systems, Kak Soky, Zhuo Gong and Sheng Li
#7252. The Speech Labeling and Modeling Toolkit (SLMTK) Version 1.0, Chen-Yu Chiang, Wu-Hao Li, Yen-Ting Lin, Jia-Jyu Su, Wei-Cheng Chen, Cheng-Che Kao, Shu-Lei Lin, Pin-Han Lin, Shao-Wei Hong, Guan-Ting Liou, Wen-Yang Chang, Jen-Chieh Chiang, Yen-Ting Lin, Yih-Ru Wang and Sin-Horng Chen
#8531. Toward Automatic Generation of Transcript from Spoken Lectures: The “Dream of The Red Chamber” Series, Tzu-Han Lin, Kuan-Lin Lee, Hsin-Yun Chung, Fu-Hai Frank Wu, Jui-Chu Li, Tung-Lung Li, Shih-Lung Lo, Yi-Wen Liu and Jason S. Chang
#8910. End-to-End Named Entity Recognition for Vietnamese Speech, Thu-Hien Nguyen, Thai-Binh Nguyen, Quoc-Truong Do and Tuan-Linh Nguyen
#9634. MNASR: a Free Speech Corpus for Mongolian Speech Recognition and Accompanied Baselines, Yihao Wu, Yonghe Wang, Hui Zhang, Feilong Bao and Guanglai Gao
|
11:45-14:00
|
|
|
Lunch Break
|
14:00-15:30
|
Session 2:
Speech Prosody
|
Dr. Minghui Dong
|
#896. Patterns of Vowel Production in The Speakers of Sanskrit Language, Pooja Gambhir, Amita Dev and Poonam Bansal
#1937. The Interaction Pattern of Focal Accent and Declarative Intonation in Mongolian, Min Ao and Aijun Li
#2910. Neural Network Models for User Attribute Extraction from Dialogues, Son Dang Ngoc and Quang Nhat Minh Pham
#4525. Multilingual Analysis of Intelligibility Classification using English, Korean, and Tamil Dysarthric Speech Datasets, Eun Jung Yeo, Sunhee Kim and Minhwa Chung
#6681. Nasality in Zhangzhou: Distribution and Constraint, Yishan Huang
#8904. A Corpus-Based Analysis of Age-Related Changes in the Acoustic Features of Elderly to Super Elderly Speech, Meiko Fukuda, Masakazu Sugiyama and Ryota Nishimura
|
15:30-15:45
|
|
|
Break
|
15:45-17:15
|
Session 3:
Poster
|
Prof. Hsin-Min Wang
|
#542. Towards the Development of Accent Conversion Model For (L1) Bengali Speaker Using Cycle Consistent Adversarial Network (Cyclegan), Sabyasachi Chandra, Puja Bharati and Shyamal Kumar Das Mandal
#2053. Speaking-Rate Effect on Prosodic Grammar of Mandarin Read Speech, Yu-Siang Hong, Chen-Yu Chiang, Yih-Ru Wang and Sin-Horng Chen
#2284. An Automated Speech Recognition System for Phonological Awareness of Kindergarten Students in Filipino, Jazzmin Maranan
#2580. Experimentation of Various Preprocessing Pipelines for Sentiment Analysis on Twitter Data About New Indonesia’s Capital City using SVM and CNN, Siska Pebiana, Nuraisa Novia Hidayati, Dian Isnaeni Nurul Afra, Elvira Nurfadhilah, Harnum Annisa, Junanto Prihantoro, Radhiyatul Fajri, M. Teduh Uliniansyah, Agung Santosa, Lyla Ruslana Aini, Yosi Sahreza, Aulia Haritsuddin Karisma Muhammad Subekti, Josua Geovani Pinem, Muhammad Reza Alfin, Agung Septiadi, Siti Shaleha, Gembong Satrio Wibowanto, Asril Jarin, Gunarso, Andi Djalal Latief and Hammam Riza
#2928. Harvard-NGSL Sentences for English Learner Speech Corpora, Kakeru Yazawa
#3619. Acoustical Analysis of Speech of ASD Children and Typically Developing Children, Babita Saxena, Sunita Arora, Karunesh Arora and Hemant Keshwal
#3711. Designing a Speaking Assessment Task using EI to Build a Korean Learner Corpus, Chang Kyung Song, Hye Yun Jeong and Ho Jung Kim
#3844. Analysis on the Interference of Chinese /n/-/l/ Confusion to Japanese /n/-/r/ Discrimination by Chinese JFL Learners, Cenyu Xiang, Tianxiang Cao and Yanlong Zhang
#4157. Creakiness Judgments by Burmese and Vietnamese Speakers, Julián Villegas and Seunghun Lee
#4258. Text to Speech System for Lambani - a Zero Resource, Tribal Language of India, Ashwini Dasare, Deepak K T, Samudravijaya K and Mahadeva Prasanna
#4493. Production and Perception of Intonation Features by Cantonese EFL Learners, Chenyang Zhao, Ziyu Xiong and Aijun Li
#5944. Analysis of Layer-wise Training in Direct Speech to Speech Translation using Bi-LSTM, Lalaram Arya, Ayush Agarwal, Jagabandhu Mishra and S. R. Mahadeva Prasanna
#8967. The Influence of Working Memory on Intonation Production of Chinese EFL Learners, Ronghao Gu and Xiaoli Ji
|
DAY 2: November 25, 2022
|
8:30-9:45
|
Session 4:
Multimodal Databases
|
Dr. Siqi Cai
|
#2975. UIT-VLFC: Vietnamese Lipstick Feedbacks Corpus, Binh Van Duong, An Trong Nguyen, Chien Nhu Ha, Hong-Hanh Thi Duong, My-Linh Thi Tran and Trong-Hop Do
#4801. ESAA: an EEG-Speech Auditory Attention Detection Database, Peiwen Li, Enze Su, Jia Li, Siqi Cai, Longhan Xie and Haizhou Li
#9020. A Speech Corpus for Chronic Kidney Disease, Jihyun Mun, Sunhee Kim, Myeong Ju Kim, Jiwon Ryu, Sejoong Kim and Minhwa Chung
#9232. Building a Speech Corpus of Children with Cochlear Implants via an Enhanced Metadata Structure, Seonwoo Lee, Sunhee Kim and Minhwa Chung
#9830. Taiwanese Across Taiwan Corpus and Its Applications, Yuan-Fu Liao, Hui-Lu Khoo, Un-Gian Iunn, Tsun-Guan Thiann, Jane S. Tsay, Le-Kun Tan, Huang-Lan Su, Hak-Khiam Tiun, Peter Kang, Li-Chen Chang, Su-Lian Liao, Hong-Hūi Tân, Siok-Hong Liau and Chhun-Sui Na
|
9:45-10:00
|
|
|
Break
|
10:00-11:00
|
Keynote 2
|
Prof. Luong Chi Mai
|
Language Technology for All: From the technology and indigenous community perspectives
Professor Sakriani Sakti
Japan Advanced Institute of Science and Technology
|
11:00-11:15
|
|
|
Break
|
11:15-12:15
|
|
|
O-COCOSDA Steering Committee Meeting (Committee members only)
|
12:15-14:00
|
|
|
Lunch Break
|
14:00-15:30
|
Session 5:
Language Learning
|
Prof. Aijun Li
|
#622. The Effect of Acoustic Features on Chinese EFL Learners' Perception of English Accentual Prominence, Weizhong Zhang, Jian Gong, Xiaoli Ji, Yuhong Sun and Kai Sheng
#1348. Designing a Korean French-Learners' Speech Corpus (KFLSC) for Spoken Language Assessment, Soeun Park, Jihye Chun, Mihyun Kim, Hyunjoo Lee, Seongheon Lee and Sunhee Kim
#1966. Syntactic Complexity in Narrative Speech Produced by Prelingually Deaf Mandarin-Speaking Children with Cochlear Implants, Jue Yu, Han Sun and Zhaoqi Dong
#2256. voisTUTOR 2.0: A Speech Corpus with Phonetic Transcription for Pronunciation Evaluation of Indian L2 English Learners, Priyanshi Pal, Chiranjeevi Yarra and Prasanta Ghosh
#4834. Spanish Stops and Their Allophones Produced by Proficient Mandarin Learners of Spanish, Xiaotong Xi and Peng Li
#5279. NASAM 2.0: Cleft-Palate Speech Assessment Application, Nattapong Kurpukdee, Kwanchiva Thangthai, Vataya Chunwijitra, Patcharika Chootrakool and Sawit Kasuriya
|
15:30-15:45
|
|
|
Break
|
15:45-17:15
|
Country/Region Reports and Discussion
|
Prof. Satoshi Nakamura
|
China, Aijun Li and Dong Wang
Hong Kong, Tan Lee
India, S.S Agrawal
Indonesia, Hammam Riza
Japan, Satoshi Nakamura
Korea, Yong-Ju Lee
Malaysia, Zuraidah Mohd Don
Myanmar, Win Pa Pa
Philippine, Nathaniel Oco
Singapore, Haizhou Li
Taiwan, Hsin-Min Wang, Yuan-Fu Liao
Thailand, Kwanchiva Thangthai
Vietnam, Luong Chi Mai
|
DAY 3: November 26, 2022
|
8:30-9:30
|
Session 6:
Dialects and Accents
|
Prof. Sin-Horng Chen
|
#490. Effects of the Syntactic Structure on the Productivity of Tone Sandhi Rules: In the Case of Xiamen Dialect, Yiying Hu and Hui Feng
#1234. Improving Vietnamese Accent Recognition using ASR Transfer Learning, Bao Thang Ta, Xuan Vuong Dang, Quang Tien Duong, Nhat Minh Le and Van Hai Do
#4539. Transliteration of Foreign Words in Burmese: Descriptions by a Mortise-and-Tenon Notation, Chenchen Ding, Win Pa Pa, Masao Utiyama and Eiichiro Sumita
#9464. Korean Dialect Identification Based on an Ensemble of Prosodic and Segmental Feature Learning for Forensic Speaker Profiling, Jooyoung Lee, Kyungwha Kim and Minhwa
|
9:30-9:35
|
|
|
Break
|
9:35-11:20
|
Speaker Verification Challenge
|
Dr. Nguyen Thi Thu Trang
|
COCOSDA & VLSP 2022 Challenges: Multilingual Speaker Verification, Nguyen Thi Thu Trang
COCOSDA & VLSP 2022 Challenge: Multilingual Speaker Verification for Asian languages, Pham Viet Thanh, Nguyen Thi Thu Trang
The Smartcall - ITS Systems for VLSP2022 Speaker Verification Tasks, Mai Van Tuan
Underfitt Systems for Cross-lingual Speaker Verification at VLSP 2022, Nguyen Xuan Thai Hoa
COCOSDA 2022 Challenge: Indic-Multilingual Speaker Verification, Jagabandhu Mishra
Popcorn System For Indian Multilingual Speaker Verification VLSP Challenge 2022, Nguyen Van Tue
Exploring Invariant Embedding with TDNN Features for MSV Challenge, Mohit Bansal
|
11:20-11:25
|
|
|
Break
|
11:25-11:55
|
|
|
Closing Ceremony
|