Workshop objectives:
The goal of this workshop series is to attempt a synthesis of research in Vietnamese language and speech processing and to bring together Vietnamese linguists and computer scientists working in this domain. All contributions to methods for developing tools and linguistic resources for Natural Language Processing (NLP) and their applications are particularly welcome. In addition to the usual presentation of papers about the latest breakthroughs in Vietnamese NLP, the workshop will feature a session dedicated to the first evaluation campaign, bearing both on tools for Vietnamese segmentation and POS tagging, and on machine translation.
Workshop topics:
The workshop topics include (but are not limited to):
- Linguistic resources
- Language modeling
- Word segmentation and POS tagging
- Syntactic structures and grammars
- Semantic analysis
- Speech processing
- Machine translation
- Other VLSP applications
Vietnamese word segmentation and POS tagging Task
Word segmentation and POS tagging are basic and difficult tasks in NLP, especially for isolating languages like Vietnamese in which compound words belong to the core of the language and the parts-of-speech are not well defined in the linguistic literature. A national project on Vietnamese Language and Speech Processing finished successfully in 2009 has brought to the researchers fundamental NLP tools and resources, thus initiating the appropriate setting to go further in researching, developing and deploying useful software applications in the field.
This campaign aims at automatically evaluating Vietnamese word segmentation and POS tagging systems, in order to encourage scientists to use and evaluate resources and tools from the VLSP project and permit to promote the most efficient methods for these basic tasks for Vietnamese processing.
The details about this campaign are provided on the Evaluation Campaign page.
Translation Task
Machine translation is one of the traditional and difficult tasks in the field of NLP. Recently, due to the rapidly increasing amount of data and computing power in ubiquitous environments, the development of Vietnamese-related translation systems has attracted not only academic institutes but also R&D units from companies, both from inland Vietnam and overseas, even at the global scope.
This campaign aims to create an authentic environment to automatically and manually evaluate translation system so that it helps to boost the research on Machine Translation of the Vietnamese Language Technology community. We also welcome research groups improve the methods and bring their systems into the real-world scenario. Further domestic and international collaborations on the field are included in our goals.
The details about this campaign are provided on the Evaluation Campaign page.
Paper submission:
Authors are invited to submit papers of up to 8 pages, in English. All papers must be original contributions and not previously published. All papers will be submitted on-line, in PDF format using the IEEE standard template, to the workshop management website.
Important dates:
August 3, 2013: Registration to the campaign
August 5, 2013: Distribution of training data for translation task
August 16, 2013: Distribution of training data for word segmentation and POS tagging task
September 15, 2013: Registration to the campaign closed
October 1, 2013: Test data release
October 5, 2013: System result submission
October 11, 2013: Scientific and system paper submission deadline
October 27, 2013: Acceptance notification
October 28, 2013: Author registration to the workshop
November 3, 2013: Camera-ready paper submission
November 10, 2013: Workshop date
Workshop Co-chairs:
Le Anh Cuong, VNU University of Engineering and Technology, Hanoi, VN
Vu Hai Quan, VNU University of Science, HCMC, VN