Skip to main content

Association for Vietnamese Language and Speech Processing

A chapter of VAIP - Vietnam Association for Information Processing

VLSP 2016 Datasets

Two datasets have been distributed for the evaluation campaign at VLSP 2016 workshop:

  • Named Entity Recognition: 16,858 tagged sentences containing 14,918 named entities (training set)
  • Sentiment Analysis: 6450 sentences (training set and test set).  

In order to get access to VLSP 2016 datasets, please fill out the form below, sign it and send it back to us via the following email address: vlsp dot resources at gmail dot com. 

Publications:

  1. Huyen T M Nguyen, Quyen T Ngo, Luong X Vu, Vu M Tran, Hien T T Nguyen, VLSP Shared Task: Named Entity Recognition, Journal of Computer Science and Cybernetics, Vol 34, No 4, pp. 295-310, 2018.
  2. Huyen T M Nguyen, Hung V Nguyen, Quyen T Ngo, Luong X Vu, Vu Mai Tran, Bach X Ngo, Cuong A Le, VLSP Shared Task: Sentiment Analysis, Journal of Computer Science and Cybernetics, Vol 34, No 4, pp. 283-294, 2018. 
File

Sponsors and Partners

VinBIGDATA   VinIF  AIMESOFT  bee  Dagoras            

 

 zalo    VTCC  VCCorp

 

 

IOIT  HUS  USTH  UET    TLU  UIT  INT2  jaist  VIETLEX