VLSP 2016 Datasets

Two datasets have been distributed for the evaluation campaign at VLSP 2016 workshop:

  • Named Entity Recognition: 16,858 tagged sentences containing 14,918 named entities (training set)
  • Sentiment Analysis: 6450 sentences (training set and test set).  

In order to get access to VLSP 2016 datasets, please fill out the form below, sign it and send it back to us via the following email address: vlsp dot resources at gmail dot com. 

Publications:

  1. Huyen T M Nguyen, Quyen T Ngo, Luong X Vu, Vu M Tran, Hien T T Nguyen, VLSP Shared Task: Named Entity Recognition, Journal of Computer Science and Cybernetics, Vol 34, No 4, pp. 295-310, 2018.
  2. Huyen T M Nguyen, Hung V Nguyen, Quyen T Ngo, Luong X Vu, Vu Mai Tran, Bach X Ngo, Cuong A Le, VLSP Shared Task: Sentiment Analysis, Journal of Computer Science and Cybernetics, Vol 34, No 4, pp. 283-294, 2018.