• Chỉ mục bởi
  • Năm xuất bản
LIÊN KẾT WEBSITE

VNDS: A vietnamese dataset for summarization

Nguyen V.-H. Hung Yen University of Technology and Education, Hung Yen, Viet Nam|
Hoai N.X. | Nguyen M.-T. Ho Chinh Minh University of Technolohgy (HUTECH), Ho Chi Minh, Viet Nam| Nguyen T.-C. Ai Academy Vietnam, 489 Hoang Quoc Viet, Hanoi, Viet Nam|

Proceedings - 2019 6th NAFOSTED Conference on Information and Computer Science, NICS 2019 Số , năm 2019 (Tập , trang 375-380)

ISSN: 158383

ISSN: 158383

DOI: 10.1109/NICS48868.2019.9023886

Tài liệu thuộc danh mục: Scopus

Proc. - NAFOSTED Conf. Inf. Comput. Sci., NICS

English

Từ khóa: Extraction; Text processing; Abstraction; Benchmark datasets; Dataset; Document summarization; Extractive and abstractive summarizations; State of the art; Text summarization; Vietnamese; Large dataset
Tóm tắt tiếng anh
We have seen a lot of interesting developments and research in text summarization. While numerous approaches for summarization have been widely studied and applied in various domains in English, it is still an early stage in Vietnamese due to a few number of papers, systems, and the lack of benchmark datasets. Inspired to contribute to make a progress in Vietnamese language research, firstly in this paper we create a standard dataset for document summarization. To the best our knowledge, we are the first to formally publish the large benchmark dataset of summarization. Secondly, we make a comparison of traditional and state-of-the-art extractive and abstractive summarization on our dataset. We strongly believe that the results of our work will facilitate studies of text summarization in Vietnamese for the future. � 2019 IEEE.

Xem chi tiết