An Innovative Approach of Bangla Text Summarization by Introducing Pronoun Replacement and Improved Sentence Ranking


Md. Majharul Haque, Suraiya Pervin, Zerina Begum, Journal of Information Processing Systems
Vol. 13, No. 4, pp. 752-777, Aug. 2017
10.3745/JIPS.04.0038
Keywords: Bangla News Document, Cosine Similarity, Dangling Pronoun, Pronoun Replacement, Sentence Frequency
Fulltext:

Abstract

This paper proposes an automatic method to summarize Bangla news document. In the proposed approach, pronoun replacement is accomplished for the first time to minimize the dangling pronoun from summary. After replacing pronoun, sentences are ranked using term frequency, sentence frequency, numerical figures and title words. If two sentences have at least 60% cosine similarity, the frequency of the larger sentence is increased, and the smaller sentence is removed to eliminate redundancy. Moreover, the first sentence is included in summary always if it contains any title word. In Bangla text, numerical figures can be presented both in words and digits with a variety of forms. All these forms are identified to assess the importance of sentences. We have used the rule-based system in this approach with hidden Markov model and Markov chain model. To explore the rules, we have analyzed 3,000 Bangla news documents and studied some Bangla grammar books. A series of experiments are performed on 200 Bangla news documents and 600 summaries (3 summaries are for each document). The evaluation results demonstrate the effectiveness of the proposed technique over the four latest methods.


Statistics
Show / Hide Statistics

Statistics (Cumulative Counts from November 1st, 2017)
Multiple requests among the same browser session are counted as one view.
If you mouse over a chart, the values of data points will be shown.




Cite this article
[APA Style]
Md. Majharul Haque, Suraiya Pervin, & Zerina Begum (2017). An Innovative Approach of Bangla Text Summarization by Introducing Pronoun Replacement and Improved Sentence Ranking. Journal of Information Processing Systems, 13(4), 752-777. DOI: 10.3745/JIPS.04.0038.

[IEEE Style]
M. M. Haque, S. Pervin and Z. Begum, "An Innovative Approach of Bangla Text Summarization by Introducing Pronoun Replacement and Improved Sentence Ranking," Journal of Information Processing Systems, vol. 13, no. 4, pp. 752-777, 2017. DOI: 10.3745/JIPS.04.0038.

[ACM Style]
Md. Majharul Haque, Suraiya Pervin, and Zerina Begum. 2017. An Innovative Approach of Bangla Text Summarization by Introducing Pronoun Replacement and Improved Sentence Ranking. Journal of Information Processing Systems, 13, 4, (2017), 752-777. DOI: 10.3745/JIPS.04.0038.