Published July 30, 2020 | Version v1
Journal article Open

LSA Based Text Summarization

  • 1. Department of Computer Science & Applications,Dr. Harisingh Gour Central University,Sagar,MP ,India.
  • 2. Department of Computer Science & Applications,Dr. Harisingh Gour Central University,Sagar,MP ,India
  • 1. Publisher

Description

In this study we propose an automatic single document text summarization technique using Latent Semantic Analysis (LSA) and diversity constraint in combination. The proposed technique uses the query based sentence ranking. Here we are not considering the concept of IR (Information Retrieval) so we generate the query by using the TF-IDF(Term Frequency-Inverse Document Frequency). For producing the query vector, we identify the terms having the high IDF. We know that LSA utilizes the vectorial semantics to analyze the relationships between documents in a corpus or between sentences within a document and key terms they carry by producing a list of ideas interconnected to the documents and terms. LSA helps to represent the latent structure of documents. For selecting the sentences from the document Latent Semantic Indexing (LSI) is used. LSI helps to arrange the sentences with its score. Traditionally the highest score sentences have been chosen for summary but here we calculate the diversity between chosen sentences and produce the final summary as a good summary should have maximum level of diversity. The proposed technique is evaluated on OpinosisDataset1.0.

Files

B3288079220.pdf

Files (530.8 kB)

Name Size Download all
md5:c6726e1679e9d1cd3930a684c429c1e5
530.8 kB Preview Download

Additional details

Related works

Is cited by
Journal article: 2277-3878 (ISSN)

Subjects

ISSN
2277-3878
Retrieval Number
B3288079220/2020©BEIESP