Pravin Jee, *An abstract or a condensed presentation of the substance of a body of material. --- from http://en.wiktionary.org/wiki/summary
"""The goal of summariza-tion is to identify the most important information in a document, and term-weighting is a useful tool for identifying important or significant words in a document. One of the simplest summarization approaches, then, would be to extract the sentences with the highest tf.idf scores (e.g., by summing or averaging over each sentence).""" --- Methods for Mining and Summarizing Text Conversations, 2011, Giuseppe Carenini, Gabriel Murray, Raymond Ng, Morgan &Claypool Publishers. * You see in above sentences, those sentence you retrieved using TF-IDF measure, are condensed... abstract.... ???? NO, These sentences are relevant or topic related only. For example, If you change word order or sentence order, TF-IDF does not make differ in retrieval. In case of summarization, it does matter. In summarization, comprehension, cohesion and coherence measure/identification is necessary requirement for reader. So, I do not agree to say, the system you developed is text summarizer. But we can say, relevant sentence identifier based on topic or document. Santa -- FOSS Nepal mailing list: [email protected] http://groups.google.com/group/foss-nepal To unsubscribe, e-mail: [email protected] Mailing List Guidelines: http://wiki.fossnepal.org/index.php?title=Mailing_List_Guidelines Community website: http://www.fossnepal.org/
