0% found this document useful (0 votes)
35 views

Topic Summarization: Features

The document proposes a system to detect summarized points from documents by extracting keywords, using clustering algorithms, and considering term co-occurrence. The system divides content into summarized content and points, extracts frequent keywords, clusters them to discover topics, and detects summaries. It is economically and operationally feasible using common hardware and software requirements.
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOC, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
35 views

Topic Summarization: Features

The document proposes a system to detect summarized points from documents by extracting keywords, using clustering algorithms, and considering term co-occurrence. The system divides content into summarized content and points, extracts frequent keywords, clusters them to discover topics, and detects summaries. It is economically and operationally feasible using common hardware and software requirements.
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOC, PDF, TXT or read online on Scribd
You are on page 1/ 4

Topic Summarization

To find prominent summarized points in a collection of documents. We


here propose a system to detect summarized points from a huge or multiple
paragraph. We use an efficient method to discover summarized points from the
provided content. The provided content is divided into two parts as Summarized
Content and Summarized Poin. One would expect particular words to appear in
the content more or less frequently: "dog" and "bone" will appear more often in
documents about dogs, "cat" and "meow" will appear in documents about cats,
and "the" and "is" will appear equally in both. A document typically concerns
multiple topics in different proportions; thus, in a document that is 10% about
cats and 90% about dogs, there would probably be about 9 times more dog words
than cat words. Our proposed system captures this intuition in a mathematical
framework and will examine the content of particular set of documents. Here the
system will extract keywords and will use clustering algorithm in order to discover
topic from particular set of documents. System will extract keywords which occur
often and will cluster this keywords using clustering algorithm and will detect
summarized point from a collection of documents. This system takes co-
occurrence of terms into account which gives best result.

Features:
 The system will provide summarized content and summarized point from
the provided content
 The system will extract keywords and will use clustering algorithm in order
to discover topic summarization for particular set of documents/content.
 This system takes co occurrence of terms into account which gives best
result.
 System will extract keywords which occur often and will cluster this
keywords using clustering algorithm and will detect topic summarization
from a collection of documents.
Feasibility Study

This system will extract keywords which occur often from collection of
documents and will cluster the words using clustering algorithm and system will
detect topic from a collection of documents.

 Economic Feasibility
This system will help the web users to easily search information for
particular topic. This system will be useful for web crawlers. This
system will provide economic benefits for many websites. It includes
quantification and identification of all the benefits expected.

 Operational Feasibility
This system is more reliable, maintainable, affordable and producible.
These are the parameters which are considered during design and
development of this project. During design and development phase
of this project there was appropriate and timely application of
engineering and management efforts to meet the previously
mentioned parameters.

 Technical Feasibility
The back end of this project is SQL server which stores parameters
related to this project. There are basic requirement of hardware to
run this application. This system is developed in .Net Framework
using C#. This application will be online so this application can be
accessed by using any device like (Personal Computers, Laptop and
with some hand held devices).
Software Requirements:

 Windows 7 or higher
 SQL Server 2008
 Visual studio 2010

Hardware Components:

 Processor – i3 Processor
 Hard Disk – 50 GB
 Memory – 1GB RAM
 Internet Connection

Advantages

 User can specify how much % the content should be summarized.


 The algorithm provides quick result with the summarized data.
 Selects the best suitable points for summarization.

Disadvantages:

 This system extracts words rather than phrases.


 The provided content must be more than 100-150 characters.

Application:

This application can be used by many web users.


IEEE Reference:

 http://ieeexplore.ieee.org/xpl/articleDetails.jsp?arnumber=6921769
 http://ieeexplore.ieee.org/xpl/articleDetails.jsp?arnumber=6327415
 http://ieeexplore.ieee.org/xpl/articleDetails.jsp?arnumber=5234971

You might also like

pFad - Phonifier reborn

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.


Alternative Proxies:

Alternative Proxy

pFad Proxy

pFad v3 Proxy

pFad v4 Proxy