Web Content Mining Research Papers

Bookmark
Download
- by Razvan Dobrea
- •
- 7
  Web Mining, Knowledge Discovery, Web Content Mining, Software Agent

In this paper, we present an overview of research issues in web mining. We discuss mining with respect to web data referred here as web data mining. In particular, our focus is on web data mining research in context of our web warehousing... more

Bookmark
Download
- by Sanjay Madria
- •
- 9
  Computer Science, Data Mining, Web Mining, Cloud Computing

Recently, the web is becoming an important part of people’s life. The web is a very good place to run successful businesses. Selling products or services online plays an important role in the success of businesses that have a physical... more

Bookmark
- by Istrate Mihai
- •
- 7
  Economics, Data Mining, Web Mining, Web Usage Mining

Recently, the web is becoming an important part of people’s life. The web is a very good place to run successful businesses. Selling products or services online plays an important role in the success of businesses that have a physical... more

Bookmark
- by Istrate Mihai
- •
- 7
  Economics, Data Mining, Web Mining, Web Usage Mining

The International Journal of Database Management Systems (IJDMS) is a bi monthly open access peer-reviewed journal that publishes articles which contribute new results in all areas of the database management systems & its applications.... more

In the current development, millions of clients are accessing daily the internet and World Wide Web (WWW) to search the information and achieve their necessities. Web mining is a technique to automatic discovers and Extract information... more

In this paper we presents study about how to extract the useful information on the web and also give the superficial knowledge and comparison about data mining. This paper describes the current, past and future of web mining. Here we... more

Bookmark
Download
- by Dhara Dave
- •
- 8
  Information Retrieval, Data Mining, Web Mining, Databases

The World Wide Web, or simply the web, is the most dynamic environment.The web has grown steadly in recent years and his content is changing every day. Today, they are several billions of HTML documents, pictures and another multimedia... more

Bookmark
Download
- by Dumitru Ciobanu and +1
  Claudia Dinuca
- •
- Web Content Mining

Bookmark
Download
- by István T. Nagy
- •
- 2
  Social Web, Web Content Mining

With the advent of the World Wide Web and the emergence of e-commerce applications and social networks, organizations across the Web generate a large amount of data day-by-day. The abundant unstructured or semi-structured information on... more

As the use of web is increasing more day by day, the web users get easily lost in the web’s rich hyper structure. The main aim of the owner of the website is to give the relevant information according their needs to the users. We... more

In recent years, the emergence of WWW (World Wide Web) led to the accumulation of huge amount of information and data. Hence the web is found to consist of unstructured and structured information that impacts the day to day life of the... more

Bookmark
Download
- by Manjunath Pujar
- •
- 3
  Machine Learning, Web Content Mining, Web Data Mining

—Dimensionality reduction of feature vector size plays a vital role in enhancing the text processing capabilities; it aims in reducing the size of the feature vector used in the mining tasks (classification, clustering... etc.). This... more

Bookmark
Download
- by Mohamed K . Elhadad
- •
- 8
  Ontology, Web Mining, Semantic Web, Dimensionality Reduction

Application of data mining techniques to the World Wide Web, referred to as Web mining, has been the focus of several recent research projects and papers. However, there is no established vocabulary, leading to confusion when comparing... more

Bookmark
Download
- by Venkata Ramana
- •
- 17
  Computer Science, Information Retrieval, Data Mining, Human Factors

E-commerce site grows rapidly since it allows someone to shop online quickly and easily without having to meet seller directly. This saves time, effort, and cost in transaction although it doesn't always provide what the customers need.... more

Typically, search engines are low precision in response to a query, retrieving lots of useless web pages, and missing some other important ones. In this paper, we study the problem of the hierarchical clustering of web pages search... more

Bookmark
Download
- by Célia Nunes
- •
- 11
  Information Retrieval, Data Mining, Graph Theory, Web Intelligence

As the use of Web is increasing more day by day, the web users get easily lost in the web’s rich hyper structure. The main aim of the owner of the website is to provide the relevant information to the users to fulfill their needs. Web... more

Bookmark
Download
- by IJSRP Journal
- •
- 2
  Web Mining, Web Content Mining

E-commerce site grows rapidly since it allows someone to shop online quickly and easily without having to meet seller directly. This saves time, effort, and cost in transaction although it doesn’t always provide what the customers need.... more

Web content extraction is a key technology for enabling an array of applications aimed at understanding the web. While automated web extraction has been studied extensively, they often focus on extracting structured data that appear... more

Bookmark
Download
- by Ziyan Zhou
- •
- Web Content Mining

In this paper, we propose an approach to automatically mine event evolution graphs from newswires on the Web. Event evolution graph is a directed graph in which the vertices and edges denote news events and the evolutions between events... more

Bookmark
Download
- by Xiaodong Shi
- •
- 3
  Knowledge Management, Web Content Mining, Directed Graph

Nowadays the World Wide Web (commonly called as Web) is used widely and it has impacted on almost every facet of our lives. To search and retrieve the information from the web requires an effective and efficient technique as it has become... more

Bookmark
Download
- by Mohd Shoaib and +1
  Ashish Kumar Maurya
- •
- 21
  Computer Science, Computer Engineering, Data Mining, Web Mining

World Wide Web is a repository of massive amount of data related to various fields. It is difficult to obtain the necessary and relevant information from this vast collection. Many researchers have proposed different methods for fetching... more

Bookmark
Download
- by Editor IJRET
- •
- 6
  Machine Learning, Data Mining, Web Mining, Web Usage Mining

With the ever-growing variety of information, the retrieval demands of different users are so multifarious that the traditional search engine cannot afford such heterogeneous retrieval results of huge magnitudes. Harnessing the... more

ABSTRACT: World Wide Web is enormous compilation of multi-variant data. For better knowledge management it is important to retrieve accurate and complete data. The hidden Web, also known as the invisible Web or deep Web, has given rise to... more

Bookmark
- by Muhammad Hassan Naeem
- •
- 10
  Engineering, Knowledge Management, Web Mining, Hidden Web

This paper introduces the concept of product identity-clustering based on new similarity metrics and new performance metrics for web-crawled products. Product identity-clustering is defined here as the clustering of identical products,... more

Bookmark
Download
- by Furkan Gözükara
- •
- 7
  Web Mining, Performance metrics, Web Content Mining, Web Data Mining

Due to the huge amount of information available on the web, the World Wide Web has becoming one of the most important resources for extracting the information and knowledge discoveries. Many Organizations rely on these websites to attract... more

One of the popular trends in computer science has been development of intelligent web-based systems. Demand for such systems forces designers to make use of knowledge discovery techniques on web server logs. Web usage mining has become a... more

Bookmark
Download
- by Murat Ali Bayir
- •
- 19
  Information Retrieval, Data Mining, Web Mining, Pattern Mining

Most web content classification methods are based on the vectorspace model of information retrieval. One of the important advantages of this representation model is that it can be used by both instance-based and model-based classifiers... more

Bookmark
Download
- by Mark Last
- •
- 13
  Information Retrieval, Graph Theory, Web Content Mining, Decision Tree

In today’s world of internet, with whole lot of e-documents such, as html pages, digital libraries etc. occupying considerable amount of cyber space, organizing these documents has become a practical need. Clustering is an important... more

Abstract Society is increasingly dependent on digital information. Much of this is available online free of charge but metadata is at a premium. This has encouraged the emergence of a new online phenomenon known as social (or... more

Bookmark
- by Andrew Kehoe
- •
- 13
  Information Science, Web Technologies, Folksonomies, E Government

Application of data mining techniques to the World Wide Web, referred to as Web mining, has been the focus of several recent research projects and papers. However, there is no established vocabulary, leading to confusion when comparing... more

Bookmark
Download
- by luca virli
- •
- 17
  Computer Science, Information Retrieval, Data Mining, Human Factors

نشریات علمی به عنوان یکی از منابع دسترسی به اطلاعات علمی نقش بزرگی در پیشبرد و گسترش پژوهش های دانشگاهی ایفا کرده اند اما باید به روش‏ های بهبود این دسترسی به محتوای نشریات نیز توجه کرد تا هم کاربران آنها با سهولت بیشتری از این محتوا... more

ABSTRACT

Bookmark
- by Mohammad Shoaib
- •
- 18
  Computer Science, Computer Engineering, Data Mining, Web Mining

Bookmark
- by Ashok SHARMA
- •
- 19
  Computer Science, Information Retrieval, Data Mining, Web Mining

One of the popular trends in computer science has been development of intelligent web-based systems. Demand for such systems forces designers to make use of knowledge discovery techniques on web server logs. Web usage mining has become a... more

Bookmark
Download
- by Ismail Toroslu
- •
- 20
  Computer Science, Information Retrieval, Data Mining, Web Mining

We are interested in replacing human processing of web resources by automated processing. Based on an experimental system we identify uncertainty issues which make this process difficult for automated processing. We show these uncertainty... more

Bookmark
Download
- by Peter Vojtas
- •
- 3
  Web Content Mining, ISWC, User preferences

In recent years, the emergence of WWW (World Wide Web) led to the accumulation of huge amount of information and data. Hence the web is found to consist of unstructured and structured information that impacts the day to day life of the... more

Bookmark
Download
- by Manjunath S Pujar
- •
- 3
  Machine Learning, Web Content Mining, Web Data Mining

In recent years, the emergence of WWW (World Wide Web) led to the accumulation of huge amount of information and data. Hence the web is found to consist of unstructured and structured information that impacts the day to day life of the... more

Bookmark
Download
- by Manjunath S Pujar
- •
- 3
  Machine Learning, Web Content Mining, Web Data Mining

Purpose -To explore the use of LexiURL as a Web intelligence tool for collecting and analysing links to digital libraries, focusing specifically on the National electronic Library for Health (NeLH). Design/methodology/approach -The Web... more

Purpose -To explore the use of LexiURL as a Web intelligence tool for collecting and analysing links to digital libraries, focusing specifically on the National electronic Library for Health (NeLH). Design/methodology/approach -The Web intelligence techniques in this study; a combination of link analysis (web structure mining), web server log file analysis (web usage mining), and text analysis (web content mining), utilize the power of commercial search engines and draw upon the information science fields of bibliometrics and webometrics. LexiURL is a computer program designed to calculate summary statistics for lists of links or URLs. Its output is a series of standard reports, for example listing and counting all of the different domain names in the data. Findings -Link data, when analysed together with user transaction log files (i.e., Web referring domains) can provide insights into who is using a digital library and when, and who could be using the digital library if they are "surfing" a particular part of the Web; in this case any site that is linked to or colinked with the NeLH. This study found that the NeLH was embedded in a multifaceted Web context, including many governmental, educational, commercial and organisational sites, with the most interesting being sites from the .edu domain, representing American Universities. Not many links directed to the NeLH were followed on September 25, 2005 (the date of the log file analysis and link extraction analysis), which means that users who access the digital library have been arriving at the site via only a few select links, bookmarks and search engine searches, or non-electronic sources. Research limitations/implications -LexiURL uses the Yahoo! API for its link extraction, but the use of commercial search engine data has several limitations. First, no search engine covers the entire web and so all are likely to return incomplete results. This problem is exacerbated by the typical limitation of 1,000 results per query and for Google and Yahoo! and their automatic search services report only a fraction of the results known by the parent search engine. Hence for a large digital library, LexiURL could be expected to find perhaps only 10% or less of the links to the site. A second limitation is that the method by which each search engine finds pages is unknown, as is the method for ranking results. Originality/value -A few studies focusing on digital library users have been carried out using log file analysis as a research tool. Log files focus on real-time user transactions; while LexiURL can be used to extract links and colinks associated with a digital library's "organic" Web network. This Web network is often not recognized enough, and can be a valuable indication of where potential users are surfing, even if they have not yet specifically visited the NeLH site.

With the massive rise in the volume of information available on the World Wide Web these days, and the emergence requirements for a superior technique to access this information, there has been a strong resurgence of interest in web... more

Bookmark
Download
- by Zakaria Zubi
- •
- 6
  Computer Science, Data Mining, Web Mining, Text Mining

Bookmark
- by Sanjay Madria
- •
- 9
  Computer Science, Data Mining, Web Mining, Cloud Computing

Syllabus Based Web Content Extractor (SBWCE) introduces a new technique of Syllabus Based Web Content Mining. It makes the Syllabus Based Web Content Extraction easy and creates an instant online book view based on the links relevant to... more

Bookmark
Download
- by Saba Hilal
- •
- 2
  Web Content Mining, Content Extraction

Web is a wide, various and dynamic environment in which different users publish their documents. Web-mining is one of data mining applications in which web patterns are explored. Studies on web mining can be categorized into three... more

Application of data mining techniques to the World Wide Web, referred to as Web mining, has been the focus of several recent research projects and papers. However, there is no established v o cabulary, leading to confusion when comparing... more

Bookmark
Download
- by JayaSudha Yuvaraj
- •
- 17
  Computer Science, Information Retrieval, Data Mining, Human Factors

Large websites pose the following challenges for comprehension of user behavior: users' behaviors are complex and diverse, the web log data is very noisy, and the quantity of the web log data is of a magnitude that defies direct... more

Today, the notion of Semantic Web has emerged as a prominent solution to the problem of organizing the immense information provided by World Wide Web, and its focus on supporting a better co-operation between humans and machines is... more

There are billions of Web pages on World Wide Web which can be accessed via internet. All of us rely on usage of internet for source of information. This source of information is available on web in various forms such as Websites,... more

Web Content Mining

Log In

pFad - (p)hone/(F)rame/(a)nonymizer/(d)eclutterfier! Saves Data!