Published February 1, 2019 | Version v1
Journal article Open

Focused crawling from the basic approach to context aware notification architecture

  • 1. Vellore Institute of Technology, Chennai Campus, India

Description

The large and wide range of information has become a tough time for crawlers and search engines to extract related information. This paper discusses about focused crawlers also called as topic specific crawler and variations of focused crawlers leading to distributed architecture, i.e., context aware notification architecture. To get the relevant pages from a huge amount of information available in the internet we use the focused crawler. This can bring out the relevant pages for the given topic with less number of searches in a short time. Here the input to the focused crawler is a topic specified using exemplary documents, but not using the keywords. Focused crawlers avoid the searching of all the web documents instead it searches over the links that are relevant to the crawler boundary. The Focused crawling mechanism helps us to save CPU time to large extent to keep the crawl up-to-date.

Files

08 14180.pdf

Files (208.7 kB)

Name Size Download all
md5:7e7b644f73f0ba19d466b453139741fc
208.7 kB Preview Download