Skip to content

eco4ndly/BasicWebCrawler

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

1 Commit
 
 
 
 
 
 

Repository files navigation

# Web Crawler in JAVA 

This is a simple a `web crawler`.

The current implementation is `SimpleUrlCrawler` what it does is

 It start crawling the given root url and discovers all the interconnected url in the page and nested pages.
 
 What it does basically it enters the given root discovers all the neighbouring http|https| urls and starts
 crawling the neighbours so on and so forth.
 
 Usage:
 
`WebCrawler webCrawler = new SimpleUrlCrawler();
 webCrawler.startCrawling("the root url you want to start crawling");`

About

Simple web crawler using java

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages