AbotX Web Crawler 1.3.73 Ultimate
A powerful C# web crawler that makes advanced crawling features easy to use. AbotX builds upon the open source Abot C# Web Crawler by providing a powerful set of wrappers and extensions.
Crawl multiple sites concurrently
Pause/resume live crawls
Render jаvascript before processing
Avoid getting blocked by sites
Automatically tune speed/concurrency
Parallel Crawler Engine
A crawler instance can crawl a single site quickly. However, if you have to crawl 10,000 sites quickly you need the ParallelCrawlerEngine. It allows you to crawl a configurable number of sites concurrently to maximize throughput.
Easy Override allows you to easily plugin in any implementation of a key interface in an easy to use object wrapper that handles nested dependencies for you. No matter how deep.
Pause And Resume
There may be times when you need to temporarily pause a crawl to clear disk space on the machine or run a resource intensive utility. No matter the reason, you can confidently Pause and Resume the crawler and it will continue on like nothing happened.
Many web pages on the internet today use jаvascript to create the final page rendering. Most web crawlers do not render the jаvascript but instead just process the raw html sent back by the server. Use this feature to render jаvascript before processing.
Most websites you crawl cannot or will not handle the load of a web crawler. Auto Throttling automatically slows down the crawl speed if the website being crawled is showing signs of stress or unwillingness to respond to the frequency of http requests.
Its difficult to predict what your machine can handle when the sites you will crawl/process all require different levels of machine resources. Auto tuning automatically monitors the host machine’s resource usage and adjusts the crawl speed and concurrency to maximize throughput without overrunning it.
crawl, sites, jvascript, crawler, Crawler, AbotX, machine, Pause, speed, powerful, concurrently, processing, concurrency, before, Automatically, quickly, DOWNLOAD, allows, throughput, process, crawl, sites, jvascript, crawler, Crawler, Pause, speed, powerful, AbotX, concurrently, processing, machine, concurrency, before, blocked, Automatically, quickly, allows, throughput, process