Octoparse - Free Web Scraper
Octoparse - Free Web Scraper

This project has already launched.

Octoparse is a free client-side Windows web scraping software that turns unstructured or semi-structured data from websites into structured data sets, no coding needed. It's an easy-to-use web scraping tools that collects data from the web. Just click the information on the website in the built-in browser and perform the extraction, you will get the structured data you need. Visit http://www.octoparse.com/ for more information.

Scraping the web on a large scale simultaneously, based on distributed computing, is the most powerful feature of Octoparse. After you upload your configuration project to the cloud, you can choose to perform the extraction concurrently by using many cloud servers. If you need to scrape 10,000 web pages within a short time, then Octoparse cloud service fits best.

Crawlers run in Octoparse are determined by the extraction rules configured. The extraction rule would tell Octoparse: which website is to be open; where is the data you plan to crawl, etc. provides high speed data collection, performing up to 10 concurrent threads.

Being a Windows application, Octoparse works well for static and dynamic websites, including those whose web pages are using Ajax. There are various export formats of your choice like CSV, EXCEL, HTML, TXT, and databases (MySQL, SQL Server, and Oracle). Octoparse simulates human operation to interact with web pages. Its remarkable features such as filling out forms, entering a search term into the textbox, etc., would make it much easier to extract web data. You can run your extraction project either on your own machines (Local Extraction) or in the cloud (Cloud Extraction). Octoparse provides a visual operation pane, which is very user friendly and straightforward. Octoparse simulates human web browsing behavior like opening a web page, logging into an account, entering a text, pointing-and-clicking the web element, etc. 

 

 

comments powered by Disqus