Colly
Colly is an elegant and lightning-fast scraping framework for Golang. It provides a clean interface to write crawlers, scrapers, and spiders for extracting structured data from websites.
Description
Colly is a Scraping Framework for Gophers that enables users to easily extract structured data from websites. This data can be used for a wide range of applications, including data mining, data processing, and archiving.
The framework offers a clean interface to develop any kind of crawler, scraper, or spider. It is particularly suited for Golang developers (Gophers) seeking an efficient tool for web scraping tasks.
Two packages are available: the main 'colly' program and 'golang-github-gocolly-colly-dev' for development files.
How It Works
Colly operates as a Golang-based framework that provides a clean interface for building web crawlers and scrapers. It leverages Golang dependencies such as htmlquery, xmlquery, goquery, and robotstxt handling to parse content, match patterns, and respect web standards while extracting structured data from websites.
Installation
sudo apt install collyExamples
colly -hcolly newcolly COMMAND --helpcollysudo apt install collysudo apt install golang-github-gocolly-colly-dev