Popularity

5.2

Growing

Activity

6.7

Stars 227

Watchers 3

Forks 13

Last Commit 17 days ago

Description

unix-way web crawler

Programming language: Go

License: MIT License

Tags: Command Line Go Tools Web Crawling CLI Crawler

crawley alternatives and similar packages

Based on the "Command Line" category.
Alternatively, view crawley alternatives based on common mentions on social networks and blogs.

cobra

9.9 7.8 crawley VS cobra

A Commander for modern Go CLI interactions
urfave/cli

9.8 8.8 crawley VS urfave/cli

A simple, fast, and fun package for building command line apps in Go

InfluxDB - Power Real-Time Data Analytics at Scale

Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

Promo www.influxdata.com

bubbletea

9.8 8.8 crawley VS bubbletea

A powerful little TUI framework 🏗
termui

9.7 3.6 crawley VS termui

Golang terminal dashboard
Rich Interactive Widgets for Terminal UIs

9.5 8.0 crawley VS Rich Interactive Widgets for Terminal UIs

Terminal UI library with rich, interactive widgets — written in Golang
gocui

9.5 0.0 crawley VS gocui

Minimalist Go package aimed at creating Console User Interfaces.
LF

9.3 8.9 crawley VS LF

Terminal file manager
color

9.3 6.6 crawley VS color

Color package for Go (golang)
lipgloss

9.3 7.8 crawley VS lipgloss

Style definitions for nice terminal layouts 👄
kafka-go

9.3 7.3 crawley VS kafka-go

Kafka library in Go
elvish

9.1 9.5 crawley VS elvish

Powerful scripting language & Versatile interactive shell
go-prompt

9.0 0.0 crawley VS go-prompt

Building powerful interactive prompts in Go, inspired by python-prompt-toolkit.
termbox-go

9.0 0.0 crawley VS termbox-go

Pure Go termbox implementation
survey

8.9 0.0 crawley VS survey

DISCONTINUED. A golang library for building interactive and accessible prompts with full support for windows and posix terminals.
bubbles

8.9 8.4 crawley VS bubbles

TUI components for Bubble Tea 🫧
tcell

8.9 8.8 crawley VS tcell

Tcell is an alternate terminal package, similar in some ways to termbox, but better in others.
cointop

8.8 0.0 crawley VS cointop

DISCONTINUED. A fast and lightweight interactive terminal based UI application for tracking cryptocurrencies 🚀
pterm

8.8 8.9 crawley VS pterm

✨ #PTerm is a modern Go module to easily beautify console output. Featuring charts, progressbars, tables, trees, text input, select menus and much more 🚀 It's completely configurable and 100% cross-platform compatible.
progressbar

8.7 6.0 crawley VS progressbar

A really basic thread-safe progress bar for Golang applications
kingpin

8.6 5.2 crawley VS kingpin

CONTRIBUTIONS ONLY: A Go (golang) command line and flag parser
tui-go

8.5 0.0 crawley VS tui-go

DISCONTINUED. Go UI library for building rich terminal applications.
go-flags

8.4 0.0 crawley VS go-flags

go command line option parser
Dnote

8.4 5.7 crawley VS Dnote

A simple command line notebook for programmers
pflag

8.4 0.0 crawley VS pflag

Drop-in replacement for Go's flag package, implementing POSIX/GNU-style --flags.
The Platinum Searcher

8.4 0.0 crawley VS The Platinum Searcher

A code search tool similar to ack and the_silver_searcher(ag). It supports multi platforms and multi encodings.
termdash

8.3 7.8 crawley VS termdash

Terminal based dashboard.
asciigraph

8.3 5.2 crawley VS asciigraph

Go package to make lightweight ASCII line graph ╭┈╯ in command line apps with no other dependencies.
readline

8.2 0.0 crawley VS readline

Readline is a pure go(golang) implementation for GNU-Readline kind library
Git Town

8.2 9.9 crawley VS Git Town

Additional Git commands for easier branch management and support for stacked changes
mpb

8.2 8.8 crawley VS mpb

multi progress bar for Go cli applications
uiprogress

8.1 0.0 crawley VS uiprogress

A go library to render progress bars in terminal applications
kong

8.0 7.6 crawley VS kong

Kong is a command-line parser for Go
dotenv-linter

7.9 8.1 crawley VS dotenv-linter

⚡️Lightning-fast linter for .env files. Written in Rust 🦀
mitchellh/cli

7.9 0.0 crawley VS mitchellh/cli

A Go library for implementing command-line interfaces.
uilive

7.8 0.0 crawley VS uilive

uilive is a go library for updating terminal output in realtime
docopt.go

7.7 0.0 crawley VS docopt.go

A command-line arguments parser that will make you smile.
CLI Color

7.7 6.8 crawley VS CLI Color

🎨 Terminal color rendering library, support 8/16 colors, 256 colors, RGB color rendering output, support Print/Sprintf methods, compatible with Windows. GO CLI 控制台颜色渲染工具库，支持16色，256色，RGB色彩渲染输出，使用类似于 Print/Sprintf，兼容并支持 Windows 环境的色彩渲染
termenv

7.7 6.0 crawley VS termenv

Advanced ANSI style & color support for your terminal applications
aurora

7.6 0.0 crawley VS aurora

Golang ultimate ANSI-colors that supports Printf/Sprintf methods
ops

7.6 8.7 crawley VS ops

ops - build and run nanos unikernels
ov

7.5 9.1 crawley VS ov

🎑Feature-rich terminal-based text viewer. It is a so-called terminal pager.
liner

7.4 0.0 crawley VS liner

Pure Go line editor with history, inspired by linenoise
complete

7.2 0.0 crawley VS complete

bash completion written in go + bash completion for go command
cli-init

7.2 0.0 crawley VS cli-init

The easy way to build Golang command-line application.
hostctl

7.2 3.9 crawley VS hostctl

Your dev tool to manage /etc/hosts like a pro!
mow.cli

7.1 0.0 crawley VS mow.cli

A versatile library for building CLI applications in Go
go-isatty

7.1 1.6 crawley VS go-isatty

isatty for golang.
flaggy

7.0 0.0 crawley VS flaggy

Idiomatic Go input parsing with subcommands, positional values, and flags at any position. No required project or package layout and no external dependencies.
go-colorable

7.0 1.3 crawley VS go-colorable

Colorable writer for windows.
cli

6.9 1.4 crawley VS cli

CLI - A package for building command line app with go

Do you think we are missing an alternative of crawley or a related project?

Add another 'Command Line' Package

Popular Comparisons

README

[ Go Version ](go.mod) Downloads

Issues

crawley

Crawls web pages and prints any link it can find.

features

fast html SAX-parser (powered by golang.org/x/net/html)
small (below 1500 SLOC), idiomatic, 100% test covered codebase
grabs most of useful resources urls (pics, videos, audios, forms, etc...)
found urls are streamed to stdout and guranteed to be unique (with fragments omitted)
scan depth (limited by starting host and path, by default - 0) can be configured
can crawl rules and sitemaps from robots.txt
brute mode - scan html comments for urls (this can lead to bogus results)
make use of HTTP_PROXY / HTTPS_PROXY environment values + handles proxy auth
directory-only scan mode (aka fast-scan)
user-defined cookies, in curl-compatible format (i.e. -cookie "ONE=1; TWO=2" -cookie "ITS=ME" -cookie @cookie-file)
user-defined headers, same as curl: -header "ONE: 1" -header "TWO: 2" -header @headers-file
tag filter - allow to specify tags to crawl for (single: -tag a -tag form, multiple: -tag a,form, or mixed)
url ignore - allow to ignore urls with matched substrings from crawling (i.e.: '-ignore logout')
js parser - extract api endpoints from js files, this done by regexp, so results can be messy

examples

# print all links from first page:
crawley http://some-test.site

# print all js files and api endpoints:
crawley -depth -1 -tag script -js http://some-test.site

# print all endpoints from js:
crawley -js http://some-test.site/app.js

# download all png images from site:
crawley -depth -1 -tag img http://some-test.site | grep '\.png$' | wget -i -

# fast directory traversal:
crawley -headless -delay 0 -depth -1 -dirs only http://some-test.site

installation

binaries for Linux, FreeBSD, macOS and Windows, just download and run.
archlinux you can use your favourite AUR helper to install it, e. g. paru -S crawley-bin.

usage

crawley [flags] url

possible flags:

-brute
    scan html comments
-cookie value
    extra cookies for request, can be used multiple times, accept files with '@'-prefix
-delay duration
    per-request delay (0 - disable) (default 150ms)
-depth int
    scan depth (-1 - unlimited)
-dirs string
    policy for non-resource urls: show / hide / only (default "show")
-header value
    extra headers for request, can be used multiple times, accept files with '@'-prefix
-headless
    disable pre-flight HEAD requests
-help
    this flags (and their defaults) description
-ignore value
    patterns (in urls) to be ignored in crawl process
-js
    scan js files for endpoints
-proxy-auth string
    credentials for proxy: user:password
-robots string
    policy for robots.txt: ignore / crawl / respect (default "ignore")
-silent
    suppress info and error messages in stderr
-skip-ssl
    skip ssl verification
-tag value
    tags filter, single or comma-separated tag names allowed
-user-agent string
    user-agent string
-version
    show version
-workers int
    number of workers (default - number of CPU cores)

*Note that all licence references and agreements mentioned in the crawley README section above are relevant to that project's source code only.