GitHub - dmavrotas/mirroring-web-crawler: A go implementation of the wget --mirror

This is a simple bash script to crawl a website and download all the files in it. It is useful for downloading a website for offline viewing.

go run wget-mirror <url> <destination>

go run wget-mirror https://developer.mozilla.org/en-US/docs/Web/HTML destination

Cobra is used for the CLI. PuerkitoBio/goquery is used for searching inside the HTML pages. Testify is used for testing and assertions.

Make sure you have go 1.20 installed. Then just do :

    go get -d ./...

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
cmd		cmd
.DS_Store		.DS_Store
.gitignore		.gitignore
ReadMe.MD		ReadMe.MD
go.mod		go.mod
go.sum		go.sum
main.go		main.go

Provide feedback