Skip to content

an html crawler utilizing <a> tag expression matching to discover hyperlinks and a shared memory buffer to support fully distributed html parsing, content indexing, broken-link detection, and page invalidation. this project served as a handy tool I developed only for myself while working as webmaster in OU's dept. of physics & astronomy

Notifications You must be signed in to change notification settings

clay-curry/python-html-crawler

Repository files navigation

Crawler

No instructions; just the following advice:

EXPECT BUGS

About

an html crawler utilizing <a> tag expression matching to discover hyperlinks and a shared memory buffer to support fully distributed html parsing, content indexing, broken-link detection, and page invalidation. this project served as a handy tool I developed only for myself while working as webmaster in OU's dept. of physics & astronomy

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages