This website requires JavaScript.
Explore
Help
Sign In
misc
/
web-scraper
Watch
1
Star
0
Fork
0
You've already forked web-scraper
Code
Issues
Pull Requests
Releases
Wiki
Activity
55
Commits
2
Branches
0
Tags
9e754a55846bf22a682c456de4116ca3f8b3be3d
T
Code
Clone
HTTPS
Tea CLI
Open with VS Code
Open with VSCodium
Open with Intellij IDEA
Download ZIP
Download TAR.GZ
Download BUNDLE
simon
9e754a5584
improve handling of gzip/deflated data detection
2018-09-09 11:21:46 +01:00
templates
report runtime of script in generated sitemap
2018-09-06 17:20:59 +01:00
utils
improve handling of gzip/deflated data detection
2018-09-09 11:21:46 +01:00
.gitignore
ignore generated file
2018-09-06 17:08:56 +01:00
crawler.py
report runtime of script in generated sitemap
2018-09-06 17:20:59 +01:00
notes.md
tick off gzip encoding
2018-09-09 10:52:37 +01:00
README.md
adjusted title
2018-08-28 09:12:48 +01:00
requirements.txt
use lxml as the parser and only find links on a page if we've got the source
2018-09-09 10:06:25 +01:00
test_helpers.py
remove testing url with requests and assume that the user is correct
2018-08-28 17:22:52 +01:00
README.md
Concurrent web scraper
Reference in New Issue
View Git Blame
Copy Permalink
S
Description
No description provided
Readme
1.3
MiB
Languages
Python
100%