web-scraper/notes.md

## Thoughts

###### for each URL, do the following:
 * mark it as crawled
 * get page content
   * if that fails, mark the link as invalid
 * find all links in the content
   * check each link for dupes
   * add to pool or discard