I had a need to return to this; here are my notes, I start with some products. Also the related pages widget has been tuned.

  1. http://docs.seleniumhq.org/projects/ide/
  2. https://www.seleniumhq.org/projects/webdriver/
  3. http://wwwsearch.sourceforge.net/mechanize/
  4. http://maxq.tigris.org/
  5. http://twill.idyll.org/

I found these articles in the Summer of 2020

  1. https://thenextweb.com/syndication/2020/07/22/how-to-use-python-and-selenium-to-scrape-websites/, Web scraping has been used to extract data from websites almost from the time the World Wide Web was born. In the early days, scraping was mainly done on static pages – those with known elements, tags, and data.
  2. https://towardsdatascience.com/web-scraping-a-less-brief-overview-of-scrapy-and-selenium-part-ii-3ad290ce7ba1 , the first rule of web crawling is you do not harm the website. The second rule of web crawling is you do NOT harm the website.

Also on this wiki,

Browser Scripting

One Comment

Leave a Reply

Your email address will not be published. Required fields are marked *

This site uses Akismet to reduce spam. Learn how your comment data is processed.