TABLE OF CONTENTS
What is scrapy_proxies?
Scrapy_proxies is a python module (preinstalled in our images), that handles the proxy management in our scrapers made with Scrapy
Our View on scrapy_proxies
Usage Rating of scrapy_proxies
1. BEST CHOICE: This is among the preferred tools we use
It's the easiest way to use an external list of proxies and their management (removing non funcioning proxies, retries and so on).
Settings
You can find the settings needed to make the module work in the github link of the project. We use a slightly modified version where there's another option for PROXY_MODE = -1, meaning don't use proxies.
When to use scrapy_proxies
Whenever and IP gets blocked after several requests.
Reference and documentation
https://github.com/aivarsk/scrapy-proxies