scrapy + tor always returns 403 but I can curl and browse -


i'm trying setup scrapy + tor

i'm using scrapy 0.24.6

  1. i first tried using polipo able access tor http proxy (https://pkmishra.github.io/blog/2013/04/16/scrapy-run-using-tor-and-multiple-agents-part-2-ubuntu/) i'm able configure web browser use polipo , i'm able browse using tor , can curl. tried httpproxymiddleware , using env var or writing own custom middleware, same result: scrapy returns 403

  2. then tried use tor directly, again can configure web browser use socks proxy , can curl torsocks, scrapy returns 403

anyone has idea might wrong ?

it looks error come scrapy because have exact same headers/user-agent , without tor, through tor 403


Popular posts from this blog

c# - ODP.NET Oracle.ManagedDataAccess causes ORA-12537 network session end of file -

matlab - Compression and Decompression of ECG Signal using HUFFMAN ALGORITHM -

utf 8 - split utf-8 string into bytes in python -