LinkExtractor changing case of URL (but didn't used to) #6329
Labels
Content-Length: 232923 | pFad | https://github.com/scrapy/scrapy/issues/6329
FEFetched URL: https://github.com/scrapy/scrapy/issues/6329
Alternative Proxies:
Regression?
I have a HTML file that contains a link like:
<a target="_blank" href="http://clevelandohioweatherforecast.com//pFad.php?u=http%3A%2F%2FMYURL%2FSomePath%2Fservices%2Fwords%2FMorePath%3Fabc">Words</a>
I'm extracting with code that looks like this:
But my URL comes back as:
http://myurl/SomePath/services/words/MorePath?abc
Note that
MYURL
has becomemyurl
.I've just upgraded from Scrapy 1.7.x to 2.11.1. In 1.7 and previously it would come out as
MYURL
. There's nothing in LinkExtractor docs about changing case, nor can I see anything in the changelogs (but may be missing that)May or may not be intentional behaviour, but the docs should probably be updated if this is intented to say the case will change.
The text was updated successfully, but these errors were encountered: