You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
- base_spider.py: fix an oom issue caused by empty strings produced by `img_re.findall()`. check for empty string before string replacing.
- base_spider.py: add image url validation to prevent potential bugs.
- base_spider.py: change `open()` in `save_item()` from synchronous to `anyio.open_file()`.
- spiders: set timeout in aiohttp client.
- optimize images downloading.
- add __main__.py in python package
0 commit comments