Scrapy v2.4.1 Release Notes

Release Date: 2020-11-17 // over 5 years ago

    🛠 Fixed feed exports overwrite support

    🛠 Fixed the asyncio event loop handling, which could make code hang

    🛠 Fixed the IPv6-capable DNS resolver CachingHostnameResolver for download handlers that call reactor.resolve

    🛠 Fixed the output of the genspider command showing placeholders instead of the import part of the generated spider module (issue 4874)


Previous changes from v2.4.0

  • Hihglights:

    👍 Python 3.5 support has been dropped.

    📄 The file_path method of media pipelines can now access the source item.

    This allows you to set a download file path based on item data.

    The new item_export_kwargs key of the FEEDS setting allows to define keyword parameters to pass to item exporter classes.

    📄 You can now choose whether feed exports overwrite or append to the output file.

    📄 For example, when using the crawl or runspider commands, you can use the -O option instead of -o to overwrite the output file.

    👍 Zstd-compressed responses are now supported if zstandard is installed.

    In settings, where the import path of a class is required, it is now possible to pass a class object instead.

    👀 See the full changelog