meta The initial values for the Request.meta attribute. If given, the dict passed in this parameter will be shallow copied.该参数也是字典形式 , 是为了在spider类的多个parse函数之间传递信息 , 见 知乎 。 注意Response对象也有一个它对应的Request对象 :The Request object that generated this response. This attribute is assigned in the Scrapy engine, after the response and the request have passed through all Downloader Middlewares In particular, this means that:HTTP redirections will cause the original request (to the URL before redirection) to be assigned to the redirected response (with the final URL after redirection).Response.request.url doesn’t always equal Response.urlThis attribute is only available in the spider code, and in the Spider Middlewares, but not in Downloader Middlewares (although you have the Request available there by other means) and handlers of the response_downloaded signal.But Unlike the Response.request attribute, the Response.meta attribute is propagated along redirects and retries, so you will get the original Request.meta sent from your spider.Response obejct这里仅介绍一些reponse对象的属性:
- url 即该response的来源url
- status 即该response的状态码
- headers response的响应头 , 形式为dict
- body response的相应数据体 , 形式为bytes
- request response对应的Request对象 , 对于它上文已经介绍 , 即Response.url可能不等于Reponse.request.url , 因为redirection的原因
SettingsSettings can be populated using different mechanisms, each of which having a different precedence. Here is the list of them in decreasing order of precedence:
- Command line options (most precedence)
- Settings per-spider
- Project settings module(settings.py)
- Default settings per-command
- Default global settings (less precedence)
一般我们直接在settings.py文件中对其进行修改 , 常见需要增改的有:user-agent指定 , ITEM_PIPELINES解除注释以开启pipeline功能 , LOG_LEVEL和LOG_FILE指定 , ROBOTSTXT_OBEY设为False等等 。
【Scrapy框架初探】
推荐阅读
-
林肯|还买什么四缸奔驰GLC 新款林肯航海家上市:大V6、40万!
-
科学科学证据表明:镜子中的你,并不是你真实的相貌!
-
|兰帕德:奇克为了回到赛场非常努力,因一场比赛批评他很残酷
-
驱动之家 反击微软:索尼要求所有的PS4新作必须兼容PS5主机
-
-
-
王者荣耀彪悍不解释|国产FPS《边境》试玩影像公开 自定义要素丰富
-
「海贼资讯」如今攀不起”,鞠婧祎变美了,杨超越才是实力逆袭,“以前不搭理
-
英雄联盟|TES半决赛对决SN,JKL直播进行回应,这8个字也太真实了
-
Intel首席架构师晒多款独显:GPU黄金时代来了
-
-
地下城与勇士|地下城与勇士:提升深渊爆率新方法,如果一直光头不妨一试!
-
海蛎子配大头菜,过油一炸,香滑鲜香,孩子一口气能吃3个
-
[健身]今年河北省将建设全民健身场地设施1700处
-
央视新闻客户端|意大利西西里大区要求清空非法移民接收设施
-
-
-
cnBetaTB|未来iPhone之间的AirDrop可能使用激光快速传输大文件
-
tiktok|52家!美国科技巨头联合出手,重拳指向特朗普!
-