我想爬取某一个网页,一个div下的每一条a,但是第一条是标题,而且和剩下的结构不同,会造成如下错误:
我的想法是爬取的内容应该为:
{“省”:["a","b","c"],“市”:["d","e","f"],“区”:["g","h","i"]},但会变成:
{"省":["a","b","c"],“市”:["d","e","f"],“区”:["地区","g","h"]
应该怎么办,我如何从第二条开始爬取。我本想在定义sites时改为 //div/a[2], 但是不成功。
scrapy新手求助!!!
我的想法是爬取的内容应该为:
{“省”:["a","b","c"],“市”:["d","e","f"],“区”:["g","h","i"]},但会变成:
{"省":["a","b","c"],“市”:["d","e","f"],“区”:["地区","g","h"]
应该怎么办,我如何从第二条开始爬取。我本想在定义sites时改为 //div/a[2], 但是不成功。
scrapy新手求助!!!