错误:
ImportError: No module named items
程序:
items.py代码:
# Define here the models for your scraped items
#
# See documentation in:
#
http://doc.scrapy.org/en/latest/topics/items.htmlfrom scrapy.item import Item, Field
class XsItem(Item):
# define the fields for your item here like:
# name = Field()
url = Field()
xs.py代码:
from scrapy.contrib.spiders import CrawlSpider, Rule
from scrapy.contrib.linkextractors.sgml import SgmlLinkExtractor
from scrapy.selector import Selector
from xs.items import XsItem
class XsSpider(CrawlSpider):
name = 'xs'
allowed_domains = ['
bbs.uc.cn']
start_urls = ['
http://bbs.uc.cn']
rules = (
Rule(SgmlLinkExtractor(allow=('\thread-\d+-\d-\d.html', )),
callback='parse_page', follow=True),
)
def parse_page(self, response):
item = XsItem()
sel = Selector(response)
item['url'] = response.url
return item
这是一个专为移动设备优化的页面(即为了让你能够在 Google 搜索结果里秒开这个页面),如果你希望参与 V2EX 社区的讨论,你可以继续到 V2EX 上打开本讨论主题的完整版本。
https://www.v2ex.com/t/144062
V2EX 是创意工作者们的社区,是一个分享自己正在做的有趣事物、交流想法,可以遇见新朋友甚至新机会的地方。
V2EX is a community of developers, designers and creative people.