Selenium 爬网页的问题， css selector - V2EX

Home Sign Up Sign In

推荐学习书目

› Learn Python the Hard Way

Python Sites

› PyPI - Python Package Index

› http://diveintopython.org/toc/index.html

› Pocoo

值得关注的项目

› PyPy

› Celery

› Jinja2

› Read the Docs

› gevent

› pyenv

› Stackless Python

› Beautiful Soup

› 结巴中文分词

› Green Unicorn

› Sentry

› Shovel

› pytest

Python 编程

› pep8 Checker

Styles

› PEP 8

› Google Python Style Guide

› Code Style from The Hitchhiker's Guide

This topic created in 2138 days ago, the information mentioned may be changed or developed.

大佬们，我想爬个网页练练手，现在碰到问题了，目标无法被 CSS 选择器选中，麻烦看下问题出在哪里
网页是这样的
<ul>
<type=1><start=1>
<li><a href="Papers/XXX.pdf">Preface</a></li>
<li><a href="Papers/XXX.pdf">Chapter 1</a></li>

使用 find_element_by_css_selector 可以选中到 ul 这里
但是再往下 type=1 start=1 怎样都无法选中（ ul > type=1 > start=1 ）
想问下问题出在哪里

6 replies • 2020-08-11 21:55:50 +08:00

1

yejianmail

Aug 11, 2020 via Android

不行就试试 xpath 选择器吧

2

jeeyong

Aug 11, 2020

type=1 这个不是元素就是个空标签属性是 type 值=1
你非得选这个
试试 find

3

j0shfan

OP

Aug 11, 2020

@yejianmail 一样选不中，捂脸

4

j0shfan

OP

Aug 11, 2020

@jeeyong 实际我想批量选的是 a href 后面那个文件的连接。
请问 find 是个什么概念，是 find_element(s)吗

5

tikazyq

Aug 11, 2020

用 puppeteer，直接 js 操作，比 selenium 简单很多

6

jeeyong

Aug 11, 2020

不是...之前回复的时候再打 pubg...
你这<type=1>是什么标签啊?
没有这种标签啊...这个根本写错了吧..
还是你爬取的场景遇到这种情况了?

About · Help · Advertise · Blog · API · FAQ · Solana · 2534 Online Highest 6679 ·

Select Language

创意工作者们的社区

World is powered by solitude

VERSION: 3.9.8.5 · 49ms · UTC 07:00 · PVG 15:00 · LAX 00:00 · JFK 03:00
♥ Do have faith in what you're doing.