1
wong2 2012-05-21 20:46:43 +08:00 1
|
2
VeryCB 2012-05-21 20:48:23 +08:00 1
|
3
VeryCB 2012-05-21 20:49:33 +08:00
BeautifulSoup http://www.crummy.com/software/BeautifulSoup/
|
4
lackrp 2012-05-21 20:54:58 +08:00 1
过滤是指要去掉么?
import re pattern = re.compile(r'<.*?>') pattern.sub('', html) |
6
phuslu 2012-05-21 20:57:02 +08:00 1
python readability
|
8
eric_q 2012-05-21 21:03:18 +08:00
我……我还是想用shell
|
12
eerie 2012-05-21 21:33:40 +08:00 1
|
15
cute 2012-05-22 20:41:05 +08:00
|
17
magicshui 2012-05-22 21:08:14 +08:00
BeautifulSoup感觉简单些~
|