python 正则匹配内容求助

b= 'hello this is book . \n \
test . \n \
abc 礼拜 test \n \
doing what ? \n \
abc 理财 ddd \n \
this is the end '
print b
print re.search(u'礼拜(.*?)理财',b)
这里输出是 None ，但是如果不加后面的理财，就可以匹配上。

理财

礼拜

匹配

11 replies • 2016-02-26 12:47:37 +08:00

v2014

Feb 26, 2016

re.search(u'礼拜(.*?)理财',b, re.DOTALL))
https://docs.python.org/3/library/re.html#re.DOTALL
默认.不包括换行，加了 DOTALL 标志才是所有字符

popok

Feb 26, 2016

@v2014 礼拜([^理财]*)理财
换行也包括了，哈哈

lonelinsky

Feb 26, 2016

@popok 但是你这样的写法，如果中间出现单独的理字或财字就会出问题了…

thinkmore

Feb 26, 2016

(?s)(?<=(礼拜)).*(?=理财)

python3.4 下测试通过

thinkmore

Feb 26, 2016

@luyg 匹配的内容不包括礼拜和理财

lxy

Feb 26, 2016

print re.search('礼拜(.+?)理财', b, re.S).group(0)
如果匹配之间的字符就 group(1)

wentian

Feb 26, 2016

regex = re.compile(ur'(?!礼拜).+(?=理财)'), re.UNICODE | re.DOTALL | re.MULTILINE)

popok

Feb 26, 2016

@lonelinsky 对的，回完贴就发现了，哈哈

wentian

Feb 26, 2016

(?<=礼拜).+(?=理财)
上面用错了环视功能,楼主试试这个, 我已经测试通过了
:)

luyg

Feb 26, 2016

@v2014 测试通过谢谢。

luyg

Feb 26, 2016

在这统一感谢，谢谢大家的帮助。问题圆满解决。