Python正则抓取html页面数据?
0
同时匹配取出<h2>Zyh,miu</h2>和<span>好好发挥特长。。。</span>这两个标签的值
<div class="article block untagged mb15" id='qiushi_tag_118869202'> <div class="author clearfix"> <a href="/users/28426004/" target="_blank" rel="nofollow"> <img src="//pic.qiushibaike.com/system/avtnew/2842/28426004/medium/2016091403452689.JPEG" alt="Zyh,miu"/> </a> <a href="/users/28426004/" target="_blank" title="Zyh,miu"> <div class="article block untagged mb15" id='qiushi_tag_118869202'> <div class="author clearfix"> <a href="/users/28426004/" target="_blank" rel="nofollow"> <img src="//pic.qiushibaike.com/system/avtnew/2842/28426004/medium/2016091403452689.JPEG" alt="Zyh,miu"/> </a> <a href="/users/28426004/" target="_blank" title="Zyh,miu"> <h2>Zyh,miu</h2> </a> <div class="articleGender womenIcon">24</div> </div> <a href="/article/118869202" target="_blank" class='contentHerf' > <div class="content"> <span>好好发挥特长。。。</span> </div> </a>
<div class="article block untagged mb15" id='qiushi_tag_118869202'> <div class="author clearfix"> <a href="/users/28426004/" target="_blank" rel="nofollow"> <img src="//pic.qiushibaike.com/system/avtnew/2842/28426004/medium/2016091403452689.JPEG" alt="Zyh,miu"/> </a> <a href="/users/28426004/" target="_blank" title="Zyh,miu"> <div class="article block untagged mb15" id='qiushi_tag_118869202'> <div class="author clearfix"> <a href="/users/28426004/" target="_blank" rel="nofollow"> <img src="//pic.qiushibaike.com/system/avtnew/2842/28426004/medium/2016091403452689.JPEG" alt="Zyh,miu"/> </a> <a href="/users/28426004/" target="_blank" title="Zyh,miu"> <h2>Zyh,miu</h2> </a> <div class="articleGender womenIcon">24</div> </div> <a href="/article/118869202" target="_blank" class='contentHerf' > <div class="content"> <span>好好发挥特长。。。</span> </div> </a>
同时匹配取出<h2>Zyh,miu</h2>和<span>好好发挥特长。。。</span>这两个标签的值
<div class="article block untagged mb15" id='qiushi_tag_118869202'>
<div class="author clearfix">
<a href="/users/28426004/" target="_blank" rel="nofollow">
<img src="//pic.qiushibaike.com/system/avtnew/2842/28426004/medium/2016091403452689.JPEG" alt="Zyh,miu"/>
</a>
<a href="/users/28426004/" target="_blank" title="Zyh,miu">
<div class="article block untagged mb15" id='qiushi_tag_118869202'>
<div class="author clearfix">
<a href="/users/28426004/" target="_blank" rel="nofollow">
<img src="//pic.qiushibaike.com/system/avtnew/2842/28426004/medium/2016091403452689.JPEG" alt="Zyh,miu"/>
</a>
<a href="/users/28426004/" target="_blank" title="Zyh,miu">
<h2>Zyh,miu</h2>
</a>
<div class="articleGender womenIcon">24</div>
</div>
<a href="/article/118869202" target="_blank" class='contentHerf' >
<div class="content">
<span>好好发挥特长。。。</span>
</div>
</a>
没有找到相关结果
重要提示:提问者不能发表回复,可以通过评论与回答者沟通,沟通后可以通过编辑功能完善问题描述,以便后续其他人能够更容易理解问题.
1 个回复
ID王大伟 - 人生苦短,我选Python。 2017-04-12 回答
赞同来自:
re2=<span>(.*?)</span>