Python正则抓取html页面数据?

0
同时匹配取出<h2>Zyh,miu</h2>和<span>好好发挥特长。。。</span>这两个标签的值
<div class="article block untagged mb15" id='qiushi_tag_118869202'> <div class="author clearfix"> <a href="/users/28426004/" target="_blank" rel="nofollow"> <img src="//pic.qiushibaike.com/system/avtnew/2842/28426004/medium/2016091403452689.JPEG" alt="Zyh,miu"/> </a> <a href="/users/28426004/" target="_blank" title="Zyh,miu"> <div class="article block untagged mb15" id='qiushi_tag_118869202'> <div class="author clearfix"> <a href="/users/28426004/" target="_blank" rel="nofollow"> <img src="//pic.qiushibaike.com/system/avtnew/2842/28426004/medium/2016091403452689.JPEG" alt="Zyh,miu"/> </a> <a href="/users/28426004/" target="_blank" title="Zyh,miu"> <h2>Zyh,miu</h2> </a> <div class="articleGender womenIcon">24</div> </div> <a href="/article/118869202" target="_blank" class='contentHerf' > <div class="content"> <span>好好发挥特长。。。</span> </div> </a>
同时匹配取出<h2>Zyh,miu</h2>和<span>好好发挥特长。。。</span>这两个标签的值


<div class="article block untagged mb15" id='qiushi_tag_118869202'>
<div class="author clearfix">
<a href="/users/28426004/" target="_blank" rel="nofollow">
<img src="//pic.qiushibaike.com/system/avtnew/2842/28426004/medium/2016091403452689.JPEG" alt="Zyh,miu"/>
</a>
<a href="/users/28426004/" target="_blank" title="Zyh,miu">
<div class="article block untagged mb15" id='qiushi_tag_118869202'>
<div class="author clearfix">
<a href="/users/28426004/" target="_blank" rel="nofollow">
<img src="//pic.qiushibaike.com/system/avtnew/2842/28426004/medium/2016091403452689.JPEG" alt="Zyh,miu"/>
</a>
<a href="/users/28426004/" target="_blank" title="Zyh,miu">
<h2>Zyh,miu</h2>
</a>
<div class="articleGender womenIcon">24</div>
</div>
<a href="/article/118869202" target="_blank" class='contentHerf' >
<div class="content">
<span>好好发挥特长。。。</span>
</div>
</a>
已邀请:
0

ID王大伟 - 人生苦短,我选Python。 2017-04-12 回答

re1=<h2>(.*?)</h2>
re2=<span>(.*?)</span>

要回复问题请先登录注册