Python爬虫程序执行后报错“IndexError: list index out of range”
0
#!/usr/bin/env python
#-coding:utf-8-
import urllib.request
import re
headers = ("User-Agent","Mozilla/5.0 (X11; Ubuntu; Linux x86_64; rv:43.0) Gecko/20100101 Firefox/43.0")
opener = urllib.request.build_opener()
opener.addheaders = [headers]
urllib.request.install_opener(opener)
for i in range(0,2):
url = "http://tieba.baidu.com/p/4645322258?pn=" + str(i)
data = urllib.request.urlopen(url).read().decode('utf-8')
pat = '<img class="BDE_Image" src="(.*?)" size'
urllist = re.compile(pat).findall(data)
for j in (0,len(urllist)+1):
thisurl = urllist[j]
fn = './' + str(i) + str(j) + '.jpg'
urllib.request.urlretrieve(thisurl,filename=fn)
程序执行后会有这样的报错:
Traceback (most recent call last):
File "tieba.py", line 26, in <module>
thisurl = urllist[j]
IndexError: list index out of range
我的urllist是不为空的,print测试过了。
#-coding:utf-8-
import urllib.request
import re
headers = ("User-Agent","Mozilla/5.0 (X11; Ubuntu; Linux x86_64; rv:43.0) Gecko/20100101 Firefox/43.0")
opener = urllib.request.build_opener()
opener.addheaders = [headers]
urllib.request.install_opener(opener)
for i in range(0,2):
url = "http://tieba.baidu.com/p/4645322258?pn=" + str(i)
data = urllib.request.urlopen(url).read().decode('utf-8')
pat = '<img class="BDE_Image" src="(.*?)" size'
urllist = re.compile(pat).findall(data)
for j in (0,len(urllist)+1):
thisurl = urllist[j]
fn = './' + str(i) + str(j) + '.jpg'
urllib.request.urlretrieve(thisurl,filename=fn)
程序执行后会有这样的报错:
Traceback (most recent call last):
File "tieba.py", line 26, in <module>
thisurl = urllist[j]
IndexError: list index out of range
我的urllist是不为空的,print测试过了。
没有找到相关结果
重要提示:提问者不能发表回复,可以通过评论与回答者沟通,沟通后可以通过编辑功能完善问题描述,以便后续其他人能够更容易理解问题.
2 个回复
randolph_huangjiayi - UI设计师 2016-12-26 回答
赞同来自:
seng - 从事BI、大数据、数据分析工作 2016-12-27 回答
赞同来自: