python之获取页面标签的方法

from urllib.request import urlopen
from urllib.error import HTTPError
from bs4 import BeautifulSoup

 

def getTitle(url):
    try:
        html = urlopen(url)
    except HTTPError as e:
        return None
    try:
        bs0bj = BeautifulSoup(html.read(), "html.parser")
        title = bs0bj.head.title
    except AttributeError as e:
        return None
    return title

title = getTitle("http://www.baidu.com")
if title == None:
    print("Title could not be found !")
else:
    print(title)

结果如下图所示

END!

优质内容筛选与推荐>>
1、git小乌龟图标不显示 windows系统
2、Node+express实现后台服务接口
3、08-04 16—20
4、C++的栈
5、753. Cracking the Safe

0 │ 收藏 │ 举报

朋友将在看一看看到

分享想法到看一看