登录

|

注册

论坛首页 › 学习交流 › 技术交流 › Python+人工智能技术交流 › 正文

黑马币：
帖子：
精华：

[学习交流] 爬取笑话

© 小檀初级黑马 / 2018-8-1 13:12 / 1315 人查看 / 0 人回复 / 0 人收藏转载请遵从CC协议禁止商业使用本文

本帖最后由小檀于 2018-8-1 13:15 编辑

# coding=utf-8
import requests
from bs4 import BeautifulSoup
# 获取html文档
def get_html(url):
"""get the content of the url"""
response = requests.get(url)
response.encoding = 'utf-8'
return response.text
# 获取笑话
def get_certain_joke(html):
"""get the joke of the html"""
soup = BeautifulSoup(html, 'lxml')
joke_content = soup.select('div.content')[0].get_text()
return joke_content
url_joke = "https://www.qiushibaike.com"
html = get_html(url_joke)
joke_content = get_certain_joke(html)
print joke_content

收藏 淘帖0 0 踩0

回复只看该作者

0 个回复