我正试图从一个网站的css类中获取数字。 下面代码的输出返回none
。
url = "https://www.reddit.com/r/" + subreddit
content = requests.get(url)
soup = BeautifulSoup(content.text, 'html.parser')
active_users = soup.find("div", {"class":"_3XFx6CfPlg-4Usgxm0gK8R"})
print(active_users)
我要找的类是网站上当前活跃用户的数量。 我该怎么做呢?
请尝试使用soup.select()
。 例如:
import requests, bs4, os
content = requests.get('https://getbootstrap.com/')
soup = bs4.BeautifulSoup(content.text, 'html.parser')
active_users = soup.select("div", {"class":"row"})
print(active_users)
for elem in active_users:
print(elem)
希望有帮助!
您可以使用他们的JSON api来获取活动用户计数,订阅者等。
例如:
import json
import requests
subreddit = 'python'
headers = {'User-Agent': 'Mozilla/5.0 (X11; Ubuntu; Linux x86_64; rv:77.0) Gecko/20100101 Firefox/77.0'}
data = requests.get('https://www.reddit.com/r/{}/about.json'.format(subreddit), headers=headers).json()
# uncomment this to print all data:
# print(json.dumps(data, indent=4))
print('Subscribers :', data['data']['subscribers'])
print('Active user count :', data['data']['active_user_count'])
打印:
Subscribers : 604566
Active user count : 2719