
python 爬蟲做什么



# This code example demonstrates how to use Python requests and Beautiful Soup libraries
# to fetch and extract data from a web page:
import requests
from bs4 import BeautifulSoup
# Step 1: get the content of the page
url = 'https://www.example.com'
response = requests.get(url)
content = response.content
# Step 2: extract data from the content
soup = BeautifulSoup(content, 'html.parser')
title = soup.title.string
paragraphs = soup.find_all('p')
links = [a.get('href') for a in soup.find_all('a', href=True)]
# Step 3: do something with the extracted data
print('Title:', title)
print('Paragraphs:', paragraphs)
print('Links:', links)

以上代碼演示了如何使用Python請求和Beautiful Soup庫從網頁上提取數據。我們首先使用requests庫獲取網頁內容,然后使用Beautiful Soup解析HTML內容,提取標題、段落和鏈接。最后,我們可以將這些數據用于分析、建模和可視化等方面。
