Answers for "bs4 extract"

8

use beautifulsoup

#start


from bs4 import BeautifulSoup
import requests

req = requests.get('https://www.slickcharts.com/sp500')
soup = BeautifulSoup(req.text, 'html.parser')
Posted by: Guest on April-30-2020
0

BeautifulSoup - scraping list from html

from bs4 import BeautifulSoup

# Simple HTML
SIMPLE_HTML = '''<html>
<head></head>
<body>
<h1>This is a title</h1>
<p class="subtitle">Lorem ipsum dolor sit amet.</p>
<p>Here's another p without a class</p>
<ul>
    <li>Sarah</li>
    <li>Mary</li>
    <li>Charlotte</li>
    <li>Carl</li>
</ul>
</body>
</html>'''

simple_soup = BeautifulSoup(SIMPLE_HTML, 'html.parser')      # use html.parser in order to understand the simple HTML

# Find list from html
def find_list():
    list_items = simple_soup.find_all('li')
    my_list = [e.string for e in list_items]  # convert list_items to string
    print(my_list)
    
find_list()
Posted by: Guest on August-14-2020

Code answers related to "Shell/Bash"

Browse Popular Code Answers by Language