Python BeautifulSoup: List of all the h1, h2, h3 tags from the webpage python.org
BeautifulSoup: Exercise-11 with Solution
Write a Python program to a list of all the h1, h2, h3 tags from the webpage python.org.
Sample Solution:
Python Code:
import requests
from bs4 import BeautifulSoup
url = 'https://www.python.org/'
reqs = requests.get(url)
soup = BeautifulSoup(reqs.text, 'lxml')
print("List of all the h1, h2, h3 :")
for heading in soup.find_all(["h1", "h2", "h3"]):
print(heading.name + ' ' + heading.text.strip())
Sample Output:
List of all the h1, h2, h3 : h1 h1 Functions Defined h1 Compound Data Types h1 Intuitive Interpretation h1 Quick & Easy to Learn h1 All the Flow You’d Expect h2 Get Started h2 Download h2 Docs h2 Jobs h2 Latest News h2 Upcoming Events h2 Success Stories h2 Use Python for… h2 >>> Python Enhancement Proposals (PEPs): The future of Python is discussed here. RSS h2 >>> Python Software Foundation
Python Code Editor:
Have another way to solve this solution? Contribute your code (and comments) through Disqus.
Previous: Write a Python program to find all the link tags and list the first ten from the webpage python.org.
Next: Write a NumPy program to convert a list of numeric value into a one-dimensional NumPy array.
What is the difficulty level of this exercise?
Test your Programming skills with w3resource's quiz.
Python: Tips of the Day
Find current directory and file's directory:
To get the full path to the directory a Python file is contained in, write this in that file:
import os dir_path = os.path.dirname(os.path.realpath(__file__))
(Note that the incantation above won't work if you've already used os.chdir() to change your current working directory, since the value of the __file__ constant is relative to the current working directory and is not changed by an os.chdir() call.)
To get the current working directory use
import os cwd = os.getcwd()
Documentation references for the modules, constants and functions used above:
- The os and os.path modules.
- The __file__ constant
- os.path.realpath(path) (returns "the canonical path of the specified filename, eliminating any symbolic links encountered in the path")
- os.path.dirname(path) (returns "the directory name of pathname path")
- os.getcwd() (returns "a string representing the current working directory")
- os.chdir(path) ("change the current working directory to path")
Ref: https://bit.ly/3fy0R6m
- New Content published on w3resource:
- HTML-CSS Practical: Exercises, Practice, Solution
- Java Regular Expression: Exercises, Practice, Solution
- Scala Programming Exercises, Practice, Solution
- Python Itertools exercises
- Python Numpy exercises
- Python GeoPy Package exercises
- Python Pandas exercises
- Python nltk exercises
- Python BeautifulSoup exercises
- Form Template
- Composer - PHP Package Manager
- PHPUnit - PHP Testing
- Laravel - PHP Framework
- Angular - JavaScript Framework
- Vue - JavaScript Framework
- Jest - JavaScript Testing Framework