Taking specific text from a div in python3

Question

1 Answer

supriya · Answer 1 · 2020-11-04T03:43:17+0000

Here is the correct approach:

from bs4 import BeautifulSoup
html_src = \
'''
<html>
<body>
<div class="small subtle link">
<a href="https://example.com" nofollow="" target='"_blank"'>
Example
</a>
This text!
</div>
</body>
</html>
'''
soup = BeautifulSoup(html_src, 'lxml')
print(soup.prettify())
div_tag = soup.find(name='div', attrs={'class': 'small subtle link'})
div_content_text = []
for curr_text in div_tag.find_all(recursive=False, text=True):
curr_text = curr_text.strip()
if curr_text:
div_content_text.append(curr_text)
print(div_content_text)

Edit: The solution by Sushil is quite clean, too.

Want to gain skills in Data Science with Python? Sign up today for this Data Science with Python and be a master in it

Taking specific text from a div in python3

1 Answer

Related questions

Browse Categories