পাইথনে BeautifulSoup ব্যবহার করে খালি ট্যাগগুলি কীভাবে সরিয়ে ফেলা যায়?

BeautifulSoup হল একটি পাইথন লাইব্রেরি যা HTML এবং XML ফাইলগুলি থেকে ডেটা বের করে৷ BeautifulSoup ব্যবহার করে, আমরা HTML বা XML ডকুমেন্টে উপস্থিত খালি ট্যাগগুলিকেও সরিয়ে দিতে পারি এবং প্রদত্ত ডেটাকে মানবে রূপান্তর করতে পারি৷ পঠনযোগ্য ফাইল।

প্রথমে, আমরা কমান্ডটি ব্যবহার করে আমাদের স্থানীয় পরিবেশে BeautifulSoup লাইব্রেরি ইনস্টল করব:pip install beautifulsoup4

উদাহরণ

#Import the BeautifulSoup library

from bs4 import BeautifulSoup

#Get the html document
html_object = """
<p>Python is an interpreted, high-level and general-purpose
programming language. Python's design
philosophy emphasizes code readability with its notable use of
significant indentation.</p>
"""

#Let us create the soup for the given html document
soup = BeautifulSoup(html_object, "lxml")

#Iterate over each line of the document and extract the data
for x in soup.find_all():
   if len(x.get_text(strip=True)) == 0:
      x.extract()

print(soup)

আউটপুট

উপরের কোডটি চালানোর ফলে আউটপুট উৎপন্ন হবে এবং প্রদত্ত HTML ডকুমেন্টকে এর মধ্যে থাকা খালি ট্যাগগুলি সরিয়ে মানব পাঠযোগ্য কোডে রূপান্তরিত করবে।

<html><body><p>Python is an interpreted, high−level and general−purpose programming
language. Python's design
philosophy emphasizes code readability with its notable use of significant indentation.</p>
</body></html>