Most Pythonic way to find the sibling of an element in XML

Question

asked Jul 26, 2019 in Python by Rajesh Malhotra (19.9k points)

Problem: I have the following XML snippet:

...snip...
DEFINITION
This, these. 
PRONUNCIATION 
..snip...

I need to search the totality of the XML, find the heading that has text DEFINITION, and print the associated definitions. The format of the definitions is not consistent and can change attributes/elements so the only reliable way of capturing all of it is to read until the next element with attribute p_cat_heading.

Right now I am using the following code to find all of the headers:

for heading in root.findall(".//*[@class='p_cat_heading']"):
if heading.text == "DEFINITION":
<WE FOUND THE CORRECT HEADER - TAKE ACTION HERE>

Things I have tried:

Using lxml's getnext method. This gets the next sibling which has the attribute "p_cat_heading" which isn't what I want.

following_sibling - lxml is supposed to support this but it throws "following-sibling is not found in prefix-map"

My Solution:

I haven't finished it, but because my XML is short I was just going to get a list of all elements, iterate until the one with the DEFINITION attribute, and then iterate until the next element with the p_cat_heading attribute. This solution is horrible and ugly, but I can't seem to find a clean alternative.

What I'm looking for:

A more Pythonic way of printing the definition which is "this, these" in our case. Solution may use either xpath or some alternative. Python-native solutions preferred, but anything will do.

1 Answer

Related questions

0 votes

1 answer

Most pythonic way to delete a file which may not exist

asked Dec 7, 2020 in Python by ashely (50.2k points)

0 votes

1 answer

Shorter, more pythonic way of writing an if statement

asked Jun 8, 2019 in Python by Anil (1.1k points)

0 votes

1 answer

What is the pythonic way to calculate dot product?

asked Oct 3, 2019 in Python by Sammy (47.6k points)

0 votes

1 answer

How to find all occurrences of an element in a list ?

asked Nov 23, 2020 in Python by laddulakshana (16.4k points)

0 votes

4 answers

What is the most efficient way to find amicable numbers in python?

asked May 9, 2021 in Python by laddulakshana (16.4k points)

Anirudh Singh · Answer 1 · 2019-07-27T09:54:34+0000

You can use xpath:

//*[@class='p_cat_heading'][contains(text(),'DEFINITION')]/following-sibling::*[1]

Or you can use lxml:

from lxml import html

data = [your snippet above]
exp = "//*[@class='p_cat_heading'][contains(text(),'DEFINITION')]/following-sibling::*[1]"

tree = html.fromstring(data)
target = tree.xpath(exp)

for i in target:
print(i.text_content())

Most Pythonic way to find the sibling of an element in XML

1 Answer

Related questions

Browse By Domains

Popular Courses

Popular Tutorials

Popular Resources