Skip to content

JATS Parser only captures the first award-id tag in each award-group #166

@seasidesparrow

Description

@seasidesparrow

award_id_tag = ag.find("award-id")

The current JATS Parser assumes that there will be only one award-id in a given award-group but this is not necessarily the case. As an example, the XML for the A+A paper 10.1051/0004-6361/202453501 has the following funding-group object:

<funding-group><award-group><funding-source><named-content content-type="funder-name">Australian Research Council</named-content><named-content content-type="funder-id">https://doi.org/10.13039/501100000923</named-content></funding-source><award-id>DP230102280</award-id><award-id>DP250101526</award-id><award-id>FL220100020</award-id></award-group></funding-group>

In this case the existing parser only returns one funding element for award number DP230102280, but there are two others that are skipped.

There should be a loop in this section of jats parser where we do a find_all("award-id") instead of a singular find

Metadata

Metadata

Labels

No labels
No labels

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions