Skip to content

Commit

Permalink
extend text to <back> part of the document to capture funding, data a…
Browse files Browse the repository at this point in the history
…vailability and other information
  • Loading branch information
lfoppiano committed Apr 3, 2024
1 parent 168d47b commit 6f06e96
Showing 1 changed file with 2 additions and 0 deletions.
2 changes: 2 additions & 0 deletions document_qa/grobid_processors.py
Original file line number Diff line number Diff line change
Expand Up @@ -815,6 +815,8 @@ def get_xml_nodes_body(soup: object, use_paragraphs: bool = True, verbose: bool
# nodes.extend([subchild.find_all(tag_name) for subchild in child.find_all("body")])
nodes.extend(
[subsubchild for subchild in child.find_all("body") for subsubchild in subchild.find_all(tag_name)])
nodes.extend(
[subsubchild for subchild in child.find_all("back") for subsubchild in subchild.find_all(tag_name)])

if verbose:
print(str(nodes))
Expand Down

0 comments on commit 6f06e96

Please sign in to comment.