Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Sample pdf Bank Document (Broker Contract Note) (#106)
* Sample pdf Bank Document (Broker Contract Note) Close #105 * Align consideration column
- Loading branch information
d269555
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
The donor pdf is an actual broker contract note appropriated from another person with the pdf code edited manually. The pdf file structure is not trivial but an overview is given at https://brendanzagaeski.appspot.com/0005.html.
This will often work: https://www.pdflabs.com/docs/pdftk-cli-examples/
Unfortunately, the data is contained in a compressed form so it was [sort of] uncompressed as follows:
E:\pdfbox3>java -jar pdfbox-app-3.0.0.jar decode donor.pdf SelfwealthBuy01.pdf
File was edited in np++ using search to find locations (see line numbers below) and HEX-Editor Plugin for Notepad++ v8.2.2 v0.9.12 to edit data. The main data with changes are shown below. Some desired font characters were not contained in the pdf so the bold font was dropped for security owner's name and the
generated at
was changed from the original Buy01.txt.I didn't worry about looking up the character mapping but for the main text the digits 0..9 were hex 00 followed by hex 13..1c.