Only then you'll be able to reverse-map the encoded characters to character names. You can only reverse a CustomEncode-d text, if there is a /ToUnicode table defined inside the PDF. On top of that, there can also be a CustomEncoding (which comes into play when the embedded font is a subset, and does not contain all glyphs defined by the font, but only those glyphs required by the document). There are 5 spec-defined standard encodings which may be used: You will learn, that in order to reverse the PDF source code to text contents, you have to reverse-apply the encoding used by the font. Be warned: this is a 756 page document, and it refers to about 90 other documents, which it declares to be also "normative" for PDF. Before you start an ambitious project like this, you should make yourself familiar with the complete official PDF-1.7 specification.
0 Comments
Leave a Reply. |