Bug #72

GET_TEXT doesn't detect all unicode characters

Added by Pierre Marc over 12 years ago. Updated about 12 years ago.

Status:ClosedStart date:08/01/2012
Priority:NormalDue date:
Assignee:Pierre Marc% Done:

100%

Category:-
Target version:2.13
Operating System:Any Tested:Yes
Version:

Description

In some situations, the GET_TEXT command cannot convert unicode characters.

History

#1 Updated by Pierre Marc over 12 years ago

  • Status changed from New to In Progress

This is a bug. The unicode characters are in a pdf object of type ToUnicode. When this object caontains more that one range of characters, the PDF service cannot detect all characters.

#2 Updated by Pierre Marc over 12 years ago

  • Status changed from In Progress to Resolved
  • % Done changed from 0 to 100
  • Tested changed from No to Yes

The unicode detection has been enhanced and the bug is corrected in v 2.13.

#3 Updated by Pierre Marc about 12 years ago

  • Status changed from Resolved to Closed

Also available in: Atom PDF