Sign In  Sign Up Live-Chat

Extract text from PDF is broken

Last post 06-06-2007, 7:48 AM by AdeelTaseer. 3 replies.
Sort Posts: Previous Next
  •  06-05-2007, 2:53 PM 79450

    Extract text from PDF is broken

    Using 2.51, when extracting text from the PDF, every word has extraneous spaces inserted into it so that the text is completely unusable.  For instance:  "JEFF JONES"  may come out as "J E F F   J O N E S" or something even more random.  Since that is all we use the product for, it is completely broken from our perspective.

    Is there a fix?

    TIA

     
  •  06-05-2007, 8:38 PM 79471 in reply to 79450

    Re: Extract text from PDF is broken

    Hi,

    Thank you for considering Aspose.

    I am unable to reproduce the error. I have checked with Aspose.Pdf.Kit for .Net v2.5.1.0 and runtime v2.0.50727. Please make sure you are using the latest version. If you still get problems, then please send your sample Pdf file to us. We will investigate this issue in detail.

    Thanks.

    Adeel Ahmad
    Support Developer
    Aspose Changsha Team
    http://www.aspose.com/Wiki/default.aspx/Aspose.Corporate/ContactChangsha.html

     
  •  06-05-2007, 11:02 PM 79488 in reply to 79450

    Re: Extract text from PDF is broken

    Attachment: Present (inaccessible)
    Attached.  Every PDF I tested had the problem and all with different spacing problems.
     
  •  06-06-2007, 7:48 AM 79554 in reply to 79488

    Re: Extract text from PDF is broken

    Hi,

    I have reproduced the error and logged it as PDFKITNET-3143. I will discuss this with the developers and we will inform you as soon as a solution is found.

    Thanks.

    Adeel Ahmad
    Support Developer
    Aspose Changsha Team
    http://www.aspose.com/Wiki/default.aspx/Aspose.Corporate/ContactChangsha.html

     
View as RSS news feed in XML