How to find x,y location of a text in pdf

2019-03-30 03:35发布

Is there any tool to find the X-Y location on a text content in a pdf file ?

标签： pdf text pdf-generation acrobat pdflib

3条回答

何必那么认真

2楼-- · 2019-03-30 04:21

Try running "Preflight..." in Acrobat and choosing PDF Analysis -> List page objects, grouped by type of object.

If you locate the text objects within the results list, you will notice there is a position value (in points) within the Text Properties -> * Font section.

0人赞添加讨论(0) 举报

Summer. ? 凉城

3楼-- · 2019-03-30 04:30

Docotic.Pdf Library can do it. See C# sample below:

using (PdfDocument doc = new PdfDocument("your_pdf.pdf", "password_if_need"))
{
    foreach (PdfTextData textData in doc.Pages[0].Canvas.GetTextData())
        Console.WriteLine(textData.Position + " " + textData.Text);
}

0人赞添加讨论(0) 举报

狗以群分

4楼-- · 2019-03-30 04:42

TET, the Text Extraction Toolkit from the pdflib family of products can do that. TET has a commandline interface, and it's the most powerful of all text extraction tools I'm aware of. (It can even handle ligatures...)

Geometry
TET provides precise metrics for the text, such as the position on the page, glyph widths, and text direction. Specific areas on the page can be excluded or included in the text extraction, e.g. to ignore headers and footers or margins.

0人赞添加讨论(0) 举报

How to find x,y location of a text in pdf

采纳回答

编辑标签

举报内容

检举类型

检举原因

检举说明(必填)

打开微信“扫一扫”，打开网页后点击屏幕右上角分享按钮

付费偷看金额在0.1-10元之间