Is there an open source library that helps me reading/parsing PDF documents in .Net/C#?
相关问题
- Sorting 3 numbers without branching [closed]
- Graphics.DrawImage() - Throws out of memory except
- Generic Generics in Managed C++
- Why am I getting UnauthorizedAccessException on th
- Correctly parse PDF paragraphs with Python
There is also LibHaru
http://libharu.org/wiki/Main_Page
iText is the best library I know. Originally written in Java, there is a .NET port as well.
See http://www.ujihara.jp/iTextdotNET/en/
http://www.c-sharpcorner.com/UploadFile/psingh/PDFFileGenerator12062005235236PM/PDFFileGenerator.aspx is open source and may be a good starting point for you.
PDFClown might help but I would not recommend it for a big or heavy use application.
I have used ITextSharp in the past to manipulate/split and reform PDF documents -- it's pretty straightforward, and also open-source.
iTextSharp is the best bet. Used it to make a spider for lucene.Net so that it could crawl PDF.