Pdf To Text是一個(gè)PDF文檔轉(zhuǎn)換器,它可以在任何.NET程序中從PDF文檔中提取TEXT文本。而且它可以非常方便的整合到現(xiàn)有的.NET應(yīng)用程序中,而無(wú)需安裝。
The ExpertPDF Pdf to Text Converter can be used in any type of .NET application to extract the text from a PDF document. The integration with existing .NET applications is extremely easy and no installation is necessary in order to run the converter.
通過(guò)下載可直接獲得一個(gè).net2.0的程序集和一個(gè)可直接使用的控制臺(tái)示例程序。并且Samples文件夾中的示例程序均包含了完整的C#源碼。所提供的示例程序均可在Visual Studio 2005或Visual Studio 2008上進(jìn)行調(diào)試。使用ExpertPDF執(zhí)行轉(zhuǎn)換后,你可以將所提取的文本作為.NET字符串對(duì)象,以便進(jìn)行搜索或?qū)⑵浯嫒氡镜匚募小?/p>
特征:
- .NET2.0開發(fā)庫(kù)文件和C#的示例程序
- 可從PDF文件或PDF流中提取text
- 提取的text可保持PDF原有頁(yè)面布局
- 可按PDF閱讀順序提取text
- 可只提取頁(yè)面中的指定部分
- 可將提取的text存儲(chǔ)為html格式并添加元標(biāo)簽描述
- 以html元標(biāo)簽形式添加從PDF獲得的標(biāo)題,關(guān)鍵字,作者等關(guān)鍵字
- 可為提取后的text添加特殊字符標(biāo)記分頁(yè)符
- 支持從密碼保護(hù)的PDF文檔中提取text
The downloadable archive contains the assembly for .NET 2.0 and a ready-to-use sample console application. The full C# source code for the sample application is available in the Samples folder. The sample application can be built with Visual Studio 2005 or Visual Studio 2008. The result of conversion is a .NET String object that you can use for example in search operations or save into a file on disk.
Features:
- .NET 2.0 development library and C# samples
- Extract text from PDF stream or a PDF file
- Extract text preserving the original PDF layout
- Extract text in PDF reading order
- Specify the range of pages to be extracted
- Save the extracted text in a HTML format and add description meta tags
- Add the title, keywords, author from PDF description in HTML meta tags
- Mark the page breaks in the extracted text with a special character
- Extract text from password protected PDF documents