|
About:
PDFTextStream is a PDF text and metadata extraction library available for Java, Python, and .NET. It supports all versions of the PDF document specification, (including v1.7, used by Acrobat 8), extraction of text encoded using double-byte character sets (including Chinese, Japanese, and Korean), decryption of 40-bit and 128-bit encrypted documents, and extraction of all document metadata provided by PDF documents (including form data, bookmarks, and annotations). Easy integration with Jakarta Lucene is included, as well as interactive form update capability.
Author:
Snowtide Informatics Systems, Inc. [contact developer]
Homepage:
http://snowtide.com/
Changelog:
http://snowtide.com/ChangeLog
Purchase:
http://snowtide.com/Purchase
Demo site:
http://snowtide.com/PDFTextOnline
Trove categories:
[change]
| [Development Status] | | 5 - Production/Stable | | [Environment] | | Other Environment, Web Environment, Win32 (MS Windows) | | [Intended Audience] | | Developers | | [License] | | Other/Proprietary License with Free Trial | | [Operating System] | | MacOS X, Microsoft :: Windows :: Cygwin, Microsoft :: Windows :: Windows NT/2000/XP, OS Independent, POSIX :: BSD, POSIX :: BSD :: BSD/OS, POSIX :: BSD :: FreeBSD, POSIX :: BSD :: NetBSD, POSIX :: BSD :: OpenBSD, POSIX :: HP-UX, POSIX :: Linux, POSIX :: SunOS/Solaris, Unix | | [Programming Language] | | Java | | [Topic] | | Information Management :: Document Repositories, Internet :: WWW/HTTP :: Indexing/Search, Software Development :: Libraries :: Java Libraries |
Dependencies:
[change]
Apache Lucene (optional)
[download links]
|
|
» Rating:
(not rated)
» Vitality: 0.01% (Rank 5064)
» Popularity: 0.97% (Rank 5931)

(click to enlarge graphs)
Record hits: 14,307
URL hits: 3,649
Subscribers: 19
|
|