PDFlib TET extracts text, images and metadata from PDF documents. TET makes available the text contents of a PDF as Unicode strings, plus detailed glyph and font information as well as the position on the page. Raster images are extracted in common raster formats. TET optionally converts PDF documents to an XML-based format called TETML which contains text and metadata as well as resource information.
Full Specifications
What's new in version 4.2
General
ReleaseMay 23, 2013
Date AddedMay 24, 2013
Version4.2
Operating Systems
Operating SystemsWindows 8, Windows Vista, Windows, Windows 7, Windows XP