PDFlib TET (Text and Image Extraction Toolkit) reliably extracts text, images and metadata from PDF documents. TET makes available the text contents of a PDF as Unicode strings, plus detailed color, glyph and font information as well as the position on the page. Raster images are extracted in common image formats. TET optionally converts PDF documents to an XML-based format called TETML which contains text and metadata as well as resource information. TET contains advanced content analysis algorithms for determining word boundaries, grouping text into columns, identifying table structures and removing redundant items such as shadow text.
With PDFlib TET you can:
The TET family comprises the following products:
Average Star Rating: 0.0 out of 5 (0 vote)
If you finish the payment today, your order will arrive within the estimated delivery time.You must be logged in to post a review.
Select at least 2 products
to compare
Reviews
There are no reviews yet.