|
|
|
|
|
|
|
Citation Software Inc.
Specialists in variable-data publishing since 1986
|
|
|
|
www.CitationSoftware.com info@CitationSoftware.com 508-436-2543
|
|
|
|
|
|
|
|
|
|
|
*
|
Click here to sign up for our free newsletter.
|
|
|
|
|
|
|
Solution Finder
|
|
|
|
•
|
PDF data mining (extraction and repurposing of text, graphics & metadata in PDF files)
|
|
|
|
|
|
|
|
|
These are the products that we offer for this Solution Finder
category. Click on the product that you want, or scroll down the
page to read about all the products. If you're not sure which
product you need, call us!
|
|
|
|
XpdfText SDK
|
The XpdfText SDK is a developer's library/SDK that converts PDF files to plain text.
In other words, it extracts text from PDF files.
The PDF file can be in memory or on disk; and likewise, the text can be extracted
to memory or directly to disk.
The XpdfText SDK generates plain ASCII, 8-bit Latin1, UTF-8 Unicode, and various standard
CJK encodings, and it can be configured to generate custom encodings.
The XpdfText SDK is available as a COM component or a DLL for Windows platforms and as a
shared library for Unix platforms. Portable C++ source code is also available.
The software is easy to use:
|
PDFHandle pdf;
char *buf;
int length;
pdfLoadFile(&pdf, "MyFile.pdf");
// convert to a text file on disk...
pdfConvertToTextFile(pdf, 1, 5, "MyFile.txt");
// ... or convert in memory
buf = pdfConvertToTextString(pdf, 1, 5, &length);
|
|
|
|
Pricing starts at $475.00 USD for a developer's license and $18.00 USD
per unit for runtime licenses.
(Pricing is subject to change without notice.)
Volume discounts are available. Call us at
508-436-2543 to get a price quote.
|
|
|
XpdfInfo SDK
|
The XpdfInfo SDK is a developer's library/SDK that extracts page count, page sizes, and
metadata from PDF files.
The XpdfInfo SDK is easy to use!
|
PDFHandle pdf;
char *title;
int length;
pdfLoadFile(&pdf, "MyFile.pdf");
title = pdfGetTitle(pdf, &length);
printf("%s\n", title);
|
|
|
|
|
The XpdfInfo SDK is available as a COM component or a DLL for Windows platforms
and as a shared library for Unix platforms. Portable C++ source code is also available.
Pricing starts at $235.00 USD for a developer's license and $9.00 USD
per unit for runtime licenses.
(Pricing is subject to change without notice.)
Volume discounts are available. Call us at
508-436-2543 to get a price quote.
|
|
|
PDF-repurposing solutions
|
We offer software that lets you extract text and images from PDF files in a format
that maintains the relationships among the various content elements
(headings, subheadings, etc.).
|
|
|
This software is available in two forms:
- Storypad: a collection of PDF-extraction solutions
that run either on a desktop or on a server
(and can be customized for you if need be),
and
- JPedal: a Java developers library that can be
used by software engineers that are creating their
own extraction solutions.
Click on
the illustrations below to get
an idea of what you can get when you use this software
to extract text and images from PDF files.
Pricing varies depending on your requirements.
To get more details, click the "Get detailed information about PDF-repurposing solutions" button.
|
|
|
|
|
|
Didn't find exactly what you need? Not sure exactly what you need?
Contact us by phone at 508-436-2543, or send e-mail to
info@CitationSoftware.com.
We can help you find appropriate software for your requirements.
|
|
|
|
|
|
|
Copyright © 2008 Citation Software Inc.
info@CitationSoftware.com
508-436-2543
www.CitationSoftware.com
|
>
|
|