extraction and repurposing of
text, graphics & metadata
in PDF Files
(PDF Data Mining):

Citation Software

 


Looking for a product or
service? Our Solution Finder
can help you find it fast!
 
   • Products & Services     • Buy software   
   • Downloads     • Support   
   • Mailpiece-design site     • Client list   
   • Company information     • Links   
   • Free newsletter     • FAQ   
   • Case studies     • Contact us   
   • News archives     • Press   
   • Customer testimonials   
   • Home   
   
 
 
  Click here to sign up for our free newsletter.  






































































































































































































































































   Citation Software Inc.
 Specialists in variable-data publishing since 1986
  
  
 www.CitationSoftware.com               info@CitationSoftware.com               508-436-2543
  
  
  
 

   Search
    
* Click here to sign up for our free newsletter.
 

 

  
Solution Finder

 
  PDF data mining (extraction and repurposing of text, graphics & metadata in PDF files)  

Click here to go to the Solution Finder Menu
 
 
 
  These are the products that we offer for this Solution Finder
category.  Click on the product that you want, or scroll down the
page to read about all the products.  If you're not sure which
product you need, call us!

  XpdfInfo SDK   PDF-repurposing solutions
  XpdfText SDK










XpdfText SDK

The XpdfText SDK is a developer's library/SDK that converts PDF files to plain text. In other words, it extracts text from PDF files.

The PDF file can be in memory or on disk; and likewise, the text can be extracted to memory or directly to disk.

The XpdfText SDK generates plain ASCII, 8-bit Latin1, UTF-8 Unicode, and various standard CJK encodings, and it can be configured to generate custom encodings.

The XpdfText SDK is available as a COM component or a DLL for Windows platforms and as a shared library for Unix platforms. Portable C++ source code is also available.

The software is easy to use:

PDFHandle pdf;
char *buf;
int length;

pdfLoadFile(&pdf, "MyFile.pdf");

// convert to a text file on disk...
pdfConvertToTextFile(pdf, 1, 5, "MyFile.txt");

// ... or convert in memory
buf = pdfConvertToTextString(pdf, 1, 5, &length);
RELATED PRODUCTS:
 • XpdfInfo SDK

Pricing starts at $475.00 USD for a developer's license and $18.00 USD per unit for runtime licenses. (Pricing is subject to change without notice.)

Volume discounts are available. Call us at 508-436-2543 to get a price quote.


If you're not using Windows,
call us at 508-436-2543 to get your free trial version.









XpdfInfo SDK

The XpdfInfo SDK is a developer's library/SDK that extracts page count, page sizes, and metadata from PDF files.

The XpdfInfo SDK is easy to use!

PDFHandle pdf;
char *title;
int length;

pdfLoadFile(&pdf, "MyFile.pdf");
title = pdfGetTitle(pdf, &length);
printf("%s\n", title);
 
RELATED PRODUCTS:
 • XpdfText SDK

The XpdfInfo SDK is available as a COM component or a DLL for Windows platforms and as a shared library for Unix platforms. Portable C++ source code is also available.

Pricing starts at $235.00 USD for a developer's license and $9.00 USD per unit for runtime licenses. (Pricing is subject to change without notice.)

Volume discounts are available. Call us at 508-436-2543 to get a price quote.


If you're not using Windows,
call us at 508-436-2543 to get your free trial version.









PDF-repurposing solutions

We offer software that lets you extract text and images from PDF files in a format that maintains the relationships among the various content elements (headings, subheadings, etc.).  
RELATED PRODUCTS:
 • XpdfInfo SDK
 • XpdfText SDK
This software is available in two forms:
  1. Storypad: a collection of PDF-extraction solutions that run either on a desktop or on a server (and can be customized for you if need be),

    and

  2. JPedal: a Java developers library that can be used by software engineers that are creating their own extraction solutions.
Click on the illustrations below to get an idea of what you can get when you use this software to extract text and images from PDF files.

Storypad displaying extracted graphic with clipping and scaling     Storypad displaying a newspaper page     Storypad displaying extracted text with XHTML markup

Pricing varies depending on your requirements.

To get more details, click the "Get detailed information about PDF-repurposing solutions" button.









 
Didn't find exactly what you need? Not sure exactly what you need? Contact us by phone at 508-436-2543, or send e-mail to info@CitationSoftware.com. We can help you find appropriate software for your requirements.
 
 




    
Click here to go to the Solution Finder Menu

• Products & Services   • Buy software   • Downloads   • Support
• Mailpiece-design site   • Client list   • Company information   • Links
• Free newsletter   • FAQ   • Case studies   • Contact us
• News archives   • Press   • Customer testimonials   • Home


   Search

Copyright © 2008 Citation Software Inc.
info@CitationSoftware.com
508-436-2543
www.CitationSoftware.com
print on demand
PDF data mining PDF text extraction PDF image extraction PDF repurposing Storypad JPedal PDF data mining PDF text extraction PDF image extraction PDF repurposing > Storypad JPedal PDF data mining PDF text extraction PDF image extraction PDF repurposing Storypad JPedal PDF data mining PDF text extraction PDF image extraction PDF repurposing Storypad JPedal PDF data mining PDF text extraction PDF image extraction PDF repurposing Storypad JPedal