DocuFreezer 3.0 – OCR Your Images & Scans to Text and Get Better Performance

Our batch file converter DocuFreezer gets a new major update! Following an update 2.0, released in June 2018, this is a better version with enhanced performance. In version 2.0, the interface was changed entirely, whereas DocuFreezer 3.0 gets many performance improvements for faster conversion of your files. Moreover, we added new features, OCR functionality and new output and input file formats to the program.

DocuFreezer 3.0 - Major update

What’s new in DocuFreezer 3.0:

Added:

  • New ability to OCR scans, images, and other data to searchable PDF or TXT
  • New output file format: plain text TXT
  • New ability to convert DWG and DXF drawings to searchable PDF and images
  • New ability to split Excel files into separate worksheets
  • New ability to convert to monochrome TIFF and PNG
  • New ability to add text watermarks
  • New ability to process an unlimited number of files per one session

Improved:

  • Faster file conversion to PDF
  • Faster file conversion to JPEG, PNG, TIFF
  • Improved processing of high-resolution files with large DPI
  • Improved conversion of EML and MSG with attachments to a single PDF
  • Improved processing of archives with a multi-hierarchical structure
  • Minor improvements and fixes

New ability to OCR scans, images, and other data to searchable PDF or TXT

Version 3.0 adds a groundbreaking Optical character recognition (OCR) feature. DocuFreezer now can convert images, and document scans from PDF, TIFF, JPEG, PNG to a searchable PDF or a TXT file. You can also get text from AutoCAD (DWG, DXF) and Excel files (XLS, XLSX). The list of supported languages ​​includes English, Russian, German, Japanese, Spanish and Hebrew.

OCR can be turned on if you select TXT or PDF as an output file format. If the same document contains texts in several languages, specify it in DocuFreezer settings:

OCR scans, images, and other data to searchable PDF or TXT with DocuFreezer

English is set by default. For better results, select only those languages which your documents have (don’t select All, if the document text is just in English and German).

Please note, that text recognition is not perfect: some characters may be corrupted or recognized incorrectly. Try to make sure that original files have high-resolution image quality.

Convert image to text with DocuFreezer

New output file format: plain text TXT

Batch convert scans, images and text documents to TXT

Now you can convert Word DOC, DOCX, RTF and other documents to the TXT format.

Based on our customers’ requests, we decided to introduce plain text (TXT) as an output file type instead of less popular XPS format. Now you can transform your other text documents to plain text (and images containing text too – using OCR!). The possible combinations are:

  • DOC to TXT
  • DOCX to TXT
  • PDF to TXT
  • RTF to TXT
  • ODT to TXT
  • XLSX to TXT

If you want to have more input file formats available here, please let us know.

New ability to convert DWG and DXF drawings to searchable PDF and images

The new DocuFreezer can convert DWG and DXF drawings to the PDF format and popular types of images. With the support for technical drawings, DocuFreezer is now a true batch DWG to PDF converter. It allows you to convert DWG / DXF to PDF or images (JPEG, TIFF, PNG) without the need for AutoCAD. What is more, the program can create PDF files with original text out of your DWG or DXF drawings (making a searchable PDF instead of converting the text to curves).

New ability to split Excel files into separate worksheets

Split Excel files into separate worksheets

With the new version, you can have your XLS or XLSX charts split into separate worksheets during the conversion to PDF, text or image format. It is convenient that output files are automatically named as [filename]-[worksheet name]. Go to Settings > Advanced and enable Split worksheets into multiple files to turn this option on.

New ability to add text watermarks

The new DocuFreezer can now put watermarks on the pages of your documents in bulk while converting them. The watermark can be set up in the Ini File Editor by adding new values.

New ability to convert to monochrome TIFF and PNG

Grayscale or monochrome? You choose – with DocuFreezer

In the new version, you can select more color models for TIFFs and PNGs. We added a new monochrome color space, which can be selected in Settings > Output file > Color space. Overall, four different color models are available for output PNG and TIFF files: RGB, RGBA, Grayscale, and Monochrome.

New ability to process an unlimited number of files per one session

If you have the software running on a server and have large numbers of files being processed, available random-access memory can be an issue. We made an improvement, which will let you have DocuFreezer operating 24/7 without any problems with allocated memory. In case the amount of memory leaks exceeds a certain limit, the program engine will restart automatically. So that the user will not even notice this. This approach allows you to handle any number of files per session.

Faster batch file conversion

Faster file conversion to PDF

We achieved a significant increase in the speed of converting files. We performed a performance comparison for popular file types – Word and PDF files as input and PDF and JPEG as output file types. See the results in the chart below:

120 files converted with DocuFreezer 2.0 vs. 3.0

 DocuFreezer 2.0
(elapsed time, mm:ss)
DocuFreezer 3.0
(elapsed time, mm:ss)
Results
Word DOC, DOCX to PDFRun 1: 7:05
Run 2: 8:53
Run 3: 7:48
_
Average: 7:55
Run 1: 2:45
Run 2: 2:40
Run 3: 2:43
_
Average: 2:43
2.9x times faster
Merge JPEG to Single PDFRun 1: 2:30
Run 2: 2:32
Run 3: 2:30
_
Average: 2:31
Run 1: 1:50
Run 1: 1:46
Run 1: 1:45
_
Average: 1:47
1.4x times faster

Faster file conversion to JPEG, PNG, TIFF

Newly introduced component to convert PDF files into JPEG, PNG and TIFF images works faster. It does the job pretty well, showing approximately 2-3x increase in speed compared to version 2.0.

120 files converted with DocuFreezer 2.0 vs. 3.0

 DocuFreezer 2.0
(elapsed time, mm:ss)
DocuFreezer 3.0
(elapsed time, mm:ss)
Results
Word DOC, DOCX to JPEGRun 1: 52:31
Run 2: 49:00
Run 3: 51:55
_
Average: 51:15
Run 1: 14:48
Run 2: 16:24
Run 3: 15:17
_
Average: 15:30
3.3x times faster
PDF to JPEGRun 1: 35:05
Run 2: 34:53
Run 3: 37:00
_
Average: 36:05
Run 1: 13:43
Run 2: 14:08
Run 3: 13:27
_
Average: 13:45
2.6x times faster

The tests were made in the same conditions on a Windows 8.1 PC with Intel Core i5 processor and 16Gb of RAM. The CPU load on all test runs was only about 30-35%.

Improved processing of high-resolution files with large DPI

Previously, the size of a single rasterized page in memory cache was limited to 1 GB in DocuFreezer. Now you can convert various documents (PDF, Word documents, drawings, and more) into images – JPEG, PNG, TIFF – with almost any DPI (Dots per inch) resolution. It is quite a significant improvement especially relevant for high-res drawings, posters, and other large documents.

Which DPI is the best? Briefly speaking, 150 is good for simple texts, 300 is a standard value, and 600 and higher is for high-quality images. And yes, we know that strictly speaking the resolution here should be called PPI 🙂 But they say “DPI” when they mean “PPI” – so often that it’s become an established convention.

Improved conversion of EML and MSG with attachments to a single PDF

DocuFreezer’s unique ability is to process files with container structure (e.g., emails with attachments, archives) and merge their contents to a multipage PDF. The examples are Outlook EML and MSG files with attachments, ZIP / RAR / 7ZIP archives. For EML to PDF conversion, just add the files to the program’s list, go to Settings and select Multipage > Merge into one PDF.

Improved processing of archives with a multi-hierarchical structure

This feature is connected to the previous one. DocuFreezer 3.0 does a better job at converting multi-level files, such as archives within archives inside or files with attachments. Now the program correctly processes all embedded contents within a file or archive.

Minor improvements and fixes

Version 3.0 had more than 60 pre-release builds with many fixes and improvements. The repairs were made for OCR, conversion to TXT, Excel files processing, PDF to PDF conversion, output page sizing for PDF, program interface resolution, converting email MSG plus attachment to a single PDF, and many more.

Please download the free version of DocuFreezer for evaluation. Version 3.0 is available for free to those who already purchased version 2.0. Users of version 1.x can get a 50% discount.