- see also Microsoft Document Imaging Format
Microsoft Office Document Imaging ( MODI ) - a component that is part of the Microsoft Office suite and is used to recognize and edit documents scanned in Microsoft Office Document Scanning .
First appeared in Microsoft Office XP and included in subsequent versions of the package, including Office 2007 . Not available since Office 2010 . To install Document Imaging in Office 2010, Microsoft released the SharePoint Designer 2007 package [1] .
MODI can read and write small TIFF images. It can also save recognized text to the original TIFF file. By default, the OCR engine requires the correct page orientation for recognition. If you call the objectname.save () method, it can write the corrected position of the page to the source file.
It has a low accuracy of character recognition, in addition, it places high demands on the quality of input images.
Content
Programming
Through the COM interface, MODI provides an object model interface such as a document or image. An important feature is the ability to integrate into the web interface for image recognition.
The MODI object is accessible from development tools through the COM model programming interface. MODI controls are accessible through development tools that support the ActiveX interface. A software module that implements this functionality is called Microsoft Office Document Imaging Viewer Control 11.0 or 12.0 (MDIVWCTL.DLL).
Example usage on Visual Basic .NET :
Dim inputFile As String = "C: \ test \ multipage.tif"
Dim strRecText As String = ""
Dim Doc1 As MODI . Document
Doc1 = New MODI . Document
Doc1 . Create ( inputFile )
Doc1 . OCR () 'recognition of all pages if the file is multi-page
Doc1 . Save () 'saves the reoriented image to the original file and the text there, back to the input file
For imageCounter As Integer = 0 To ( Doc1 . Images . Count - 1 ) 'works on every page
strRecText & = Doc1 . Images ( imageCounter ). Layout Text 'putting the results of work in a line
Next
File AppendAllText ( "C: \ test \ testmodi.txt" , strRecText ) 'save the recognized file to disk
Doc1 . Close () 'clear
Doc1 = Nothing
Changes with Office 2003 SP3
In Office 2003 Service Pack 3, Microsoft excluded associations with .TIFF files with Microsoft Office Document Imaging as part of its security fix package. Also, TIFF files no longer use JPEG compression [2] . No explanation was given in this regard.
In Office 2010, MODI is completely rejected. This change also affected the installation tree, which does not show the MODI Help, OCR item. The Internet Fax feature in Office 2010 uses the Windows Fax printer driver to generate TIF format images. MODI and all its components are removed in the office version for 64-bit Office 2010 [3] .