Customization on Abbyy FineReader through programming

Customization on Abbyy FineReader through programming - ocr

We are very new to Abby Fine Reader. When we are doing OCR by using this product, the recognized text will be transferred to text editor window. Here we would need to read the text from text editor window through programming as and when it comes to the text window.(Please refer attachment). Please suggest that how can we do this by programming? Thanks in advance.

FineReader Pro is designed as an interactive end-user tool and it prevents such programmatic access to its objects intentionally.
If your goal is the detection of text presence: You easily could use another 'screen scraping' or screenshot application and perform OCR or histogram comparison on the captured Bitmaps to detect (Boolean) if the text appeared in the designated area or not. Before recognition, that area displays a static "Text not recognized" message.
But if you intend to capture and export the text itself without using FineReader's export features, then there are much better tools to do OCR and pass the results programmatically, such as the ABBYY Recognition Server from the same manufacturer (I can provide additional setup and integration help), like FineReader on steroids, automated OCR tool I have used in many integration projects.

Related

Looking for 3D CAD Viewer Webkit

We want to embed a 3D CAD viewer into our web-based collaboration platform. It would work similar to the one offered by Opencascade. You can see it here.
We do not charge for the use of our platform and we would prefer an open source option if it is available (Opencascade viewer is not open source).
Does Autodesk Forge have a similar offering?
Thanks!

CAD viewer solution for a WEB can be split into several pieces:
On-server converter of CAD model.
Conversion of CAD file into intermediate format. You might need a lot of Data Connectors in case if you need supporting wide range of CAD formats, or restrict functionality to import vendor-neutral formats like STEP.
Generation of Web-friendly triangulation-based 3D model. This step might include model simplification in case if you have to deal with large models.
Server backend implementing business logic and backend for 3D viewer. The complexity of 3D Viewer backend depends on functionality provided with it and ability of viewer to handle extra-large models. In simplest case, this might be just a file server providing 3D model in format like glTF.
Client part, e.g. HTML+JavaScript code for displaying 3D model using WebGL, as well as GUI and application-specific logic. The viewer might just display 3D model (in which case it is barely distinguishable from non-CAD viewers), or give many features like Shaded and Wireframe (not just mesh edges) modes, picking sub-parts, Clipping Boxes, clipping with capping, annotations, dimensions, explode view, HLR view, a bunch of them!
Although it is technically possible loading some small CAD models directly in the Browser, in general this task is expected to reside on server to improve user experience and give some protection to intellectual properties (of CAD models). The converter may be based on Open CASCADE Technology, which is an open source framework, in case if STEP/IGES input file formats are enough (output could be your custom file format or glTF 2.0, for instance). But in case of native CAD formats support (like CATIA and others) you will unlikely find any free open-source libraries.
The Web viewer itself relies on WebGL and normally can be based on any traditional JavaScript-based library like Three.js (or existing 3D Viewer on top), or may rely on C++-based engine (again, Open CASCADE Technology can be build as WebAssembly module). WebAssembly-based viewer may give some performance benefits and it is more friendly to C++ developers (but apparently - less friendly to JavaScript developers). There is a hidden issue, though, as many open-source C++ libraries commonly have LGPL license - pretty good for desktop, but might have legality issues in case of a Web application. In contrast, WEB-based libraries are normally more proprietary-friendly with permissible licenses ignoring open-source software ideas put into licenses like (L)GPL.
In general, 3D CAD Viewer definition is very wide, and requirements list in your question are very basic and unclear - just to embed 3D Viewer in your WEB application. You may write your own Web Viewer based on open source components, or consider commercial options already implementing features you are looking for, and aware of / providing some solutions for handling tricky problems.

Looks like Forge Viewer fits the bill perfectly and covers most of the features available with the demo ;)
You may experience/preview Viewer following this link using a model of your own here and follow here to build up a MVP here...
For the features that are not readily/built-in out of the box Viewer is highly extensible and offers API making it pretty straight forward to develop extensions following here and also the blogs/tutorials available here.

PDF Accessibility checker

We are busy with improving our PDF files so that it's accessible for everyone, also for people with sight loss. At the moment we are using the following tool to verify if a PDF is accessible.
http://www.access-for-all.ch/en/pdf-lab/pdf-accessibility-checker-pac/downloading-pac.html
We are thinking that this is quite a good tool to verify if the PDF is accessible for people with sight loss, however we were wondering what other options we have to verify if a PDF is valid for people with sight loss.
The problem is that we have to solve the issues that are reported from the tool, like 'Path object not tagged', we were wondering how we should solve this in HTML as we are generating the PDF based on HTML.
Thanks

First off, no automated accessibility tool can substitute for a skilled person who understands the intent and practical implementation of WCAG / 508 standards.
For example, your "Path object not tagged" error is likely referring to a line segment or other vector shape that didn't correctly get tagged as an artifact.
If money is no object, then I recommend CommonLook tools, which are amazing, but very pricey. If you do this kind of work much, then these tools are absolutely worth the money.
Otherwise, I'd recommend using the accessibility checker in Acrobat Pro. Lots of government and enterprise organizations use nothing more than Acrobat Pro in their accessibility programs.

Any free OCR engine or API to identify hand written scan document?

I am using google Tesseract engine python binder https://code.google.com/p/python-tesseract/ to extract the text in a image(http://ceoarunachal.nic.in/eci/affidavits/s02/ge/1/KIREN%20RIJIJU/KirenRijiju_SC1.jpg). I am trying to make it digitized for thousands of images similar to it. But Tesseract is not able to extract the handwritten text in it correctly, as its mainly designed for machine text.
Any way to optimize current image which will help in improving the recognition by training the data or is there any other better tools to do it?

voice based translation from one language to another in windows phone os

My project is to do a translator based on the voice only .The user need to give the input in the form of voice in their native language. Then the system need to produce the translation in the user preferred language. The problem for is the i don't know how to set the voice recognition in different language . please help me. thanks in advance i am doing this in windows phone 8 .

There are APIs for Windows Phone, but you will need to use two to accomplish what you are trying to do. First, you need to use the speech recognition to convert the speech to text. Then, run this through the Microsoft Translator to translate to another language. Note that for the speech recognition, you should query for the appropriate speech recognizer for the language that is spoken.
See the following links:
Speech Recognition
Microsoft Translator

There's an example here of how to build the Windows Phone 8 app for Translator:
http://blogs.msdn.com/b/translation/p/windowsphone8.aspx
You could use this with voice-captured instead of text-entered input.

rtf and html can live in the same application?

I have what I think it is a very popular problem. I have a legacy app that contains rtf fields. rtf in my case is used only for very basic features (font: size, style, color, ... hyperlinks). Anyway since I allow to paste from word sometimes some more complex things can go in there.
Now rtf is ok, even because with Report Builder I can get reports where rtf is printed.
The problem comes with web application. I am developing a web application that is basically an optional interface for my legacy application.
At this stage I solved the probelm by removing the rtf formatting (I use the TRichEdit trick discussed here).
Anyway ideally I would like to be able to see and edit formatted text in both the web application and the win32 application.
What can be the solution?
In this question I found a link to the ScroogeHTML converter.
But even if I convert there are these points that i cannot reconcile:
Storage: what to store? Currently of course I store rtf.
Should I convert rtf to html every time I show the rich text in web app and then convert it back to rtf for storage?

Your could implement a RTF editor in your web application.
http://en.wikipedia.org/wiki/Online_rich-text_editor
This way you don't have to change the storage format.

We Keep Coding

html mysql json google-apps-script actionscript-3 ms-access google-chrome google-maps reporting-services sql-server-2008