I am working on a project on handwriting recognition in Android Studio.
From online source Neuroph OCR seems is the best to recognise hand-written char.
My input is a list of x,y coordinate and draw them in line using opencv drawline function
then create an image showing the char
the image can be recognised by hand-written OCR.
Anyone can have some instructions or example or source how can i use this library?
i only know it is a useful library but seldom find any example
Related
I have attempted to do the following
When I tried to check either Bullet or FreeType or Both, the following pops up
I do not get why?
The Bullet and FreeType libraries both depend on native code (C or C++) that isn't available in the HTML Libgdx backend.
The HTML Libgdx backend uses Google's GWT project to cross-compile Java bytecodes into Javascript. Thus, only Java code, or APIs that exist in both places can be used. See http://www.badlogicgames.com/wordpress/?p=2308 for more details.
I have recently started to use the Microsoft OCR Library for loading text from images. And I have a question. Is it possible to automatically detect the language of the picture? Or I need set all possible languages by myself?
AFAIK,MS OCR lib cannot detect the language of image/pic.
If you need the languages compatibility, you can include all language that your app support in the OCR resource file.
I am working on a math quiz using flash CS3 with ActionScript 3.0
I have the source document in Microsoft Word. They have all been typed in equation nicely.
I cannot copy them straight into flash document.
I do not want to printscreen those equation. There a lot of fraction,index and symbols.
Is there a way so that I can copy the equations straight into flash?
Is there something like, install plug in so that flash can read it?
You should consider using LaTeX2SWF library instead of trying to import Microsoft Equation directly.
Example here
If you can't download it, you can try something like this
You should also convert all of your Math Equations from Microsoft format to LaTeX. There are a lot of automated programs and plugins, like this, but there are lots of others
I use tesseract 3.0.1 on windows 7 64 bit.
The documentation on training says:
Each font should be put in a single multi-page tiff (only if you are
using libtiff!)
I'm not familiar with libtiff. I use ImageMagick to create multi-page tiff. So far this is working well, or at least seems to be. Am I expected to get some road blocks later on? If so what to do with libtiff - is it enough to run its setup or do I need to configure something?
Tesseract doesn't care how you produced your multi-page tiff as long as it can read it with leptonica (which internally depends on libtiff). If tesseract can handle your tiff now, it can do the same for the rest of training process as well as run for OCR, so you are good to go.
I've produced my multi-page tiff with .Net standard library and tesseract had no problem with it.
I'd like to add character recognition functionality to my application that's why asking you what's the best available and affordable OCR SDK . I looked at ABBY FineReader Engine 10.0 but haven't got trial version yet as I requested from the official site!
I've downloaded Asprise OCR SDK but it's doesn't recognize Cyrillic symbols..
How to implement character recognition on my application ? By using what kind of libs, SDKs, APIs and so on..
There's Cunieform and Google's Tesseract OCR, both of which are free. Personally I've used Tesseract, the SDK was giving a lot of trouble so finally decided to simply call the command line interface of Tesseract with arguments from within my C program using the system() function.
Lots of people face difficulties with the Tesseract installation, so here's a short summary (version 2 works for me, insert appropriate version if necessary):
Download the following from the svn: tesseract-2.00.tar.gz, tesseract-2.00.exe6.tar.gz, tesseract-2.00.eng.tar.gz
Unzip tesseract-2.00.tar.gz to a folder
Unzip tesseract-2.00.exe6.tar.gz and move to where tesseract-2.00.tar.gz was unzipped. A few files will be replaced this way
Similarly unzip tesseract-2.00.eng.tar.gz and move to tesseract-2.00.tar.gz where tessdata folder will be replaced.
After all this is done, open the tesseract.dsw workspace, select All Files and do "Rebuild All." This'll take a while with loads of warnings but hopefully no errors.
The command using DOS shell is tesseract picture.tif textfile -l eng. So basically save your image as a TIFF file, run the command from within your program and then read in the OCR output strings from the text file.
I can recommend you Crystal OCR if you don't need to recognize a very complex documents, they sent me C++ Builder sample by request. IMHO, Tesseract is still buggy, though it's the best free OCR of course.
You can try KSAI-Toolkits. It has a completely ocr application, which include C++ API, OCR model, benchmark and test data. And it supports different platforms.