Training Tesseract-OCR with JTessBoxEditor - ocr

I have problem with this program that we want to download the jTessBoxEditor for training tesseract language file, but when we try to download it ,it downloads ocr program "VietOCR.NET and also how can i download jTessBoxEditor and install it on windows OS ?
can anyone help me ? it is so important

Go the page for training...
http://vietocr.sourceforge.net/training.html
Click the link on the first word on the page: jTessBoxEditor
This should bring you to their download site - files are in ZIP format.
The ZIP file contains the jTessBoxEditor and is executed from there.

Related

How to save PDF generated by chromehtml2pdf to local folder?

I'm using this tool to export PDF from web page. From the author's instruction
Once installed, simple command line usage is:
chromehtml2pdf --out=out.pdf https://wikipedia.org
to generate a PDF of that web page.
For local files use:
chromehtml2pdf --out=out.pdf file:///path/to/file/file.htm
Could you please explain how to use the command chromehtml2pdf --out=out.pdf file:///path/to/file/file.htm to save the PDF of https://wikipedia.org to folder 'C:\Users\Dung Le\Downloads\Documents?
Thank you so much for your help!
Oh my bad, I've just figured it out :)
chromehtml2pdf --out="E:\Trial\test.pdf" hhttps://wikipedia.org

Microsoft Custom Translator: File upload not working for SC tmx files

I am testing the new Microsoft Custom Translator and trying to train a system EN-SC, but the file upload fails.
I am trying to upload unzipped TMX files, all less than 50 MB in size, as specified in the user's guide. The file names I have tried are:
Master1_en-us_zh-cn.tmx, Master2_en_zh.tmx, Master3_zh-cn.tmx,
in case the filename format had something to do with this.
I have successfully uploaded files and trained an EN-Spanish system using the same procedure, so I am not sure what I may be doing wrong.
I would be grateful for any suggestions on where to look for errors.
We indeed identified an issue related to Chinese tmx files that we are working to resolve now. The problem is that many of the abbreviations used in the tmx files to identify Chinese simplified are not resolving properly. The work around for now is exactly what you did: to change the abbreviation used in your tmx file to identify Chinese TUs to zh-hans rather than zh, zh-chs or zh-cn.
Solved - I have managed to submit files for training by editing the TMX to globally search&replace the target language code from zh-CN to zh-Hans. – maria c laguardia 5 hours ago

Convert .json to ipynb

I am taking a course that uses ipython notebook. When I try to download the notebook (through File -> Download as -> ipython notebook), I get a file that ends with ".ipynb.json". It doesn't open as an ipython notebook but as a .json file so something like this:
{
"cells": [
{
"cell_type": "markdown",
"metadata": {},
"source": [
"---\n",
"\n",
"_You are currently looking at **version 1.1** of this notebook. To download notebooks and datafiles, as well as get help on Jupyter notebooks in the Coursera platform, visit the [Jupyter Notebook FAQ](https://www.coursera.org/learn/python-data-analysis/resources/0dhYG) course resource._\n",
"\n",
"---"
]
},
...
}
I've tried deleting the ".json" in the file name and it doesn't work. How can I convert this file back to something that can be opened and run as an ipython notebook? Thank you very much!
My Solution: just remove the filename extension .json. for example, change myfile.ipynb.json to myfile.ipynb. Then, you can open it by a click in jupyter notebook !
I have encounter the same problem as you did. I found a link that describe what ipynb exactly is. see here http://ipython.org/ipython-doc/rel-1.0.0/interactive/nbconvert.html. It says ipynb file is actually json file. Hope this
On the Mac you could go and
Right click on the filename.ipynb.json
Click on Get Info from the list.
From the Get Info window, find the section Name&Extension remove the extension/suffix .json from the file name.
Hope that helps!
Are you trying download this from Github? Especially on Google Chrome browsers, I've had issues download .ipynb files using right click > Save link as... I'm not sure if other browsers have this issue (Microsoft Edge, Mozilla Firefox, Safari, etc.).
This causes issues since when downloading, it doesn't completely download the file usually and it becomes corrupted so you can't download an IPython notebook that may run properly. One trick when trying to download .ipynb files on Github is to click it, click Raw, then copy everything (Ctrl + A) and paste it into a blank file (using text editors such as Notepad, Notepad++, Vim, etc.) and save it as "whatever_file_name_you_choose.ipynb". Then you should be able to properly run this file, assuming a non-corrupted file was uploaded to Github.
A lot of people with very large, complicated IPython notebooks on Github will inevitably run into this issue when simply trying to download with Save link as.... Hopefully this helps!
I opened it as/with nbviewer and then selected it all and saved it as a "txt" file that I then opened in Notepad++. I then resaved it as a file with the extension ipynb and opened it in my jupyter notebook ok.
The easy thing to do is to copy the JSON contents into a notepad and save it again with .ipynb extension
Just remove the .json file extension leaving the .ipynb one, as pointed out by the following related post: https://superuser.com/questions/1497243/why-cant-i-save-a-jupyter-notebook-as-a-ipynb. As #jackie already said, you should consider them as .json files meant only to be edited by the IPython Notebook app itself, not for hand-editing.
Use a simple trick. Let that file get downloaded automatically. Re-download it again then it will prompt you to download and replace that file. At that time, you save that by replacing .json to .ipynb
After downloading the file with ipynb.json, Take the following steps:
Go your terminal/command line window
Navigate to the directory where your file is
Type:
windows OS: rename yourfile.ipynb.json to yourfile.ipynb
Unix/Linux: mv yourfile.ipynb.json to yourfile.ipynb
This work perfectly for me.
i tried this method and it worked. Just copy, paste it in notepad and save as "file_name.ipynb". hope this works for you too.

How to run JM reference software using visual studio 2010

I have downloaded the H.264 source code and I can view it in Visual studio 2010. I don't know how to run the JM Reference software. How can I do that, and how to view the output?
First of all right click on the project in VS and rebuild solution.
After that from command prompt go to bin folder of JM reference software.
Which is in JM->bin.
Now write lencod.exe in command prompt.
This will run your encoder for default video file selected in encoder configurations. And same goes for decoder that is ldecod.exe.
For other encoding/decoding parameters read manual.
These slides will also help you to get started.
Firstly, I recommend you to download the latest JM reference software for H.264/AVC form this link. After downloading just click on jm_vc11.sln file (solution file) and it will open in Microsoft Visual Studio (MVS) as this:
Solution Explorer Image in MVS
Then from toolbar menu, click on Build>Batch Build and you will see something like this:
Batch Build in MVS
Then click on Rebuild button and it will compile and build all of the projects of this solution in your current folder. After seconds or so, you can go through a folder called "bin" in the current folder. In this folder you see lots of config files which are in .cfg format. Please open encoder.cfg file and you can see all the encoder configuration for you H.264 codec and at first lines of it there is InputFile name, FramesToBeEncoded, FrameRates and the deafult values is as this picture. Just please make sure the file "foreman_part_qcif.yuv" existed in the bin folder. Then just open command prompt (type cmd in windows search bar) and go to the current directory>bin folder where you can see lencod.exe file. Type this in your command line: "lencod.exe -d encoder.cfg" and it will encode your InputFile and generate test_ref.yuv as a reconstruction file that you can see the result of your encoding process. Also you can see your .yuv files using third-party apps like "yuvplayer". For decoding, Please open bin>decoder.cfg and check that the inpuFile is "test.264" and the output is "test_dec.yuv". Then in command prompt type: "ldecod.exe -d decoder.cfg" and the decoded output file will be generated as "test_dec.yuv" which you can play it in yuvplayer, for example.
For more details about the parameters of encoders and decoders please check the documentation file in your currnet directory>doc folder.

CHM file unreadble if saved to hdd

I've just downloaded JSon.Net for framework 4.0 as a zip file.
Opening zip (using WinRar) I'm able to open Documentation.chm file by double-clicking over the name listed and the help file is well done.
Anyway if I extract this file to hdd and I open it, help is unreadable.
Sounds silly, but it's happening.
Thanks for your help!!!
EDITED:
Another tip: unreadble chm file is located in D:\.... (second NTFS partition of first hdd); if I save it on my desktop, chm file becomes readable !!! Crazy...
------------------------------------
---------- SOLUTION ----------
------------------------------------
I answer my question to close it and to help someone who finds in my same trouble.
Using tip suggested by #Marco van de Voort I searched Google for "chm unblock" and found this link:
Your CHM files are stored in the folder with '#' (hash) character in
the path Many C# developers discovered that their documentation
and e-books in CHM format cannot be read because they were storing
their CHM files in the directories like 'C:\E-books\C#\'. The hash
character signifies an anchor in HTML so the CHM viewer fails to
resolve the path properly and to retrieve the content.
That was my problem: I had path with a #... and I'd never thought this could be a great mess for CHM files!!!
Thanks to SO mates for the help given !!
Windows keeps track of downloaded files on NTFS systems, and puts certain restrictions on them. If you extract with windows explorer, this status propagates from archive to file. Using a third party tool (winrar in your case, I use INFO zip) can circumvent this.
Some descriptions and other links can be found
http://wiki.lazarus.freepascal.org/chm_backend_for_fpdoc#Troubleshooting
Free Pascal (2.4.4) has some tool to unlock them. (chmls unblock). It is done by truncating an alternate filestream (therefore it only works with NTFS). I found this info somewhere on stackoverflow, but can't quickly find the link.
It might be that virtual folders like desktop don't fully support this.
maybe the references to some content is to web or no chm file. maybe if you can disassamble in CHM editor and change paths can fix it.
maybe this can help you
There are security permissions applied to this file type by a recent patch.
you need to unblock the file:
http://www.helpscribble.com/chmnetwork.html
instead of thinking many possibilities of "windows security permission", get a CHM reader or wrap your CHM using winrar, and doubleclick to view it like you did.
It's save your time a lot!!
So, two simple solutions are:
have a CHM reader: https://blog.kowalczyk.info/articles/chm-reader-viewer-for-windows.html
OR right click CHM file/s, > Add to RAR Archive.
If you want to read it, double click RAR file > double click CHM file. BINGO!!