Is it possible to extract text from handwritten notes in OneNote? - onenote

For a project I am looking for ways to extract text from handwritten notes in OneNote. The idea is that one can quickly write notes in OneNote and then my application reads the notes and saves it as a text file somewhere else.
I have read into the REST API and I know that there are many ways to submit content to OneNote, but I am not sure about extracting handwritten ("ink" ?) content from it.

The OneNote apps across multiple platforms have varying degrees of ability to analyze ink strokes and recognize text. For e.g. in the Windows Desktop app, you can convert handwriting to text.
But the extracted text is not supported by the REST APIs today.

Related

how migrate from office documents to modern web technologies based documents - advice welcome

Currently, all documentation is based on MS office. This makes it quite challenging if you want to integrate some functionality. Then you have either the option to go with VBA or VSTO. First is not that comfortable, second could be like taking a sledgehammer to crack a nut.
Simple things like simple controls, hiding text or basic maths can be easily realized by HTML.
So I would need an HTML text processor what focus on content (text) and allow me to add interactivity when I need it. That means switching to source code or showing additional panels only on request so that the author can focus on the textual content (A more programming familiar person would do the formatting/interactivity).
In the long term, I want to have the ability to integrate things like SQLite and API calls.
In addition, the output has to be in a single file otherwise it isn't portable in a practical way and users (who only fill in data) won't accept it.
I conduct some research and figured there isn't an all-in-one solution instead there are several options to meet some of my requirements.
I wonder which is best to realize my long-term goals.
HTML5 offline app
It looks like that I either develop an offline HTML5 app what is well explained well, e.g. here:
Offline web applications: a working example
Tutorial: How to make an offline HTML5 web app, FT style
Creating HTML5 Offline Web Applications
How to Build an Offline Single-page Website
plus some background information on Single page apps in depth
markdown
The content could be generated in a markdown editor as recommended in What's a good, auto-saving, WYSIWYG HTML word processor? or just simple convert office documents to html5.
HTML editor / site designer
Alternatively, I could use an HTML editor or and visual site designer
but the selection isn't exactly a small one for me to choose from.
I found some help in:
Battle of the Text Editors: Atom, Sublime & Brackets
26 Tools and Frameworks for HTML-based Desktop and Web App Interfaces
Free HTML Editors: The 16 Best for Web Developers on Windows
14 Best Free HTML Editors
or even simpler for the standard office user: GrapesJS - Next generation tool for building templates without coding
single file website (app)
In my understanding there is still the problem that the output won't be a single file, will it?
I could make use of the archive formats but there aren't supported by all applications as well explained on What's the best “file format” for saving complete web pages (images, etc.) in a single archive?.
That could work if I do only simple things and integrated media object as Base64 encoded objects but it comes with the disadvantage of large overhead.
Desktop web apps
If I want to realize more complex things I would need to develop desktop apps using HTML/CSS/JavaScript but once again the selection is little as you can see in How to develop Desktop Apps using HTML/CSS/JavaScript? [closed].
I haven't found anything yet telling me if I can deploy a desktop web app as a single file.
A4 layout
The layout would be realized on the information given in How to make an HTML Page in A4 paper size page(s)?
office to HTML / html word processor
XSweet - The open .docx to HTML conversion tool
Wax / Wax II (web-based word processor)
HTML to office
For the transition phase, it would be nice to be able to transfer HTML to e.g. docx. Some options are mentioned in the references below:
How to convert HTML file to word?
Convert html to docx using pandoc
html-docx-js
Convert Html to Docx in c#
So how shall I proceed?

Looking for an existing iGoogle/OpenSocial gadget comparable to iGoogle's default native RSS gadget

iGoogle's standard RSS feed gadget is not OpenSocial, so it can't be embedded in other web sites. So I am hoping there is an alternative solution already available somewhere.
In iGoogle's gadget list there are other RSS gadgets, but none of them seem as nice as the default one by Google which is native to iGoogle.
The main difference between the standard gadget and most others is the ability to expand a headline and see more of the article by clicking the arrow next to the story. Also a clean layout.
It must be a widget/gadget that is OpenSocial compatible.
(I am aware that iGoogle will be closing, that is not relevant to my needs.)
MyYahoo:
Appears overall best alternative to iGoogle in terms of setup speed, "just working", and widgets for average or novice users.
appears not to import opml, but can support rss feeds
by hand can setup the same rss feeds as igoogle or netvibes
supports upto 9 tabs. every tab has an advertisement
only supports Yahoo search
appears no wikipedia search widget
good support for display of stock portfolio quotes in widgets provided by Yahoo
several good yahoo widgets provided, e.g. tv guide, local movies
can have different theme for each tab, or all the same.
can use uploaded image for wallpaper.
can make backgrounds transparent.
can set background and text colors for all parts of a theme, separately.
has spectrum chooser for colors. cannot type in hex codes.
very user friendly interface for configuring themes.
supports user-created applications (widgets) for MyYahoo tabs.
This is how to get started with it
Try Skim.Me as an alternative to embedding iGoogle's RSS gadget

Recommended approach for presenting formatted text in Android?

I am developing an application that will provide instructions to making a product. The text will have bullets and/or numbered steps as well regular text paragraphs. I may have headings for various sections. The text will be placed into a scrollable TexView.
I was originally planning on loading the text from a resource text file and then applying formatting via xml. However, I just learned about WebView and the ability to load local html files. I could easily format the text in html and load it into a WebView for the various activities.
My question is, is there a performance issue with using WebView vs. TextView? Are there other ways to easily format text for a TextView?
Thanks,
WebView definitely takes longer to load the first time into your process. It also is not designed to go in a ScrollView, since it scrolls itself. OTOH, you get excellent HTML support.
TextView can display limited HTML, converted into a SpannedString via Html.fromHtml(). Here is a blog post where I list the HTML tags supported by the Android 2.1 edition of fromHtml(). Note that these are undocumented, and so the roster of tags may be different in other Android releases.

How do I convert PDF to HTML programmatically?

Are there any classes, COM objects, command line utilities, or anything else that I can make an API for that can convert a PDF to an HTML document? Obviously the conversion might be a little rough since PDFs can contain a lot more than HTML can describe. I found a utility called pdftohtml on Source Forge, but quite honestly it does a horrible job with the conversion. I don't care if the software is free or commercial, but is there anything out there at all that I can incorporate with my own software to do this sort of conversion at least decently? I know Google's developed their own method of doing this, since you can click "View as HTML" on a PDF attached to an email through Gmail, but I was hoping there was something out available to the public.
Remember, PDF to HTML. I'm NOT worried about HTML to PDF.
well one solution i can think of is to write little program that reads pdf text using library called iText and then generate html files.
well for java based PDF solutions...we dont have a clean way i guess-still.. all solutions are primitive and kind of workarounds... No easy solution for
1. Designing a template of a PDF
2. Then at runtime using java, populate data into this template...either using xml or other datasources...
such a simple requirement and NONE has a good "open-source and free" solution yet !
Eclipse BIRT comes close.. but does not handle Barcode elements ..OOB.
You were looking for pdf2htmlEX (C++), which converts PDF to HTML without losing text or format.
To convert further to semantic HTML, you can process pdf2htmlEX output using my project Transcript (Python). It is however not lossless anymore and works best on documents not deviating too much from conventional visual layout.

OCR graph paper

I would like to take a pdf of a scanned graph paper notebook (with handwriting) and turn it into a text file.
How can I do this?
Thanks
Check out an OCR library, like OCRopus. I don't think it takes PDF, so you may have to convert it to a TIFF or JPEG first.
There are OCR libraries that convert typing (OCRopus, tesseract, etc.)
There are also Java based handwriting libraries. I am not sure if OCRopus has that ability, one library I was looking into to do handwriting recognition was:
Online Video
Java Neural Networks
Conceivably you could take the pdf, convert it into a tiff if need be (according to the software), and it would give you something..
Good luck!
If it is the notebook as a PDF file you could e-mail it to a gmail account and then gmail allows you to "view" the PDF from within your browser as an HTML file. Still the pages remain images.
If you would like the text out of it OCR might work but it may also be uncapable of getting the text out of it.