Google Vision Api text recognition for iOS [duplicate] - ocr

I'm looking at using Google Mobile Vision with an iOS app. Their documentation talks about face recognition, barcode scanning, and text recognition (OCR). However, the code samples for iOS development only include facial recognition, and I don't see anything in the API documentation for iOS other than facial recognition support.
Is it the case that they only support OCR and barcode scanning on Android?
If this were the case, I would have expected that fact to be clearly stated somewhere in the introduction to the product or the getting started page, but it is not, which surprises me, and leaves me wondering whether I'm just confused and not finding it.
Thanks.

The Google Mobile Vision website at the time of this post says:
Note that iOS supports only face detection and tracking at this time.
Source: https://developers.google.com/vision/reference

In March 2017, it find faces and barcodes in photos and videos.
https://developers.google.com/vision/ios/getting-started

As of April 2018 it does support text recognition however in I/O 2018 Google deprecated it in favor of ML.Kit which is part of Firebase. ML.kit supports both cloud and on device text recognition.

Related

Client-side Speech Recognition on a mobile browser?

I am working on a project that is targeted for browsers on smart phones. And I can't seem to find any way to do a client-side speech recognition, as the mobile version of chrome doesn't even support their own Web Speech API. Does anybody know how to have speech recognition working on a mobile browser like Chrome or Firefox? Or is there a work around that can be used? like a 3rd-party service that provide APIs to be called.
Potential duplicate: Safari Speech Recognition
NOPE: https://caniuse.com/#feat=speech-recognition
NOPE: https://developer.mozilla.org/en-US/docs/Web/API/Web_Speech_API
Not client side but might be of some assistance: https://cloud.google.com/speech-to-text/docs/streaming-recognize
Also this: https://github.com/Kitt-AI/snowboy (seems like need to set up your RasPi)
And this: https://github.com/tensorflow/tfjs I can totally imagine doing all the machine learning in the browser, just like you can run Windows 95 and what not.

Configure WebVR in google chrome mobile for specific google cardboard / VR device

I've created a simple WebVR app using A-Frame. On my phone it will launch into VR mode however it doesn't seem to be well configured for my model of google cardboard. The two images are two far apart and I just get a double vision rather than 3d effect.
I can get a good VR experience on the google cardboard app by configuring cardboard to the specific model of google cardboard. Is there an equivalent that can be done in google chrome to tell it the model of cardboard I'm using / display distance etc?
Take a look at this issue thread on GitHub:
https://github.com/aframevr/aframe/issues/2117
The solution toward the bottom from garrethwilliams explains what you can do to tweak A-Frame to work with your model of the phone until the webvr-pollyfill is updated with specs from the newer Android devices.

Is it possible to use a Google Earth 3D view, when embedding a google map?

Or can you embed Google Earth directly even though the Earth API is deprecated and their documentation page is mostly disabled. Im trying to have a website that works in the current version of Chrome and uses a 3D google earth. Any way on how I can do this, or are there any alternatives to utilize a 3D globe in your website.( I'm building on rails) Also any know why it was depreciated?
At the moment it is not possible to include the Google Earth 3D View with the maps API, but Google already announced that 3D is "in their blood" and so there probably will be an integration into the maps api some day (because the Earth view actually is already included in the official maps web application).
One reason the Google Earth Plugin was deprecated was the fact that the plugin is based on NPAPI and Chrome doesn't support it anymore because of security issues.
A alternative could be Cesium.

Firefox OS 'HELLO WORLD' development

i want to develop apps for Firefox-OS but i simply cant understand where to start from. Okay i have firefox Os simulator installed, i got my notepad open, I know html 5, css3, JS, jQuery bla bla bla...I googled but simply couldnt find the tutorial for HELLO WORLD FIREFOX OS app. any help guys. thanks
As far as "Hello World!"-type of beginner-applications concerned, I think fxosstub is as close as it gets. Also as you will most definitely want to work with some of the Open Web API-s, so I'd recommend studying Robert Nyman's Firefox OS Boilerplate App, which itself, is based on the above mentioned fxosstub application, and is a great way to learn a thing or two about the Open Web API-s and about accessing the mobile device hardware through JavaScript API calls in Firefox OS.
Those two articles mentioned both appeared on Mozilla Hacks, mozilla's developer blog, which is treasure trove of - among other things - Firefox OS related articles and resources. Another great place to start would be the Quick start guide in the Developer section of the Firefox Marketplace.
For technical reference you should check out the Mozilla Developer Network - it contains a host of very useful information about Web Applications, Web API-s and general web development, too.
For Firefox OS UI-design samples, resources & best practices you might want to check out the Mozilla Brick Library (a successor to the former Building Firefox OS website) if you want your app to blend in visually with the core applications in Firefox OS.
Also there is a pretty active community at the Mozilla webapps mailing list where you might find lots of helpful chaps, well-versed in the intricacies of Firefox OS, and would gladly help you out whenever you might find yourself stuck during development.
This took me about a minute to find. A step by step guide. More info related to app development for FF OS can be found at their official page
Edit: A video tutorial with the use of the jQuery library.
Here is a BMI calculator webapp that follow Firefox OS build-in apps (Gaia) structure and use Firefox OS build-in styles. Check the demo to see how real hosted webapp looks like on any device.
(For a web platform, you can use whatever you are familiar with, but BMI calculator is a kind of hello world type app though...)
Or here is the webapplate project, which has preconfigured webapp template for hosted or package
reuse. BMI calculator webapp is modified from that.
(Note Firefox OS build-in styles are not cross-browser friendly yet. )
I think the above answers are a bit out of date. I think the fastest/easiest way to start development is to
1) Download Firefox Developer Edition and start it up
2) From Firefox developer edition, launch the WebIDE (click on the little spanner, and choose WebIDE). From there you can create a hello world application.
For more information see the WebIDE documentation

Does Chrome have built-in speech recognition for "x-webkit-speech" input elements?

I'm wondering how
<input type="text" x-webkit-speech speech />
Is there a speech recognition enging built into Chrome or is it accessing an underlying speech recognition facility in the operating system?
Yup, Chrome does speech recognition via Google's servers. But there's no reason that other browsers couldn't choose to implement it differently (for example using some speech recognition facility in the OS).
Balu, your link is actually a bit out of date. The latest Google proposal can be found here: http://www.w3.org/2005/Incubator/htmlspeech/2010/10/google-api-draft.html
Although speech recognition has been available in the Chrome dev channel for some time, it has not shipped yet and we're not yet sure when it will ship. We definitely want people to play with the API and offer feedback on it, but we don't think it's quite ready for prime time yet.
According to the code it sends the audio data as a POST request to:
https://www.google.com/speech-api/v1/recognize?client=chromium&lang=??&lm=??&xhw=??&maxresults=3
lm is grammar in the code, xhw is hardware_info which is optional according to a comment. The audio appears to be speex, x-speex-with-header-byte:
// Encode the frame and place the size of the frame as the first byte. This
// is the packet format for MIME type x-speex-with-header-byte.
It looks like it would be pretty trivial to modify the chrome code to use in your own app.
Update:
You also need to get a speech recognition API key and they are limited to 50 requests per day. There is no way to increase that limit - not even by paying.
There is an experimental fork of speexenc that can encode x-speex-with-header-byte MIME binary format, its referenced on the QXIP Wiki and is available on GitHub. Does the job fine by placing the size of the frame as the first byte of packets.
They are using their own API for speech recognition. Ex: sending a post request to there servers.
Speech recognition is a proposal by Google. https://docs.google.com/View?id=dcfg79pz_5dhnp23f5
The feature ships with Chrome 8+ and it looks like it sends the data to google servers to perform the actual recognition.
This feature now works on chrome 11 beta.
check this out..
http://slides.html5rocks.com/#speech-input
This might be of interest https://github.com/taf2/speech2text ruby bindings for the google speech to text API
Yes, Chrome does have built-in speech support through WebKit; just look at the Google homepage (which now has a microphone to the right of the search box). I wonder, however, if the Chrome team is working on Omnibox speech support. After all, Chrome is a WebKit-based browser!
I just confirmed this on my Chrome Cr-48, it works.
There is also a working group that produced http://www.w3.org/TR/xhtml+voice/ but I don't believe this is implemented in any browser except Opera.