Do any FLOSS projects utilize human input for speech synthesis? [closed] - open-source

Closed. This question does not meet Stack Overflow guidelines. It is not currently accepting answers.
We don’t allow questions seeking recommendations for books, tools, software libraries, and more. You can edit the question so it can be answered with facts and citations.
Closed 7 years ago.
Improve this question
Are there any open source, open content projects that use recorded speech data to generate synthesized speech? (With the goal of synthesising/simulating the speech of a particular individual. As a side note, is there a name for this process, goal or the data extracted? "voice signature"?)
I imagine the workflow would be something like:
record speech from standardized text ("The teddy sat on the mat.")
pick out phonemes ("a" of cat), accounting for accent
get the data that makes Alice's "eh" sound different to Betty's "eh"
render text to speech using accent-appropriate phonemes plus voice signature
Answering this question is a critical step in petitioning Jack Angel (Teddy, Wonkers) to donate his soothing voice signature to the public domain for the sake of humanity.

Here is an open source project called festvox sponsored by Carnegie Mellon University that has a goal of synthesized voice built on a particular speaker. There concept is described here and it sounds like a very time consuming process to get it tuned correctly. There is a good list of Text-To-Speech open source projects on BableFish.org. There is a good discussion on Text To Speech Blog about building a TTS engine around a particular speaker.

Related

OCR solution that be able to recognise fields from identity card/passport/driving license [closed]

Closed. This question does not meet Stack Overflow guidelines. It is not currently accepting answers.
We don’t allow questions seeking recommendations for books, tools, software libraries, and more. You can edit the question so it can be answered with facts and citations.
Closed 2 years ago.
Improve this question
I'm looking for an OCR solution (library/service/API) that be able to recognise fields from identity card/passport/driving license. Preferable a paid cloud service.
Edit:
To clarify, I'm looking for any solution (open source or not) that be able to detect: first name, last name, dob etc...
Go for BlinkID iOS SDK (https://microblink.com/products/blinkid/mobile-sdk) It is basically providing the best results.
You can also download the demo iOS app available at Appstore and check for various documents.
Accura Scan (https://accurascan.com/) is also competitor in the market which provides the accurate results recognising the MRZ as well as PDF417 code format. Accura Scan scans the below documents:
All countries Passports (with MRZ code)
DL of all countries except that if India and Philippines.
Alien cards/ID cards
Also, scans QR code/Bar-code.
I have not personally used any but some cool ones are:
Google OCR
Tesseract OCR (open source)
#QuangLihnLe, I just want to clarify, first you need detect passport / id card text with OCR, then validate or parse text as MRZ Text data. MRZ is a standard format for passport / id cards it contains sets of personal data including name, surname, birthday, expiry date etc. I have used MLKit to recognize MRZ data on passport and I made samples.
Android: https://github.com/alimertozdemir/EPassportNFCReader
iOS: https://github.com/alimertozdemir/IDCardPassportNFCReader

Is there a better solution than google speech-api? [closed]

Closed. This question does not meet Stack Overflow guidelines. It is not currently accepting answers.
We don’t allow questions seeking recommendations for books, tools, software libraries, and more. You can edit the question so it can be answered with facts and citations.
Closed 7 years ago.
Improve this question
I'm working on Sbire which use google speech-api. There is a some things which I don't like with this api.
First, this is not in streaming mode. It receive small audio files but there is no real pipes.
Secondly, it's not precise. Google does not record our voice to learn our accent. I think it's very important to have a good quality of recognition.
I'm looking for a free api or an open-source tool.
Actually there is a streaming mode, but it requires an API key. Here is an example of how to use it: http://mikepultz.com/2013/07/google-speech-api-full-duplex-php-version/
If you want something that learns, you'll need to create a neural network to handle the data. The best open source tool, however, is CMUSphinx or pocketSphinx(a mini version of CMUSphinx). Here is a link to that tool: http://sourceforge.net/p/cmusphinx/code/HEAD/tree/trunk/
The best part of CMU-Sphinx, however, is that you can set a dictionary of keywords. Instead of outputting gibberish, you can have it ONLY output those commands and finds the command closest to what it thinks you are saying. This make speech recognition incredibly accurate, but only work for a small set of words.

Open source machine vision code to locate a human in a room [closed]

Closed. This question does not meet Stack Overflow guidelines. It is not currently accepting answers.
We don’t allow questions seeking recommendations for books, tools, software libraries, and more. You can edit the question so it can be answered with facts and citations.
Closed 7 years ago.
Improve this question
Has anybody seen a resource covering specifically the use of machine vision for locating a human in a room, including cases where they may be partially obscured? I know about OpenCV and other machine vision packages, but I'm looking for a paper/library/code example that focuses solidly on identifying a human in a room; something that is readily integrated into code rather than being a rough idea that would require weeks or months of tweaking or enhancing to do a reasonable job of succeeding at the task. I'd prefer something that does not require binocular vision and can work with a single camera.
First: you should read this about human detection. Then you can find/use opencv code implementing the training step of this kind of detector. You will need to access to a database for training like INRIA Person dataset.
Second: if you need to handle partial obstruction, you may need to implement a human body tracker that can handle this problem. Particle filter could do the work.
If you have problems with implementation feel free to ask more specific questions.

Open-source tool for gender-recognition using voice [closed]

Closed. This question does not meet Stack Overflow guidelines. It is not currently accepting answers.
We don’t allow questions seeking recommendations for books, tools, software libraries, and more. You can edit the question so it can be answered with facts and citations.
Closed 7 years ago.
Improve this question
Good evening,
I'm working on a project with Kinect and I need to perform a gender-recognition using the voice information caught by the microphone of the sensor.
The recognition could also be text-based, i.e. the sentence said by the user could always be the same. I "just" need the binary answer male/female.
I've checked the quite popular open-source Sphinx but I can't understand if it could be used for my needs.
Any idea of what I could use?
Thanks in advance.
For sake of completeness, finally I solved the recognition task without analyzing the voice input.
Gender can be detected with an acceptable precision also by learning and then classifying visual biometric traits. I used speech recognition only as "interface" to activate the demo.
There are several ways to achieve this. You can track pitch (lower pitch values will be male, otherwise female). Or try to build a GMM (Sphinx cannot do this, but HTK can), with one model for male, other for female and another to children.

What's the best tool to allow non-developers to create a ponycorn adventure game? [closed]

Closed. This question does not meet Stack Overflow guidelines. It is not currently accepting answers.
We don’t allow questions seeking recommendations for books, tools, software libraries, and more. You can edit the question so it can be answered with facts and citations.
Closed 7 years ago.
Improve this question
This is the ponycorn game: http://www.untoldentertainment.com/games/sissy/
A couple of my non-programmer friends who are teachers are interested in building games with a similar look and feel with their kids.
What would be a good (the best) tool to get started with?
Independently of the particular game in question, there are some projects aiming to bring programming closer to non-programmers. The all differ in the level of abstraction though. Some of the most interesting ones :
Scratch visual, lego-like, great for kids, developed by the MIT
Lego Mindstorms provides visual programming, but I'm afraid, this is not suitable for programming ponys as it concentrates on the mindstorms robotics project.
processing - a former MIT project. Cross-platform (windows, linux, mac, android) scripting environment, is a rather thin layer over a java framework. very versatile, providing even OpenGL rendering, but your friends will have to write code still - simpler than with most other frameworks but still