I'm working on a research project in which we create a new text-to-speech (TTS) engine, that converts text to spoken audio.
As the engine is already performing good, we try to make it usable by a large number of applications which made us want the engine to show up as a TTS voice on Windows 10.
In Microsoft's developer documentations, all I found was information on how I can use exisiting/already installed voices in my application. However, I didn't find any information on how to implement a voice so that it shows up as a Windows voice and can be used by any application using the Speech SDK or SAPI.
Which interface do I have to implement or what API do I have to connect to in order to get our new TTS engine work with Windows Speech?
I already crawled the documentation of the Microsoft Speech SDK as well as developer sites like https://learn.microsoft.com/en-us/dotnet/api/system.speech.synthesis.ttsengine
You should look at the TTS Engine Vendor Porting Guide. You need to implement ISpTTSEngine, which does all the work, and ISpObjectWithToken, which manages registration and creation.
Related
I am having troubles finding the answer for this question on web.
The project I am developing demands that I could save a recorded audio file, and, after that, transcribe the audio to text for finding interesting predefined keywords.
I am using the Windows.Media.SpeechRecognition framework, and it works fine when you are transcribing the speech during the recording process. I can't find, in the same framework, a function which I can use with an audio file as input.
Does anybody know a good approach for this problem? Or another [free] framework for Windows Apps?
For online recognition and in particular in JS projects you can use directly Microsoft Cognitive Services, that are behind online recognition in the SpeechRecognition in Windows. It is free under some limits.
In particular here is open sourced wrapped for JavaScript on GitHub:Oxford.Speech.JS. It can deal with both wav-files and microphone. Sample code is designed like a website, but I'm pretty sure you can easily convert it into a HTML/JS-based UWP app.
I am making windows 8 application using c++/cx but i am not able to get how to include azure feature in my app,every where tutorials is given in c# but nowhere i can find tutorial in c++/cx.
As most of the samples written in managed code to access Windows Azure Mobile Services, It is trues that there are no samples available in C++/C. However it does not mean you could not do it. Keep in mind you would need to do extra work to get it done.
As Windows Azure Mobile Services use RESTful interface so you can access them from any language as long as you can call REST api from there. Here is the documentation for Windows Azure Mobile Services REST Api.
With C++/C you just need to search and find an appropriate framework to call REST api and once you have that working just call Windows Azure Mobile TEST Api to achieve your objective.
This blog post has some examples of how to perform various Azure Mobile Services tasks using C++ for Windows 8.1 Store apps. The IDE tooling mentioned in that post is not available for Windows 8 apps.
I guess the question pretty much says it all?
And I would prefer not to access cloud services such as Microsoft Translator and Project Hawaii?
Is there any direct API I can access?
(For metro apps)
I was able to get the Microsoft Speech Platform working on my Windows 7 laptop (both Voice Recognition and Text-to-Speech). You just need to install SDK and the runtime. You can also download additional Voice and Language packs. I would think it should work on Windows 8 as well. Here is a good sample on how to set it up to recognize some basic phrases like "Find restaurants near Seattle".
There are some new APIs for Windows 8.1:
http://msdn.microsoft.com/en-us/library/windows/apps/windows.media.speechsynthesis.aspx
Text-To-Speech sample:
http://code.msdn.microsoft.com/windowsapps/Speech-synthesis-sample-6e07b218
//build video on Channel9:
http://channel9.msdn.com/Events/Build/2013/2-171
There are no Text to Speech or Speech to text libraries available in .Net for Windows 8 apps. System.Speech and such is not available. You will need to roll your own or find compatible 3rd party library.
I use Bing Translator service in my apps, but that has been removed from existance, they brought the text translator services into Azure, but did not bring speech over and they disabled the old website to get API keys for bing translator. Hadn't heard of Project Hawaii before will have to check it out.
Microsoft's speech API seems available in Windows 7. Was it taken out of Windows 8?
You can now Access Bing Services for Windows 8 , the service has been just released and is in Beta1 stage
Does anyone know a good library or sample code to add new entry to a domain using REST?
Amazon has a full .NET SDK.
Edit
I stand corrected! I wasn't aware Windows Phone was lacking full .NET support.
There is however a blog series here discussing accessing SimpleDb from Windows Phone 7. It covers the signature, which is probably the trickiest part to get right on your own!
Its also probably worth looking at the official .NET SDK in reflector to see what is happening.
The Amazon Web Services SDK for Windows Phone has recently been released. This SDK provides developers with a speed dial that lets you quickly connect and integrate Windows Phone applications with AWS S3, SimpleDB and SQS services.
The SDK can be found from here.
These articles are not about SDK. They're about manually connectting to SimpleDB from Windows Phone; and written before the SDK.
Here is series of articles written in Turkish:
Original: http://muhammedbaykal.com/post/Windows-Phone-75-Amazon-AWS-SimpleDb-Part-I.aspx
Google Translated: http://translate.google.com/translate?sl=tr&tl=en&js=n&prev=_t&hl=tr&ie=UTF-8&layout=2&eotf=1&u=http%3A%2F%2Fmuhammedbaykal.com%2Fpost%2FWindows-Phone-75-Amazon-AWS-SimpleDb-Part-I.aspx&act=url
Part II: http://muhammedbaykal.com/post/Windows-Phone-75-Amazon-AWS-SimpleDb-Part-II.aspx
Google Translated: http://translate.google.com/translate?sl=tr&tl=en&js=n&prev=_t&hl=tr&ie=UTF-8&layout=2&eotf=1&u=http%3A%2F%2Fmuhammedbaykal.com%2Fpost%2FWindows-Phone-75-Amazon-AWS-SimpleDb-Part-II.aspx&act=url
Part III: http://muhammedbaykal.com/post/Windows-Phone-75-Amazon-AWS-SimpleDb-Part-III.aspx
Google Translated: http://translate.google.com/translate?sl=tr&tl=en&js=n&prev=_t&hl=tr&ie=UTF-8&layout=2&eotf=1&u=http%3A%2F%2Fmuhammedbaykal.com%2Fpost%2FWindows-Phone-75-Amazon-AWS-SimpleDb-Part-III.aspx&act=url
can we call inbuilt Windows phone mango voice feature programmatically as in we use Launcher and Choosers.
Is there a way I can use this feature for my Translator application where my application recognises the voices spoken in different languages and convert them to text in that particular language.
Do we have this feature in BING Translator service? if yes then how to use them?
also how to use the INbuilt voice command of my WP mango to build my application?
can we call inbuilt Windows phone mango voice feature progamatically as in we use Launcher and Choosers.
No. There is no speech API for Windows Phone as of right now. See Microsoft TellMe for a upcoming API.
Is there a way i can use this feature for my Translator application where my applictaion recogonises the voices spoken in different languages and convert them to text in that particular language.
No
also how to use the INbuilt voice command of my WP mango to build my application?
You can't.
Not sure if you have found an answer for this, but Wade Wegner from the azure team has a translating and OCR app built using the Bing translation services. I have not tried it, but this seems like something you could use.