Google Assistant on Raspberry pi text input speech output - raspberry-pi3

I'm trying to use Google Assistant on Raspberry Pi, I'm using text as input and it provides output in text. I was wondering if it was possible to get the output in speech?

You should be able to get the AudioOut data if you're using the service, which would enable you to obtain the raw speech output.
You can check out the pushtotalk sample to see how to play audio output.

Related

Speech to Text Converter In Xamarin Uwp

I need to implement the audio to text converter in the Xamarin Uwp platform specific. I am able to record the audio and save that audio. But now I need to convert that audio file in to text file. If anyone can suggest me any working good example than it would be really good for me.
There is a Speech-To-Text API in UWP, but it's moslty for commanding, e.g. executing actions based on what the user is saying. At least that's what I've seen it used for. You can read about that here.
What you probably want is something like the speech recognition that Microsoft Cognitive Services offers. You can upload an audio file of up to 15 minutes and retrieve a transcript of the file. You can read about that here.

access to input data from microphone in WP7?

How can I access to input data from microphone in Windows Phone 7.0 or 7.1? (like a Volume Meter), i want to detect start and end input data from microphone.
Have a look at How to: Access the Microphone in a Silverlight Application for Windows Phone for basic usage.
If you want to detect the start and stop of certain audio input you'll have to parse/process the returned stream yourself.
This blog post also has a great tutorial (and code) showing how you access the microphone in WP7 via XNA: http://blogs.msdn.com/b/mikeormond/archive/2010/08/27/xna-from-silverlight-on-windows-phone-7-the-microphone.aspx
Good luck!

How to show audio details when played using MediaPlayerLauncher?

In my application i am playing an audio file (let it be .mp3) using MediaPlayerLauncher. File is downloaded from network and stored in IsolatedStorage.
I managed to play the audio but it was a bit disapointing to see that there is absolutly no details about the played track.
Is there a way to show picture and/or track name when launched with MediaPlayerLauncher? Or am i forced to use MediaElement?
Unfortunately, if you're just playing an audio stream using the MediaPlayerLauncher there si no way to specifyu additional (meta) data for the launcher to also display.
If you want additional information displayed then you'll need to create the UI yourself and play the track via a MediaElement (or background audio).

Accessing the output audio buffer on Windows Phone 7

The MediaPlayer.GetVisualizationData method does not work for Windows Phone 7, so I'm trying to find another way to visualize music playing through the MediaPlayer.
The post Extracting sound spectrum data in WP7 gives a method for doing beat detection given an audio buffer, but it doesn't explain how to get this buffer on the phone.
How can I access audio samples/frequency data from the music MediaPlayer is playing? I'm not trying to access a buffer from the microphone - I need access to the output audio on the device.
See my answer to this other SO question:
Extracting sound spectrum data in WP7
"Visualization data is not available
on Windows Phone. Any attempt to get
visualization data on the phone will
populate the VisualizationData class
with 0s."

Device driver to act as a virtual web camera

I'm looking for writing virtual camera drivers. Does anybody has idea?
Any book that would be helpful or any link.
Adding more details:
I have developed a device driver which saves the image to disk and the display uses the device driver to display the image. The performance does not seem good.
The fns. that I have used are:
//to capture
GetDesktopWindow()
CreateCompatibleBitmap()
Save()
//to display
WM_MOUSEMOVE
giving a call to capture and display every time
but the display is not continuous and appears only after window goes out of focus and comes in focus again
Should I use some other technique to record or display images, what will give fruitful results, please help.
Thanks,
-mitesh
What do you mean by virtual camera driver?
It is possible to write a virtual capture device using DirectShow. Such a virtual capture device can then be used by applications such as skype, etc. If that suffices for your needs, you can download vcam from http://tmhare.mvps.org/downloads.htm under the "Capture Source Filter" link.
Edit:
In order to use that capture device in the link I posted you need to download the Windows SDK. The Windows SDK has a tool called "GraphEdit" If you search online, I'm sure you can find a quick GraphEdit tutorial. Basically GraphEdit allows you to construct a multimedia pipeline by connecting a bunch of filters. (This is what happens in the background for instance when you play a movie on your computer. ) This could be something like
web cam -> renderer
or
file source -> some decoder -> renderer
and would result in you seeing the video captured by the web cam or the content of the file. The example download shows how you can construct a virtual capture device i.e. it looks like media is coming from a 'real' capture device, but actually you can generate any video you want if you adapt the code to your specific means i.e. take a screengrab and output that. Applications like skype can pick up you virtual capture device if it is registered correctly.
The easiest way to find out if this is sufficient for your needs is to download the capture source filter, register it with the regsvr32 command, and then to use GraphEdit to insert the capture source into a graph, connect the source to a video renderer and hit the play button. A lot of the above mentioned concepts/keywords might seem new to you, but you can do some reading on each topic, and perhaps this will give you a point to get started.
Edit 2:
Is the capture source filter approach not sufficient for your requirements?
1) AFAIR you stated in your (now deleted) answer that you would like to take a screen grab, and use that as a virtual camera device for use in applications such as skype.
If that is all you require, you do NOT have to write a device driver. DirectShow can do that perfectly well by means of the capture source filter. You would then need to
learn some basic DirectShow
modify the source code of the capture filter to take screen grabs etc.
As far as books are concerned to write device driver to accomplish the same, I have no idea. The point I'm trying to make, is that you need to determine whether you actually need to write a device driver or whether simply modifying the open source capture filter is sufficient.

Resources