Scenarios for On-Device Speech-to-Text Support - cobalt

// Whether the current platform has microphone supported.
#define SB_HAS_MICROPHONE 0
// Whether the current platform has speech recognizer.
#define SB_HAS_SPEECH_RECOGNIZER 0
// Whether the current platform has speech synthesis.
#define SB_HAS_SPEECH_SYNTHESIS 0
I'm curious about #define for these.
YouTube search box
Speak the video you want on the microphone
Search video
Ouput video thumbnail after voice data analysis
Is it implemented in this way?

Go to YouTube search page
Speak out the search key word
Voice recognition system recognizes the voice and returns it as text
Search video by the text and show thumbnails
More details are described in https://sites.google.com/a/google.com/youtube-leanback-partners/integration/voice-search
To test the voice search, you should use the following parameter. This parameter should not be used in the product though.
https://www.youtube.com/tv?env_supportsVoiceSearch=1
To launch voice search in the product, work with your technical PoC.

Related

Is it possible to switch between voices using SSML tags when doing text-to-speech on Windows?

I have Windows 7 with several Ivona voices (harposoftware.com) installed. I have an app that starts SpVoice from Sapi and makes it read a text. Now I also have a piece of text that I'd like to turn into a dialog between multiple voices.
Previously, to change between voices, my app had to cut a text into phrases and speak each phrase individually, selecting the required voice for it.
I was wondering if there is a better solution.
So, is it possible to embed some SSML tags in my text to enable switching between voices on the fly?
You should be able to use the <voice> element to change voices on SAPI 5.3.
Make sure the call to Speak() has the SPF_PARSE_SSML flag set.
Note that the name attribute in the <voice> tag is processor dependent, and can't have embedded spaces. (E.g., <voice required="Name=Microsoft Mark"> will definitely not do what you expect.)
This is described in the W3C specification, but it's easy to miss.
There's a much better description of how the voice tag works in this document by Claro Software.

how to create our own custom filter so that we get an access over there using directshow

I have a question regarding Media Playback which is as below .
Please solve my problem
"i have an audio stream which is successfully compressed by using direct show, now before entering it to the rendering filter
i need to create my own custom filter so that i have full access over it , because by using existing filter i am not able to get access over the file."// This is my question
I have read about Direct show.The Microsoft Direct Show application programming interface (API) is a media-streaming architecture for Microsoft Windows. Using Direct Show, your applications can perform high-quality video and audio playback or capture.
The Direct Show headers, libraries, SDK tools, and samples are available in the Windows SDK.
Please suggest..
Windows SDK also offers you samples. Gargle Filter Sample in \Samples\multimedia\directshow\filters\gargle is close to what you need: mid-point filter for audio with full control over streamed data.
// Summary
//
// A simple, in-place transform, audio effect which modifies the data
// in the samples that pass through it. The effect is an amplitude
// modulation with a synthesised secondary wave function.
// The secondary wave can be a triangular or square wave. A properties
// sheet allows the shape and frequency of the secondary wave to be chosen.
//
// At low modulation frequencies it sounds like a tremolo, at higher
// modulation frequencies it sounds like a distortion, adding extra
// frequencies above and below the original unmodulated sound.

Embedded youtube video with "autoplay=1". Does it count towards views?

I can cause an embed youtube video to begin playing automatically using "autoplay=1"
When a user views the embedded video does it count towards my views count?
I have attempted to test but had inconclusive results. Viewed embedded videos to not appear to consistently convert to views even when play is manually clicked.
My tests have been from a single machine and IP which may be the problem? Browser session does not seem to be relevant.
There's been some discussions regarding this topic. Here's a thread with some contradictory answers:
Does the views in the embed player count?
And here's an article about an official response from Youtube claiming that autoplay views does not count:
Youtube Says Autoplay Don't Count
Here's another one that claims that embedded videos with autoplay doesn't count:
How does YouTube video view count work?
(found through this SO answer)
Note 1: Note that there seems to exist "cheats" and bugs, but Youtube claims to be working to prevent it. This means that, if you're using their API in a regular way, your autoplay views probably doesn't count.
Note 2: The above note is simply Youtube's official statement. Who knows what's really going on. Some people (including you, OP) are getting mixed results.
The bottom line is, don't count on it (pun intended).
Short answer: NO
I found this official documentation on the subject (however it's regarding the Custom gadget development guidelines)
Rule of Thumb:
Any time a user initiates a playback using the native player controls, that playback should count. Using an API method to PLAY a video does not count. BUT, you can use cueVideoById and then have the user click a native built in play button and it will count.
Views will NOT be counted if:
The user initiates playback via an API method like playVideo or loadVideoById
The video autoplays via the autoplay parameter
You can try adding the attribute "enablejsapi=1" . This will increase the view count in youtube.com site
No. It was a long search, but the answer seems to be hidden here. Follow the link to check it out.
Taken from here.
https://support.google.com/youtube/answer/171780?hl=en&ref_topic=3014330

How to capture screen activities in windows phone 8?

i'm new to windows phone 8 and need your help to capture screen activities in a video. I've to make a video of the activities performing on screen?
one solution to this that strike in me is to capture the screen in image form by dispatching a timer at a instance of time but this is not a right way to do as i've to make a video of screen activities? suggest your opinion how to handle this problem.
There's no built in way of doing what you want.
You will need two things:
Do a dispatch timer as you describe
Find code that will encode these frames into a movie. That's not an API that the phone supports - you will need to find existing code and use it. I am not aware of such code existing, but I have only looked for it once or twice and not very hard. You could, potentially, create an MJPG which is a fairly simple video format, but even that's not trivial and the ending file size can be prohibitive.

How do I create a simple video effect filter for DirectShow or DMO?

How do I create a simple video effect filter for DirectShow or DMO?
What I need is simple - a tutorial or tutorials on how to create simple filter (like a brightness/contrast adjustment filter or any other pixel-per-pixel kind of filter) for filtering Direct Show Video astream (so I want to have a graph like "my Web Kamera" -> "My photoshop like filter" -> "rendering (or saving to file)".
What you are looking for is probably a DirectShow transform filter, there is a sample filter in the Windows SDK; EZRGB24 Filter Sample
http://msdn.microsoft.com/en-us/library/dd375468(VS.85).aspx

Resources