voice actions??? - XPERIA X10 Themes and Apps

I thought there was a 2.1 port of voice actions somewhere, but my googling has provided me with nothing, am I just being daft? or did that project end?

There's always Vlingo (now free) which I did see in the market for 2.1.

I have Voice Actions V1.4 wich works good on 2.1 but Vlingo is really good too.

Related

MS VoiceCommand and Voice Dialer Software Experience

I have spent some time looking at various shareware and commercial voice dialer apps this week.
I was really impressed with the specification of MS VoiceCommand No training required, context dependent actions (ie you say "Call So and So" and it will dial immediately if there is only one number, or will say "Mobile or Work?" if there is ambiguity, or you can just say"Call So and So at Work" and off it goes. Terrific.
Except it doesn't work on the XDA. Can't see why - it hears something (so "Show Calendar", "Help", "Yes" etc. work) but does not have enough mic sensitivity to do name recognition. This is so whether you hold the XDA 1 inch or 18 inches from your mouth, whether AGC is on or off.
So for a quarter of the cost I have bought NeuVoice Dialer, which works pretty well. Problems? Needs to be trained, no context dependency, so you either have to associate a specific number (Work, Mobile etc) with a voice tag, or you have to recognise the name first, the specific tag second, which means pressing the action button twice. Nause. Also not very easy in the car w/o a hands-free profile, which is what we are all after, isn't it?
So are my experiences unique? Would love to hear!
I'm using Voice Command with XDA II and it works great. How many processes are you running? I found that if you get close to the 32 process max then it doesnt work as well (see other posts in this forum about max processes). At 26 processes no problem.
Only two or three!! It is really strange - I can't see why it works so much worse than NeuVoice. Not noticed any particular probs with the phone either , so I don't think that the microphone itself is faulty, although it could be that.
How well is VOice Command working really? How many contacts do you have and does it really discrimate? I have about 500 contacts+.
Cheers
Stephen,
I doubt you only have 2 or 3 processes running, you mean applications visible through the memory option in settings. You need a third party tool like Taskmanager (available on Handango) to see the processes running (a bit like the Windows taskmanager) I have 500+ contacts and Voice Command really does a great job of discriminating between them. Try the stereo headset that comes in the box to rule out any microsphone issues. Also I run Voice Command with default settings, I didnt mess with the AGC Microsphone settings at all.
Out of the programs I tried for VoiceDialing, I found Fonix VoiceDial was the most accurate (using XDA 1)
Tried with the stereo lead - very, very slightly better, but no way is this stuff usable.
I'm pretty sure there is an architecture issue here - the fact that the AGC cannot be manually set, strange behaviour using teh recorder during phone calls etc. VoiceCommand may be great, but it don't work too well on my XDA2. Maybe I have a funny voice!!!
Cheers
XDAII - MS Voice Command experience
I am using MS VC with i-MATE for already several months and I have over 700 contacts...
Based on my experience - try to reinstall MS Voice Command - some times it works better.... Also do not speak to loud; do not keep it very close to your mouth. It works fine even if it is on a console between sits and I am driving (looking straight on the road).
And yes, you should check number of process running... http://www.scarybearsoftware.com/ppc_cn_overview.html - would be the best bet...
Good luck.
Paul
stephen_oliver said:
I'm pretty sure there is an architecture issue here - the fact that the AGC cannot be manually set, strange behaviour using teh recorder during phone calls etc. VoiceCommand may be great, but it don't work too well on my XDA2. Maybe I have a funny voice!!!
Click to expand...
Click to collapse
Don't forget, you need to speak with an American accent it you want this thing understand you...!
Yes, that has improved it - 135 notifications.
Not sure how to check active processes - got any suggestions on utilities?
I can now get recognition about 60-70% of the time. Sadly it still has problems with my "Yes"!! Maybe I should coach my voice to sound more patient :?
Thanks for the advice ...
How do you BUY VoiceCommand?? Won;t accept non-US addresses!!!!
As far as I know you can get trial for one or several days.
As per "buy it" I am not sure how you can do that not from states or Canada.
About YES - possibility that you just saying it too soon. Try pause for 1,5 2 seconds and after repeat if necessary.
Thanks - I've discovered Handandgo/uk so I'm all set now.
Cheers everyone!
Hate to say this but I tried Voice Command and gave up!
Sometimes it got it right, but for most of the time it got it wrong. I think I had better results when I tried an American Accent with a Scouse Accent (Livererpudlian for our international fans), but even then it was a burger less than a MacDonalds.
I seemed to think it was down to the XDA II only having AGC on and off choices and not a manual audio gain setting. I did try it with both the free version and the *cough* cracked version (for evaluation purposes of course) to little success. Saying that, didn't have much success with Via Voice either.
Anybody willing to "share" this software.... long term.. hehehehe
e-mail me at [email protected]
fella's please persit with and you will be rewarded with porbably the best voice recognition program available....
i nearly gave up but i'am glad i did'nt
It helped me going to the website and reading the help files and watching the demostration video...
Zetex
MS Voice Command and Fonix
G'day Guys,
I've persisted with MS Voice Command and agree with the previous posts.. If you've got your alerts sorted out and processes running low, it's a REALLY good package...
The Fonix VoiceDial and Voice Commander has excellent potential but I couldn't get the accuracy up and the volume was lousy - Crackly and distorted.. So I'm sticking with MS VC...
Still can't get "DIAL 1 2 3 4" etc. working AT ALL.. It's ALWAYS wrong?!?! But "Call CONTACT NAME" is consistently excellent.. So that's all I need it for!
Beats the heck out of having accidents on the road while trying to stylus the iMate!!!!
P.S. For the SmartPhone, Fonix VoiceDial is absolutely brilliant - I've also used that and it's consistently good.. Just a shame that there seems to be an issue with the iMate PDA (O2 XDA II) hardware and Fonix VoiceDial...
Cheers
Andy
Voice Command in Oz
Andy,
How did you get VC in Oz?. I have used the trial version and found it great (after learning to americanise my "strine"), but non US addresses just can't download. I am happy to pay for my programs, but if MS don't want my money...
Che?
Voice Command in Oz
Andy,
How did you get VC in Oz?. I have used the trial version and found it great (after learning to americanise my "strine"), but non US addresses just can't download. I am happy to pay for my programs, but if MS don't want my money...
Che?
usa purchase
Its very simple if you are only purchasing a software download - Give your real email address but gìve any usa mail address that you can see on packaging or from the web and the sale will go through! important the ZIP Code must be real
Neuvoice for XDA II
Thanks for that Sam. In the meantime, I have tried Neuvoice, and have actually found it is a better product. Apart from the need to train it, the recognition is better and I don't have to alter my accent

Speech Recognition API for Windows Mobile 6.1?

I've started developing an application that allows the user to compose and send an email completely hands-free... by voice command only.
However I'm having trouble finding a decent, open source (free) speech recognition (speech-to-text) engine / API to use.
Does anyone know of one? I tried PocketSphinx but had trouble compiling it in Windows using VS2008.
I'm wondering what API the Windows Live Search app uses? Its speech recognition capabilities are already decent, and if it's included with Windows Mobile or .NET Compact Framework 3.5 or Windows Mobile 6.1 itself, then I would prefer to use that. But I'm having trouble determining if this speech recognition is available to 3rd-party developers and, if so, how to interface with it.
Any help would be greatly appreciated!
OMG I hate timeouts lol
So I had this nice long post about how I thought it might be one of three things and I whipped out my omnia and disconnected the network and blah blah.
When I hit post, I got a not logged in timeout.
So here's the short of it:
It uses a server, that's probably related to UC aka Office Communications Server aka Speech Server 2007... you can get to it (and all the Microsoft Speech technologies, including Voice Command) here:
http://www.microsoft.com/speech/speech2007/default.mspx
A little more searching lead me to read the MSDN Channel 9 blog on said subject:
http://blogs.msdn.com/speech/archiv...h-for-mobile-now-with-speech-recognition.aspx
which states:
"The speech recognition functionality for the application doesn't actually sit on the Windows Mobile phone. Instead, the phone takes your speech input, sends it to a server, the server does it's recognition magic, and sends the results back to the phone. "
Speech Server 2007
Thanks for the reply MerlinJim... sucks about the timeout! That's why on a long post I always copy the text to the clipboard... that way if it times out I can just paste it in! (It's happened to me too many times for me to not do that now!)
Yeah I've looked at Speech Server 2007 as well... and I was thinking that maybe Live Search offloaded the speech recognition to a server. There's a little lag between what you say and when it guesses what you said.
I guess something like that would work. If you're writing an email then you need an Internet connection, and so sending the voice data to a speech server would be plausible. The only downside would be if it used up a lot of data transfer/bandwidth, and the user was on metered bandwidth.
The lag would be a bit of a drawback, because if the Speech Server guessed incorrectly what you said, but you kept talking (due to the processing lag), then you would have to go back and correct what you had said.
And also sometimes the Live Maps speech recognition is WAY off. Like I'll say "1 Jefferson Parkway" and it will come back with something like "Did you say 'Parkstone Apartments?'"
It's also speaker-independent, so you don't do any training. I would rather train an app to recognize my voice specifically, because I would be the only user of it.
But it may be my only solution for right now. Thanks for the info! I was beginning to think that no one knew the answer.
acrosser said:
Thanks for the reply MerlinJim... sucks about the timeout! That's why on a long post I always copy the text to the clipboard... that way if it times out I can just paste it in! (It's happened to me too many times for me to not do that now!)
Yeah I've looked at Speech Server 2007 as well... and I was thinking that maybe Live Search offloaded the speech recognition to a server. There's a little lag between what you say and when it guesses what you said.
I guess something like that would work. If you're writing an email then you need an Internet connection, and so sending the voice data to a speech server would be plausible. The only downside would be if it used up a lot of data transfer/bandwidth, and the user was on metered bandwidth.
The lag would be a bit of a drawback, because if the Speech Server guessed incorrectly what you said, but you kept talking (due to the processing lag), then you would have to go back and correct what you had said.
And also sometimes the Live Maps speech recognition is WAY off. Like I'll say "1 Jefferson Parkway" and it will come back with something like "Did you say 'Parkstone Apartments?'"
It's also speaker-independent, so you don't do any training. I would rather train an app to recognize my voice specifically, because I would be the only user of it.
But it may be my only solution for right now. Thanks for the info! I was beginning to think that no one knew the answer.
Click to expand...
Click to collapse
perhaps, but there IS a speech application loaded ON a Windows Mobile 6.1 which has text-to-speech capabilities and speech recognition
(my Blackjack II loaded with Wm6.1 has this capability)
can't find any API to use it though... only way to activate this TTS capability is to
1) sms announcing
2) appointment announcing
3) call announcing
no actual program to do TTS...
Any progress on this or any other speech-to-text program? I'm really interested in finding one.
Wouldn't mind being a beta tester, either.
*Double Post*
DELETE

SIP dialing for Gingerbread 2.3 Discussion

In reviewing the new page on the nexus s, the only thing I really am interested in is the internet dialing/SIP. I have tried SIPdroid on the captivate, and it works ok, but very staticy on the other end. I am curious as to whether anyone knows where the SIP client for 2.3 came from. Is it SIPdroid? Did Google buy SIPdroid? I would also be very interested in flashing only the SIP part of Gingerbread 2.3, the rest of the update isnt all that interesting to me. If anyone comes across info or files related to this, post here.
Thanks
Benny1234 said:
In reviewing the new page on the nexus s, the only thing I really am interested in is the internet dialing/SIP. I have tried SIPdroid on the captivate, and it works ok, but very staticy on the other end. I am curious as to whether anyone knows where the SIP client for 2.3 came from. Is it SIPdroid? Did Google buy SIPdroid? I would also be very interested in flashing only the SIP part of Gingerbread 2.3, the rest of the update isnt all that interesting to me. If anyone comes across info or files related to this, post here.
Thanks
Click to expand...
Click to collapse
Google bought Gizmo5 about 1 year ago. This is probably where they got the SIP client.
From what I read on the official document, internet calling works via wifi only.
I hope and i'm sure the devs will modify this to make it work via 3G.
A few things:
1) You are using VoIP over 3g? It's probably not goind to sound great, and any free client you have is not going to implement g729 b/c it requires a license so GSM is your best bet for low bandwidth, and it doesnt sound great.
2) a. No it is not sip droid. b. Sipdroid sucks - it is a terrible client. Basically the PBXes.org people who "wrote" sipdroid took the example "client" from mjsip and jammed an android interface on it. The backend sip implementation is pretty crappy, doesnt support call transfer, does multiple registrations and can flood a registrar.
2) CSipSimple is a much more promising client (IMO). And while has not yet implemented transfers yet, it is based on pjsip, a very well implemented sip stack that does fully support these features. Also, I have found that CSipSimple has less fuzzy audio too.
3) I have been looking at the 2.3 SIP stuff - It's not like you can just take that part from 2.3 and "flash" it to your device.
Besides that, the implementation that is in the 2.3 SDK looks very simplistic so far - and it is unclear to me whether or not the dialer will allow native SIP calling, or if google has just put in sip libraries for people to develop SIP applications. I see no where in any settings that allow you to specify "phone wide" sip credentials.
This would seem to be further backed by the sample "walkie talkie" application that is available with the 2.3 sdk.
4) I doubt google bought gizmo5 for their sip stack/client. There are many highly compliant open source sip stacks freeley available: sofia-sip, pjsip, jain-sip etc. etc. I dont know what is "under the hood" but what google has exposed via the SipProfile so far looks to be sub-par.
5) "Internet calling" support isnt going to be enabled on all devices, although i think the nexus s probably will be one of the few, in fact even in the AVD I get an "internet calling not supported" message when I try to call a sip URI from the dialer.
Your best bet right now - try out CSipSimple. Native SIP support is going to be a bit raw for a while is my guess.
SIP dialing
Thanks etamme ! Thats some good information! Yeah I would like to use SIP for international calls, wifi and 3G data. I use the phone when we travel and have foreign sims. So making GV /G5 calls would be perfect. I used to use an iphone, but unlocking became a PITA recently. (read as 6.15.00 BB)
Will check out CSSipSimple, I assume it will work with G5. I'm glad someone else realizes Sipdroid is terrible. Every time I use it the other party complains. With all the updates theyve done they have failed to address a major problem.
Thanks again!
have you tried this?
i'm trying it now
http://androidandme.com/2010/12/news/how-to-place-and-receive-internet-calls-with-android-2-3/
Benny1234 said:
I am curious as to whether anyone knows where the SIP client for 2.3 came from.
Click to expand...
Click to collapse
Gingerbread has a built-in SIP stack.
etamme said:
A few things:
1) You are using VoIP over 3g? It's probably not goind to sound great, and any free client you have is not going to implement g729 b/c it requires a license so GSM is your best bet for low bandwidth, and it doesnt sound great.
2) a. No it is not sip droid. b. Sipdroid sucks - it is a terrible client. Basically the PBXes.org people who "wrote" sipdroid took the example "client" from mjsip and jammed an android interface on it. The backend sip implementation is pretty crappy, doesnt support call transfer, does multiple registrations and can flood a registrar.
2) CSipSimple is a much more promising client (IMO). And while has not yet implemented transfers yet, it is based on pjsip, a very well implemented sip stack that does fully support these features. Also, I have found that CSipSimple has less fuzzy audio too.
3) I have been looking at the 2.3 SIP stuff - It's not like you can just take that part from 2.3 and "flash" it to your device.
Besides that, the implementation that is in the 2.3 SDK looks very simplistic so far - and it is unclear to me whether or not the dialer will allow native SIP calling, or if google has just put in sip libraries for people to develop SIP applications. I see no where in any settings that allow you to specify "phone wide" sip credentials.
This would seem to be further backed by the sample "walkie talkie" application that is available with the 2.3 sdk.
4) I doubt google bought gizmo5 for their sip stack/client. There are many highly compliant open source sip stacks freeley available: sofia-sip, pjsip, jain-sip etc. etc. I dont know what is "under the hood" but what google has exposed via the SipProfile so far looks to be sub-par.
5) "Internet calling" support isnt going to be enabled on all devices, although i think the nexus s probably will be one of the few, in fact even in the AVD I get an "internet calling not supported" message when I try to call a sip URI from the dialer.
Your best bet right now - try out CSipSimple. Native SIP support is going to be a bit raw for a while is my guess.
Click to expand...
Click to collapse
Running Gingerbread right now...I can tell you that "Internet Calling" works straight from the dialer. However, my experience with built in SIP over wifi on Pbxes.org is that cSipSimple is more usable. There is a noticeable lag using the built in SIP configuration and only UDP works for PBXes since it doesn't allow a hybrid mode like cSipSimple (TCP for registration, UDP for connection due to PBXes incomplete TCP implementation). The problem I have with cSipSimple is that the mic gain is way too high and it picks up all the background noise.
Benny1234 said:
In reviewing the new page on the nexus s, the only thing I really am interested in is the internet dialing/SIP. I have tried SIPdroid on the captivate, and it works ok, but very staticy on the other end. I am curious as to whether anyone knows where the SIP client for 2.3 came from. Is it SIPdroid? Did Google buy SIPdroid? I would also be very interested in flashing only the SIP part of Gingerbread 2.3, the rest of the update isnt all that interesting to me. If anyone comes across info or files related to this, post here.
Thanks
Click to expand...
Click to collapse
Looks to me that it is based on JAIN-SIP from inspecting the AOSP source. Have look at http://android.git.kernel.org/?p=platform/external/nist-sip.git;a=tree;hb=HEAD
JAIN-SIP was developed by NIST and the AOSP references NIST also. Haven't dug deeper to confirm however.
etamme said:
A few things:
3) I have been looking at the 2.3 SIP stuff - It's not like you can just take that part from 2.3 and "flash" it to your device.
Click to expand...
Click to collapse
What would it take to extract the SIP bits from the AOSP and use it as a reference library for a SIP enabled application? I have never tried but it should be possible no?
Wrong mic (top)
OCedHrt said:
The problem I have with cSipSimple is that the mic gain is way too high and it picks up all the background noise.
Click to expand...
Click to collapse
You can set the mic gain:
While in a call, click the menu button (bottom left)
Select Media
Move the Mic slider left to lower the volume (quickly before the screen goes back to default).
However, this does not really solve the problem because cSipSimple uses the mic on top of the phone which does not pick up your voice very well, and pics up the sound from the receiver (speaker) causing the person on the other end to hear an echo of everything they say.
I have not been able to find a way to change this behavior.
anyone having an issue with phone calls being rejected with sip dialing? I've tried sipdroid, csipsimple and native sip. using google voice forwarding a number i recieved from ipkall, and using iptel.org instead of pbxes

Any way to change the android voice?

I have tasker run functions, and it sometimes talks to me, now coming from a 2.3 device to a 4.3 I see the voice is different, and it's not terribly nice, are there any alternative voices to try? I know in the past other makers (Samsung) have had different voices)
I'll answer my own question in case anyone else finds this useful.
If you're unhappy with text to speech on your android device, there's several options to try both paid & unpaid.
For me (after extensive research & testing) I found this solution works best.
Svox Classic Text To Speech Engine
https://play.google.com/store/apps/details?id=com.svox.classic
and the voice I used for maximum clarity & easy on the ears was this:
SVOX US English Grace Voice
https://play.google.com/store/apps/details?id=com.svox.classic.langpack.eng_usa_fem
Naturally your mileage will vary, but this one worked for me. It doesn't seem to work in google now speech, but in maps (navigation), tasker & FBReadet TTs it does.

Google Speech API: distinguish voices

Hello
I am working on my dissertation & I want to use Google Speech API to distinguish users by there voice.
I have seen and imported the Google Speech API Sample from
https://github.com/GoogleCloudPlatform/android-docs-samples/tree/master/speech/Speech
Before I was trying to use DialogFlow (used to be API.AI), didnt have much luck. Thought it might be better to use Google Speech API instead.
is this even possible to start with?
I was also looking at "Actions on Google"
https://developers.google.com/actions/identity/user-info
thanks for any input
Have you figured out how to achieve it?

Categories

Resources