[Q] Question about android TTS Engines (newbie) - Samsung Infuse 4G

Looked all over for an answer, hope someone can enlighten me. Tried both the pica and SVOX text-to-speech engines in my Infuse4G. Both work great EXCEPT certain words like "the" and "of" are spelled out, rather than pronounced. It's maddening to hear t-h-e instead of the completed word; this makes the TTS function basically unusable. I am using word docs in Quickoffice for testing.
Any way to modify either engine so they stop spelling words?

Wow I've used SVOX and I've never had that problem before

@3GBeyotch Thanks for your response. Upon further checking I found that the TTS works fine if I originate a document in Quickoffice. Problems occur when I paste text from the clipboard. I took this pasted text and stripped all hidden characters by passing it through notepad, then pasted straight ASCII to a file. When transferred to android, the TTS engine still screws up words. Very bizarre. Generate text natively in Quickoffice and all is OK. Bring clean text into Quickoffice and TTS takes a dump on certain short words such as 'the' and 'of' by spelling them.
This may be one of those computer things that is beyond explanation

Related

Speech Recognition API for Windows Mobile 6.1?

I've started developing an application that allows the user to compose and send an email completely hands-free... by voice command only.
However I'm having trouble finding a decent, open source (free) speech recognition (speech-to-text) engine / API to use.
Does anyone know of one? I tried PocketSphinx but had trouble compiling it in Windows using VS2008.
I'm wondering what API the Windows Live Search app uses? Its speech recognition capabilities are already decent, and if it's included with Windows Mobile or .NET Compact Framework 3.5 or Windows Mobile 6.1 itself, then I would prefer to use that. But I'm having trouble determining if this speech recognition is available to 3rd-party developers and, if so, how to interface with it.
Any help would be greatly appreciated!
OMG I hate timeouts lol
So I had this nice long post about how I thought it might be one of three things and I whipped out my omnia and disconnected the network and blah blah.
When I hit post, I got a not logged in timeout.
So here's the short of it:
It uses a server, that's probably related to UC aka Office Communications Server aka Speech Server 2007... you can get to it (and all the Microsoft Speech technologies, including Voice Command) here:
http://www.microsoft.com/speech/speech2007/default.mspx
A little more searching lead me to read the MSDN Channel 9 blog on said subject:
http://blogs.msdn.com/speech/archiv...h-for-mobile-now-with-speech-recognition.aspx
which states:
"The speech recognition functionality for the application doesn't actually sit on the Windows Mobile phone. Instead, the phone takes your speech input, sends it to a server, the server does it's recognition magic, and sends the results back to the phone. "
Speech Server 2007
Thanks for the reply MerlinJim... sucks about the timeout! That's why on a long post I always copy the text to the clipboard... that way if it times out I can just paste it in! (It's happened to me too many times for me to not do that now!)
Yeah I've looked at Speech Server 2007 as well... and I was thinking that maybe Live Search offloaded the speech recognition to a server. There's a little lag between what you say and when it guesses what you said.
I guess something like that would work. If you're writing an email then you need an Internet connection, and so sending the voice data to a speech server would be plausible. The only downside would be if it used up a lot of data transfer/bandwidth, and the user was on metered bandwidth.
The lag would be a bit of a drawback, because if the Speech Server guessed incorrectly what you said, but you kept talking (due to the processing lag), then you would have to go back and correct what you had said.
And also sometimes the Live Maps speech recognition is WAY off. Like I'll say "1 Jefferson Parkway" and it will come back with something like "Did you say 'Parkstone Apartments?'"
It's also speaker-independent, so you don't do any training. I would rather train an app to recognize my voice specifically, because I would be the only user of it.
But it may be my only solution for right now. Thanks for the info! I was beginning to think that no one knew the answer.
acrosser said:
Thanks for the reply MerlinJim... sucks about the timeout! That's why on a long post I always copy the text to the clipboard... that way if it times out I can just paste it in! (It's happened to me too many times for me to not do that now!)
Yeah I've looked at Speech Server 2007 as well... and I was thinking that maybe Live Search offloaded the speech recognition to a server. There's a little lag between what you say and when it guesses what you said.
I guess something like that would work. If you're writing an email then you need an Internet connection, and so sending the voice data to a speech server would be plausible. The only downside would be if it used up a lot of data transfer/bandwidth, and the user was on metered bandwidth.
The lag would be a bit of a drawback, because if the Speech Server guessed incorrectly what you said, but you kept talking (due to the processing lag), then you would have to go back and correct what you had said.
And also sometimes the Live Maps speech recognition is WAY off. Like I'll say "1 Jefferson Parkway" and it will come back with something like "Did you say 'Parkstone Apartments?'"
It's also speaker-independent, so you don't do any training. I would rather train an app to recognize my voice specifically, because I would be the only user of it.
But it may be my only solution for right now. Thanks for the info! I was beginning to think that no one knew the answer.
Click to expand...
Click to collapse
perhaps, but there IS a speech application loaded ON a Windows Mobile 6.1 which has text-to-speech capabilities and speech recognition
(my Blackjack II loaded with Wm6.1 has this capability)
can't find any API to use it though... only way to activate this TTS capability is to
1) sms announcing
2) appointment announcing
3) call announcing
no actual program to do TTS...
Any progress on this or any other speech-to-text program? I'm really interested in finding one.
Wouldn't mind being a beta tester, either.
*Double Post*
DELETE

Looking for a speaking ebook reader

Hi,
Does such a PocketPC application exist:
An ebook reader, capable of handling at least *.TXT files, that shows the text on the screen like any other ebook reader, but featuring like a play button to read the text with syntesized voice?
Even better would be to be able to output that voice through a bluetooth headset or carkit.
I read a lot of ebooks, but obviously while i.e. driving, it is not possible to do so. Instead of hearing radio or trying to burn an audio CD with rendered reading (such software exists for PC's), I would love to have the book read to me by my PDA.
I have searched Google for such an application, but did not found what I am looking for... Any ideas?
Cheers,
vma
TapText makes some text-to-speech products that I think can be paired with a reader and read audibly. They even have a trial you can download that allows for testing a small excerpt at a time (160 characters, I think) but you have to listen to the admonition and demo prompt prior to hearing your text. You can disable this annoyance by (here's a thought) purchasing the product. I am a bit in a rush but wanted to get this out to you so that you had an answer to your question. If you have trouble finding this app. please PM me and I'll send you a copy of the trial cab. I may also have some alternatives for you as well.
On another point. Look into GoogleVoice as an alternative option for visual voice mail. I use voice recognition and text-to-speech, as well as dictation software quite a bit. So I'll have to dig up some of my old cabs for you. Many are now quite hard to find, so contact me directly for copies of the same. Happy to share. Gotta run.
Regards,
LWBIIPLLC

Android - Voice To Text - Preferred Words?

I've been using the voice to text feature a lot lately, mostly because I am a heavy emailer and texter. Once you get used to specifically saying "hello comma there period" to yield hello, there. it isn't so bad.
The only issue I have is this. I haven't ran into it with other words but I'm curious in particular if this is possible.
Whenever I text and say my other half's name, it comes up as "Christy" since that is the most common spelling. Problem is, that's not it... it's "Kristi."
Any idea how I can get the system to prefer the K spelling instead of CH?

Any way to change the android voice?

I have tasker run functions, and it sometimes talks to me, now coming from a 2.3 device to a 4.3 I see the voice is different, and it's not terribly nice, are there any alternative voices to try? I know in the past other makers (Samsung) have had different voices)
I'll answer my own question in case anyone else finds this useful.
If you're unhappy with text to speech on your android device, there's several options to try both paid & unpaid.
For me (after extensive research & testing) I found this solution works best.
Svox Classic Text To Speech Engine
https://play.google.com/store/apps/details?id=com.svox.classic
and the voice I used for maximum clarity & easy on the ears was this:
SVOX US English Grace Voice
https://play.google.com/store/apps/details?id=com.svox.classic.langpack.eng_usa_fem
Naturally your mileage will vary, but this one worked for me. It doesn't seem to work in google now speech, but in maps (navigation), tasker & FBReadet TTs it does.

[Q] Perfect Keyboard Pro question ... ?

I've tried dozens of keyboards but always end up landing on Perfect Keyboard Pro as my default. Its T9 key and voice key are appropriately placed, and its basic, yet customizable layout is superb!
There is but one little annoyance. It seems to come preloaded with dictionary words that I can't delete.
For example, if I type the word after, then AfterDawn pops up as a suggested word (as much as I like AfterDawn, I just don't type it enough for it to be in my dictionary). Another example would be if I type the word only, I get Molybdenum as a suggestion (I had to look that word up).
These suggestions are not in the stock dictionary, and don't pop up as suggestions in any other keyboard I install, so it must be exclusive to PKP.
Tried emailing you at [email protected] but no response, has anybody had any luck with this app's user dictionary?

Categories

Resources