backtop


Print 8 comment(s) - last by ZardozFrayn.. on Apr 14 at 11:34 AM

My hobby project for years has been a Babel device. In a relatively short time I am not too far away from a sample -- where is my competition?

When I was ten or eleven I read one of the first books that changed my life -- The Hitchhiker's Guide to the Galaxy.  Although later in life I tended to shy away from Douglas Adam's point of view, the concept of the Babel Fish was perhaps the most interesting and exciting things I had ever read about, even if it was made up.

The Babel Fish was a very silly creature adventurers would put in their ears. Once in your ear, the fish would interpret audible language from other creatures and quasi-telepathically put the meaning in your head.  The idea was so elegant and succinct that AltaVista later named its automatic translation service Babel Fish.

Several years ago I started working on my own Babel Fish -- a device you could actually put in your ear and get live translation of the world around you.  I thought as long as I had a blog on DailyTech, I could document the progress of my ultimate hobby project. Even back in 2000 we had more or less the components needed to do something like this:

  • Microphone attached to some sort of mobile device LI>
  • Speech to Text Software, a la Dragon Naturally Speaking (and others)
  • Automatic Text Translation, AV Babel Fish, Google Translation, etc
  • Text to Speech Software, AT&T Natural Speaking
  • Bluetooth headsets

The components have gotten a little better over the years, but the basic elements are more or less the same (though maybe I would opt for one of these bad boys instead of a headset). However, since 2000 processing power has increased dramatically.  My 200MHz Qtek 9100 PDA has no problem going on the Internet and actively translating text through a parser.

Unfortunately, there are some horrible walls I've run into while creating my Babel Fish.  The first is the delay in transmission.  Even when my speech to text software picks up a good block of conversation, there is a half second or so delay before it reaches my text parser.  From there it is another several seconds while the parser attempts to determine the language, connects to the internet and parses the text back to English.  The written English to audible English translation takes another second or two.  After all is said and done, the turnaround time for a basic translation is upwards of ten seconds.

There are some considerable steps I can take to reduce this -- moving the translation software onto the PDA, threading the input stream and increasing the processor speed.  I am working on all of these and I will report my progress in the next few months.

The much larger problem I have run into is getting the speech to text software to readily recognize languages correctly.  While I have no problem "selecting" a module for Chinese or a module of Japanese beforehand, every text to speech package I have dealt with has a horrible recognition rate for Asian languages (which, quite frankly are the only ones I am interested in developing this software for).  Spanish, so far, has been the easiest.  I have heard good things about the Microsoft Speech SDK, but the majority of my programming is already done on Linux.  More on this problem in the next few months as well.

With all the hurdles I need to overcome, the progress I have made has been relatively easy for a single programmer working in spare time.  If Sony or Samsung decided to put some effort into a project like this, everyone would have Babel Fish devices within a year.  I wonder why no major vendor has picked this up and run with it.



Comments     Threshold


This article is over a month old, voting and posting comments is disabled

Great minds...
By jskirwin on 4/12/2006 9:27:38 AM , Rating: 2
Think alike. I too was inspired by the Babelfish after living in several foreign countries and having as much of a knack for language as a pig does for flying. I even went so far as to spec out a system along the lines of yours, but I never made it into the prototype phase. Congratulations! Work out the bugs and I'll buy one!

That said, some points:
1. Why not skip the language determination phase and load modules instead? When I travel to France, I need French - not Farsi. You are making it unnecessarily complicated for little gain methinks.

2. Instead of connecting to the Internet, why not connect to a translation program loaded locally? That would speed things up.

Why hasn't it been done? Because it hasn't. Just do it yourself and make it real. There is a definite need for this device.

Good luck!




RE: Great minds...
By KristopherKubicki (blog) on 4/12/2006 10:10:05 AM , Rating: 2
Yeah I agree with you about the modules. Thanks for that confirmation :)


RE: Great minds...
By dilz on 4/13/2006 5:53:22 PM , Rating: 2
Sorry to be the bearer of bad news, but I think you're going to have a difficult time moving forward on this project - not because it isn't a good idea, but because I'm the the IP has been tied up already.

Check this out:
http://www.phraselator.com/


Next step....
By Cocytus on 4/12/2006 1:19:17 PM , Rating: 2
KK,

Have you beaten the bushes for capital? Are you planning to contact one of those companies in the near future or after you actually have a sample?

If there aren't currently any major competing projects out there...then potentially you could be sitting on a gold mine.




better find a patent lawyer
By ElFenix on 4/13/2006 11:59:09 AM , Rating: 2
even if your device isn't commercially successful, one of those big boys you mentioned might have one that's similar.




Stopp patting yourselves on the back
By jmunjr on 4/13/2006 12:51:36 PM , Rating: 2
While a "Babel Fish" would indeed be quite a marvel, the gizmo you speak of is nothing special nor terribly novel. Maybe if you put up a few pages along with a review about it there might be a little room for applause, as none of the dozens of other people I know with the same idea haven't done so either.. Then again, they've got thousands of ideas they either never started or if started, they got bored, and never completed.

Let's see your project in action.




Inventor
By Alphafox78 on 4/13/2006 4:42:36 PM , Rating: 2
You should go on american inventor and get some $$ for R&D




Most likely scenario...
By ZardozFrayn on 4/14/2006 11:34:39 AM , Rating: 2
Seems like with the convergence of functionality into the mobile phone, that the first practical application would be to build speech recognition / translation into cell phones, if this hasnt happened already. Seem to recall reading about the US military already has a version of this, with handy phrases like "drop the gun" or "kiss your ass goodbye" and such.

Perhaps when we get 64 cpu multi-core processors chips with holographic storage that fits in a cell phone, and some pattern matching software that can use the multiple CPU threads and massive storage to quickly produce decent untrained speech recognition and translation it will happen.

I would guess thats about 5 years out.

... And yes, its a bitch trying to understand the tonal inflections that change meaning in Chinese.




"If they're going to pirate somebody, we want it to be us rather than somebody else." -- Microsoft Business Group President Jeff Raikes

Related Articles
















botimage
Copyright 2014 DailyTech LLC. - RSS Feed | Advertise | About Us | Ethics | FAQ | Terms, Conditions & Privacy Information | Kristopher Kubicki