Bing

How do you spell Levenshtein?

December 06, 2004, 04:08 PM by Bing | 13 Comments

Doing a good job of helping Search users to correct misspelled queries is super important for two main reasons: a) 5 billion crawled docs and a bleeding edge ranking algorithm can’t do much if the query isn’t spelled right and b) more than 10% of all searches are misspelled! So we made sure our new search engine included a revamped spelling correction system that’s much better than our old one.

To improve the speller we worked with Silviu Cucerzan and Eric Brill from Microsoft Research’s Text Mining, Search and Navigation Group. Silviu and Eric have developed some novel techniques for using search query statistics and iterative transformation of query strings to improve spell correction. Their published paper on this topic – Spelling correction as an iterative process that exploits the collective knowledge of web users –goes into much more detail on some of the technical thinking that inspired the spelling correction system we built.

Among others, we also got help from Greg Hullender, a Search engineer who previously worked on some of the earliest spell correction systems and who more recently architected the handwriting recognition algorithms for Microsoft’s Tablet PC software. My main contribution to the spelling project was that I’m a uniquely terrible speller.  :)  This “talent” enabled me to rigorously put the speller through its paces before we made it public.

Here are a few examples of misspelled searches plucked from the query logs for which our new speller does much better than the old one:
• cheaspeake wall coverings
• how to surive in alaska
• federal plus studend loan program

Despite being a significant improvement for MSN Search users, the new spelling correction system is still far from perfect. Looking through user feedback, there are still a few areas in which we have room for improvement. For example, we often suggest corrections for correctly spelled names, and we are seeing this a lot during our beta phase since many users query their own name to evaluate our new search engine. There are also cases where we’re not making simple corrections, such as Helo 2 or caffine levels of sodas. (Yes, you guessed it, both of these errors were sent to me from folks on the Halo 2 team.)

An accurate search spelling correction system is essential to helping users find the information they’re looking for. So in the coming months we’ll be working to increase the accuracy of the spelling correction system. We’ll also be expanding the system to include many more languages, since the English and German-speaking users of our search engine aren’t the only imperfect spellers on the planet.  :)   Please continue to send us feedback via the “help us improve” link when you encounter the spelling system not doing the right thing. We try diligently to read everything that comes in!

Lastly, if you haven’t yet seen the documentary Spellbound that was in theatres about 2 years ago, you should drop everything and rent it! It follows a bunch of students from all over the USA competing to win the 1999 National Spelling Bee competition. Spellbound is an inspiration to bad spellers everywhere.

- Oliver Hurst-Hiller, Program Manager

subscribe

Comments

iamrussell

Posted On December 06, 2004, 04:39 PM
Launch already!! Google needs a good butt kickin'!

Cygnus

Posted On December 06, 2004, 05:32 PM
"So in the coming months we’ll be working to increase the accuracy of the spelling correction system."

I hope you aren't planning on getting every single bell and whistle set before launching live; the results in the beta are more relevant than the main site results...would like to see MSN take the proverbial plunge -- it is more than ready to go against Y! and G.

BTW, that was an interesting cookie test with the beta results temporarily going live -- I hope it satisfied the scalability tests.

Cygnus

Michael Griffiths

Posted On December 06, 2004, 08:51 PM
Nice

N. P.

Posted On December 07, 2004, 01:51 PM
Google has launched Google Scholar. With it you can search through academic paperss and journals.
I would like MSN Search to do the same, plus include a way to search through online free books like the Online Books Page or search for non-free electronic books in general like the www.mslit.com page. There are thousands and thousands of free books provided by the Project Gudenburg and they need a way of searching. A way to compare e-book prices from various publisher is also useful. Add journal search as well. This will need MSN paying fees to publishers that it would be something that no other search engine does.
This would be extremely useful since the Internet can often give unreliable information on a subject. Adding scholarly papers and free ebooks for reference at the top of the search list would enable us to be more certain for the information we gather from the Internet.
It would help students, teachers and academics a lot!

Alexandre Kavinski

Posted On December 08, 2004, 04:32 AM
"We’ll also be expanding the system to include many more languages"

Whichh languages come next???

Sam

Posted On December 08, 2004, 05:57 AM
There is another problem with improper spelling:

With so many folks posting web content a lot of the web pages contain spelling errors, too. And searching using the correctly spelled word will not find these misspelled pages.

Is there a solution (or part of one) to this problem, too?

Sam

Joey

Posted On December 09, 2004, 06:36 AM
Cygnus> One would think that the misspelled words would render in the results with a prompt to check for the correct (suggested) spelling - google'esqe

Nektar

Posted On December 09, 2004, 11:22 AM
I would like to see support for the Greek language in the next version!
Really now, MSN Search should employ the same tactic as Google by which people can volunteer to translate the interface into their native language. I have many friend who use Google because it is the only one that displays in Greek and by default, without having to click on a link and choose the default language.

Sir Seek

Posted On December 09, 2004, 06:27 PM
I have created a artificial intelligence search engine too (it thinks for you and offers keyword suggestion that you may never have thought of), check out <a href="http://www.SirSeek.com">">http://www.SirSeek.com"> http://www.SirSeek.com </a> where you see the working beta (I am not using our own index of websites YET, our search &amp; seek web fetching bot is still in development, will even retrieve results when none indexed to index). This will index general web sites and pages linked to by those sites, any topic of data will be accepted. I know its got too many affiliate ads (I need the money), I know it doesn't have billions of web pages indexed (YET), I know my internet access provider can be a little slow at time, I know I am not comparable (YET) but I am working on it. Any suggestions are appreciated so I can continue making the World's best search engine (will be better than the rest once complete).<p>Currently implememting an automatic spelling fixer for common misspellings (will auto fix misspelt words with correct ones or give you a choice to select alternate spellings).<p>I have been making specific topic search engines since 1998 (now moving to a caching, spidering and non topic engine design). Some examples are <a href="http://www.canuckster.com">">http://www.canuckster.com"> http://www.canuckster.com </a>, <a href="http://www.sparesomechange.com">">http://www.sparesomechange.com"> http://www.sparesomechange.com </a>, <a href="http://www.whak.com">">http://www.whak.com"> http://www.whak.com </a>, etc...

Queen Mary

Posted On December 10, 2004, 12:06 AM
I'm ready for the new engine to drop too!

Hidden Nook

Posted On December 10, 2004, 02:43 PM
It is good to see that you (and Yahoo!) finally have a blog. Now I can abandon the unofficial blogs of yours. A quick question though.

Although you have a great search feature (which may overtake Yahoo pretty soon) would you be able to integrate it with per say a shopping list search?

What I mean is that Google has Froogle and when you log into your Gmail account it and search you can create a wish list (which you can add to your blog). Would you be able to set up a similiar feature with hotmail as well?

Thanks!

Welcome

to the Microsoft Bing community

Remember, don't post your personal information!