Link Details

Link 199722 thumbnail
User 410289 avatar

By CodeJustin
via teddziuba.com
Published: Jul 07 2009 / 03:25

There's a question that comes up on Stack Overflow every couple of months: "How do I strip diacritic marks from Unicode characters?". Popular variants include "How do I remove special characters" and "How do I convert Unicode to ASCII", but the underlying motivation is the same: characters that don't have their own key on an American keyboard have no place in modern web software.
  • 9
  • 30
  • 1384
  • 656

Comments

Add your comment
User 211643 avatar

zynasis replied ago:

1 votes Vote down Vote up Reply

haha, i feel this pain every day with web programming
unfortunately for me, we have to accommodate for unicode

as for simply stripping the unicode chars down to their appropriate ascii representation, why not store in unicode so that it display's properally, but then allow the searcher to search on both ascii rep and unicode. that seems like a more suitable solution.

User 437079 avatar

solemah replied ago:

5 votes Vote down Vote up Reply

आप होमोसेक्सुअल बकवास,

User 296128 avatar

drehorgelmann replied ago:

1 votes Vote down Vote up Reply

This is the most short sighted article I've read in quite a while.

User 335891 avatar

onno.solin.eu replied ago:

0 votes Vote down Vote up Reply

Try distinguishing between the German words "schön" and "schon" without using something like UTF.

User 149937 avatar

ntpruett replied ago:

0 votes Vote down Vote up Reply

A DailyWTF waiting to happen...

Add your comment


Html tags not supported. Reply is editable for 5 minutes. Use [code lang="java|ruby|sql|css|xml"][/code] to post code snippets.