By CodeJustin
via teddziuba.com
Published: Jul 07 2009 / 03:25
There's a question that comes up on Stack Overflow every couple of months: "How do I strip diacritic marks from Unicode characters?". Popular variants include "How do I remove special characters" and "How do I convert Unicode to ASCII", but the underlying motivation is the same: characters that don't have their own key on an American keyboard have no place in modern web software.
Comments
zynasis replied ago:
haha, i feel this pain every day with web programming
unfortunately for me, we have to accommodate for unicode
as for simply stripping the unicode chars down to their appropriate ascii representation, why not store in unicode so that it display's properally, but then allow the searcher to search on both ascii rep and unicode. that seems like a more suitable solution.
solemah replied ago:
आप होमोसेक्सुअल बकवास,
drehorgelmann replied ago:
This is the most short sighted article I've read in quite a while.
onno.solin.eu replied ago:
Try distinguishing between the German words "schön" and "schon" without using something like UTF.
ntpruett replied ago:
A DailyWTF waiting to happen...
Voters For This Link (9)
Voters Against This Link (30)