This post originated from an RSS feed registered with Java Buzz
by dion.
Original Post: xenophobia: give me options to cut to english
Feed Title: techno.blog(Dion)
Feed URL: http://feeds.feedburner.com/dion
Feed Description: blogging about life the universe and everything tech
I dabble in Spanish and French (read: I took them in school, awhile back), but let's face it, I need content in English for it to mean much to me, and I sure as hell have no clue with non-latin alphabets.
I like other cultures, but I have xenophobia on the web.
I want an english filter. I really need it on:
Email Filter: Mr. spam filter, if it isn't in english, it probably aint for me. I am getting tons of asian language spam at the moment, and it drives me nuts when it gets through a spam filter
RSS Filter: Google Blog search has an option for english. Technorati let's you tell it if your blog is in english. I want searches on technorati and friends to automatically filter out the non-english.
This matters even more for me as I often have searches running for some form of 'Ajax'. Most of that feed results have to do with the dutch soccer team. If dutch was ignored, most of the content would instantly disappear. It doesn't solve the semantic web, and people speaking about the team Ajax in english would get through (but that is another problem).
How would you decide if a message is in English? X% of the words have to match a dictionary of some sort? :)