E-Threats

Translating Spamglish: Are All Spammers Grammatically-Challenged?

Spammers send almost 300 billion messages a day combined, mostly in English. That’s more than 90 percent of the world’s e-mail traffic. It costs billions of dollars a year in lost productivity, annoys hundreds of millions of people, and seriously harms some of those who fall into the trap.

It also means that the vast majority of the world’s e-mail is a babbling, eye-crossing, head-scratching mess of near-random vowels, consonants, exclamation marks, happy faces, misplaced commas and silly synonyms.

Are all spammers in serious need of grammar lessons? Did their parents not tell them about the spell check function? Or is it something sneakier?

Bitdefender writer Bianca Dima decided to find out. She spent most of the week talking to Bitdefender spam fighters. And reading spam messages. She read so much spam, in fact, that her story drove our spell checker mad and she lost her keyboard privileges until she re-reads the collected works of Shakespeare. 

Here’s what she found:

The spam that has invaded our day-to-day life is inhuman – literally. That is, it’s computer-generated. Computers can be silly writers.

To evade spam filters, spam is usually based on templates that change words automatically to random synonyms. This is meant to make the message look unique. A single template may spread in 100 “unique” messages by replacing letters with lookalike numerals or characters (“1nterest r@te”) or by adding letters in the middle of the word (“Some words about hea vx lth!”).

Don’t think anti-spam software specialists didn’t see that coming though. Software engineers have designed tools that screen text for words trying to pass filters. The complex dictionary checking looks for rude words and isn’t fooled easily by various spam tricks, such as the replacement of letters. Another anti-spam solution may use regular expressions to search and manipulate text based on patterns. With the matching, spam messages can be identified. Still, some weird synonyms may beat detection – and common sense.

Security specialists say most “colorful” messages fall into the dating category, which represents 8% of total spam. Besides the old detected spam, new messages try to push their luck and enter the market. Research turns up four to five major new spam campaigns a day, with thousands of variable messages spreading like bad-tasting mushrooms into the wild.

“Hello my dear”

Dating spam is the crème de la crème of grammar mistakes, many made on purpose, according to security engineers.

Let’s take for instance the old Russian-bride scam spam. What’s more authentic than a Russian woman excusing herself for “bad English”? “Hello my dear – I’ve had a web camera with good quality – sign up on my page – and I’ll do what you want for you, long time not seen you in a web camera. Why not send me letters? I am very sad one in Russia, write to me necessarily, it is waiting for your Stella.”

The structure of this popular spam is simple: a three to five line html message (not plain text) that includes tempting words like “pictures” or “webcam”, and one URL (subdomain.domain.tld). Some of the Russian bride spam also includes a message asking money for the bride-to-be’s transport.

The bride of Tarzan?

The grammar lesson spammers all seemed to skip is the one on “The Verb.” On purpose or not, many of the daily messages commit serious disagreement mistakes or don’t figure out what preposition goes with which verb.

“I want to be with you again, write me or call, come with me to the page – there all my contact details,” reads one of the messages. Another proudly announced: “We are open new online store!”

“My name is Klara Belova. Yes, and besides, I’m from another country. I am unmarried! Let’s?,” seductively asked another.

The verb usage of some of the available “brides” in the spam would seem to scare away any suitor but Tarzan.

“How do you do. I’m Ksenia I attractive. Give write,” read one of the spam messages. “Hey, lets associate this modern movement? You will surprise”, and “I really want you like,” said others.

According to security experts, such common mistakes are either made intentionally to increase credibility, or are automatically generated. When you randomly choose a verb from a set of 10 synonyms, the right preposition is just a lucky guess.

Spammers need grammar lessons for the articles as well. “I have ideas for an marriage”, suggested one of the Russian “brides”. “I am a optimistic and “right” women,” proudly stated another.

Nouns and pronouns are also a great source of stress for spammers:

“This informations is available to you after a brief and NO COST registration,” one message read. “If you are seeking their “second half”, then perhaps you are interested in viewing my profile and photos,” read another.

Not only dating spams have problems with English grammar. Banking messages that invade our inbox aren’t as smart as they think they are:

“Note: In a case were you think you are not familiar with the bellow banking information as presented by your legal representative and agent, we advice you send to us now without any delay your detail in full containing your Banking information and other personal information so we dont transfer to wrong account,” read one of the banking spams. Other messages try to lure users into clicking on vicious links on behalf of legitimate e-mail services: “Dear User, You Have reached the maximum number of login with your account. Clicks the button below to reactivate your account”.

Other grammatically-challenged messages come as comments to blog posts and articles: “I am please to find this post very useful for me, as it contains lot of information. I always prefer to read the quality content and this thing I found in you post. Thanks for your post.”

For the online masochist, here’s a little reader’s guide. At almost 300 billion spam messages a day, you need all the help you can get.

Spamglish Dictionary

Made-up or misspelled words

Trying to pass filters, spammers misspell or invent words.

advise “I’ll give some advise how could get it on for a day or two.”

to concider – “I concider myself to be a very positive person.”

count – “If you are looking for a bride, then possibly you are interested in viewing my profile and count.”

Bizarre or bombastic synonyms

Long live the English Dictionary of Synonyms! Automated templates use plenty of matching words to escape anti-spam software. Some work, even if they range from the antiquated to the unusual to the pompous and beyond.  Here are some of the more common words that may seem a tad out of place or overused in spam.

conversationalist = skilled at conversation (“I am a very interesting conversationalist, and you will not be bored.”)

to enkindle = to turn on (“Hi, dear. I believe that a man reserves his true and deepest love not for the species of woman in whose company he finds himself electrified and enkindled, but for that one in whose company he may feel tenderly drowsy.”)

to embody = to incarnate (“I’ll never forget because you are embodied in him all my secret desires! Waiting, write me!”)

familiarization = viewing (“If you are seeking for a wife, then perhaps you are interested in familiarization my info and photos.”)

frugal = temperate

good = handsome (“Hello my good – I had a dream and you were in it”)

incidentally = by coincidence (“Incidentally, I have a webcam and we can chat right now”)

methodically = in a systematic order

mountainous = big

practically = in a practical manner  (“Hello, dear.  I am a big thinker even though I work methodically and practically by taking my time to achieve what I set out to do.”)

to procure = to give (“Hi handsome – you were so good in bed with me – I want to again make love to you – no 1 man I had never done so well as you have done – I’ll procure very great pleasure – my body was looking at your pictures just on fire!”)

resourceful = creative (“I am frugal, resourceful and hard working.”)

solitariness = solitude


And here are some words that are downright weird.

ph ql ysi mi cally = physically

L py ev mb itr rg a or Vi fry a ngk g yfv ra =  Viagra
s whz ex vct ual = sexual

(“If you can not do it ph ql ysi mi cally, use some ataraxics. For example L py ev mb itr rg a or Vi fry a ngk g yfv ra… In my s whz ex vct ual practice it helped me not once…”)

s gxg ite =  site (“Visit our s gxg site…”)

Enjoy reading! At 300 billion spam messages a day, you probably have a lot of catching up to do.

All product and company names mentioned herein are for identification purposes only and are the property of, and may be trademarks of, their respective owners.

This article is based on the technical information provided courtesy of Bitdefender AntiSpam Lab.

 

 

About the author

Bianca STANESCU

Bianca Stanescu, the fiercest warrior princess in the Bitdefender news palace, is a down-to-earth journalist, who's always on to a cybertrendy story. She's the industry news guru, who'll always keep a close eye on the AV movers and shakers and report their deeds from a fresh new perspective. Proud mother of one, she covers parental control topics, with a view to valiantly cutting a safe path for children through the Internet thicket. She likes to let words and facts speak for themselves.

8 Comments

Click here to post a comment