Author

I am Joannes Vermorel, founder at Lokad. I am also an engineer from the Corps des Mines who initially graduated from the ENS.

I have been passionate about computer science, software matters and data mining for almost two decades. (RSS - ATOM)

Meta

Entries in spam (9)

Sunday
Jan072007

What's wrong with Google AdWords

I have been using AdWords for quite a time, having setup ad campaign both for PeopleWords.com (freelance translation marketplace) and for Lokad.com (business time-series forecasting). And, being both a AdWords consumer (yes I do click on ads) and an AdWords publisher, I must say it’s quite surprising to see a company weighting hundred billions of dollars, with a business so crippled with long-lasting intrinsic issues. Please note that this post is not a complaint against Google, they are quite good at what they are doing; my point is that the AdWords business model is really weak against adversarial behaviors.

Adversarial behaviors


First, let me clarify what I mean by “adversarial behavior” here. Put very simply, it’s a situation where an actor (can be a company, an individual) does not play by the rules (can be the laws, can be ethics, can be traditions) and attempts to exploit system weaknesses for its own profit with no overall benefits for the society as a whole. I would oppose this behavior to a “constructive behavior” where an actor attempts to make profits by building wealth with a positive overall benefit for the society.

Typical adversarial behaviors includes stealing, cheating, lying. To avoid those issues, most modern societies have laws to fight those behaviors. On the web, the situation gets more complicated. Not because law does not apply to the internet (which is totally untrue) but because laws gets really complicated to enforces. For example, if you’re caught stealing $10000 to your neighbor, you’re in trouble. But what happen if you’re caught stealing $0.001 to 1,000,000 people from 30 countries over the worlds? Well most probably nothing is going to happen, because nobody is going to complain for 0.1cent.

Such situation is precisely what is causing spam. There are a lot of spammers; the impact of each spammer is low on the global scale, but all together the spammers are a real pain for everybody. E-mail as it has been designed several decades ago is super weak against adversarial behaviors. Yet, also the problem is very well known, it’s still quite easy to mess up, see the Coding Horror story about trackbacks .

The Google AdWords case


It has been quite a lot introduction, but the point I will defend here is that AdWords are almost as broken as the e-mail system to fight adversarial behaviors. Let’s review what I get for “open source ecommerce”, I have pasted below the result obtained at 2007-01-07

Sponsored Links

Ecommerce Open Source
Open an ecommerce website or an
eshop. Try free!
storefront.homestead.com

Open Source Ecommerce
1000’s of Great Programmers Bidding
Fast & Simple Project Outsourcing.
www.GetACoder.com

Open Source Ecommerce
Find the Top Links on
Open Source Ecommerce
www.linx-best.com

Open Source Ecommerce
Open Source Ecommerce?
Get More Helpful Free Info Here.
yoursources.info


Ok, what’s wrong with this listing? Actually the first result is just a blatant lie, HomeStead.com is absolutely not featuring any open source solution (just check their web site if you do not believe me). Well, try to do that on a National TV show, and I am pretty sure that you get a class action the next morning. The second ads, for GetACoder.com, is also a lie but I would say that it’s lesser lie since it can be argued that nothing prevents you to find freelancers to customize an open source ecommerce application. Google can’t be accused of willingly carrying lies, certainly they do not. Yet, they can discard their responsibility. Both Homestead.com and GetACoder.com are lying here, but their behavior is not totally adversarial, they do both provide added-value solutions for their customers.

The two next results are more subtile but also more perverted. Those two links, Linx-best.com and Yoursources.info are not real website but just a super aggregation of advertising links. Yes, they do not feature any content, they just gather a huge amount of sponsored links into a self generated web directory (I suspect the directory architecture to be actually nothing more than the Dmoz.org directory itself). Actually, what ever link you click in those websites is a sponsored link. I guess that the business idea is to buy cheap Ads from Google (notice that they are not the top results) and to sell random clicks from lost web visitor that happen to get to their website.Those trash websites represent a very significant percentage of the Google AdWords results (I would be on 20%).

Even Google is guilty

Just for fun, let’s see what we get for “make money fast” on Google (results obtained 2007-01-05). Note that the results are split in two sections, the first section being displayed before the actual search results, the second section being displayed on the side at the right of the search results.

Make Money Fast
www.google.com/adsense Have a website or blog? Earn revenue with Google AdSense.

Europe Trade LTD.
www.eutradeltd.com/ 0.3% up to 4.5 % daily. Risk Free Short and Long Term Investments

 

Sponsored Links

www.$3500weekly.com
Learn to be a super affiliate
Better Quit yr day job. It works!
www.3500weekly.com

Want to Make a Fortune?
Make $55,000 to 1,000,000 a Year!
Guaranteed success! - $49
www.club-fortune.com

Fast money work at home
Ultimate wealth package
try your luck and make1000’s
ragstoriches.com

NETWORK MARKETERS WANTED
JOIN NOW FOR ENDLESS OPPORTUNITIES
GROUND FLOOR COMPANY
www.trilogyrx.com/essentials

Sharemoney Investing
up to 385% return for 7 days
real short plan investment
sharemoney.net

Make Money Online
Secrets To Make Money Online
Revealed In Ebook. Free Download.
TurboChargedInternetMarketing.com

Make Money Online Today?
You can once you know this secret
Free audio shows how to get started
www.secretofmakingmoney.com

Well, I am quite astonished that Google put an ad titled “Make Money Fast”. To the people of marketing dpt. of Google Inc: Under-promise and over-deliver! and Don’t be evil!, anybody remember? Frankly, I will not even bother to review those links, they are all crap.As a conclusion, please don’t get me wrong, this post is not a Google bashing. Any company that will go in this direction (online automated Ad marketplace) will encounter the exact same problems. The “naive” technical solution (i.e. AdWords in its present state) is directly leading to this situation. Also no laws and no regulations will solve this situation. IANAL, but in France, I am pretty sure that there are enough laws to condemn every advertisers listed in this post.

Monday
May012006

Additional goodies from the blog spammers

An interesting thing about running a web application, it's that people never cease to surprise you. I have already discussed the behavior of the scammers within PeopleWords. Now, I am encountering a new kind of annoying people: the blog spammers.

Given that PeopleWords.com has no blog, there is no reason for blog spammers to get interested in Peoplewords, right?

Wrong, PeopleWords has no blog, but blog spammers don't care. What blog spammers are detecting is the link toward an XML feed.

<link rel="alternate" type="application/atom+xml"
title="PeopleWords, JobBot" href="AlertAtomFeed.aspx" />

This XML feed is not carrying blog posts, but PeopleWords translations jobs. As a side effect, the zombie army[*] commanded by spammers considers that any website providing an XML feed is a blog and therefore they try to fill up all the available forms with various crappy contents.

On PeopleWords, the only freely available form accessible without user registration is actually the user login form. As a consequence, I get a dozen blog spamming attempts everyday generating errors that are caught in the server logs.

[*] The PeopleWords server logs indicate that the blog spam attempts are coming from a large number of different IPs. This would indicate that the blog spammers are using a large amount of regular (but infested) machines over the internet.

Thursday
Apr202006

Scammers going global leveraging freelance services

In my previous post, I was discussing the various trust issues that are encountered when dealing with an online community. I was discussing some issues related to translation jobs sabotage by scammers. My point was the risks in freelance translation jobs are much higher on the customer side than on the translator side. The main argument was it's hard to turn a small translation job into cash. Well, scammers have more imagination that I have, and it seems that this argument is partially wrong.

I have just recently banned a few translation jobs on PeopleWords. Basically, a scammer was trying to use freelancers to translate typical scam letters. Indeed the main letter topic (the letter content was on public display by the scammer) was related to a large amount of money somehow blocked in a country of Central Africa. It was quite perfectly fitting the typical scam patterns (see 419Eater for an introduction to the common scam practice and how to fight them).

Moral of the story: scammers are fast movers, like any ambitious company they want to go global. Freelance translators will be on the front line of this move.

Tuesday
Apr042006

Building a safe community for online translation works

This article focuses on the various issues related to online trust for freelance translation jobs and the various solutions adopted by freelance websites in this domain. As for all online activities, trust is a difficult yet critical element to obtain. My personal experience in this domain comes from the management of the PeopleWords website.

The naive approach: the rating system

Most freelance websites provide a rating system for all users (PeopleWords is no exception, see [1]). After each translation job, the customer can rate the translator and vice-versa. On the long run, competent translators and reliable customers accumulate a large amount of positive evaluations. The trust is build on those evaluations. Yet the downside of the rating system is the initialization phase. This part should not be neglected because every customer or translator has to go through that phase in the first place (no matter matter how "established" the website can be). Most of the websites that I am aware of don't do much beside their rating system. For example, since PeopleWords is just starting at this moment, nobody has positive evaluations (well, I do have positive evaluations, see what the the translators say about me, but being the website operator myself, it does not count much). I will start with a short analysis of the risks involved for unrated translators or customers. The analysis is followed by an evaluation of a few approaches that can be used to solve those issues.

Risk analysis on the translator side

For example, on PeopleWords, a translator is paid only on translated documents delivery. Therefore there is a risk for the translator to accept a job and not get paid in this end. The important question is What is the risk of accepting a job from an unrated customer?. My personal opinion on that matter is not much. I am not saying that dishonest customers do not exist, I am just saying that I believe them to be quite infrequent, probably not more than a few percents. Why?

  • It's hard, as a customer, not to reveal your identity through the documents to be translated. In this aspect, the situation is not symmetric between customers and translators. The online translator can easily disguises his identity, but the task is much harder for the online customer. As a crook, your identity is certainly the last thing you would like to reveal to your victims.

  • Almost no translation job comes as a one shot. If you need a translator now, then, most probably, you will need a translator again in a near future. Therefore it is more interesting, even on a sole financial basis, to have a stable customer/translator relationship rather than trying to crook random translators again and again. Also the web is not a very forgiving place, if you crook somebody, this person is probably not going to sue you (at least not for $100) but this person can hurt your reputation in spreading the word through forums or blogs. Additionally if the job is a large and obvious one shot then the translator can ask for a fragmentation of the payments as the work goes on.

  • As a customer, what you obtain from the translator has no financial value for anybody but yourself or your company. This point should not be overlooked. Indeed, if I refuse to pay an online vendor on eBay after receiving an ordered product, I can actually re-sell this product and obtain some cash from it. But in case of a translation job, I have no simple way to turn a translated document into cash.

As a side argument, I would add that online freelancers are well-armed against online frauds. Professional freelancers are usually quite experienced in the art of dealing with "remote" people. Exactly like experienced web users easily spot phishing attacks, experienced freelancers can probably identify most of the online scam.

Risk analysis on the customer

Typical freelance scam: Through many years experience, I translate all documents in many languages. All domains. Lowest prices. No delay. Please contact me at vishnu123@gmail.com.

As a customer, since you pay only on documents delivery, there are not risks, right? Wrong, there are a lot of risks. Worse, the customer may not even realize that he has been crooked before the document is actually published in one form or an other.

  • As a customer, you have no way to check the quality (or even the content) of the translated documents. How can I check that the translator just didn't use a software to produce a abysmal translation of my documents? This point is very important, because it means that crooks can actually turn translation jobs into cash through sabotage.

  • The costs involved through missed deadlines (or through poor translation qualities) can

    also vastly exceed the translation costs. In other words, if the customer does not get the translated documents within a given deadline (or if the quality is poor), the delay (or the bad impact) can cost him much more that the actual translator fees.

The translation sabotage is really a major issue. On PeopleWords, I have identified that

roughly 20% of the translation offers are just scam. This problem is far from being specific to PeopleWords. On most of the website that I have tried, the scam rate was, in my experience, always roughly the same (actually, I would say, the more "established" the website, the higher the scam rate). The fact that customers are, on average, much less experienced than freelancers to deal with scammers is only aggravating the problem.

Solution n'1 that does not work: Screening job offers for pre-approval

Most of the freelance websites actually screen all offers for a pre-approval by their staff before publishing the offers. One obvious drawback of such an approach is the additional business day of delay to get a job offer published. The second drawback is that such a screening process requires human resources that are paid, in the end, by the customers.

There are basically two kinds of abusive job offers. This first kind is just plain classical spam (the content of the offer is simply totally irrelevant to any translation job). This kind of abuse is annoying but mostly harmless for the freelance translators. The second kind of abuse is the case of the never-paying customers that continuously create new accounts under different names to crook freelance translators on a regular basis.

Unfortunately, the second kind of abuse is never going to be filtered because no matter how dedicated are the website operators, the offer "looks" right (there is nothing wrong with the actual offer content).

My opinion is that many websites rely on such a pre-approval step because it gives them a (false) sensation of control. But clearly this practice does not add any value either for the customer (whose job offers are delayed) or for the translators (because real abuse won't be filtered).

Solution n'2 that does not work: Escrow account

Some websites propose an escrow account where the customer can put the money at the beginning of the job. When the job is done, the customer releases the amount frozen in the escrow account. In case of disagreement, a neutral party is called to "solve" the case. The important question is Does the escrow account reduce the overall risks or costs? My conclusions are actually quite counter-intuitive: I believe that the escrow account increases the risks and the costs for both customers and translators.

But if the escrow account provides no advantage for either the customer or the freelancer, why does it even exists? Why web designer would have ever bothered to implement such a system if it was the case?

My opinion (discussed below) is that customers and freelancers have no interest into relying on escrow accounts; but there is one person that has actually a strong financial interest in such a system: the website operator himself. Indeed, the "frozen" money on the escrow account is not frozen for everyone, the money is actually earning interests for the profit of the website

operator. Moreover that money is actually providing a large amount of cash to run the website to the detriment of both customers and translators. In case of non-disagreement, the escrow account is clearly just a financial overhead for both of them.

Let us see now what happen in case of disagreement. The disagreement can only occurs if the customer refuses to release the money from the escrow account. Why would the customer do that? Case A: the customer is a crook, he wants his money back. Case B: the translator is a crook, the translation does not match the quality requirements. In any case, the customer, being honest or not, will claim that the translated documents do not meet some "quality" requirements. For the neutral party who is supposed to "solve" the case, it's going to be tough; because unless you have 200 translators covering all technical domains in all languages, you have no (reliable) way to tell if the customer is honest or not.

Worse, the neutral party has no interest to solve the case (well, no interest to solve it as fast as possible). The longer it takes to solve the case, the more profitable it is because the money is earning interests on neutral party bank account. At the end, what happen? Since I tend to pay freelance translators when I require their services, I have never experienced myself the "disagreement resolution" part of the system; therefore the following is just a wild guess of my own (remember that we are considering people with no previous ratings). In case of disagreement, the website operator will wait first and then he will take a random decision (like splitting the amount of money in two and sending 50% back to the customer). Why a random decision? Because, as a website operator, you cannot financially afford a "wise" decision to each disagreements. It is not possible to pay a reliable offline translators to evaluate disagreements when your profit margin is lower than 5%. As a consequence, only option available to the website operator is just to take random (or quasi-random) decisions.

Filtering abusive translators: a solution that might work

Abusive translation offers is the Number One issue for customers and translators alike. Why? Because abusive translators

(i.e. scammers) hurt the freelance system as a whole. Being myself a customer of freelance translation services, I have always a feeling of walking through a minefield when I browse freelance translator offers. As I said, roughly 20% of the offers definitively look very suspicious. As a direct consequence, I would guess that most customers will simply be very reluctant to hazard a translation job over the web because of the "minefield" answers that you get.

The situation is not desperate though. The good news is that liars tend to be greedy and lazy too. When I started to browse the system logs of PeopleWords (that include the times, the languages, the number of offers made by each translators), I realized that scammers where basically all following the same patterns: they answer all open translation job offers in less then 10 mins. Languages do not matter (scammers can do anything ...), specialized technical documents are never an issue and no matter the amount of work it can always be done in 48h. As I said, scammers are too greedy to actually resist the urge of answering ALL open translation job offers, and scammers are too lazy to actually create accounts with ad hoc profiles that would actually match the translations jobs.

What should be done with those of scammers? Banning them is a bad idea. Indeed, once banned, the scammer will just come back under a different name, and we are back to the starting point. Therefore, in PeopleWords, I have opted for a more twisted solution, scammers are just shadowed. Since the scammer is not banned, he can continue to browse all translation offers, he can also continue to post offers too. The only difference being that offers send through a shadowed account are never visible from the customer side... This system is not perfect, a scammer can create customer account, log in, post a job, log out, log in again, post a offer, log out, log into the customer account again, and check if his offer is visible. But it's a time-consuming process and scammers are already too lazy to provide a user profile that would simply match the translation jobs anyway.

I believe that this approach has a large added value for both the customers and the translators. Indeed, those scams are obvious only from a website operator viewpoint (that can easily spot translators making inconsistent offers), but would probably less than obvious for the customer that do not have access the website logs.

Filtering abusive customers: a solution that might work

As I discussed here above, the most "dangerous" kind of abusive customers are not spammers (those are just annoying), it's the never-paying customers seeking "free" translations, crooking each time a different translator. Since no particular patterns (at least no obvious patterns) can be used to distinguish such a customer, it is not possible to "catch" them before they actually crook at least one translator. Moreover, since the customer ends up with a negative evaluation after crooking a translator, he never use the same account twice. Ratings are just useless to deal with those people.

The approach used by PeopleWords consists in relying on the only people that are actually able detect the fraud: the translators themselves. In order to leverage the "distributed" knowledge, PeopleWords includes a system dedicated to abuse reports. Freelancers can actually help the website to prevent such a customer crooking other freelancers.

What should be done with those crooks? Again, banning them is a bad idea because they will come back. I am still currently hesitating between several options to deal with such users. The option currently in use consists in shadowing their offers. A shadowed offer is not visible anymore by the translators. The only issue being they will quickly realize that something is wrong if they do not get any offers; because when you propose a translation freelance job, you always get offers. Therefore, as an additional option, I am thinking of using the scammers list but for the reverse purpose. Instead of displaying the legit translator offers, only the scam offers are displayed to such customers. This option is so twisted I am not sure to actually ever use it, but using scammers to fight back never-paying customers is still a very interesting perspective.

Notes

[1] PeopleWords only provides binary evaluations: I am satisfied or I am not satisfied (plus an additional but optional comment). This system is very rough compared to most of the other websites. For example, Guru.com provides no less than 10,000 possible rating combinations after each job (4 evaluation criterions ranging from 1 to 10 involve indeed 10,000 combinations). If PeopleWords has such a simple system, it is not due to the lack of fund (PeopleWords is dramatically under-funded) but a design choice. If tomorrow, PeopleWords provides 20 evaluation criterions each one of them ranging from 1 to 100, will the PeopleWords ratings become 10 times more accurate than Guru's ratings? Certainly not. As a customer, I have no clear idea of the "quality" of the translation job anyway. I can use MS Word spell checker to see if the translator has left many obvious spelling mistakes, but I can't really do more. As a translator, there is, usually, not much to say about the customer either. My personal opinion is that providing more complicated rating systems actually decrease the quality of the ratings. Indeed, if the system is not totally idiot proof, then users will start to do mistakes simply because they do not understand the system. For example, if you can evaluate somebody on a scale ranging from 1 to 10, what is the meaning of 10? In the German educational system, 1 is the best grade. In the French (primary) educational system, 10 is the best grade. Some other countries, like the USA, prefer the use of letters. Bottom-line: do not try to be smart, you will just confuse your users.

Page 1 2