This is why you should let someone experienced do surveys. Or, well, not.

July 1st, 2008

Surveys are complex. There is a lot that you can do wrong. Actually, looking at many surveys around, there is a lot that is done wrong. Time after time.

Sometime it’s the big stuff. Sometimes small.

Sometimes the surveys are not done to get answers, but to show what you want the answers to be, by skewing the questions. That’s bad for academic research, but very popular in politics.

And sometimes you really do want answers. Which is hard to do right. Ask the wrong question, ask them in the wrong way, or give the wrong options for answers, and the results may not say what you think they do, or may be impossible to analyse properly. That’s why there are those who deal professionally with surveys, know the theory (and, hopefully, statistics), have done it many times before, and should be able to avoid most of the mistakes.

They usually don’t do the really big mistakes.

A very long birth yearThey do, however, often do small, or really incomprehensible, mistakes. I guess finding a professional can be a problem as well.

Take, for example, a survey currently being run by iPerceptions , for InforWorld and ComputerWorld.

Both these clients are one client, belonging to the same company. And they do very similar things. So the surveys are practically identical (I did the ComputerWorld one originally, and just now noticed they also run it for InfoWorld. I progressed a little bit, and they’re the same questions in the same order with the same possible answers. Just the name of the company in the survey changed).

This survey has some strange points.

One main problem was that they apparently forgot that some questions may not apply. There was one (maybe 2-3) question where they did have an option to indicate the question is not relevant, or that I don’t know or can’t judge. For all the rest, and there were many of them, I was asked to rank the sites on many criteria, some of which really didn’t interest me and I didn’t know. But the options were just to rank.

Assuming that I’m not the only person who goes through a site that has many different sections, and doesn’t know (or use) all sections, this means that the answers they receive are worthless. What do you pick when you don’t know, or can’t rank? Do you say that it was excellent, since you don’t know it’s bad, causing a potential problematic part to appear good? Do you rank it as very lousy, since it didn’t do anything for you, thereby causing a potentially excellent service to appear bad? Do you rank it in the middle, trying not to judge either way, but still making anything really good, or really bad, seem more average and undeserving of attention?

This is why these things usually contain an answer to state that this question isn’t relevant for you, and you don’t have a real answer for it. But here, no. Good luck to them in the later analysis.

A second point is much less severe, but far more amusing and baffling.

In the personal details, at some point they asked for year of birth. And provided a field to type the year number in. With a maximum of 500 characters. Yes, you read that right, 500 characters to answer the question “In what year where you born?”. They also made the text box large enough to type a small essay in.

What sort of an answer where they expecting? Hmm… Maybe…

That’s a tough question, there. I don’t know what year I was born in. It was a cold, harsh, dark year for my family. My parents were moving a lot. I don’t have no birth certificate, ’cause they were always running from them cops. Who need a stinking certificate? My mom knew I was born. And I had a tough childhood, so people tell me I look 40, but I bet I’m younger. I don’t remember much from those years, really can’t say. Is this important? If it is, I can try remembering, just let me know. Yes?

That’s not a true story (for me, anyway), but it does have exactly 500 characters. For comparison, writing something like 2008, or 1912, takes 4 characters (as does “NOYB“). They could have even been generous, cover all their bases, and give 5-6 characters (You know, for time travellers, or for really really old people). Maybe 3 digits more, for a space followed by “AD”, in case they’re actually worried? A little longer still, so they can get “year of the dragon”? Why the heck 500 characters?

You want to know what’s even more strange? This is in the third part of the survey. In the first part they already asked most personal questions (gender, business, people working in same company, etc), including one about age. But there they just gave several age groups (e.g. 24-35 or something like that), so I suppose they really needed the birth year too.

Then again, if they ask for birth year, why bother asking for age? Odd, that.

Well, I was in a nice mood (this was more amusing than annoying), so I decided I’ll let them know. At the last page of the survey they had a link to provide feedback. I was actually impressed with that, since sometimes I have comments, and nobody official to tell them to. This was nice. Or so I thought.

It was a mailto style link, that contained an email address, and a prepared subject line with the code/number of this survey (Good idea, so they won’t have to wonder what survey it was, and I won’t have to try and describe it too much to ensure they identify it).

There was just one main problem with it. The email address they provided? It wasn’t correct.

I sent a message. I got back a bounce.

<info1@iperceptions.com>:
208.65.144.12 does not like recipient.
Remote host said: 553 mailbox info1@iperceptions.com is restricted (Mode: normal)
Giving up on 208.65.144.12.

Impressive. This is a company that specializes in running surveys. In getting feedback from people for their clients. Except they can’t seem to arrange to get feedback for themselves.

It seems like a typo. The “1″ in the email address does not belong. I checked later on their site, and this address is listed there, without the “1″, in their contact page. But, well, by that time I was out of the helpful mood, and into the annoyed and unimpressed one. Which I think is perfectly understandable.

If you can’t handle bidirectional text, don’t show bidirectional text

July 1st, 2008

Some ad companies think they can get better results by targeting the ads to the viewers. Some strategies are matching the ad with the viewed page, while others try to target the audience in different ways, based on location or language. Which makes sense.

One problem is finding the country of origin of the viewer. Most companies seem to have solved that by pretty accurate geotargeting. Though some, of course, are still stumbling in the dark. For example, as a Jew living in Israel, I still occasionally get ads for Muslim dating sites. Or for various deals which are only relevant to US residents. But these are becoming more rare.

When they do detect a location, the basic step is only to show ads relevant to people from that location. That’s the basic step, which most have been doing (or trying to do) for a while.

Ad with the Hebrew text going backwardThese ads often don’t only change content, but language as well. If the advertised product is sold internationally, people from different countries may pay more attention to ads in their own language[1].

One way to do it is to have a set of pre-made ads, and show them according to the location.

Another way, for those wanting to be more… efficient? is to have a single ad, with several localized text strings that can change inside this ad according to the source.

In theory, it’s nice. There is a need to keep only one copy of a picture, or interactive program, and yet still someone from the US will see English, and someone from, say, France, will see French. The main needed investment is to get the text lines translated into the relevant languages.

And then you have those that go the extra mile (backwards, usually, though) and pick languages that are harder to handle. They do the whole design with languages that go left to right, like English, and then put in right-to-left text, like Hebrew or Arabic.

In many of those cases that I saw, they then forget that the text has to be added to the pictures a little differently. And they don’t bother to show the finished result (calculated ad with the language) to someone who knows the language. They probably just verify the initial text strings, thinking that nothing can go wrong since the same exact text will go into the image.

The end result? Extremely unprofessional advertising, when all the words in the text, or even the whole sentence, go backward, letter by letter. ( !stoidi diputS )

Like this image taken from an ad I saw on several websites. It was on a page together with at least one more different ad, by the same advertiser, that contained the exact same problem.

Did I mention that it looks extremely unprofessional, silly, and pathetic? Because, well, it does. And it definitely gets you thinking that if they managed to screw the ads so bad, on something so basic, what else didn’t they bother to pay attention to, and was it important?

So, the advertising company (the one putting the ads, I don’t know who designed them) is fastclick.net , which redirects to ValueClick Media. Nice name, not so much value to the advertiser.

I thought I’d be nice, and let them know. So I went to their site, got the Contact page, and looked for an email address, or a form. No email address, but there is a contact form. A contact form where the required fields include things like phone number, company, location, how I heard about them, and so on.

This may be alright (OK, not really) for people who are potential customers. But for someone who just wants to do them a favour by dropping a quick helpful note? Completely unacceptable. I shouldn’t have to work, and provide lots of details, just to try and help them.

Required fields should be the message content, and a quick subject. Maybe not even the quick subject. Asking for email address is also fine, if the message may need a follow-up, but that should be left to the discretion of the person sending the message.

And this company is supposed to make money by selling things to people?! By marketing?! That’s supposed to be their strong side? Funny.

---
  1. Personally it annoys me, and I always feel more comfortable when it’s English, rather than Hebrew or other language, if I read on a computer, but I’m really not representative here[back]

Updates, and getting back

July 1st, 2008

After neglecting this blog for too long, I now updated the software (WordPress) to a new version, updated the several plugins it uses, and am also planning to get back to actually using it.

In the meantime, if anything seems broken, or just strange, in the way the blog behaves, please let me know. It can be because of the large amounts of updates.

Real content to follow soon, in more posts…

There is an expiration date on medical privacy

January 23rd, 2008

There are a lot of debates about privacy, and about the balance between privacy and research (or possibly privacy and anything else of utility that can be derived from the private information).

Of all the areas of privacy, one of those where people object the most to violations of their privacy is in health and medical information. You’d be very hard-pressed to find anyone willing to disclose diseases, health problems, and medical examination reports.

And when disclosure of private information occur, people get the most annoyed, or concerned, when it comes to children. It may involve a lot of over-sensitivity, but it’s still the case.

Well, now there’s a new service, for any interested researcher, or any interested Internet user with some free time and a little curiosity. A website that allows to browse a comprehensive set of complete and full, uncensored, medical records from a large hospital for children.

You can search and browse by the real full names of the children[1]. You can get the full diagnoses, what diseases the doctors found when examining these children. For some of the children you can get the full case notes of the doctors who checked the sick children. You can see what were the treatments and medication given to each of these children, and whether they helped.

And you can even get the full, and real, physical home address of the patients. The people who digitized the information on the site actually invested quite a lot to make sure that the addresses are correct and, for example, none of the street names would be misspelled.

Sounds lovely, does it?

So why isn’t there a huge outcry over it? A major violation of privacy and data protection laws. Not to mention children. And it didn’t even hit the news. Why?

Because the details are from 1852 to 1914. Meaning that youngest patients would potentially be 94 years old now. Certainly in no condition to care, or complain, if they’re even alive. Just some interesting past cases for research.

I’m sure their children won’t mind at all that mom’s chronic illnesses are online for everyone to see. Mom is dead, so there’s no point in keeping her secrets, right?

Welcome to Small and Special, the site showing you all the gritty details from the Great Ormond Street Hospital for Children, in Britain.

A unique database lies at the heart of the Small and Special website. It is based on the information contained in the In-Patient Admission Registers from Great Ormond Street Hospital for the period 1852 to 1914, which have survived intact. The Registers have been carefully transcribed and indexed to allow flexible and accurate searching of this important resource. Users can search for children by name (first and last names), age, sex, and address. Other searchable fields include date of admission and discharge, admitting doctor, outcome of treatment and subsequent referrals (if any).

The database is supplemented by a collection of scanned images from 14 volumes of patient case notes of the founding physician, Dr Charles West. The case notes, which cover a period between 1852 and 1874, contain a wealth of information on the treatment and management of sick children in the mid-Victorian period.

Some of the information is accessible freely. For the rest you need to register. But the registration is easy, free, and they don’t require that you’d prove (or even claim to be) you are a researcher, doctor, or anything.

And what does it say about the future? About my own medical records, or yours? We trust[2] in doctor-patient confidentiality. In privacy laws. In that even if the doctor has to share the details with insurance companies, none of them could, or would, ever just list everything on the Internet for the curious masses.

Anyone having these expectations of privacy about your medical and health records? Just wait about a hundred years or so, and we’ll see. We most definitely will see. Everything. Free for searching and browsing.

After all, our friends in Kingston University are still working:

This outstanding resource will be further enhanced by the inclusion, at a later date, of the surviving Registers for Cromwell House (the Hospital’s convalescent home at Highgate), from 1869 to 1910.

Amusingly enough, when you register to the site they have a privacy policy, and they clearly state they won’t share your personal details with anyone. Which is admirable, and I wish more online services would be so clear about their privacy policy. It’s just that, well, they’d keep my name and email secret, but don’t feel any problem with showing me the names, addresses, medical history, and diagnoses, of many many past-children who never thought to agree to release it.

---
  1. Some of the names have been anonymised. This is a minority of them. I’m not sure what were the criteria to choose. And even for the anonymous ones, you still have a full home address, just not the name[back]
  2. Want to trust, anyway?[back]

Some basic math for waiters

October 8th, 2007

When a group of several people eat together at a restaurant (or bar, coffee shop, etc…) there are common ways to split the bill:

  • One person pays everything.
  • Split evenly.
  • Each pays for their own portion.

The exact values are of course a bit fluid on the last two options, since the numbers may be rounded. Currency is discrete rather than continuous, after all. Not only that, but it’s often simpler to divide up to the main coin and not the sub-coins[1].

The payment can be done by cash. In that case the people would usually just collect enough, pay with it, and divide the change between themselves when the change comes back. The work on properly dividing the charge is on the customers in these cases.

Sometimes, though, people pay with credit cards. Which means that many times the waiters will just receive a bunch[2] of cards, with simple instructions on how to divide the charge between them.

The common one is of course “Split it evenly”. And these are the cases where money is often rounded to higher coins, since apparently most waiters have a problem with fractions. I can recall maybe 1-2 cases, ever, where the individual charges weren’t rounded with one person paying the extra.

When things are not split evenly, well, that’s when the fun begins. And by “fun” I mean an all too common tragic comedy of errors.

The simple case is when the customers still calculate the amounts in advance. In this case the waiter receives exact instructions in the style of “Put 100 on this card, and 150 on that card”. Simple. Easy.

And they still sometimes manage to get it wrong:

  1. The bill comes back split evenly.
  2. The amounts are charged correctly, but on the wrong cards. In this example, the first card is charged 150, and the second 100.
  3. All of the cards are charged the same amount, which is one of the sub-amounts. So, for example, for this 250 bill either both cards will be charged 100, or both will be charged 150.
  4. Some of the cards may be charged correctly, and some will be charged an unrelated amount. This is because the complexity of the task got the waiter confused and he/she charged an amount due for another customer entirely.

I had all of these happen to me, as a customer in restaurants.

One time I had two of them happen in a series. The waitress made a mistake (#3 above), I alerted her, and she came back with a “correction” that included another type of mistake (#4 above). When there’s a charge, and a cancellation, as a customer you’re requested to sign on both. If you simply don’t sign on the charge, it creates all sorts of complications. So I ended up having to sign five times for my bill that day. What did I tell you? Fun!

It also happens, though, that the job of dividing the charge is placed on the waiter. Sometimes the customers know the difference between what they’re supposed to be billed for, but not the final amount.

In which cases someone has to do the calculation. It’s a simple enough calculation, you know the total, and you know the differences.

And the natural tendency would be to let the waiter do it. People just had a meal, are finishing up, and they need to pay the bill. Why would they want to do the work, as easy as it is, when there’s a waiter that will have to process the charges anyway and is being paid for it?

Makes sense.

Except it doesn’t. Because many waiters seem a bit deficient in the math department.

The latest time this happened to me was a couple of weeks ago. I was finishing a meal with a friend. We basically shared the dishes, so almost everything was supposed to be split evenly. The only difference was that I had an extra glass of some medium-pricey alcohol.

The waitress arrived, and saw the two credit cards on the tray with the bill. The dialog between me and the waitress went something like that:

Waitress: Should I split this up?
Me: Yes, but it’s 70 more on this card.
Waitress: Right. 70 on this card, and the rest on the other card.
Me: No. Split it between the cards, so that this card is charged by 70 more than the other card.
Waitress: Eh…
Waitress: Hmm….
Waiterss: I’m…. err… not….
Me: It’s simple. Just split evenly, add 35 to this card, and reduce the other 35 from the other card.
Waitress: Ah. Yes. OK, sure.

And this is the math lesson for today. If you want to divide a sum X between N people so that everyone pays the same except for one who pays an extra Y, this is what you do:

  1. Divide X by N. Let’s call that A for average. You already know how to do that. This A would be what you’d charge each card if you had to split evenly.
  2. Divide Y by N. Let’s call that B. This value is like the average of the differences. Mathematically it’s the exact same process as the previous step, so if you knew how to do it, you know how to do that.
  3. Everyone, except the person who has to pay more, pays A-B. You know how to do subtraction already. It’s the same thing you’d do if someone paid part by cash and part by credit card, and you’d have had to reduce the cash amount from the total to get the credit card charge.
  4. The person who has to pay more pays A + [(N-1)*B]. Basically all the B’s you reduced from the bills of the other people, you add to this one’s bill. You already know how to do addition too. It’s just like what you’d do if someone asked you to charge the tip on the card as well, telling you how much is the charge and how much is the tip. You already know how to do multiplication as well, it’s what you’d do if you got everyone else’s cards and they all told you they have to pay B.

That’s it. Easy. Simple steps. And these are all things that waiters are supposed to know how to do already.

Except sometimes they don’t.

In this case, for example, I was indeed charged 35 more. The other card? Charged exactly the amount of an even split.

Wait, wait, I know what you’re thinking. In this case it would mean that the total would come to 35 more than the real total, right? So the waitress, or at least the cash register computer, should notice something is off, right?

Right.

But they had a simple solution for that. You see, the final bill came back printed with three items:

  • Credit card charge : A
  • Credit card charge : A+35
  • Refund : -35

So the total was absolutely correct, making the waitress feel perfectly happy about it. No problem if it all adds up, after all.

Except that, of course, we didn’t get that refund. The bill did not come back with 35 cash, nor did one of the credit cards get a refund (which would have kind of defeated the whole purpose, but at least would have meant the amount of money passed from us to the restaurant would have been correct).

Our poor waitress didn’t quite see the problem. It all adds up after all, and the total is right. Luckily another waitress/supervisor did see the light immediately after a very brief explanation.

Waiters should learn a little basic math. Me, I should learn not to trust waiters to do even the most basic math. I think I learned my lesson. Now it’s their turn.

---
  1. OK, Poor terminology here. I mean that, for example, to split 25$ between to people you’d sometimes, often, see one charged 13$ and one 12$, rather than 12.5$ each.[back]
  2. 2 is also valid for “bunch”[back]

Now even random spammers believe I think too highly of myself

September 19th, 2007

From a spam email I received today, after the link to their site:

Greeting yaron
get rid of that self-esteem once and for all.

I think I’ll keep my self-esteem, but nice of them to offer.

Release notes should really include the release notes

September 18th, 2007

A new version of the Firefox browser was released today. A minor update from version 2.0.0.6 to version 2.0.0.7.

Even more minor than that, actually, since what came out was just an RC version for testing. Sometime in the past I downloaded an update that was considered a beta or RC, so I’m on the list to keep getting them on the automatic updates.

The problem is that there was no information provided on what exactly the update includes, and what is the purpose behind it. The release notes page did not contain any relevant info (I’m not promising they won’t change the page in the future. It doesn’t contain the info now, and haven’t for quite a few hours so far).

It had lots of other things, the general outline they put on each release-notes page. But the actual release notes, what was changed from the last version, no. Nothing.

There wasn’t even any link to a page where this information could be found. Because, well, in theory it would have been that exact same page.

That’s a very very poor way to roll out an update. If you ask someone to install a new version of a software, and especially if it’s a beta/RC that you want people to test and provide feedback for, you have to tell them why and what has changed.

Seems very sensible to me. Apparently doesn’t seem so sensible to some of the people in the Mozilla foundation. Don’t get me wrong, they’re doing a great job, and Firefox is terrific. But most people don’t follow all the bugs and progress on every single application they use, so it’s far from obvious what an update is for.

I do hope they’ll do better next time. I’m more than willing to install updates, but I need to know why.

In this particular case, if someone is interested, it’s a single fix for a single security vulnerability. Well, a potential whole class of problems, but only a single known point. Which was now actually more of a problem with the Quicktime plug-in (on Windows) and not in Firefox itself, but in this case it’s a good idea to fix it in Firefox as well, to prevent any future problems from the same direction. You can look at the actual bug report for more technical information, if you really want to.