blogs.conchango.com

welcome to the conchango blogging site
Welcome to blogs.conchango.com Sign in | Join | Help
in Search

SSIS Junkie

Conchango are busy and need talented consultants in and around London. Interested? Email me or send me a message

Ensuring data quality

Richard Wand, a colleague of mine here at Conchango, has posted an interesting article here that talks about whether a web site owner should give their browsers the ability to type in their title (e.g. Mrs, Mr, Dr...etc) or just offer them a list of options instead in a drop-down box.

He cites an interesting case at BA where they chose a drop-down box which actually consisted of 204 items. Richard attributes this to "technical constraints" and states: "We are defining the specification so we shouldn't be contrained by technical bits." That's an understandable point of view because Richard works in our award winning web design team and in that discipline the User Experience is key.

As a data professional though I am a little bit more sympathetic to BA's situation. The "technical constraints" that Richard speaks of can doubtlessly be better described as a desire by BA to ensure data quality in their systems - unweildy as that data may be to the end user. They are trying to avoid what Donald Farmer has referred to as idiothetic data quality issues. Data quality, as anyone who has worked on a data integration project will tell you, is one of the biggest (I would say THE biggest) contributory factors to success or failure of those types of project. Allowing users to type in freetext fields ultimately causes headaches for us data guys when we are attempting to deduplicate or categorise customers. For example, allowing people to type in freetext could(and will) result in all of the following titles being entered:

  • Mr
  • Mister
  • Mr.
  • Dr
  • Dr.
  • Doc
  • Doctor

To the human eye its quite obvious that there are only really two distinct titles here but unfortunately machines generally aren't that smart and in the main these would be treated as seven seperate titles. From that point of view I applaud what BA have done - they are simply trying to improve the data quality in their systems.

 

However, I'm also a BA customer myself (or I might be) and as a user I don't want to have to select "Mr" from a list of 204 items. As Richard correctly states, "they have technical constraints battling with usability best practices" and therein lies the problem. This isn't an isolated case of course - incongruent data issues like this crop up all the time and will only increase as the web becomes all the more pervasive. A better solution to this problem needs to be found and indeed Richard himself suggests a very good one. I'll be interested to see how this one progresses, hoepfully Richard will keep us informed.

 -Jamie

 

 

Published 07 February 2007 00:28 by jamie.thomson

Comment Notification

If you would like to receive an email when updates are made to this post, please register here

Subscribe to this post's comments using RSS

Comments

 

Andy said:

Surely the solution here is a user control that allows you to type or select items. Typing "m" and "r" would highlight the relavent item "Mr". This kind of thing has been possible in conventional GUIs for ages and now with AJAX its becomming possible in web interfaces to.

I've got the same problem with my client's data but they've also been working in three different languages which makes the issue worth.

February 7, 2007 16:52
 

jamie.thomson said:

Andy,

You're right, a solution along these lines seems the best way to go - and that is the solution that Richard mentioned in his blog.

That solution solves the problem of having 204 items in the drop-down box. I would then ask:

"Is it still a freetext field? i.e. Do we let them type in values or do we still restrict them to a finite list? And if the answer is that we restrict them, how do you communicate to the user that they can't type in whatever they want?"

Because my concern would still be that people would add entries that we don't want them to.

Thanks for the comment.

-Jamie

February 7, 2007 18:20
 

Phil Brammer said:

I would have to ask if, in this case, the salutation field is even appropriate.  When it comes to salutations, do they really mean anything?  Does a Dr. designation differentiate me over Mr.?  Generally, I'd consider that piece of data garbage in terms of data integrity.

Granted, the general concept in your post is still valid.  The salutation field is merely an example...

February 12, 2007 22:24

Leave a Comment

(required) 
(optional)
(required) 
Submit

This Blog

Syndication

News

Powered by Community Server (Personal Edition), by Telligent Systems