Data Cleaning

Data cleansing is the repairing or removing of data in a database, or file, that's incorrect, incomplete or improperly formatted. Several large organisations specialise in cleaning up massive address databases where inconsistencies in addressing lead to multiple versions of the same record. For example, A.N Other, 35 st. Johns Road vs. Ann N. Other, Thirty Five, Saint John's Road. etc. If you need to clean up this kind of data then these companies are the ones you want and be prepared for the invoice. If, on the other hand, you've managed to somehow corrupt your Excel spreadsheet (Word, CSV etc) with strange data characters and now you appear to have someone called "Peter%20O'Carroll & Associates" incorrectly listed, you may need our services.

Perhaps you saved a text file and when you next opened it, it reads:

This is a relatively straightforward problem to correct, even for the not-so-technically inclined. But suppose you had several hundred files like this affected?.

 

How about you had the following entries in an Excel Spreadsheet cell: 
"O' Gorman Peter John"
"O'Reilly Patrick"
"Ryan Ann Mary"
and you want to reparse the entries into 2 cells, Firstname, Lastname. You have 2,000 records. Again, it's not rocket science, but most medium sized companies do not have the resources in-house to do this efficiently and are reluctant to go to an IT company for such a small issue. We may be able to help - just email/FTP us the files and we will return the cleaned version. As it can often be coded and parsed in minutes rather than hours, we can work out very economical 

 

I'm interested. -> Contact us with your requirements.