In the past there have been some misadventures with uploads of large numbers of articles whose data is mostly tabular.
I noted some of the conversions that listed the place of birth as Earth. Listing names as "unknown" or "not found" on info pages are equally bogus as well, cosmetically "bulking up" the volume of the infopage, without adding additional information. Besides being non information, this results in redlinks and invites users to create articles of that name. Lastly, many articles have been uploaded with all cap names. This is particularly surprising because easy to use tools such as Word can correct such problems in preloaded data with trivial effort.
Unless I hear any objections, I am going to be sending PhloxBot off to correct all cap names and the unknown/ not found errors, but we perhaps need to put in place some guidelines regarding minimum quality requirements for bulk uploads.
Any opinions, please chime in. I don't want to make it hard for folks to do bulk uploading. I do want to prevent junk data from infecting into our surname or place categories and other structures. Note that malformed category names cannot simply be redirected. Once other articles start to link to them, it becomes non trivial to exorcise such malformed info. ~ Phlox 22:19, October 22, 2009 (UTC)