M
Mark
I have a table about people containing 25 fields. The table contains the usual
fields - first, last, address, city, state and zip. There is no primary key.
These fields all have data with the exception of a few records missing zip. A
person may be in one to five records in the database. If a person is in multiple
records, the other fields in the table in each record for that person may or may
not contain data. I have two problems:
1. I need to determine which people are in more than one record in the database
2. I need to build a master record for each person who has multiple records in
the database where the master record contains all the data in the database for
that person. This means merging the multiple records into one record. Upon
observation, there is no pattern as to which records contain which data. For
example, the social security number is not always in the second record. So my
conclusion is that the merge has to occur on a field by field basis after
determining which records belong to the same person.
Having a Charles Wilson and a Chuck Wilson is not an issue although they may be
the same person. However, I don't want to have more than one Charles Wilson or
more than one Chuck Wilson unless there are two different people named Charles
Wilson or two different people named Chuck Wilson.
I will appreciate all suggestions on how to clean up this database.
Mark
fields - first, last, address, city, state and zip. There is no primary key.
These fields all have data with the exception of a few records missing zip. A
person may be in one to five records in the database. If a person is in multiple
records, the other fields in the table in each record for that person may or may
not contain data. I have two problems:
1. I need to determine which people are in more than one record in the database
2. I need to build a master record for each person who has multiple records in
the database where the master record contains all the data in the database for
that person. This means merging the multiple records into one record. Upon
observation, there is no pattern as to which records contain which data. For
example, the social security number is not always in the second record. So my
conclusion is that the merge has to occur on a field by field basis after
determining which records belong to the same person.
Having a Charles Wilson and a Chuck Wilson is not an issue although they may be
the same person. However, I don't want to have more than one Charles Wilson or
more than one Chuck Wilson unless there are two different people named Charles
Wilson or two different people named Chuck Wilson.
I will appreciate all suggestions on how to clean up this database.
Mark