Generate composite key from 2 Foreign keys

ElectricBlueHorseman · October 18, 2012 1:03AM

I'm having the same problem as detailed in the forum for SQL Data Generator 2 here.

To recap I have these tables:
dbo.TableA (KeyA int primary key)
dbo.TableB (KeyB int primary key)
dbo.MapA2B (KeyA int, KeyB int, unique(KeyA, KeyB))

I've used SQL Data Generator (2.0.3.1) to generate 90 rows for dbo.TableA & 1,400 rows for dbo.TableB.

The next task I'm trying to do is generate data for dbo.MapA2B. For that I have set:
KeyA - repeat 1 - 2 times
KeyB - repeat 1 - 100 times

I've also set "When data is invalid" to "Skip row".

When I run the data generation I get this error:
Violation of UNIQUE KEY constraint 'XXX'. Cannot insert duplicate key in object 'dbo.MapA2B. The duplicate value is (YY, ZZZ). The statement has been terminated.

I've found some old (4 years, SQL Data Generator 1) references to this being a problem but I can't find any solutions. To be honest I'm rather surprised it has not been fixed. The many-to-many relationship pattern is pretty common.

Brian Donahue · October 19, 2012 11:19AM

Hi,

The problem as I see it, is not a bug in Data Generator that can be fixed, but a design limitation in that generators apply to a column and do not have an awareness of what data is being generated for a companion column in the same table. It therefore, does not handle composite keys very well unless all values are unique in both columns.

This can be worked around usually with a python script generator that generates predictable values.

Assuming the following schema

CREATE TABLE TableA &#40;
identifier INT PRIMARY KEY
&#41;
CREATE TABLE TableB &#40;
identifier INT PRIMARY KEY
&#41;
CREATE TABLE LookupTable
&#40;
identifierA INT,
identifierb INT
&#41;
ALTER TABLE LookupTable ADD CONSTRAINT uq_LTable UNIQUE &#40;identifierA,identifierB&#41;

Set the seed value for TableA.identifier to 3128, and the seed for TableB.identifier to 3129 and set the distributions to sequential. Use the following Generic->Python generator for LookupTable.IdentifierA:

__randomize__ = False
import System
def main&#40;config&#41;:
    rowCounter=0
    myList=&#91;&#93;
    ltRnd=System.Random&#40;3128&#41;
    while &#40;rowCounter &lt; config&#91;"n_rows"&#93;&#41;:
        rptNum=2 #repeat two times
        repeatCounter=0
        numNbr1=ltRnd.Next&#40;0,9999999&#41;
        while &#40;repeatCounter &lt; rptNum&#41;:          
            myList.append&#40;numNbr1&#41;
            repeatCounter=repeatCounter+1
        rowCounter=rowCounter+rptNum
    return myList

...and the following Generic->Python generator for LookupTable.identifierB:

__randomize__ = False
import System
def main&#40;config&#41;:
    rowCounter=0
    myList=&#91;&#93;
    ltRndA=System.Random&#40;3129&#41;
    while &#40;rowCounter &lt; config&#91;"n_rows"&#93;&#41;:
        rptNum=100 #repeat 100 times
        repeatCounter=0
        numNbr1=ltRndA.Next&#40;0,9999999&#41;
        numNbr2=ltRndA.Next&#40;0,9999999&#41;
        while &#40;repeatCounter &lt; rptNum&#41;:          
            myList.append&#40;numNbr1&#41;
            myList.append&#40;numNbr2&#41;
            repeatCounter=repeatCounter+2
        rowCounter=rowCounter+100
    return myList

Now, you should have each number in the identifierA column twice, and a repeating sequence of 100 each identifierB in a way that does not violate the constraint.

I know this is a difficult road to go down, but hopefully you do not have too many of these kinds of tables. Hopefully this helps.

SinisterChinaPenguin · April 27, 2022 11:59AM

I usually create a copy of the table WITHOUT the composite key, use SQL generator to populate it & then manually sort out the key values.

Then insert the data into the "real" table

Obviously some tables will be a nightmare but in a lot of cases I've found this pretty easy to implement.

Generate composite key from 2 Foreign keys

Comments

Product Learning

Community Forums

Events & Friends

Simple Talk