Power Query: Self Merging
10 March 2021
Welcome to our Power Query blog. This week, I look at a way to reorganise data in a table by merging with itself.
Just for a change, I have some tent data:
data:image/s3,"s3://crabby-images/0bb17/0bb1772c221448829ec079dee453a716a8597bd8" alt=""
I want to have the description and price in separate columns. To this end, I begin by extracting my data to Power Query using ‘From Table/Range’ on the ‘Get & Transform’ section of the Data tab.
data:image/s3,"s3://crabby-images/226a0/226a003fbcf476636323211e8c60680c3a18078f" alt=""
There are a number of ways to solve this, but for this example, I am going to merge the table with itself. I don’t need to create a copy of the table to do this.
data:image/s3,"s3://crabby-images/7e4b1/7e4b190a43aaa8f9f6cd3e364cc826cba141337c" alt=""
Power Query will allow me to link my table to itself, as indicated by the Table1 (Current) in the table dropdown for the second table. I choose to use the Left Outer join, and I join on the Tent Size field, since I know this will always be unique in this table.
data:image/s3,"s3://crabby-images/68d3e/68d3e96bf2abf38183ce39c44ddf0903cd78bf43" alt=""
The name given to my new column is the name of the previous step since this is equivalent to the current table. I can then expand the table:
data:image/s3,"s3://crabby-images/86283/862839de3edd6abbab2a8f6e74d0f4dc14fd2b6e" alt=""
I only want to see Tent Colour/Price. I don’t need to use a prefix as I don’t have many columns, so it will be easy to distinguish between them.
data:image/s3,"s3://crabby-images/4ead9/4ead9b23d5883149e21dad4ac0cf6041d47504cb" alt=""
I rename my second and third columns to Tent Colour and Tent Price respectively.
data:image/s3,"s3://crabby-images/4098f/4098f3c9b7013069e64ca77ef881bddf4d47c843" alt=""
The Tent Price column is easy to sort out, so I modify this field first. I can just transform the column type to ‘Whole Number’, and this will give me errors for the colours.
data:image/s3,"s3://crabby-images/8bb4d/8bb4dbf4b895e1906ddc885196e215afa05fa2ac" alt=""
I can now right-click and choose to ‘Remove Errors’.
data:image/s3,"s3://crabby-images/816f2/816f2eb7445685201caff576beea023c69a0e90e" alt=""
Removing the errors removes the colours from this column:
data:image/s3,"s3://crabby-images/54bd7/54bd768d49876eb9f03fe930a4b82670ebed18d9" alt=""
I need to remove the prices from the Tent Colour column. I can do this by creating a new conditional column. Since I plan to compare Tent Colour and Tent Price, I start by transforming them to data type ‘Text’.
data:image/s3,"s3://crabby-images/ec0b6/ec0b64b2e84b5539c1a7619bd02916e5c9a8a840" alt=""
I create a new column which will be null if Tent Colour is the same as the value in Tent Price, otherwise it is set to Tent Colour.
data:image/s3,"s3://crabby-images/ae7fa/ae7fa05994c46724e10f9fa7d9ab4bd1b16f185e" alt=""
I click ‘OK’ to see my new column.
data:image/s3,"s3://crabby-images/c33bb/c33bbfc8c29397c4184cbc4589899d82d2030985" alt=""
I can now remove the null rows by filtering and choosing to ‘Remove Empty’ on Tent Colour (new).
data:image/s3,"s3://crabby-images/d1d9a/d1d9a9db004d53d047dd98cce625390f843639e5" alt=""
I can now remove my new column.
data:image/s3,"s3://crabby-images/3af24/3af246e5b267cbc1bea25df1b42169bc2a707b00" alt=""
I have my data in the required format. Next time, I will look at a nested merging problem.
Come back next time for more ways to use Power Query!