Power Query: If You Can’t Tell Them Apart, Join Them
11 October 2017
Welcome to our Power Query blog. This week I look at how to merge queries to compare two similar lists.
Sometimes, the tasks I have to do are less than thrilling. I could be given two long lists of employees, and told to find who is missing from each list:
data:image/s3,"s3://crabby-images/f0ead/f0ead0acf846cf308d4973d70c914bb16c2a2a19" alt=""
With Power Query, there is an easy way to do this.
I have two lists of employees which look virtually identical, but of course they are not. The first step I take is to use Power Query to load my lists. I begin by creating my first query.
data:image/s3,"s3://crabby-images/2eb79/2eb79ec0d3b97f40874bab8b79801c49f310d109" alt=""
On the ‘POWER QUERY’ tab, I use the ‘From Table / Range’ option in the ‘Excel Data’ section. I can either convert my data into a table before I use this option or let Power Query do this as part of my query creation.
data:image/s3,"s3://crabby-images/af6b2/af6b29b3baad9ed8b4abc75ced9f0d748d5040db" alt=""
My data appears in a new query, which I call ‘Employee Table 1’. I choose the ‘Close and Load to’ option on the ‘Close and Load’ section on the ‘Home’ tab.
data:image/s3,"s3://crabby-images/afea8/afea8559813dc3c06194dcef755abdd9e8c57c4f" alt=""
I only want to create the connection as I am going to use my queries for comparison. Having created the first query, I repeat the process for the second employee list.
data:image/s3,"s3://crabby-images/49bf3/49bf34ecd34eb1c7fce0628aaab6bf15f926f898" alt=""
My next step is to merge my queries and to do this I can right-click on one of my employee queries (I could have chosen to use the option from the ‘Combine’ section of the ‘POWER QUERY’ tab).
data:image/s3,"s3://crabby-images/a87e2/a87e2ce07b9e9e75dd6eb8fafe77d39be24c573c" alt=""
I select the ‘Merge’ option.
data:image/s3,"s3://crabby-images/61230/6123068bd77b8494ad15db1834137acf9bd2f862" alt=""
In the ‘Merge’ screen, I specify the second employee table, and I see a preview of the data in both queries. I want to merge my tables completely, so I select all columns in both tables.
At the bottom of the screen I have the option to specify the ‘Join Kind’, and this will determine what data I see when the queries are merged.
I am interested in the joins that will either find my matching or missing data, which are the last three on the drop down. I will run through each option in turn.
‘Inner’
This will give me those employees that are on both lists. I pick this option first.
data:image/s3,"s3://crabby-images/4d172/4d17214ecd30f4ea919845c755689c78968087f7" alt=""
The rows I can see initially are those rows which match the criteria in the first table, and I have an extra column which is labelled ‘Employee Table 2’, which contains a table. I choose to expand the second table to see which rows match the criteria in the second table:
data:image/s3,"s3://crabby-images/02454/02454e4d5cc2da62634d283887b3650adc6e4692" alt=""
I don’t need to use any original column names as a prefix, so I uncheck that box and choose ‘OK’.
data:image/s3,"s3://crabby-images/cc00d/cc00d19f8694e2e455253cbd2f43c986b42b58c5" alt=""
I have 42 employees in both tables.
‘Left Anti’
I opt to merge my tables again, and this time I pick join type ‘Left Anti’.
data:image/s3,"s3://crabby-images/faa1e/faa1e403338b518272c04a31570878a0ba1f6c12" alt=""
This time I am interested in the first table, since this will show me those employees in ‘Employee Table 1’ who are missing from ‘Employee Table 2’. I find four employees and two seem a little familiar… Notice Zoe, as she’ll come up again later…
Just to show that the second table will be empty (as I am looking for employees that don’t appear here), I expand the table viz.
data:image/s3,"s3://crabby-images/429c1/429c1c90742e6a0ae3619a8941a2133f842f42e3" alt=""
‘Right Anti’
My final choice is to create a merge with a ‘Right Anti’ join. In the Query Editor, this looks a little strange before I expand the table.
data:image/s3,"s3://crabby-images/60222/60222f87ef177604d8f5a36a6d7f6d879d47ca35" alt=""
This is because I am seeing the results from the first table, and since I am looking for values that are in the second table and not in the first table, it is empty. I choose to expand the second table:
data:image/s3,"s3://crabby-images/c78c4/c78c4b8700daae2754e86d36856d52c1526fc587" alt=""
I have two employees that are missing from the first table – and it wouldn’t take too much investigation to discover that Zoe has changed her name from Green to Brown!
Want to read more about Power Query? A complete list of all our Power Query blogs can be found here.
Come back next time for more ways to use Power Query!