Power Query: Emerging from Merging
13 November 2019
Welcome to our Power Query blog. This week we look at extracting from merged Excel cells.
I have some data from John, one of my imaginary salespeople:
data:image/s3,"s3://crabby-images/e5de6/e5de6655905427dd6debad7be95fbf3290304020" alt=""
John has almost followed the expected format, but he’s decided to merge the date cells instead of using auto fill. I need the date on each row. I begin by extracting the data to Power Query by using the ‘From Table’ option on the ‘Get & Transform’ section of the Data tab.
data:image/s3,"s3://crabby-images/8582a/8582a03efb5bf7a37fd6b34b04b990e3a555c7e3" alt=""
I accept the defaults.
data:image/s3,"s3://crabby-images/8b3f3/8b3f3727d99e8e19861aedfcf1bc4d259a99dfb2" alt=""
I can sort out my data by using ‘Fill Down’ which is available from the ‘Transform’ tab or if I right click on a selected column, viz.
data:image/s3,"s3://crabby-images/4c69d/4c69dfcece60a31c518bd0e688d60d7ba9efdaea" alt=""
I select ‘Fill’ and ‘Down’.
data:image/s3,"s3://crabby-images/72572/725722cbe63fbff196db0ffa5b7bfd4963228b4c" alt=""
The dates are now populated correctly. However, there are other ways that John likes to merge cells in Excel:
data:image/s3,"s3://crabby-images/8ff04/8ff0478afb7f1af749476455174cd33a058a0984" alt=""
In this case he’s merged ‘Contact 1’ and ‘Contact 2’ for two the dates, as well as merging the rows. I extract this data again into Power Query:
data:image/s3,"s3://crabby-images/acd7d/acd7d03a361cb4f337cb89499cad43e026cd79d6" alt=""
I can see that the Contact 2 column is currently redundant, so I will remove it using the ‘Remove Columns’ option on the Home tab and create a new one.
data:image/s3,"s3://crabby-images/c2294/c22945a74e1252142a277b04b5e7b07151fa8dfd" alt=""
I can now fill down as before.
data:image/s3,"s3://crabby-images/354fc/354fce8359937e099925dfcf0c33dcb80edf5645" alt=""
I need to split Contact 1, so I use ‘Split Column’ from the ‘Transform’ tab.
data:image/s3,"s3://crabby-images/56406/56406eff8170849885a078a78db7ae22c40ae586" alt=""
I split by delimiter, but I need to create two new columns, not three, so I won’t split for every occurrence of SPACE.
data:image/s3,"s3://crabby-images/c48ce/c48cebd9d13d3faa8503ad9b6c650bb64029a54b" alt=""
I choose to split at the ‘Right-most delimiter’.
data:image/s3,"s3://crabby-images/2d536/2d5368b90524b9c61870b21d83dddf2d5ac723d2" alt=""
Now I need to remove the ‘and’ from the names in Contact 1.1. There are several ways I can approach this. I can split the column again and delete the column with ‘and’. Another way is to create a custom column. In this example, I will use ‘Add Column by Examples’ on the ‘Add Column’ tab to see which method Power Query applies.
data:image/s3,"s3://crabby-images/8ad2a/8ad2a145d3c53304c4cec29c6988e46b9407f29a" alt=""
After only one example, Power Query has opted to transform the column so that only the characters before the SPACE delimiter appear:
= TextBeforeDelimiter([Contact 1.1],"")
data:image/s3,"s3://crabby-images/2b25f/2b25ff1160a7a8b5e4de451040314840bd5f43d3" alt=""
I remove the column I no longer need and rename the contact information. I also reorder my data.
data:image/s3,"s3://crabby-images/92c42/92c4252500aeebd70b7588e3be3d9faac263bf59" alt=""
I now have the data in a standard format so that I can append it to data from other salespeople.
Come back next time for more ways to use Power Query!